PhpDig.net

Go Back   PhpDig.net > PhpDig Forums > Troubleshooting

Reply
 
Thread Tools
Old 07-15-2004, 03:17 AM   #1
lolodev
Orange Mole
 
Join Date: Apr 2004
Location: Nancy (54)
Posts: 38
no spider my file links

hello

there 's something wrong that i can't explian:

i've put somes doculents(MSWORD) in tree file of my wwwroot apache.

http://quito.citipro.fr/documents


i run an index from this url: phpdif sees first dir but not the MSWORD in this dir ??
lolodev is offline   Reply With Quote
Old 07-15-2004, 04:46 AM   #2
lolodev
Orange Mole
 
Join Date: Apr 2004
Location: Nancy (54)
Posts: 38
ok it's good
lolodev is offline   Reply With Quote
Old 07-15-2004, 09:19 AM   #3
lolodev
Orange Mole
 
Join Date: Apr 2004
Location: Nancy (54)
Posts: 38
quelqu'un a t il une solution pour eviter que phpdig indexe autre chose que l'url donnée ...

si je lui donne quito.citipro.fr/documents, il remonte ensuite Ã* la racine du site quito.citipro.fr et indexe les pages en dessous ...

SITE : http://quito.citipro.fr/
Chemins exclus :
- @NONE@
1:http://quito.citipro.fr/documents/
(temps : 00:00:05)
+ + +
niveau 1...
2:http://quito.citipro.fr/index/pages/fr/20.htm
(temps : 00:00:16)
+ + + + + + + + + + + +
3:http://quito.citipro.fr/documents/lolo/
(temps : 00:00:21)
+
4:http://quito.citipro.fr/documents/lolo2/
(temps : 00:00:26)
+
niveau 2...
5:http://quito.citipro.fr/index/pages/fr/101.htm
(temps : 00:00:36)
+ + + + + + + + + + + + + + + +
6:http://quito.citipro.fr/index/pages/fr/99.htm
(temps : 00:00:42)
+ + + + +
7:http://quito.citipro.fr/index/pages/...tter/index.php
(temps : 00:00:47)
8:http://quito.citipro.fr/index/pages/.../news/news.php
(temps : 00:00:52)
niveau 3...
9:http://quito.citipro.fr/index/pages/fr/119.htm
(temps : 00:01:02)
+ + + + + +
10:http://quito.citipro.fr/index/pages/fr/41.htm
(temps : 00:01:08)
+ + +
11:http://quito.citipro.fr/index/pages/...ipt=affdoc.php
(temps : 00:01:13)
+ + +
12:http://quito.citipro.fr/index/pages/fr/120.
(temps : 00:01:18)
niveau 4...
13:http://quito.citipro.fr/index/pages/fr/127.htm
(temps : 00:01:29)
14:http://quito.citipro.fr/index/pages/fr/130.htm%20clas
(temps : 00:01:34)
15:http://quito.citipro.fr/index/pages/fr/129.htm%20clas
(temps : 00:01:39)
16:http://quito.citipro.fr/index/pages/fr/128.htm%20clas
(temps : 00:01:44)
lolodev is offline   Reply With Quote
Old 07-15-2004, 09:29 AM   #4
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hi. If LIMIT_TO_DIRECTORY is true then you need to have an ending slash to stay within that directory:

e.g., http://quito.citipro.fr/documents/lolo/ (only indexes documents within documents/lolo/)

e.g., http://quito.citipro.fr/documents/lolo2/lolo3/ (only indexes documents within documents/lolo2/lolo3/)

PHP Code:
//for limit to directory, URL format must either have file at end or ending slash at end
//e.g., http://www.domain.com/dirs/ (WITH ending slash) or http://www.domain.com/dirs/dirs/index.php
define('LIMIT_TO_DIRECTORY',true);      //limit index to given (sub)directory, no sub dirs of dirs are indexed 
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 07-15-2004, 11:20 PM   #5
lolodev
Orange Mole
 
Join Date: Apr 2004
Location: Nancy (54)
Posts: 38
ok

LIMIT_TO_DIRECTORY was always TRUE

:-
lolodev is offline   Reply With Quote
Old 07-15-2004, 11:23 PM   #6
lolodev
Orange Mole
 
Join Date: Apr 2004
Location: Nancy (54)
Posts: 38
but this define limit index to given (sub)directory, no sub dirs of dirs are indexed - My pb is not in sub dir but prevously dir
lolodev is offline   Reply With Quote
Old 07-15-2004, 11:26 PM   #7
lolodev
Orange Mole
 
Join Date: Apr 2004
Location: Nancy (54)
Posts: 38
with TRUE or FALSE , i've the same result
lolodev is offline   Reply With Quote
Old 07-15-2004, 11:38 PM   #8
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Hi. Make sure the tempspider table is empty and then index http://quito.citipro.fr/directory/ (with ending slash).
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 07-15-2004, 11:49 PM   #9
lolodev
Orange Mole
 
Join Date: Apr 2004
Location: Nancy (54)
Posts: 38
hi- my temspider table is empty
lolodev is offline   Reply With Quote
Old 07-16-2004, 12:11 AM   #10
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
There has to be links to the WORD documents, but all http://quito.citipro.fr/documents/ has in it is folders:
Code:
Index of /documents
 Name                    Last modified       Size  Description
--------------------------------------------------------------------------------
 Parent Directory        15-Jul-2004 14:20      -  
 lolo/                   13-Jul-2004 21:23      -  
 lolo2/                  13-Jul-2004 21:38      -  
--------------------------------------------------------------------------------
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 07-16-2004, 12:21 AM   #11
lolodev
Orange Mole
 
Join Date: Apr 2004
Location: Nancy (54)
Posts: 38
i don't unsterstand so good ...

PHPDIG can crawl directory, sub-directory and doc.
i've tested that, and it runs -


(do you speak french ?)
lolodev is offline   Reply With Quote
Old 07-16-2004, 12:23 AM   #12
lolodev
Orange Mole
 
Join Date: Apr 2004
Location: Nancy (54)
Posts: 38
folder and sub folders are like pages or link in a html page
lolodev is offline   Reply With Quote
Old 07-16-2004, 12:29 AM   #13
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
What version are you using?
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 07-16-2004, 12:30 AM   #14
lolodev
Orange Mole
 
Join Date: Apr 2004
Location: Nancy (54)
Posts: 38
1.8.3
lolodev is offline   Reply With Quote
Old 07-16-2004, 12:32 AM   #15
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
Quelle URI voulez-vous indexer?
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
spider ignores links Maarten Wijnen Troubleshooting 2 03-17-2005 03:23 PM
Spider From A File Thru Web Interface vinyl-junkie Mod Requests 3 12-15-2004 04:15 AM
Spider site with links dell_10 External Binaries 10 09-20-2004 07:42 AM
spider only one site/file jdc32 Troubleshooting 2 07-02-2004 06:49 AM
phpdig spider hangs (a powerpoint file problem) davideyre Troubleshooting 1 03-29-2004 01:35 PM


All times are GMT -8. The time now is 06:37 AM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.