|
04-26-2004, 06:18 AM | #1 |
Orange Mole
Join Date: Oct 2003
Location: hamburg, germany
Posts: 52
|
it just doesn't want to spider...
hi!
i just installed phpdig for a new website. unfortunately it doesn't want to spider anything at all. the url is http://www.fussball24.de it always says: "no links found" and "has just been indexed". any ideas? |
04-26-2004, 06:48 AM | #2 |
Orange Mole
Join Date: Oct 2003
Location: hamburg, germany
Posts: 52
|
there's something that i should probably add:
i took the set_time_limit(86400); // 1 full day out of the spider.php because it produced an error (safe-mode). could that be the reason? |
04-26-2004, 07:11 AM | #3 |
Purple Mole
Join Date: Jan 2004
Posts: 694
|
A few possibilities come to mind.
|
04-26-2004, 07:24 AM | #4 |
Orange Mole
Join Date: Oct 2003
Location: hamburg, germany
Posts: 52
|
no, it's none of that. there must be another reason. where can i find that log?
concerning the safe mode: i just installed phpdig on another server with safemode=off, so that's not the reason either. |
04-26-2004, 07:33 AM | #5 |
Purple Mole
Join Date: Jan 2004
Posts: 694
|
Your spider log is the display that happens when you try to spider your site. Sometimes it helps to take a look at that. Then again, the problem could be something completely different.
|
04-26-2004, 08:01 AM | #6 |
Orange Mole
Join Date: Oct 2003
Location: hamburg, germany
Posts: 52
|
ah, that's what you mean:
SITE : http://www.fussball24.de/ Ausgeschlossene Pfade : - - @NONE@ Kein Link in der temporäreren Tabelle -------------------------------------------------------------------------------- Links gefunden : 0 ...Wurde gerade indiziert Optimizing tables... Indizierung abgeschlossen! i tried about 20 times, somethings really wrong with it... |
04-26-2004, 08:34 AM | #7 |
Head Mole
Join Date: May 2003
Posts: 2,539
|
Hi. Try applying the code change in this post and fix your robots.txt file so that it reads:
User-agent: * Disallow: /go.php instead of just: Disallow: /go.php Also, PhpDig is set to fully function when safe_mode is off.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension. |
04-26-2004, 08:46 AM | #8 |
Orange Mole
Join Date: Oct 2003
Location: hamburg, germany
Posts: 52
|
yeah! thanx charter, it works now. i only added the line in the robots.txt. great!
|
04-26-2004, 11:29 AM | #9 |
Green Mole
Join Date: Apr 2004
Posts: 4
|
I'm encountering the same problem, but creating a robots.txt file does not seem to help.
Here is my robots.txt: User-agent: * Disallow: /include/ When I tell it to spider localhost or the hostname of the local server (depth=5) always results in this: Spidering in progress... -------------------------------------------------------------------------------- SITE : http://localhost/ Exclude paths : - include/ There is no CPU cycles being used and the page is fully loaded. What else can I check? How will I know that the spider process is actually doing something? Thanks |