PhpDig.net

Go Back   PhpDig.net > PhpDig Forums > Troubleshooting

Reply
 
Thread Tools
Old 01-21-2005, 11:07 AM   #1
tajmahal
Green Mole
 
Join Date: Jan 2005
Posts: 16
Indexing Issue

I appologize for starting another thread on what seems to be a very common topic, but i have looked through all other threads and have not been able to resolve my problem. My PHPdig wont index any sites i put into it. First, i put in my domain, and it seemed to pick up my home page and one subdirectory. I have tried to re-index for the whole site, but it has been unable to do so. I tried indexing other websites, and haven't even been able to get any pages from them. I have chmod-ed all of the required directories to 777, and my config file is as follows:

PHP Code:
define('SEARCH_DEFAULT_LIMIT',20);    
define('SPIDER_MAX_LIMIT',100);    
define('RESPIDER_LIMIT',100);          
define('LINKS_MAX_LIMIT',100);      
define('RELINKS_LIMIT',100);           
define('LIMIT_TO_DIRECTORY',false);
define('PHPDIG_IN_DOMAIN',true); 
also my control panel gives me:
Spidering in progress... [Stop spider]
SITE : http://www.mysite.org
Exclude paths :
- @NONE@
links found : 0
...Was recently indexed
Optimizing tables...
Indexing complete !

Any ideas?
tajmahal is offline   Reply With Quote
Old 01-21-2005, 02:49 PM   #2
Dave A
Purple Mole
 
Dave A's Avatar
 
Join Date: Aug 2004
Location: North Island New Zealand
Posts: 170
It may be because you have recently indexed the web site, you could try to delete the domain from the admin panel and then re index it.
There are some related functions in the config.php script that relates to re indexing, for my part I usualy drop the domain and then tidy up the index, keywords and common words and then spider it again.
Dave A is offline   Reply With Quote
Old 01-21-2005, 04:14 PM   #3
tajmahal
Green Mole
 
Join Date: Jan 2005
Posts: 16
i tried deleting and re-indexing, but it is to no avail. I'm getting similar results. One thing i forgot to mention, my pages are all .shtml - would that make any difference?

Another question:
The footers on my website hold links to other areas of the website. I would greatly desire for the footer to be spidered. Unfortunately, I insert the footer into the pages via Server-Side Includes (SSI). Is there any way i can spider these dynamically-generated links?
tajmahal is offline   Reply With Quote
Old 01-21-2005, 08:02 PM   #4
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
The SHTML pages and SSI should not matter. You'd need to spider your site from a page with links to other pages. Starting from the main Flash page, without links elsewhere, won't index the site, as PhpDig follows links. Enter a link from an inner page in the textbox, click the 'no' radio button, set 'Search depth' to a large value, set 'Links per' to zero, and give that a shot. Also, as an aside, the home.shtml page seems to be missing an end DIV tag somewhere.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 01-22-2005, 06:20 AM   #5
tajmahal
Green Mole
 
Join Date: Jan 2005
Posts: 16
Thanks! the spider seems to be working well now. How often will it respider? By the way, how'd you know about that div tag?
tajmahal is offline   Reply With Quote
Old 01-22-2005, 01:26 PM   #6
Charter
Head Mole
 
Charter's Avatar
 
Join Date: May 2003
Posts: 2,539
PhpDig respiders as often as you do it, or as often as you set a cron job to do it. As for the DIV tag, I am magic... ha, ha... just a view of the HTML source is all.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension.
Charter is offline   Reply With Quote
Old 01-23-2005, 07:24 AM   #7
tajmahal
Green Mole
 
Join Date: Jan 2005
Posts: 16
set cron job to do it ? Can i have a link to the proper documentation on that? And by the way magic man, how'd you know my domain in the first place?
tajmahal is offline   Reply With Quote
Old 01-23-2005, 07:26 AM   #8
jmitchell
Orange Mole
 
Join Date: Dec 2004
Location: Tennessee
Posts: 60
you hit the nail on the head - charter is magic
__________________
60,000 pages indexed!!!!! http://www.sharemylink.com
jmitchell is offline   Reply With Quote
Old 02-19-2005, 12:03 PM   #9
tajmahal
Green Mole
 
Join Date: Jan 2005
Posts: 16
ok so when my cron job reindexes, does it delete the old index and then make a new one? if i rearrange some pages by moving them to different folders, it seems that the new page as well as the old page is listed in the index. any way around this?
tajmahal is offline   Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
pdftotext issue JonnyNoog External Binaries 6 07-15-2006 12:40 AM
Spidering issue cefiro How-to Forum 0 02-28-2005 10:01 AM
config issue baskamer Troubleshooting 2 12-18-2004 01:33 PM
Installation issue... again jinx Script Installation 1 06-14-2004 09:31 PM
pstotext issue killer27 External Binaries 7 05-12-2004 02:28 PM


All times are GMT -8. The time now is 06:25 PM.


Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.