|
01-18-2005, 03:59 PM | #1 |
Purple Mole
Join Date: Aug 2004
Location: North Island New Zealand
Posts: 170
|
Indexing dynamically generated web pages
I wonder if there is a way of getting Phpdig to index dynaically generated web pages?
Each time I try to spider a web site that has dynanic pages generation it would appear that the spider doesn't find any content and can't index it. Perhaps there may be a few things within the configuration files that need amending? So if any one has any ides could they please post an answer to the forum. Many thanks from Dave Downunder |
01-18-2005, 04:57 PM | #2 |
Head Mole
Join Date: May 2003
Posts: 2,539
|
What version of PhpDig are you using? Would you provide an example link?
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension. |
01-18-2005, 07:10 PM | #3 |
Purple Mole
Join Date: Aug 2004
Location: North Island New Zealand
Posts: 170
|
Relpy
Hi Charter,
firstly many thanks for getting back to me so quickly. One example of the type of dynamic files I can't seem to index can be found via www.hastings.co.nz The spider visits and tries for a few moments then replies with Indexed 0 files and then no link found in Temporary folder. I have contacted the people who designed the web site and they have said that each page is dynamically generated. Thanks for your help with this, typing is a little hard because I had a couple of cataract ops yesterday and things seem just a little fuzzy around the edges until the swelling has gone. Many regards Dave Andrews |
01-18-2005, 07:47 PM | #4 |
Head Mole
Join Date: May 2003
Posts: 2,539
|
What version of PhpDig are you using?
If you use the latest version, you should see the following type output: Spidering in progress... [Stop spider] SITE : http://www.hastings.co.nz/ Exclude paths : - @NONE@ 1:http://www.hastings.co.nz/ (time : 00:00:08) HTTP/1.1 403 Forbidden - http://www.hastings.co.nz/editable/welcome.shtml/ See http://www.w3.org/Protocols/rfc2616/rfc2616-sec10.html for explanation. HTTP/1.1 403 Forbidden - http://www.hastings.co.nz/editable/welcome.shtml See http://www.w3.org/Protocols/rfc2616/rfc2616-sec10.html for explanation. + level 1... HTTP/1.1 403 Forbidden - http://www.hastings.co.nz/editable/welcome.shtml See http://www.w3.org/Protocols/rfc2616/rfc2616-sec10.html for explanation. 2:http://www.hastings.co.nz/editable/welcome.shtml (time : 00:00:20) No link in temporary table links found : 2 http://www.hastings.co.nz/ http://www.hastings.co.nz/editable/welcome.shtml Optimizing tables... Indexing complete ! [Back] to admin interface. It has nothing to do with dynamic files. That site is giving 403s, meaning forbidden, not allowed, go away. Hope your eyes feel better soon.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension. |
02-01-2005, 12:03 PM | #5 | |
Green Mole
Join Date: Jan 2005
Location: Sacramento
Posts: 8
|
Quote:
|
|
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Extracting search results and using them in your own web pages | ciaran@clissman | Mod Submissions | 1 | 11-26-2005 11:14 AM |
Indexing stops after a few pages | Sibona | Troubleshooting | 1 | 05-03-2005 11:27 AM |
is it real to inrease indexing time with web interface? | zaartix | How-to Forum | 1 | 07-14-2004 09:13 PM |
converted from html pages to php pages now no pages will index!!! help!! | bigals | Troubleshooting | 24 | 04-01-2004 10:34 AM |
Understanding logs from web indexing | kenazo | How-to Forum | 1 | 03-15-2004 12:06 PM |