|
09-19-2004, 12:05 AM | #1 |
Green Mole
Join Date: Sep 2004
Posts: 11
|
PHPDig indexing certain pages
Hi there,
I have a website that has more than 65 pages, the website is on Server A, this server is a Windows 200 Advanced Server behind a proxy on the Internet. Currently PHPDig is installed on a Server B a Unix Server in the LAN. Server A and B have a trust relation begind the proxy, but, for some reason PHPDig indexes only 7 pages from the website. By the way PHPDig has no problem in converting any documents, since it indexs my Intranet website on Server C (+900 documents) perfectly. Could you please give me a hint on what to do, or what troubleshooting is needed. Thanks for your continous help and support. |
09-20-2004, 02:40 AM | #2 |
Green Mole
Join Date: Sep 2004
Posts: 11
|
More clarifications
Hi there,
Currently my log file shows me that the indexed documents are the one that had GET statement, all the others that only have HEAD and no GET have not been indexed, an HEAD 200 error has been displayed. This is the messages and errors I get with my logfile: 2004-09-20 06:07:48 HEAD /robots.txt - 404 PhpDig/1.8.3+(+http://www.phpdig.net/robot.php) 2004-09-20 06:07:48 HEAD /internet/College-of-Medicine.doc_cvt.htm - 200 PhpDig/1.8.3+(+http://www.phpdig.net/robot.php) 2004-09-20 06:07:48 GET /internet/College-of-Medicine.doc_cvt.htm - 200 PhpDig/1.8.3+(+http://www.phpdig.net/robot.php) 2004-09-20 06:07:53 HEAD /internet/CSS/style_internet.css - 200 PhpDig/1.8.3+(+http://www.phpdig.net/robot.php) 2004-09-20 06:07:53 HEAD /internet/index.htm - 200 PhpDig/1.8.3+(+http://www.phpdig.net/robot.php) Last edited by asanad; 09-20-2004 at 02:49 AM. |
09-20-2004, 02:46 PM | #3 |
Green Mole
Join Date: Sep 2004
Posts: 13
|
have same problem !!!!
Hi all
I have the same problem, any hint about it |
09-20-2004, 08:23 PM | #4 |
Purple Mole
Join Date: Jan 2004
Posts: 694
|
There's a thread about this same sort of issue that was posted not too long ago. I don't remember the outcome of it though. You might want to do a forum search and see what turns up.
|
09-21-2004, 02:41 AM | #5 |
Green Mole
Join Date: Sep 2004
Posts: 11
|
vinyl-junkie,
thank you for your co-operation, could I please as you for a favour, I tried to search for it and find it, I had no luck, could you give me a tip, where to find it Thanks |
09-21-2004, 03:00 PM | #6 |
Green Mole
Join Date: Sep 2004
Posts: 13
|
thanks vinyl-junkie
but still can't find any thread can help me to solve this problem ... if you could gave me a link of any thread talk about same problem |
09-21-2004, 03:05 PM | #7 |
Green Mole
Join Date: Sep 2004
Posts: 13
|
Last edited by dell_10; 09-21-2004 at 03:19 PM. |
09-21-2004, 07:32 PM | #8 | |
Purple Mole
Join Date: Jan 2004
Posts: 694
|
Quote:
You can speed up your search a little by hovering your mouse over the subject of each post and viewing the first few lines of it. That might give you a clue as to whether you'd want to read that post further, if searching by keywords isn't finding it for you. |
|
09-21-2004, 07:55 PM | #9 | |
Purple Mole
Join Date: Jan 2004
Posts: 694
|
Quote:
Would you like for me to PM that to you or post it here? |
|
09-21-2004, 10:43 PM | #10 |
Green Mole
Join Date: Sep 2004
Posts: 13
|
thanks vinyl-junkie
yes it's include around 122 pages. if you could PM spider log to me or post here thanks again Last edited by dell_10; 09-21-2004 at 10:49 PM. |
09-22-2004, 12:39 AM | #11 |
Green Mole
Join Date: Sep 2004
Posts: 11
|
i am really puzzled
I am a bit puzzled now,
if PhpDig could search the website from Internet connection, why doesn't it index it locally on my machines. Currently in my LAN Phpdig searches many websites in the Intranet, but for the Internet website (behind a proxy), it only indexes the documents under the root, for example, "hompage/test.htm" could be indexed, but, "homepage/about-us/test.htm" will not be indexed. Why does it follow only the links for the documents under the root of the website only ? Do you have any hints ?? |
09-22-2004, 04:37 AM | #12 | |
Purple Mole
Join Date: Jan 2004
Posts: 694
|
Quote:
I'm not gonna keep posting like this, but I did want to point out that the reason it's so slow is most likely because you're on a Windows server. Phpdig just doesn't perform very well there, in my experience. |
|
09-22-2004, 03:48 PM | #13 |
Green Mole
Join Date: Sep 2004
Posts: 13
|
Thanks vinyl-junkie
I'm really confused why you could index it and I could not ,, my partner give you our situation we have 2 servers intranet and internet both (windows servers) phpdig already index intranet (900 pages ) but it's only index 7 pages (only the root folder ) on internet server Maybe cause internet server behind a proxy !! |
09-22-2004, 08:31 PM | #14 | |
Purple Mole
Join Date: Jan 2004
Posts: 694
|
Quote:
I've attached the zip file for your spider log. Hope it helps. |
|
09-23-2004, 10:02 AM | #15 |
Green Mole
Join Date: Sep 2004
Posts: 13
|
Thanks vinyl-junkie
if you can help me and post your config.php and robots_function.php files so I can compare it with my files. Thank You |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Indexing Pages in other Languages | Turan | Troubleshooting | 0 | 09-11-2006 11:26 AM |
Not indexing pages, keywords, etc.. | patrick@online- | Troubleshooting | 5 | 04-15-2006 03:10 AM |
Indexing stops after a few pages | Sibona | Troubleshooting | 1 | 05-03-2005 11:27 AM |
PhpDig 1.8.5 does not index all pages | gaam | Troubleshooting | 3 | 12-14-2004 05:57 AM |
converted from html pages to php pages now no pages will index!!! help!! | bigals | Troubleshooting | 24 | 04-01-2004 10:34 AM |