![]() |
speciffically slow spidering at fgets()
I've read the other posts re: slow spidering behavior and found nothing matching my situation. Please help!
After inserting traces and such into the code, I've found a consistent delay of 10 - 15 seconds for each page being indexed which occurs across a specific function call: Code:
FILE: robot_functions.php Code:
OS: Win 2000 Thanks much! |
PS - One more helpful(?) bit of info: while PhpDig spidering is going on, I've watched my CPU activity which is mostly nothing, with occasional spikes (every 10 - 15 seconds, BTW). To me, this points to a timeout issue - but I don't know where / what layer to consider. (Also, I've reduced all PhpDig sleeps to 1 or 2 seconds and this is NOT the problem at all). Thanks again!
|
Are you able to spider from shell? That might be a way around the problem.
|
Vinyl J -
Good idea (and it made me solve some incidental installation problems), yet no go (i.e. same problem and with harder-to-read output <lol>). Anyway, as I mentioned above, the wget mirroring program doesn't have any trouble like this - it's quite zippy! That points away from the httpd software / configuration. It has all the smell of a communication timeout issue, but how do I investigate beyond the sticking fgets() ? |
Hi. Can't say I've experienced fgets problems. Perhaps something here might help?
|
Well, I've exactly found the problem: the code doesn't respect the Content-Length header (or when chunked, the chunk sizes). Thus, it will always attempt an over-read. I suppose on some configurations that doesn't make a difference, but on mine it surely does! I've fully solved the problem in the test script and partially moved that solution into my own PhpDig code. If anyone cares to know more, get in touch...
Cheers! |
Will you post your mod in the Mod Submissions forum?
|
just to throw in my two cents worth...
i'm already communicating with slintz, but this isn't a problem specific only to him...the exact same thing happens to me when i try and spider my site...i always get between 10-15 seconds (sometimes up to 20) of delay / page here is my server info: OS: Solaris 5.8 PHP: 4.3.8 Apache: 2.0.50 MySQL: 4.0.13 PhpDig: 1.8.3 yes, i realize that some of those are older versions, but i have no control over that...i just write the webpages :) |
All times are GMT -8. The time now is 03:49 PM. |
Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.