|
03-14-2004, 01:47 PM | #1 |
Green Mole
Join Date: Mar 2004
Posts: 9
|
multiple crawlers
anyone think this is a good way to get phpDig to run multiple crawlers from sites in the database? When I spider.php I noticed that it would only do one site at a time and when it came across a site such as dmoz.org it would take days, if not weeks to index all of that.
http://rbhs.ath.cx/~reza/phpdig/wrapper.php |
03-14-2004, 04:54 PM | #2 |
Head Mole
Join Date: May 2003
Posts: 2,539
|
PHP Code:
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension. |
03-14-2004, 09:11 PM | #3 |
Green Mole
Join Date: Mar 2004
Posts: 9
|
fix
i made the following fix.
PHP Code:
|
03-15-2004, 08:59 AM | #4 |
Green Mole
Join Date: Mar 2004
Posts: 19
|
Think it's a nice option.
But where must the file install? And how the get it work? Marten |
03-15-2004, 09:37 AM | #5 |
Green Mole
Join Date: Mar 2004
Posts: 9
|
how to work?
Hey, I usually just run it in the same directory as spider.php
%pwd /usr/home/reza/public_html/phpdig/admin %php -f wrapper.php screen -A -m -d -S freebsd.org_phpdig php -f spider.php http://freebsd.org/ screen -A -m -d -S openbsd.org_phpdig php -f spider.php http://openbsd.org/ %screen -list There are screens on: 13219.daily.daemonnews.org_phpdig (Detached) 10700.freebsd.org_phpdig (Detached) 13241.openbsd.org_phpdig (Detached) 88053.staff.daemonnews.org_phpdig (Detached) 88057.seclists.org_phpdig (Detached) 6 Sockets in /tmp/screens/S-reza. % and you can add it to crontab to have it run whenever you want |
01-06-2005, 07:18 PM | #6 |
Orange Mole
Join Date: Dec 2004
Location: Tennessee
Posts: 60
|
I guess I"m not totally understanding how this works...
If I'm correct, you install this script, and run it via cron jobs, and it will see if there are sites to be indexed, and add multiple spiders to handle it - right? jmitchell
__________________
60,000 pages indexed!!!!! http://www.sharemylink.com |
01-06-2005, 07:21 PM | #7 |
Orange Mole
Join Date: Dec 2004
Location: Tennessee
Posts: 60
|
__________________
60,000 pages indexed!!!!! http://www.sharemylink.com |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Re-upload multiple crawlers | ammo | Mod Requests | 1 | 05-25-2005 08:49 AM |
Multiple Spiders | jmitchell | How-to Forum | 3 | 12-16-2004 05:43 PM |
multiple crawlers | searchboy | How-to Forum | 1 | 09-11-2004 07:10 AM |
multiple SIDs | chilling | How-to Forum | 1 | 05-17-2004 05:47 PM |
Multiple spiders | tryangle | How-to Forum | 3 | 04-24-2004 03:43 AM |