![]() |
robots.txt
With the following robots.txt, no indexing, I always get: links found: 0, ... Was recently indexed:
User-agent: phpdig Disallow: User-agent: * Disallow: / After removing this robots.txt, all goes fine. My intention was to allow PhpDig to index, but tell the others to go away. Did I get the syntax wrong? |
|
I've taken this example from the quoted source, fr. Anonymus. In my opinion it shows that it should be possible (sorry for the lost alignment):
# /robots.txt for http://www.fict.org/ # comments to webmaster@fict.org User-agent: unhipbot Disallow: / User-agent: webcrawler User-agent: excite Disallow: User-agent: * Disallow: /org/plans.html Allow: /org/ Allow: /serv Allow: /~mak Disallow: / The following matrix shows which robots are allowed to access URLs: unhipbot webcrawler-excite other http://www.fict.org/ No Yes No http://www.fict.org/index.html No Yes No http://www.fict.org/robots.txt Yes Yes Yes http://www.fict.org/server.html No Yes Yes http://www.fict.org/services/fast.html No Yes Yes http://www.fict.org/services/slow.html No Yes Yes http://www.fict.org/orgo.gif No Yes No http://www.fict.org/org/about.html No Yes Yes http://www.fict.org/org/plans.html No Yes No http://www.fict.org/%7Ejim/jim.html No Yes No http://www.fict.org/%7Emak/mak.html No Yes Yes |
Hi. I haven't tested the below code, but it should get around the following case:
Quote:
In robot_functions.php find the phpdigReadRobotsTxt function and in this function find: PHP Code:
PHP Code:
|
All times are GMT -8. The time now is 09:53 AM. |
Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.