![]() |
Exclude list?
First of all, PhpDig looks like an awesome product. I've been looking for a new search engine for ages, and I think I've found it!!
One question: In the docs for phpdig 1.8.0, it says: Quote:
PHP Code:
/developers/community/forums/... -Antun |
Hi antun, and welcome to PhpDig.net!
The BANNED constant is meant to prevent the following of certain links in pages that get crawled. To prevent certain directories from being crawled altogether, set a robots.txt file in your web root. If a directory has already been crawled and you want to exclude it, just click the red circle noway symbol from the admin panel. |
Thanks, but if I use a robots.txt file to exclude certain directories, won't that prevent those dirs from being indexed by public search engines too (e.g. Google?).
I'm only trying to fine-tune our search - for example, I'd like to exclude our forums from all searches, and I'd like to remove our Developers area from all non-tech releated searches. Should I be excluding different directories and running separate indexes, or should I be running one large index and (if possible?) excluding parts of the site at search-time? -Antun |
Hi. A robots.txt file with the following should exclude the directories from PhpDig prior to index:
Code:
User-agent: PhpDig |
Got it! That will work for the "/developers/community/forums/", which I never want indexed.
However, in my case, I'd like to have separate configurations: - The entire website (excluding /developers/). - All of /developers/, but nothing in the rest of the site. - Just /lps-2.0/docs/lzx-reference/, but nothing else. I presume the best way would be to have each one as a separate website, right?. You see I want to give people an option as to what to search (using a pull-down) most likely. (You can see what I mean here: http://www.laszlosystems.com/developers/). -Antun |
Hi. Perhaps this thread might help.
|
All times are GMT -8. The time now is 03:45 AM. |
Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.