![]() |
|
![]() |
#1 |
Green Mole
Join Date: Nov 2005
Posts: 2
|
Exclude files by pattern
I have a directory tree which contains blah_nn.html and blah_nn_1.html, where blah_nn_1.html are the "Printable" versions of blah_nn.html. Can phpDig be configured to exclude files matching /blah_\d+_1\.html/i when spidering?
Thanks, Pi |
![]() |
![]() |
![]() |
#2 |
Green Mole
Join Date: Nov 2005
Posts: 2
|
Hmm.
Now that I think about it, I could mess with FORBIDDEN_EXTENSIONS. Code:
define('FORBIDDEN_EXTENSIONS','(\d+_1\.html)|\.(rm|ico|cab|swf|css|gz|z|tar|zip|tgz|msi|arj|zoo|rar|r[0-9]+|exe|bin|pkg|rpm|deb|bz2)$'); |
![]() |
![]() |
![]() |
#3 |
Head Mole
Join Date: May 2003
Posts: 2,539
|
Yes, very reasonable. Try [0-9]+ though as eregi is used. See this thread for another example.
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension. |
![]() |
![]() |
![]() |
|
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Can't exclude few pages | mleray | Troubleshooting | 2 | 11-19-2004 12:25 AM |
Exclude paths : -'*' -@NONE@ | BootsWalker | Troubleshooting | 2 | 10-20-2004 06:12 PM |
exclude metatags | tomas | How-to Forum | 5 | 08-15-2004 03:22 PM |
How can i exclude pages?? | onlytrue | How-to Forum | 2 | 03-19-2004 02:47 PM |
exclude after spidering | baskamer | Troubleshooting | 2 | 03-01-2004 02:17 AM |