![]() |
getting '400 Bad Request' in some search results titles
Hi,
I saw another thread about this problem relating to XHTML and some other stuff, but I don't have any of that. If you have a look at my search site http://casat.unr.edu/search/ and enter search term of 'FRN' (you can also try 'PIC') you will see results with this in the title. However, I've reviewed the pages very carefully and can't see any bad links or other problems that might cause this. The only thing I can think to suspect is that I have 2 similar URL's: http://casat.unr.edu/frn/ http://casat.unr.edu/frn.html and I have the setting PHPDIG_DEFAULT_INDEX set to TRUE. Any ideas? Best Regards, John |
Also, I'm not seeing any errors in the spider.log
|
It looks like the server is not happy with certain requests, don't know why, but here's a quick and dirty way to get that out of the title.
In the phpdigCleanHtml function of robot_functions.php find: Code:
$title = trim($regs[1]); Code:
$title = trim(str_replace("400 Bad Request","",$regs[1])); |
Cool! Thanks for the quick/dirty fix.
Do you have any suggestions for how I might try to debug the real cause of the problem? I'm happy to dig in and I'm fairly proficient with PHP. Server is Linux/Apache 1.3.33/PHP 4.3.10 Regards, John |
Hmm, perhaps start with the phpdigGetUrl function, echo the requests, and try them by hand via shell.
|
All times are GMT -8. The time now is 01:09 PM. |
Powered by vBulletin® Version 3.7.3
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
Copyright © 2001 - 2005, ThinkDing LLC. All Rights Reserved.