|
10-09-2003, 06:09 AM | #1 |
Orange Mole
Join Date: Oct 2003
Location: hamburg, germany
Posts: 52
|
need help: phpdig suddenly reads html-comments!
hi!
i've been using phpdig for about 1 year now and it has always worked fine. but now suddenly - and i haven't changed anything - it starts to read html-comments in the source-code and put's it into the description. and as that of course has nothing to do in the results page for the user, i'd like to get rid of that. has anyone else ever experienced that problem and knows a "cure"? |
10-09-2003, 03:32 PM | #2 |
Head Mole
Join Date: May 2003
Posts: 2,539
|
How about look at this and modify the if statement?
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension. |
10-10-2003, 05:49 AM | #3 |
Orange Mole
Join Date: Oct 2003
Location: hamburg, germany
Posts: 52
|
no, that is not my problem. it doesn't only red that exclude-comment, but all html-comments! so i got stuff like "main table starts here" ect. in my results-page.
that really sucks! any ideas why that is? |
10-11-2003, 06:20 AM | #4 |
Head Mole
Join Date: May 2003
Posts: 2,539
|
Hi. What version of PHP do you have? Try running the following. What are the results when viewing the HTML source?
PHP Code:
Code:
<!-- test -->A<br> <!-- test -->B<br> <!-- test -->C<br> <!-- test -->D<br> <!-- test -->E<br> <!-- test -->F<br> <! test >G<br> < test >H<br> I<br> J<br>
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension. |
10-11-2003, 08:50 AM | #5 |
Purple Mole
Join Date: Sep 2003
Location: Kassel, Germany
Posts: 119
|
Hmm there are my problems which i also post here:
PHP 4.3.2 - Result: Code:
<!-- test -->A<br> <!-- test -->B<br> <!-- test -->C<br> <!-- test -->D<br> <!-- test -->E<br> <!-- test -->F<br> <! test >G<br> < test >H<br> < test >I<br> < test >J<br> -Roland- |
10-11-2003, 10:26 AM | #6 |
Head Mole
Join Date: May 2003
Posts: 2,539
|
Hi. It seems that strip_tags in PHP 4.3.2 has been reworked, making it so that it doesn't eliminate as much as before. The following will remove everything between the < and > symbols.
In robot_functions.php, replace: PHP Code:
PHP Code:
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension. |
10-11-2003, 10:40 AM | #7 |
Purple Mole
Join Date: Sep 2003
Location: Kassel, Germany
Posts: 119
|
I also found something:
Code:
$text = preg_replace('/<.*>/U', '', $text); echo $text . "K<br>\n"; -Roland- |
10-11-2003, 11:28 AM | #8 | |
Head Mole
Join Date: May 2003
Posts: 2,539
|
Quote:
PHP Code:
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension. |
|
10-12-2003, 03:01 PM | #9 |
Orange Mole
Join Date: Oct 2003
Location: hamburg, germany
Posts: 52
|
@charter:
that really seems to be the problem. my hoster must have updated php. what i get is <!-- test -->A<br> <!-- test -->B<br> <!-- test -->C<br> <!-- test -->D<br> <!-- test -->E<br> <!-- test -->F<br> <! test >G<br> < test >H<br> < test >I<br> < test >J<br> the server is running PHP Version 4.3.3. i'm now gonna try indexing with the $text = ereg_replace("[[:blank:]]+"," ",eregi_replace("<[^>]*>","",$text)); workaround you posted. thanks! :-) |
10-12-2003, 03:08 PM | #10 |
Orange Mole
Join Date: Oct 2003
Location: hamburg, germany
Posts: 52
|
and that seems to work. great. thanks again!
|
10-12-2003, 05:01 PM | #11 |
Purple Mole
Join Date: Sep 2003
Location: Kassel, Germany
Posts: 119
|
Hm okay, what is now the better solution for the future ?
Code:
//replace any group of blank characters by an unique space $text = ereg_replace("[[:blank:]]+"," ",strip_tags($text)); Code:
//replace any group of blank characters by an unique space $text = preg_replace('/<.*>/sU', '', $text); |
10-12-2003, 05:32 PM | #12 |
Head Mole
Join Date: May 2003
Posts: 2,539
|
Hi. My personal preference would be to use
PHP Code:
PHP Code:
__________________
Responses are offered on a voluntary if/as time is available basis, no guarantees. Double posting or bumping threads will not get your question answered any faster. No support via PM or email, responses not guaranteed. Thank you for your comprehension. |
10-12-2003, 05:40 PM | #13 |
Orange Mole
Join Date: Oct 2003
Location: hamburg, germany
Posts: 52
|
i just used this line
$text = ereg_replace("[[:blank:]]+"," ",eregi_replace("<[^>]*>","",$text)); and it seems to work perfectly. and that's good enough i guess. ;-) |
10-13-2003, 06:30 AM | #14 |
Orange Mole
Join Date: Oct 2003
Location: hamburg, germany
Posts: 52
|
dammit. it still doesn't work. after completely reindexing the page (that takes some hours) i got comments in the results page again.
the original comment-line was <!----------sub-navbar table ends here----------> and in the html-source of the results page now i still find this: < sub-navbar table ends here > all my comments start with <!---------- and end with ---------->. unfortunately i'm not a php-crack, but it somehow has to be possible to get rid of that and everthing in between. is it? |
10-13-2003, 07:58 AM | #15 | |
Purple Mole
Join Date: Sep 2003
Location: Kassel, Germany
Posts: 119
|
Quote:
This is quite expected behaviour. The SGML specification doesn't allow whitespaces to appear right after the less than sign.: http://bugs.php.net/bug.php?id=25730 -Roland- |
|
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Multi-line HTML comments incorrectly being indexed | nicrodgers | Troubleshooting | 0 | 12-22-2004 02:32 AM |
How to make phpdig index certain content, located in certain html tags?! | r3m | How-to Forum | 1 | 11-18-2004 05:27 PM |
Phpdig indexing including HTML in results | Mrsoft | Troubleshooting | 1 | 09-28-2004 04:23 AM |
PHP and Javascript in phpdig.html template file | jayhawk | How-to Forum | 1 | 06-17-2004 05:03 PM |
Indexing all HTML-Comments | Rolandks | Bug Tracker | 4 | 10-04-2003 06:38 AM |