[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: How to exclude HTML tags in searching ?



Hi Christian, hi all

I think I have an idea what the problem is here - sorry I had emailed David
separately without copying the group!

The way I have it set right now, glimpse is run without the -z option (and
therefore without using the filters) unless the <input name=nonascii ...>
tag is set to "on" or "yes".  The reason is the -z was slowing down a lot
of sites since we're still not defaulting to the shared libs.

Instead, there is an old hack that takes effect to suppress HTML tags in
the actual result output, but it relies on just recognizing a tag by the
enclosing < and > chars in each record.  Since in this site, there was a
return in the middle of the tag, it wasn't suppressed.

So to make a long story short, I suggested that David include the tag
<input name=nonascii type=hidden value=on> in his search form ...did you
try that, David?

I think it will fix the problem by just turning on the filter.

Talk to you soon!

--Golda

At 10:31 AM 1/25/01 -0500, Christian Vogler wrote:
>On Thu, Jan 25, 2001 at 12:32:50AM -0800, David W. Anderson wrote:
>> I just upgraded to Webglimpse 1.7.11.  Now when I do a search I noticed 
>> it's searching HTML tags.
>> 
>> For example, I search for 'Nightmare on Elm Street' and one of the
results is:
>> 
>> ALT="Nightmare on Elm Street screenshot 1" BORDER="0">
>> 
>> I don't want ANY HTML tags being included in the search.  How can I set 
>> Webglimpse/Glimpse to not search HTML tags?
>
>There are several possibilities. First, could you please check or
>provide the contents of the .wgfilter_index file in your archive
>directory? These define what filters are used.
>
>Depending on the configuration in this file, there may either be a
>configuration problem, a bug in the perl filter, or a bug in the
>shared library filter.
>
>Regards,
>Christian
>
>
------------------------------------------------------------
Golda Velez         gvelez@iwhome.com	        626-792-9277
Internet Workshop                          http://iwhome.com
Webglimpse Search Software             http://webglimpse.net
		~~~~~~~~~~~~~~~~~~~~~~~~~~~
 Help organize the world - index your own corner of the web