[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: How to exclude HTML tags in searching ?
I included the tag and the -z option. Works great.
Thank you.
At 06:42 PM 1/25/01 -0800, you wrote:
>Hi Christian, hi all
>
>I think I have an idea what the problem is here - sorry I had emailed David
>separately without copying the group!
>
>The way I have it set right now, glimpse is run without the -z option (and
>therefore without using the filters) unless the <input name=nonascii ...>
>tag is set to "on" or "yes". The reason is the -z was slowing down a lot
>of sites since we're still not defaulting to the shared libs.
>
>Instead, there is an old hack that takes effect to suppress HTML tags in
>the actual result output, but it relies on just recognizing a tag by the
>enclosing < and > chars in each record. Since in this site, there was a
>return in the middle of the tag, it wasn't suppressed.
>
>So to make a long story short, I suggested that David include the tag
><input name=nonascii type=hidden value=on> in his search form ...did you
>try that, David?
>
>I think it will fix the problem by just turning on the filter.
>
>Talk to you soon!
>
>--Golda
>
>At 10:31 AM 1/25/01 -0500, Christian Vogler wrote:
> >On Thu, Jan 25, 2001 at 12:32:50AM -0800, David W. Anderson wrote:
> >> I just upgraded to Webglimpse 1.7.11. Now when I do a search I noticed
> >> it's searching HTML tags.
> >>
> >> For example, I search for 'Nightmare on Elm Street' and one of the
>results is:
> >>
> >> ALT="Nightmare on Elm Street screenshot 1" BORDER="0">
> >>
> >> I don't want ANY HTML tags being included in the search. How can I set
> >> Webglimpse/Glimpse to not search HTML tags?
> >
> >There are several possibilities. First, could you please check or
> >provide the contents of the .wgfilter_index file in your archive
> >directory? These define what filters are used.
> >
> >Depending on the configuration in this file, there may either be a
> >configuration problem, a bug in the perl filter, or a bug in the
> >shared library filter.
> >
> >Regards,
> >Christian
> >
> >
>------------------------------------------------------------
>Golda Velez gvelez@iwhome.com 626-792-9277
>Internet Workshop http://iwhome.com
>Webglimpse Search Software http://webglimpse.net
> ~~~~~~~~~~~~~~~~~~~~~~~~~~~
> Help organize the world - index your own corner of the web
David W. Anderson - dave@horrordvds.com
Webmaster - Horrordvds.com