[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Webglimpse Home]
Re: Search results titles
On Fri, Mar 02, 2001 at 04:08:03PM -0500, andy hopkins wrote:
> I've tried to fool it by adding:
>
> *.pl* /usr/local/webglimpse/lib/html2txt <
>
> to .glimpse_filters with no luck.
The glimpse filters are just responsible to converting HTML to plain
text, so that it can be searched and indexed.
The extraction of the HTML title depends on the suffix and is
hardcoded into the glimpseindex executable. One possible solution for
you would be to add .pl as an HTML-like suffix. To do that you would
need tomodify glimpse.h in the glimpse source tree and change
these lines from
#define EXTRACT_INFO_SUFFIX {".htm", ".html", ".shtm", ".shtml"}
#define NUM_EXTRACT_INFO_SUFFIX 4
#define EXTRACT_INFO_SUFFIX {".htm", ".html", ".shtm", ".shtml", ".pl" }
#define NUM_EXTRACT_INFO_SUFFIX 5
then recompile.
If recompiling is not an option, we may be able to do some hackery in
one of the webglimpse perl scripts, but it is not exactly pretty.
In that case, or if the recompile did not solve your problem, it would
help if you could send me the contents of your .wg_toindex and
archive.cfg files.
Regards
- Christian