[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Webglimpse Home]

Re: Searching part of the server?



On Wed, Oct 04, 2000 at 01:50:50PM -0400, Sherman, Rob wrote:
> When I tried restricting webglimpse to our subdirectory on the server, it
> was following links and not recognizing other locations as being "foreign"
> and not indexing them.

This might not be the best or only solution, but you can define
regular expression patterns for the URLs that are rejected and
accepted. Basically, you only want to accept links that point to the
subdirectory and reject everything else. Try editing .wgfilter-index
in the archive directory and putting the following in the file:

Allow http:\/\/[^/]+\/hcil\/.*\.s?html?$
Allow http:\/\/[^/]+\/hcil\/.*\.txt$
Deny .*

This allows indexing of *.html, *.htm, *.shtml, *.shtm, and *.txt in
the hcil directory. If you need other types, you need to add them in a
similar manner as above.

HTH
- Christian