[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Webglimpse Home]
Re: The format of the .wgfilter-box
At 01:37 PM 12/11/99 +0200, Ilyes Attila wrote:
>Hello !
>
>I can't find anything about the format of the file .wgfilter-box.
>The Harvest(harvest.transarc.com) link is dead.
>Is this right:
>Deny http:/server.name/dir1/
>if I want to exclude the whole directory named dir1 ?
Actually, I think you mean .wgfilter-index, if you want to exclude a
directory. You can use a regular expression such as
Deny (^|/)dir1(/|$)
to exclude any file on your server that contains "/dir1/" in the path (or
dir1 at the beginning/ending of the path). Currently, .wgfilter-index only
affects files on your own server, it does not affect the gathering of
remote links.
For information about the Harvest project, see
http://www.tardis.ed.ac.uk/harvest/
--Golda