[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Webglimpse Home]

Re: The format of the .wgfilter-box



At 01:37 PM 12/11/99 +0200, Ilyes Attila wrote:
>Hello !
>
>I can't find anything about the format of the file .wgfilter-box.
>The Harvest(harvest.transarc.com) link is dead.
>Is this right: 
>Deny http:/server.name/dir1/ 
>if I want to exclude the whole directory named dir1 ?

Actually, I think you mean .wgfilter-index, if you want to exclude a
directory.  You can use a regular expression such as

Deny	(^|/)dir1(/|$)

to exclude any file on your server that contains "/dir1/" in the path (or
dir1 at the beginning/ending of the path).  Currently, .wgfilter-index only
affects files on your own server, it does not affect the gathering of
remote links.

For information about the Harvest project, see

	http://www.tardis.ed.ac.uk/harvest/

--Golda