[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Webglimpse Home]

Using Glimpse for a FTP server?




Hello,

We like to use Glimpse/WebGlimpse for indexing a ftp server.
Did anybody use Glimpse for indexing a ftp server content?

Many problems arise in this area:

0. Glimpse / WebGlimpse do not index file names

How can I index the names of files, too. Glimpse
does only index the contents of a file!
(I do not like to create an extra file <file>.txt for each
file named <file> which simply contains the file name ...) 

1. Long-term operations

Files are normally tar.gz- or zip-compressed, to index them,
a very "long-running" filter must be used.
How can I index all the files in a tar-archive separatedly without
really extracting files to a temp dir?

(How can I maintain the relationship to the archived file,
not to the archive as a whole?)

2. File names to index

There are many different file names to index, e.g.

*.ps
*.1
.man
README
*.pdf
*.texinfo
*.dvi
*.doc (sometimes Word DOC, sometimes plain text)
... (and many more)
- so many, many filters are needed.

3. Index new files only

Does Glimpseindex and WebGlimpse by default
support a time-stamp indexing, e.g. only
files newer than a given time-stamp will be indexed?

Any comments, suggestions are appreciated.
(Or a web site using Glimpse for this purpose?)
Before I start, I'd like to know not to be the first to try.

Yours sincerely
Frank Elsner

#-------------------------------------------------------#
Dipl.-Math. Frank Elsner
Universitaet Osnabrueck (University of Osnabrueck)
- Rechenzentrum - (Computing Center)
Albrechstrasse 28, AVZ
D-49076 Osnabrueck
Deutschland (Germany)

Tel. (Phone): ++49 (0)541/969-2343 Fax: -2470
E-Mail: Frank.Elsner@rz.uni-osnabrueck.de
#-------------------------------------------------------#