Our server license includes both Glimpse, the fast C fulltext search engine,
and Webglimpse, the flexible Perl spider and indexer.
Both binaries and source code distributed.
Platforms: Linux, Solaris, SunOS, HP/UX, freeBSD, AIX, IRIX, OSF, Mach, Rhapsody (Mac OS X).
For licensed users:
- NEW in v2.0 Web interface for managing multiple archives
- Customized Output using flexible template format
- Built-in templates for Hebrew, German, Spanish, Finnish and Portuguese
- Result Caching and "Next N Hits" toolbar for navigating result pages
- Rank Hits by customizable formula. May include date, keyword frequency, META tags
- Input Syntax translator allows user to type AND/OR/NOT instead of symbols
- Log Searches in format compatible with wusage software
For all users (licensed and edu/nonprofit/gov):
- Index by subdirectory OR traverse links to a specified depth.
- Free for nonprofits (EDU version).
- Flexibility: Very flexible rules for choosing links to index, filtering & excluding files.
- Can be used on Internet or Intranet.
- Full text indexing, robot spider and file system indexing.
- Index HTML documents, Word, PDF, and any other documents that can be filtered to plaintext.
- All single-byte languages can be indexed
- HTML character code support (ü,í,à, etc.).
- Reindex from crontab.
- Limit search to recent files.
- Supports standard Boolean operators and wildcards.
- HTTP and FTP support.
- Custom ranking; "keywords" and "description" META tags support.
- Robots exclusion standard support.
- Ability to search the "neighborhood" of any indexed page
(if traversing links). You configure how many "hops" defines
- Option to add neighborhood search boxes to all local pages indexed.
- Configurable for multiple domains on a single server.
- Multiple domains on a single server.
- Hit highlighting and results in context.
- Uses the fast Glimpse search engine with many configurable options:
Unix server, telnet access, user should have some Unix knowledge (experience installing other command line
Disk space requirements
About 15% of the indexed filespace will be required for the index.
Remote files must be retrieved and stored locally for indexing. The program itself takes about 5Mb.
A comparison with HtDig