Webglimpse HowTos
|
NEW 12/12/07:: Returning Good URLs as Search Result Pages Merge SQL query results with full text search. (Some coding required) NEW 2/15/08:: Eliminate repetitive text from search results. Install the Webglimpse cPanel Addon Search Tips for the End User: advanced query syntax Configuring an Archive: detailed description of how to create a new archive using the web interface. popular Customize Results Output: make search results page blend in with your site Newdoc 10/10/07 Customize Ranking Order of Search Results: create your own formula for sorting results Newdoc 10/16/07 Achieve 100% Uptime: by use of alternative indexes |
Include/Exclude Files from the index. Index files with spaces in the names Index MS Word and Excel Documents popular Index PDF documents using Xpdf Index PDF documents using Ghostscript Index upper-ASCII International Characters Convert HTML codes to upper-ASCII characters Field-based searching: recognize structured text Search by directory: allow users to search within specific subdirectories. Prefilter files for greater Speed BibGlimpse: setup a light-weight PDF reprint manager for scientific literature, featuring automated bibliography retrieval. |
Additional documents are available in the context-sensitive help screens in the Webministration interface. The best way to view these files is to run wgarcmin.cgi from your own web server (something like http://yourserver.com/cgi-bin/wg2/wgarcmin.cgi) and press on the 'Help' links. This will give you exact paths to the files that require editing on your server.
|
Allow or Deny indexing of filetypes Add Search Boxes automatically to all web pages Search on neighborhood of a page
Prefiltering and caching for greater search speed
|
Glimpse and Glimpseindex Man Pages
An article describing the ideas behind the design of glimpse.
Search the Harvest Docs from http://www.tardis.ed.ac.uk/harvest/docs/
Webglimpse 1.X Docs
Installing Webglimpse v1.6 and above
Configuring an archive with confarc
Understanding the vhost option with confarc
Removing an archive with rmarc
Academic papers & articles
some of the files below come from pre-publication copies of articles, so we do not have full author list and publication dates available. All of the links below are in PDF format unless noted otherwise.
Webglimpse - Combining Browsing and Searching 1997 Usenix Technical Conference
PS format
GLIMPSE: A Tool to Search Through Entire Filesystems White paper, Udi Manber & Sun Wu, 1993.
PS format
Siff - Finding Similar Files in a Large File System Udi Manber, Oct 1993
PS format
A Text Compression Scheme That Allows Fast Searching Directly in the Compressed File Udi Manber, 1993 ACM TOIS
PS format
A Fast Algorithm for Multi-Pattern Searching Sun Wu & Udi Manber, 1994
PS format
A Simple Scheme to Make Passwords Based on One-Way Functions Much Harder to Crack Udi Manber 1994
PS format
Approximate Multiple String Search Robert Muth & Udi Manber
PS format
An Algorithm for Approximate Membership Checking With Application to Password Security Udi Manber & Sun Wu, 1992. Information Processing Letters vol 50
PS format
Scalable Internet Resource Discovery: Research Problems and Approaches
PS format
Suffix arrays: A new method for on-line string searches Udi Manber & Gene Meyers, 1989
PS format | text format
Connecting Diverse Web Facilities Udi Manber & Peter Bigot
PS format
Harvest: A Scalable, Customizable Discovery and Access System
PS format
Troubleshooting Notes
It is probably also worth searching the Sympa mailing list archives.|
Indexing: 0 files indexed archive appears to build successfully, but reports zero files indexed |
[ Home ] [ Purchase ] [ Downloads ] [ Docs ] [ Support ] [ Contact Us ] [ Web Hosting ] [ Top of Page ]
see also our sites for: [ Tucson, Arizona ] [ Dallas (Garland), Texas ] [ bTeaching.com : Ideas for Everyday Learning]
[ Webglimpse Advanced Site Search Software : providing flexible local search since 1997 ]
Copyright © Internet WorkShop, 2002. All Rights Reserved.