[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Webglimpse Home]

RE: Excessive logging by the webserver



Hi Charles

On Tue, 20 Mar 2001, Charles Hall wrote:

> Mario, can you post a few dozen sample log entries? Off hand I can't
> imagine why webglimpse should be generating anything special in the way
> of log activity.

I'm pasting some lines of the access_log file from the only machine I have
access. Unfortunately there are just a few HTML files.

Note: Webglimpse is scanning the same machine where it is installed, but
through the http server, not a "raw" directory scan.

< From access_log >

200.19.254.126 - - [20/Mar/2001:11:27:40 -0300] "GET /proxy/ie4/ie4.html
HTTP/1.0" 200 1216
200.19.254.126 - - [20/Mar/2001:11:27:40 -0300] "GET
/proxy/netscape/netscape.html HTTP/1.0" 200 2102
200.19.254.126 - - [20/Mar/2001:11:27:54 -0300] "GET /robots.txt
HTTP/1.0" 404 274
200.19.254.126 - - [20/Mar/2001:11:27:54 -0300] "GET / HTTP/1.0" 200 2566
200.19.254.126 - - [20/Mar/2001:11:27:54 -0300] "GET /proxy/ie4/ie4.html
HTTP/1.0" 200 1216
200.19.254.126 - - [20/Mar/2001:11:27:54 -0300] "GET
/proxy/netscape/netscape.html HTTP/1.0" 200 2102
200.19.254.126 - - [20/Mar/2001:11:28:53 -0300] "GET /robots.txt
HTTP/1.0" 404 274
200.19.254.126 - - [20/Mar/2001:11:28:53 -0300] "GET / HTTP/1.0" 200 2566
200.19.254.126 - - [20/Mar/2001:11:28:53 -0300] "GET /proxy/ie4/ie4.html
HTTP/1.0" 200 1216
200.19.254.126 - - [20/Mar/2001:11:28:53 -0300] "GET
/proxy/netscape/netscape.html HTTP/1.0" 200 2102

< end >

Entries like that full up the log files in the scanned machines. They are
normal access logs, I know, but I was wondering if there is a way to avoid
logging accesses from the machine running webglimpse. That's because the
other servers have a lot of HTML files, so the access generate a lot of
information in the log files.

That's the screen output when I run wgreindex: 

proxy:/usr/local/apache/logs# /usr/local/apache/htdocs/wgreindex 
No search boxes used
Getting local links by hops, 2 hops...
Neighborhood will be 1 hops.
Got url http://proxy.furg.br into file
/usr/local/apache/htdocs/.remote/1.br.
Traversing 2 hops...
Url http://proxy.furg.br/proxy/ie4/ie4.html is remote...
Getting remote url: http://proxy.furg.br/proxy/ie4/ie4.html
Got url http://proxy.furg.br/proxy/ie4/ie4.html into file
/usr/local/apache/htdocs/.remote/2.html.
Url http://proxy.furg.br/proxy/netscape/netscape.html is remote...
Getting remote url: http://proxy.furg.br/proxy/netscape/netscape.html
Got url http://proxy.furg.br/proxy/netscape/netscape.html into file
/usr/local/apache/htdocs/.remote/3.html.
No more links to traverse.
Got url http://www.ecomp.furg.br into file
/usr/local/apache/htdocs/.remote/4.br.
Traversing 2 hops...

Regards
Mario