Friday, March 04, 2005
Scratchy - The Apache Log Parser and HTML Report Generator for Python
Scratchy - The Apache Log Parser and HTML Report Generator for Python: "Scratchy is a set of scripts to parse Apache web server log files and extract useful information. From this data, Scratchy will create HTML reports so that website administrators can easily view the information and determine trends and their typical audience.
Scratchy began as a proof-of-concept which allowed me to compile stats about my personal website. As time progressed I continually added features and improvements and I felt that it would be useful to others.
Why Scratchy?
Well, the name of the project of course comes from the Simpsons 'Itchy and Scratchy Show'. The functionality that the project aims to supply is a complete log parsing and report generating tool. Also, there seemed to be a need for such a project in Python. I have seen some other Apache log parsers but they were developed in other languages (such as Perl, C, etc). One goal of this project is for it to be extensible, to that tune, most of the report appearance can be easily modified by tweaking a single config file.
What information does Scratchy report?
# Accessed web pages
# Hosts accessing your website
# Operating systems
# Browsers and versions
# Search engines
# Robots
# File types accessed
# Errors
# Country name lookups (if enabled).
# Charts of most data (if enabled).
# A trace of pages accessed by each ip address (if enabled)."
Scratchy began as a proof-of-concept which allowed me to compile stats about my personal website. As time progressed I continually added features and improvements and I felt that it would be useful to others.
Why Scratchy?
Well, the name of the project of course comes from the Simpsons 'Itchy and Scratchy Show'. The functionality that the project aims to supply is a complete log parsing and report generating tool. Also, there seemed to be a need for such a project in Python. I have seen some other Apache log parsers but they were developed in other languages (such as Perl, C, etc). One goal of this project is for it to be extensible, to that tune, most of the report appearance can be easily modified by tweaking a single config file.
What information does Scratchy report?
# Accessed web pages
# Hosts accessing your website
# Operating systems
# Browsers and versions
# Search engines
# Robots
# File types accessed
# Errors
# Country name lookups (if enabled).
# Charts of most data (if enabled).
# A trace of pages accessed by each ip address (if enabled)."