Code
Free software.
Thursday, February 20, 2003
« filestats 1.1 »
I just uploaded filestats.py version 1.1 to fix some stupid stuff in 1.0 -- it wouldn't run unless the dns.cache file already existed.
« filestats »
I have just uploaded version 1.0 of a specialized web server log analysis program called filestats. Not a big deal, but it does something that Analog and Webalizer doesn't do, AFAIK. Given a list of one or more files on a server, filestats builds a report that shows who (i.e., what client machines) have accessed those files, by referer (the default) or by UserAgent. So, for example, if you'd like to see who is downloading your RSS feed, grouped by referer, you would say something like this (except all on one line):
To see the same data, but grouped by UserAgent, just add the switch "-a".
I decided to use OPML as the output format, because I use Radio and ActiveRenderer, which gives me editing, rendering, and publishing without any extra work. And besides, filestats produces hierarchical data, so using an outline format just makes sense.
It's written in Python and relies on Mark Nottingham's very useful Weblog package.
I'm publishing it under the Creative Commons Attribution License, which (as I understand it) means that it's free for any use, commercial or non-commercial, and can be modified or derived from, you just have to give me credit.
I'm open to suggestions and bug reports.
Free software.
« filestats 1.1 »
I just uploaded filestats.py version 1.1 to fix some stupid stuff in 1.0 -- it wouldn't run unless the dns.cache file already existed.
« filestats »
I have just uploaded version 1.0 of a specialized web server log analysis program called filestats. Not a big deal, but it does something that Analog and Webalizer doesn't do, AFAIK. Given a list of one or more files on a server, filestats builds a report that shows who (i.e., what client machines) have accessed those files, by referer (the default) or by UserAgent. So, for example, if you'd like to see who is downloading your RSS feed, grouped by referer, you would say something like this (except all on one line):
python filestats -i access_log
-o feedstats.opml
-s http://www.example.com
-f /weblog/rss.xml
To see the same data, but grouped by UserAgent, just add the switch "-a".
I decided to use OPML as the output format, because I use Radio and ActiveRenderer, which gives me editing, rendering, and publishing without any extra work. And besides, filestats produces hierarchical data, so using an outline format just makes sense.
It's written in Python and relies on Mark Nottingham's very useful Weblog package.
I'm publishing it under the Creative Commons Attribution License, which (as I understand it) means that it's free for any use, commercial or non-commercial, and can be modified or derived from, you just have to give me credit.
I'm open to suggestions and bug reports.
