Using Webalizer
Summary
Webalizer is a free log file analysis program that can be used to analyze your current extended_log data. It can produce configurable numerical and graphical reports based on your Web site's traffic. Step-by-step instructions for basic Webalizer use are included below; references are provided for more advanced use. This article assumes a familiarity with the Personal Log File Service.
Included on this page:
Introduction
The Personal Log File Service delivers raw log file data to your personal extended_log file. Webalizer helps you analyze this data by producing reports you can view with a Web browser. (Sample reports can be found on the Webalizer home page.)
In its most basic use, Webalizer produces a report for each calendar month it finds in your log file data. Highlights from each month report include:
- total hits, files, pages and visitors
- top 30 requested URLs
- top 30 visiting sites
- top 30 referring URLs
- top 10 entry and exit pages
- top 15 browser types
Note: before you can use Webalizer you need log file data. The extended_log format is recommended, although Webalizer can also analyze the access_log format. Refer to the Personal Log File Service to learn how to turn on your extended_log file and start accruing data.
When to Use Webalizer
Although the Personal Log File service delivers new data every night, you must run Webalizer each time you want to generate or update your reports. Use Webalizer whenever you want, as frequently as you want. Just be careful when you roll over your log file.
Starting a New Log File:
If you delete or archive your current extended_log file and start a new,
empty one on any other day of the month except the first, you will lose
Webalizer information. For this reason, it is recommended that you roll
over your log file on the first day of the month or use the more
advanced "incremental processing" option mentioned below.
Basic Step-by-Step Instructions
Overview:
The following instructions help you:
- Create a subdirectory to store Webalizer reports
- Run Webalizer on your current extended_log file
- Open the new Webalizer report in your Web browser
- Start a new extended_log file (optional)
Instructions:
-
Log in to your Homer or Dante account.
-
Press the O key for Other, the press the W key to drop into the Web development environment.
-
At the prompt, enter the following command to change directories to your Web directory:
cd public_html
Your personal extended_log is in this directory.
-
Make a subdirectory where Webalizer can deposit its reports. To do so, enter the following command:
mkdir stats
This command creates a subdirectory called stats. You can choose a different name if you like, but, if you do, make sure to substitute appropriately in the following steps.
-
At the prompt, enter the following command to process your extended_log file with Webalizer:
webalizer -n depts.washington.edu -o stats extended_log
Substitute depts with faculty, staff, students or courses according to the server you use. This command tells Webalizer to put its usage statistics in the subdirectory you just created.
-
Since the stats subdirectory is in your Web directory, you can open your new Webalizer usage statistics with a Web browser. The URL address is:
http://servername.washington.edu/youruwnetid/stats/
Substitute servername and youruwnetid appropriately.
-
Optional: Whenever possible, it is a good idea to start a new, empty extended_log file. It will help you stay under your disk quota. A good time to do this is the first day of the month right after using Webalizer. Why? Because the first day of each month is the only day when all of the previous month's data and none of the current month's data has been delivered to you. Warning:If you start a new extended_log on any other day of the month than the first, you will lose information.
To remove your current extend_log file and start a new one, enter the following commands:
rm extended_log
touch extended_log
More Advanced Use
Webalizer is highly configurable and supports many command line options. Some useful variations include:
-
Specify your own report title and exclude the hostname
webalizer -t 'My Webalizer Stats' -n '' -o stats extended_log
-
Using "incremental processing". A powerful option, but an advanced one. Useful if you frequently need to roll over your log file or if you invoke it using the cron utility. Read the man page before using!
webalizer -p -o stats extended_log
To display all the available command line options enter:
webalizer -h
To read the Webalizer man page (which is quite good) enter:
man webalizer
To learn even more, refer to the Webalizer home page.
