Managing Server Monitoring Checks

Checks are the key element in managing your server or website monitoring. They tell the service what addresses and services you want it to monitor.

Main Status View

The main screen lists all of your checks and their current status. The status of each check is shown on the right side of the list. Since most checks run once a minute, the status view updates about that often as well. When you add a new check, in most cases the first result should be available and displayed in less than 90 seconds. The list shows the type of check, the label or target URL (if you didn't assign the check a label), the interval that the check is set to run at, the time of the most recent result, and the current status. Note that the time is not the time at which the status on the screen was updated, it's the actual time that the check was last run. If you have the correct access level to edit check settings, an edit button will also appear on each line of the list.

The list is fully sortable, and will remember your sort choice from the previous time you visited the page. You can also search the list by typing in the Search box at the upper right. The list will be filtered as you type, so you can see the list narrow as you enter your search term. It is often a good idea to label your checks to enhance your search capabilities. For example, if several of your checks are monitoring routers, including "router" in the label makes it easy to see all of your router checks, or you could identify physical locations in your lables to indicate where hosts are located. Careful labeling is helpful both for search and sorting.

Clicking on a check in the label column will display details about that check, including the results of the last five check runs and links to the public reports (if they are enabled). There is also a link to a summary of results and chart of the response times for that check. This same chart view is also available if you click on the Results column (click on PASS or FAIL). That view also shows the total uptime percentage for the check.

Check Configuration

Clicking on the Edit button for a check, or on the link to create a new check, will bring up the Edit Check dialog. The fields on the form will vary depending on what the Type field is set to. If you are creating a new check, the type is the first thing to set. The specific fields for each type of check are described on the Documentation menu.

The dialog shown here is for an HTTP check, the most common check for website monitoring. The dialog for HTTP is very similar to several of our other simpler server monitoring checks. In fact, most of the fields in the HTTP dialog are present in all check types, except that the URL is called "Target" or "Address" for some check types. Several check types have additional fields specific to that protocol.

The label for a monitoring check can be just about anything. The label will be used in the status list, on reports, and in notifications, so it should be something that is meaningful particularly to the people getting notifications. If you leave it blank the system will use a reasonable default, which for most check types is the URL or Target address.

Checks can be individually enabled or disabled. Often if you are taking a server down for maintenance it is nice to disable the check temporarily to avoid a shower of notifications for an outage you already know about. You can also enable or disable checks in the detail view of the status list (accessed by clicking on the label in the list).

The check frequency sets how often (in minutes) the check will be run. The system will run the check at whatever interval you set here. The API can be used to set a number of minutes different from what is available in the drop down in the dialog. For most checks one minute is appropriate. Running monitoring checks does create a connection to the host being monitored, which often shows up in the logs, so if that is an issue either because of the logging, bandwidth usage, or some other reason you can set the check to run less frequently. Some server monitoring checks, such as the SSL certificate check, don't really need to run more often than once a day or so. In the case of the SSL certificate check, it can be set to warn you some days before the certificate expiration date.

The timeout setting sets how long the check should wait for a response before concluding that the remote service is "down." In most cases 5 seconds is a good setting for the timeout. Most web sites should respond much more quickly than that, but setting this timeout much lower will usually cause you to get more notifications than you want to get. This is particularly true on website monitoring of sites on shared hosting providers, but even for well performing sites there will be times when general Internet conditions might cause response times to go over two or three seconds. Remember that this is the time to get a response back to our monitoring servers, not the server's local response time that appears in the web server's own logs. For monitoring own site we set this to 5 seconds, even though our web server responds to most page requests in a quarter second or less and often responds within a couple milliseconds.