Log Management for the University of California: Issues and Recommendations

Table of Contents

1 Introduction
2 Candidate Sources of Logged Information
3 Recommended Log Management Practices

3.1 Log Generation
3.1.1 Application Logs
3.1.2 System Logs
3.1.3 Network Logs
3.1.4 Time Synchronization
3.2 Use of Log Information
3.2.1 Baseline Behavior
3.2.2 Investigation

4 Appropriate Use of Log Information
5 Log Record Life-Cycle Management

5.1 Retention
5.2 Log Management Infrastructure

6 Summary of Recommendations
7 Acknowledgments

1 Introduction

Most components of an information technology infrastructure are capable of producing logs chronicling their activity over time. These logs often contain very detailed information about the activities of applications and the layers of software and hardware that support those applications. Examples include:
With proper management, these logs can be of great benefit in a variety of scenarios, to enhance security, system performance and resource management, and regulatory compliance. In particular, a log management infrastructure can capture information and aid analysis about the following:
The following are example scenarios illustrating how the information in logs can be critical to resolving a security or operational issue:
The University must manage these logs in a manner that facilitates these benefits while protecting the privacy and integrity of the information contained in these logs. This document recommends appropriate practices for log management within a campus's overall IT infrastructure.

2 Candidate Sources of Logged Information

There are many sources of log information:
These logs have the potential of being very large, depending on the volume of activity and the amount of information in each log entry. The cost of storage and processing should be considered when determining which sources should be incorporated into the log management infrastructure, as well as the potential benefit of having that information at some time in the future.

3 Recommended Log Management Practices

3.1 Log Generation

This section describes information that might be included in various types of logs. It should be noted, however, that the information in logs often cannot be controlled by application, system, or network administrators, so the items listed here, while often highly desirable, should not be viewed as absolute requirements.

3.1.1 Application Logs

Applications should log their activity in a manner that correlates well with the business processes the applications support, particularly any operations that modify permissions or access rights. These logs should include:
It should be noted that the "application" may actually be a more generic service, such as a web, file, or print server, or even a PBX. In this case, it may be difficult to relate the more generic logs to business processes. When this is the case, appropriate documentation may need to be maintained describing the relationship between the logs and the supported business processes.

3.1.2 System Logs

Many components of the IT infrastructure generate logs. Examples of these components include:
In general, all of these logs have potential value and should be maintained. These logs should include the following types of information:

Host-based firewalls also generate valuable log information. This is described in the "Network Logs" section of this document.

It should be noted that client workstations also generate system logs that are of interest, particularly for local authentication, malware detection, and host-based firewalls

3.1.3 Network Logs

Many components of the network infrastructure generate logs. Examples of these components include:

These logs typically describe flows of information through the network, but not the individual packets contained in that flow. (A flow is the traffic that corresponds to a logical connection between two processes in the network. Examples of flows include a connection to a web server, a remote login session, or a Domain Name System lookup.) Information logged for a flow should include:

Other components of the network infrastructure, such as DHCP and DNS servers, provide valuable information about network configuration elements, such as IP addresses, that change over time. The logging requirements for these servers are covered in the "System Logs" section of this document.

3.1.4 Time Synchronization

One of the important functions of a log management infrastructure is to relate records from various sources by time. Because of this, it is important that all components of the IT infrastructure have synchronized clocks. Use of a time service, such as NTP, is highly recommended.

3.2 Use of Log Information

3.2.1 Baseline Behavior

It is essential that a baseline of activity within the IT infrastructure be established and tracked as it changes over time.

Understanding baseline behavior allows for the detection of anomalous behavior, which could indicate a security incident or a change in normal usage patterns. Procedures should be in place to ensure that this information is reviewed on a regular and timely basis.

3.2.2 Investigation

When an incident occurs, various ad hoc questions will need to be answered. These incidents may be security related, or they may be due to a malfunction, a change in the IT infrastructure, or a change in usage patterns. Whatever the cause of the incident, though, it will be necessary to retrieve and report log records based on a variety of selection criteria, such as:

When matching records from multiple sources, time and network address will be the most valuable for matching records. Application, service, and user may also be desired for matching, but it is likely that they will need to be associated with network address and time in order to accomplish this.

4 Appropriate Use of Log Information

Logs often contain information that, if misused, could represent an invasion of the privacy of members of the UC community. While it is necessary for the University to perform regular collection and monitoring of these logs, this activity should be consistent with the provision of "least perusal" described in UC's Electronic Communication Policy.

5 Log Record Life-Cycle Management

When logs document or contain valuable information related to activities of the University's information resources or the people who manage those resources, they are University Administrative Records, subject to the requirements of the University Records Management Program to ensure that they are "...appropriately managed and preserved, and can be retrieved as needed." See the University Business and Finance Bulletins Records Management and Privacy Series for more information, particularly:
The following are specific issues that should be considered with respect to such log records.

5.1 Retention

In order to facilitate investigation as well as to protect privacy, the retention of log records should be well-defined to provide an appropriate balance among the following:

The records required to support investigation often contain information about specific individuals' activities and must, therefore, be protected adequately against unauthorized disclosure. It is also the case that records required for long-term analysis often contain information about specific individuals' activities, but that specific information is not needed. When the retention period required for analysis is significantly longer than that for investigation, new records can be created by aggregating or redacting the original records to reduce the cost of managing the records and protecting them against unauthorized disclosure.

Care should be taken not to retain log records that are not needed. The cost of long-term retention can be significant, and could expose the University to high costs of retrieving and reviewing the otherwise unneeded records in the event of litigation.

5.2 Log Management Infrastructure

As a best practice, a log management infrastructure should be established to provide common management of log records. This infrastructure will:

The following diagram illustrates such an infrastructure.


In order to facilitate the creation of campus-based log management infrastructures, it is recommended that system-wide groups be established to address the following issues:

6 Summary of Recommendations

Section

Recommendations

Introduction

The University should manage logs in a manner that facilitates the benefits described in this document, while protecting the privacy and integrity of the information contained in these logs.

Candidate Sources of Logged Information

The cost of storage, processing, management, etc., as well as the benefit benefit, should be considered when selecting sources for incorporation into the log management infrastructure.

Application Logs

Applications should log their activity in a manner that correlates well with the business processes the applications support, particularly any operations that modify permissions or access rights. These logs should include, at a minimum:
  • The business operation that was requested
  • Whether the request was accepted or denied
  • The time and date the operation was performed (Start and end times may be appropriate for long operations.)
  • Who initiated the operation
  • System and network resources used
  • Any information needed for business process controls
  • Client hardware and software characteristics

System Logs

System logs should include the following types of information:
  • The server operation that was requested
  • Whether the request was accepted or denied
  • The time and date the operation was performed (Start and end times, or duration, may be appropriate for long operations.)
  • Who and/or what system initiated the operation
  • System and network resources used

Network Logs

Information logged for a network flow should include:
  • Network (IP) addresses of the end points
  • Service identifiers (port numbers) for each of the end points
  • Whether the flow was accepted or denied
  • Date, time, and duration of the flow
  • Number of packets and bytes used by the flow

Time Synchronization

One of the important functions of a log management infrastructure is to relate records from various sources by time. Because of this, it is important that all components of the IT infrastructure have synchronized clocks. Use of a time service, such as NTP, is highly recommended.

Baseline Behavior

The baseline of activity within the IT infrastructure should be established and tracked as it changes over time.
  • For system and network administrators, this should include the volume of activity for major applications and systems, as well as traffic volume over the network, and should be presented over a common time scale.
  • It may also be desirable to present application activity to business managers in a manner that enables them to correlate the information with business volume.
  • Procedures should be in place to ensure that this information is reviewed on a regular and timely basis.

Investigation

When conducting an investigation, it will be necessary to retrieve and report log records based on a variety of selection criteria. Preparations should be made to perform ad hoc queries based on criteria, such as the following:
  • Source(s) of the log records
  • Time
  • Network address
  • Application or service
  • User

When matching records from multiple sources, time and network address will be the most valuable for matching records. Application, service, and user may also be desired for matching, but it is likely that they will need to be associated with network address and time in order to accomplish this.

Appropriate Use of Log Information

While it is necessary for the University to perform regular collection and monitoring of these logs, this activity should be consistent with the provision of "least perusal" described in UC's Electronic Communication Policy.

Retention

In order to facilitate investigation as well as to protect privacy, the retention of log records should be well-defined to provide an appropriate balance among the following
  • confidentiality of specific individuals' activities,
  • the need to support investigations, and
  • the cost of retaining the records

Log Management Infrastructure

Each campus should establish a log management infrastructure to do the following:
  • move log records into the infrastructure,
  • provide secure storage for the records,
  • implement record retention policies,
  • provide analysis tools that enable correlations among records from multiple sources, and
  • protect the chain of evidence for the possibility that log records are used in legal proceedings.
System-wide groups should be established to address the following issues:
  • Technology solutions that can be used to build log management infrastructures.
  • Typical retention periods for common examples of logged information.

7 Acknowledgments

This document is the result of work done by a subgroup of the University of California Information Technology Policy and Security Officers. The work group members were:
Jacqueline Craig, UC Office of the President
Jon Good, UC Office of the President
Karl Heins, UC Office of the President
Binh Nguyen, UC San Francisco Medical Center
Carl Tianen, UC San Francisco
David Walker, UC Office of the President

May 1, 2006