Lesson 2: Data Quality Assessment

The rest of this course explains different attributes of data quality to help you think through all the components of data quality in a systematic way. This lesson, however, provides you with a bird’s-eye view of what you would need to do to assess and improve the quality of the data in your system. This lesson can help you think through what things you can (and can’t) do in your system, and where you might want to start.

There is a lot to consider when conducting a data quality assessment, but the steps below serve as a good starting point.

Step 1: Gather background information

Before you begin, it’s important to gather background information about the data to ensure that you can accurately evaluate its quality. If supporting materials do not exist, then you’ll probably want to begin building some. Conducting a data quality assessment will help you identify which materials will be the most helpful in improving the quality of your data. These materials may include the following:

  • Codebooks, data maps, etc.
  • Instructions or training materials on how to enter data into the data system
  • Internal audits or audit processes
  • Reports generated using the data

Step 2: Evaluate data compared to data quality attributes

Lessons 3–6 explore dimensions and attributes of data quality. As you can see, and as you’ll learn, there is a lot more to data quality than whether numbers are right or wrong. This table summarizes the main points:

Dimension Definition Attributes
Intrinsic
Lesson 3
The data is accurate, reliable, credible, and impartial. Accuracy
Believability
Reputation
Objectivity
Contextual
Lesson 4
The data meets the needs of a particular use case. Relevance
Completeness
Timeliness
Representational
Lesson 5
The data is concise, consistent, interpretable, and easy to understand. Understandability
Interpretability
Consistency
Accessibility
Lesson 6
The data is available and easy to access and use. Accessibility
Access Security

Use the following checklist to understand your agency’s data quality strengths and weaknesses. Then, start thinking about which dimensions and which attributes of data quality you might want to focus on.

Intrinsic data quality (Lesson 3)

Accuracy attribute

Believability attribute

Reputation attribute

Objectivity attribute

Contextual data quality (Lesson 4)

Relevance attribute

Completeness attribute

Timeliness attribute

Representational data quality (Lesson 5)

Understandability attribute

Interpretability

Consistency

Accessibility data quality (Lesson 6)

Accessibility attribute

Access security attribute