Share

DATA QUALITY CONCEPTS - Part 2

Christopher Wagner • Jan 07, 2022

You have to know where you are before you know where you are going

Before we go too deep into Data Quality, we must first establish standard verbiage and the breadth and depth of the concepts at hand. These brief definitions of the topics need to be addressed in data quality. By no means is this an in-depth list of Data Quality terms, nor are they fully defined. The below definitions are just enough to get you started. 


Completeness - Do you have a complete record or dataset? 

 

Accuracy- Does the data match the real world? 

 

Consistency- Is the data consistently cleansed and formatted? (Chris, CHRIS, chris, cHrIS) 

 

Validity- Does the data represent real-world expected values? (ZIP codes are ZIP Codes, Countries are Countries, etc.) 

 

Uniqueness- Is each record suitably unique? (You should only have one business key in a type one dimension.) 

 

Integrity- As data moves through a system, are you losing or duplicating records? 

 

Accessibility- Do the right people have access to the correct data in the right way? (Business users via reports and dashboards, analysts via query tools, data engineers via sources, transformation, errors) 

 

Timeliness- Do you have the data within the acceptable SLA? (Real-time/ near real-time, micro-batch, batch)

 

Relevance- Is the data relevant for the business needs? 

 

Actionable- Do you have a complete record or dataset? Is the data Actionable, and is the data recommending the right actions?



DATA QUALITY BLOG SERIES

Each day the Data Quality Blog post will be released at 8:45 AM each day.


DATA QUALITY - Part 1 January 6th

DATA QUALITY CONCEPTS - Part 2 January 7th

DATA QUALITY FOR EVERYONE - Part 3 January 10th

DATA QUALITY FRAMEWORK - Part 4 January 11th

DATA QUALITY DEVELOPMENT - Part 5 January 12th

QUALITY DATA - Part 6 January 13th



CHRIS WAGNER, MBA MVP

Analytics Architect, Mentor, Leader, and Visionary

Chris has been working in the Data and Analytics space for nearly 20 years. Chris has dedicated his professional career to making data and information accessible to the masses. A significant component in making data available is continually learning new things and teaching others from these experiences. To help people keep up with this ever-changing landscape, Chris frequently posts on LinkedIn and to this blog.
By Christopher Wagner 15 Nov, 2023
In a dynamic data engineering scenario, Sam, a skilled professional, adeptly navigates urgent requests using Microsoft Fabric. Collaborating with Data Steward Lisa and leveraging OneLake, Sam streamlines data processes, creating a powerful collaboration between engineering and stewardship. With precision in Azure Data Factory and collaboration with a Data Scientist, Sam crafts a robust schema, leading to a visually appealing Power BI report.
By Christopher Wagner 28 Apr, 2023
NOTE: This is the first draft of this document that was assembled yesterday as a solo effort. If you would like to contribute or have any suggestions, check out my first public GIT repository - KratosDataGod/LakehouseToPowerBI: Architectural design for incorporating a Data Lakehouse architecture with an Enterprise Power BI Deployment (github.com) This article is NOT published, reviewed, or approved by ANYONE at Microsoft. This content is my own and is what I recommend for architecture and build patterns.
Work Hard - Let's GO!
By Christopher Wagner 14 Apr, 2023
Work Hard - Let's GO!
Data God Guide to Learning DAX
By Christopher Wagner 22 Jan, 2023
DAX is more than the name of my new puppy
Show More
Share by: