The Pillars of Data Quality Management: A Guide to Mastering Them All
According to Gartner, poor data quality issues can generate an additional $15 million in annual costs on average. In fact, it is not only about financial losses, but also affects other levels, such as less reliable analysis, poor governance and risk of non-compliance, loss of brand value, slowing down corporate growth, etc.
For all these reasons, having quality data has become a fundamental value for companies that want to continue to innovate and stand out from the competition. We analyze its principles, best practices, and the keys to avoid falling into poor-quality data.
What is Data Quality
Data quality refers to the degree of accuracy, consistency, completeness, reliability, and relevance of data collected, stored, and used in an organization or in a specific context.
High-quality data is essential for making informed decisions, performing accurate analyses, and developing effective strategies. It is also necessary to properly function other technologies, such as artificial intelligence or IoT solutions.
Maintaining their high quality is a crucial factor for companies to obtain valuable and correct information, make the best decisions, and achieve their objectives. In fact, data quality has a direct influence on operational efficiency, as it gives departments the accurate information they need for day-to-day tasks, such as inventory management and order processing. It also affects customer satisfaction and new business opportunities by enabling more effective marketing and sales strategies based on accurate customer segmentation and targeting.
Data Quality Dimensions
Data quality dimensions are critical aspects used to assess the health and usability of each organization’s data. They provide a framework for effectively identifying and correcting quality problems.
The most important dimensions are:
- Completeness: refers to whether a data set contains all the necessary records, as a complete data set allows for more comprehensive analysis and decision-making.
- Accuracy: refers to the degree to which the data accurately represents real-world values or events. To ensure accuracy, it is necessary to identify and correct errors in the data set, such as incorrect entries or misrepresentations. To improve it, data validation rules can be implemented to help prevent inaccurate information from being entered into the system.
- Consistency: represents whether the same information stored and used in multiple instances matches. It ensures that analyses correctly capture and leverage the value of the data. It is difficult to assess requires planned testing across multiple data sets, and is often associated with the accuracy of the data.
- Timeliness and topicality: these ensure that data is current and relevant when used for purposes such as analysis or decision-making. Outdated information can lead to incorrect conclusions, so it is essential to keep data sets up to date.
- Singularity: refers to the absence of duplicate entries in a data set. Duplicate entries can distort the analysis by overrepresenting specific data points or trends. The primary action taken to improve the uniqueness of a dataset is to identify and remove duplicates.
- Granularity and relevance: these two ensure that the level of detail in the dataset is fit for purpose. Too much granularity can lead to unnecessary complexity, while too little can render the data useless for specific analyses. Striking a balance between these two aspects ensures that you get relevant and actionable information from the data.
Data Quality and Governance
Both data quality and data governance are two indispensable factors for companies wishing to become a data-driven enterprise. They may be independent practices, but they are highly related.
In summary, you cannot have data quality without good governance. In fact, organizations need proper data governance before even considering an enterprise-scale data quality tool.
Data governance affects security, privacy, accuracy, compliance, roles and responsibilities, management, integration, and so on. It is used for different tasks such as increasing transparency around data, standardizing systems, policies, and procedures, solving problems, and ensuring regulatory and organizational compliance.
All these tasks are necessary to improve and monitor data quality, as good governance allows creators and users to work on the same platform, which enables better communication and shared understanding of data quality.
Although the data may need a massive overhaul to improve its quality, this experience can be leveraged to adjust data governance policies and procedures to incorporate new data. Thus, using this overlapping perspective is the most useful when designing joint strategies on data governance and data quality.
To achieve successful incorporation of both practices, data teams must ask themselves questions (Where to start? Which data to focus on? Which data may be out of scope? Which has the greatest business impact?) from two different angles:
- Critical data elements: identify what is critical to the business, either through a regulatory report, a KPI, etc.
- Value of data: calculate the lifetime of poor-quality data or the risk associated with it, focusing first on those areas with the highest risk.
In both cases, once organizations identify and prioritize areas of concern, they can use data governance to create a collaborative framework for managing and defining policies, business rules and assets to provide the necessary level of data quality control.
Once it is clear how data flows through the organization and what the standards are, it is easier to ask the data quality team to translate these standards into data quality rules and enforce them on the data in those systems.
Data Quality Monitoring
To maintain and improve data quality, it is necessary to incorporate techniques and best practices into daily data management routines.
The most effective techniques include:
- Data profiling: review existing data to detect anomalies, patterns, or inconsistencies.
- Standardization: applying uniform formats across all data sets.
- Cleaning: correcting or removing inaccurate, incomplete, or irrelevant data records.
- Data enrichment: enhancing data from internal and external sources for greater context and value.
Regarding best practices:
- Periodic data quality assessments to proactively detect and address problems.
- Clear business rules that guide data inputs and avoid common data errors.
- Hire experts as data analysts who can use advanced analytics tools.
- Zero-defect data approach to achieve data quality that borders on perfection.
Data Quality Management
Establishing data quality standards is essential to ensure consistency and accountability in your organization’s data. Some of the principles of data quality management are as follows:
- Focus on business needs: The primary focus of data quality is to meet the requirements of the data quality dimensions according to business needs.
- Leadership: It is important that leaders from all departments align on a common set of strategies, policies, processes, and resources.
- Stakeholder engagement: Data quality is everyone’s responsibility and to achieve this, all employees must work within a framework where they can raise issues that cause poor data quality and have clear ways to address and prevent them.
- Process-based approach: A comprehensive and successful data quality and management program must take into account all business and technical processes that acquire, produce, maintain, transform, or disseminate data. Understanding how they interact with each other and what results they produce will be key to optimizing the data ecosystem.
- Continuous improvement: data management should be understood as a program that needs to be continually re-evaluated and adapted to keep up with internal and external conditions.
- Data-driven decision-making: Decision-making can be challenging, but with useful data, facts, evidence, and reliable analysis, more objective decisions can be made.
- Relationship management: Data quality management not only encompasses internal stakeholders but also extends to data management tool providers, suppliers, and consumers.
These data quality management principles can be applied in many different ways. As such, how each organization implements them will depend on the specific nature and challenges they face. What is common to all is that they will find many benefits in establishing a management program based on these principles.
Data Quality Framework
A data quality framework provides a structured approach to managing and improving data quality across all business operations. It ensures that data is accurate, complete, and reliable.
To create a data quality framework, you will need to consider aspects such as:
- Define roles and responsibilities
- Establish data quality rules
- Periodic evaluations
This framework must be adaptable to changing business needs while remaining robust to the challenges posed by new types of data and emerging technologies.
Implementing a comprehensive data quality framework ensures a reliable foundation for your information systems, fostering confidence in your data and the decisions derived from it. That’s why at Plain Concepts we offer you a Data Adoption Framework to become a data-driven enterprise.
We help you discover how to get value from your data, control and analyze all your data sources, and use data to make smart decisions and accelerate your business:
- Data analytics and strategy assessment: we evaluate data technology for architecture synthesis and implementation planning.
- Modern analytics and data warehouse assessment: we provide you with a clear view of the modern data warehousing model through understanding best practices on how to prepare data for analysis.
- Exploratory data analysis assessment: we look at the data before making assumptions so you get a better understanding of the available data sets.
- Digital Twin Accelerator and Smart Factory: we create a framework to deliver integrated digital twin manufacturing and supply chain solutions in the cloud.

We will formalize the strategy that best suits you and its subsequent technological implementation. Our advanced analysis services will help you unleash the full potential of your data and turn it into actionable information, identifying patterns and trends that can condition your decisions and boost your business.
Configuration ACCEPT Reject all
Privacy Overview
More information in the Cookies Policy.
Cookie | Duration | Description |
---|---|---|
__cfduid | 1 year | The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information. |
__cfduid | 29 days 23 hours 59 minutes | The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information. |
__cfduid | 1 year | The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information. |
__cfduid | 29 days 23 hours 59 minutes | The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information. |
_ga | 1 year | This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors. |
_ga | 1 year | This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors. |
_ga | 1 year | This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors. |
_ga | 1 year | This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors. |
_gat_UA-326213-2 | 1 year | No description |
_gat_UA-326213-2 | 1 year | No description |
_gat_UA-326213-2 | 1 year | No description |
_gat_UA-326213-2 | 1 year | No description |
_gid | 1 year | This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the wbsite is doing. The data collected including the number visitors, the source where they have come from, and the pages viisted in an anonymous form. |
_gid | 1 year | This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the wbsite is doing. The data collected including the number visitors, the source where they have come from, and the pages viisted in an anonymous form. |
_gid | 1 year | This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the wbsite is doing. The data collected including the number visitors, the source where they have come from, and the pages viisted in an anonymous form. |
_gid | 1 year | This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the wbsite is doing. The data collected including the number visitors, the source where they have come from, and the pages viisted in an anonymous form. |
attributionCookie | session | No description |
cookielawinfo-checkbox-analytics | 1 year | Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category . |
cookielawinfo-checkbox-necessary | 1 year | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary". |
cookielawinfo-checkbox-necessary | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary". |
cookielawinfo-checkbox-necessary | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary". |
cookielawinfo-checkbox-necessary | 1 year | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary". |
cookielawinfo-checkbox-non-necessary | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non Necessary". |
cookielawinfo-checkbox-non-necessary | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non Necessary". |
cookielawinfo-checkbox-non-necessary | 11 months | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non Necessary". |
cookielawinfo-checkbox-non-necessary | 1 year | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Non Necessary". |
cookielawinfo-checkbox-performance | 1 year | Set by the GDPR Cookie Consent plugin, this cookie is used to store the user consent for cookies in the category "Performance". |
cppro-ft | 1 year | No description |
cppro-ft | 7 years 1 months 12 days 23 hours 59 minutes | No description |
cppro-ft | 7 years 1 months 12 days 23 hours 59 minutes | No description |
cppro-ft | 1 year | No description |
cppro-ft-style | 1 year | No description |
cppro-ft-style | 1 year | No description |
cppro-ft-style | session | No description |
cppro-ft-style | session | No description |
cppro-ft-style-temp | 23 hours 59 minutes | No description |
cppro-ft-style-temp | 23 hours 59 minutes | No description |
cppro-ft-style-temp | 23 hours 59 minutes | No description |
cppro-ft-style-temp | 1 year | No description |
i18n | 10 years | No description available. |
IE-jwt | 62 years 6 months 9 days 9 hours | No description |
IE-LANG_CODE | 62 years 6 months 9 days 9 hours | No description |
IE-set_country | 62 years 6 months 9 days 9 hours | No description |
JSESSIONID | session | The JSESSIONID cookie is used by New Relic to store a session identifier so that New Relic can monitor session counts for an application. |
viewed_cookie_policy | 11 months | The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data. |
viewed_cookie_policy | 1 year | The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data. |
viewed_cookie_policy | 1 year | The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data. |
viewed_cookie_policy | 11 months | The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data. |
VISITOR_INFO1_LIVE | 5 months 27 days | A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface. |
wmc | 9 years 11 months 30 days 11 hours 59 minutes | No description |
Cookie | Duration | Description |
---|---|---|
__cf_bm | 30 minutes | This cookie, set by Cloudflare, is used to support Cloudflare Bot Management. |
sp_landing | 1 day | The sp_landing is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content. |
sp_t | 1 year | The sp_t cookie is set by Spotify to implement audio content from Spotify on the website and also registers information on user interaction related to the audio content. |
Cookie | Duration | Description |
---|---|---|
_hjAbsoluteSessionInProgress | 1 year | No description |
_hjAbsoluteSessionInProgress | 1 year | No description |
_hjAbsoluteSessionInProgress | 1 year | No description |
_hjAbsoluteSessionInProgress | 1 year | No description |
_hjFirstSeen | 29 minutes | No description |
_hjFirstSeen | 29 minutes | No description |
_hjFirstSeen | 29 minutes | No description |
_hjFirstSeen | 1 year | No description |
_hjid | 11 months 29 days 23 hours 59 minutes | This cookie is set by Hotjar. This cookie is set when the customer first lands on a page with the Hotjar script. It is used to persist the random user ID, unique to that site on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID. |
_hjid | 11 months 29 days 23 hours 59 minutes | This cookie is set by Hotjar. This cookie is set when the customer first lands on a page with the Hotjar script. It is used to persist the random user ID, unique to that site on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID. |
_hjid | 1 year | This cookie is set by Hotjar. This cookie is set when the customer first lands on a page with the Hotjar script. It is used to persist the random user ID, unique to that site on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID. |
_hjid | 1 year | This cookie is set by Hotjar. This cookie is set when the customer first lands on a page with the Hotjar script. It is used to persist the random user ID, unique to that site on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID. |
_hjIncludedInPageviewSample | 1 year | No description |
_hjIncludedInPageviewSample | 1 year | No description |
_hjIncludedInPageviewSample | 1 year | No description |
_hjIncludedInPageviewSample | 1 year | No description |
_hjSession_1776154 | session | No description |
_hjSessionUser_1776154 | session | No description |
_hjTLDTest | 1 year | No description |
_hjTLDTest | 1 year | No description |
_hjTLDTest | session | No description |
_hjTLDTest | session | No description |
_lfa_test_cookie_stored | past | No description |
Cookie | Duration | Description |
---|---|---|
loglevel | never | No description available. |
prism_90878714 | 1 month | No description |
redirectFacebook | 2 minutes | No description |
YSC | session | YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages. |
yt-remote-connected-devices | never | YouTube sets this cookie to store the video preferences of the user using embedded YouTube video. |
yt-remote-device-id | never | YouTube sets this cookie to store the video preferences of the user using embedded YouTube video. |
yt.innertube::nextId | never | This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen. |
yt.innertube::requests | never | This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen. |