Data quality assurance
| This article does not cite any references or sources. (December 2009) |
Data quality assurance is the process of profiling the data to discover inconsistencies, and other anomalies in the data and performing data cleansing activities (e.g. removing outliers, missing data interpolation) to improve the data quality .
These activities can be undertaken as part of data warehousing or as part of the database administration of an existing piece of applications software.
Criticism of existing tools and processes
The main reasons cited are:
- Project costs: costs typically in the hundreds of thousands of dollars
- Time: lack of enough time to deal with large-scale data-cleansing software
- Security: concerns over sharing information, giving an application access across systems, and effects on legacy systems
