Application of An Ontology for Characterizing Data Quality For a Secondary Use of EHR Data

Appl Clin Inform. 2016 Feb 10;7(1):69-88. doi: 10.4338/ACI-2015-08-RA-0107. eCollection 2016.

Abstract

Objective: The goal of this study is to apply an ontology based assessment process to electronic health record (EHR) data and determine its usefulness in characterizing data quality for calculating an example eMeasure (CMS178).

Methods: The process uses a data quality ontology that references separate data quality, domain and task ontologies to compute measures based on proportions of constraints that are satisfied. These quantities indicate how well the data conforms to the domain and how well it fits the task.

Results: The process was performed on a de-identified 200,000 encounter sample from a hospital EHR. CodingConsistency was poor (44%) but DomainConsistency (97%) and TaskRelevance (95%) were very good. Improvements in the data quality Measures correlated with improvements in the eMeasure.

Conclusion: This approach can encourage the development of new detailed Domain ontologies that can be reused for data quality purposes across different organizations' EHR data. Automating the data quality assessment process using this method can enable sharing of data quality metrics that may aid in making research results that use EHR data more transparent and reproducible.

Keywords: Data quality; data validation and verification; electronic health record; ontology.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Data Accuracy*
  • Electronic Health Records / statistics & numerical data*
  • Hospitals
  • Humans
  • Medical Informatics / methods*