Evaluating the Impact of OMOP-CDM on Data Quality Insight Generation in Respiratory Disease Management.

dc.contributor.authorYankam, B.M.
dc.contributor.authorLuc Baudoin, F. T.
dc.contributor.authorAndeso P.
dc.contributor.authorOnana Akoa, F. A.
dc.contributor.authorEbimbe, J. B.
dc.contributor.authorBarasa, M.
dc.contributor.authorOnana, M.
dc.contributor.authorIddi, S.
dc.contributor.authorKiragga, A.
dc.contributor.authorMbatchou Ngahane, B. H
dc.contributor.authorData Science Without Borders Project
dc.date.accessioned2026-05-15T19:12:28Z
dc.date.issued2026
dc.description.abstractThe increasing volume and heterogeneity of patient care data present significant challenges for comprehensive analysis and the generation of insights, particularly in specific areas such as respiratory diseases. Standardizing diverse health data is crucial for enabling large-scale observational research and ensuring data readiness. The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) provides a widely adopted standard for harmonizing such data. However, evaluating the quality of data transformed into the OMOP CDM format is a critical step before its use in research or clinical decision support. This study evaluates the impact of the OMOP CDM standardization process on generating data quality insights for a respiratory disease dataset. The source dataset was initially paper-based, converted to an electronic format, and translated from French into English. This historical dataset covers the years 2009-2023 and contains 108 variables and 2,154 records. The data underwent the standard Extract, Transform, and Load (ETL) process to convert into the OMOP CDM format. Following this transformation, the quality of the resulting OMOP CDM instance was assessed. The Data Quality Dashboard (DQD) was utilized to evaluate the quality of the OMOP CDM database before and after ETL verification, with checks on completeness, plausibility, and conformance. Overall, the assessment conducted 2,344 checks, of which 2,269 passed and 75 failed, resulting in a corrected pass rate of 96% before ETL verification. After ETL verification, the assessment conducted 2,374 checks, of which 2,356 passed and 40 failed, resulting in a 100% corrected pass rate. Standardizing respiratory disease data using the OMOP CDM enabled a structured and transparent evaluation of data quality, demonstrating the utility of OMOP CDM in generating meaningful data quality insights, and highlighting the model's potential to enhance data readiness and support evidence-based decision-making in respiratory disease management.
dc.identifier.urihttps://doi.org/10.3389/fdata.2026.1744885
dc.identifier.urihttps://knowhub.aphrc.org/handle/123456789/3039
dc.publisherFrontiers
dc.subjectOMOP Common Data Model
dc.subjectHealth data standardization
dc.subjectData quality assessment
dc.subjectRespiratory disease informatics
dc.subjectETL process in health data
dc.subjectEvidence-based decision-making in Africa
dc.titleEvaluating the Impact of OMOP-CDM on Data Quality Insight Generation in Respiratory Disease Management.

Files

Collections