| Literature DB >> 25848565 |
Emily Beth Devine1, Daniel Capurro1, Erik van Eaton1, Rafael Alfonso-Cristancho1, Allison Devlin1, N David Yanez1, Meliha Yetisgen-Yildiz1, David R Flum1, Peter Tarczy-Hornoch1.
Abstract
BACKGROUND: The field of clinical research informatics includes creation of clinical data repositories (CDRs) used to conduct quality improvement (QI) activities and comparative effectiveness research (CER). Ideally, CDR data are accurately and directly abstracted from disparate electronic health records (EHRs), across diverse health-systems.Entities:
Keywords: CERTAIN; Comparative Effectiveness; Health Information Technology; Informatics; Quality Improvement
Year: 2013 PMID: 25848565 PMCID: PMC4371452 DOI: 10.13063/2327-9214.1025
Source DB: PubMed Journal: EGEMS (Wash DC) ISSN: 2327-9214
Figure 1.Data Flow Diagram for the CERTAIN Automation and Validation Project
CERTAIN=Comparative Effectiveness Research Translation Network; EHR=electronic health record; NLP=natural language processing; SCOAP=Surgical Care Outcomes & Assessment Program; VPN=virtual private network. The illustration is consistent with all existing data use agreements, business associate agreements, and memoranda of understanding between CERTAIN investigators and staff, and participating sites.
Figure 2.Comparison of the ARMUS and Amalga Databases for the SCOAP CERTAIN Automation and Validation Project
EHR=electronic heath record; ETL=extract, transform, load;
NLP=natural language processing
Example of data elements in combined ARMUS and Amalga dataset for specified time frame
| 1 | 1 | HbA1c | 1/1/13 | 7.4 | 7.3 | 7.4 |
| 1 | 1 | SCr | 1/1/13 | 0.7 | 0.8 | 0.8 |
| 2 | 3 | Age | 2/3/13 | 26 | 26 | 26 |
| … | ||||||
| 3 | 375 | Smoker | 3/31/13 | Yes | No | Yes |
HbA1c=hemoglobin A1c; SCr=serum creatinine; ID=Identifier.
Source #1 or #2=data from ARMUS or Amalga, each blinded.
Note: Identifiers are at the hospital and patient level but are coded and do not reflect actual hospital or patient identifiers.
Example of performance measures of validity, for each time frame
| Source #1 to EHR | Source #2 to EHR | Source #1 to EHR | Source #2 to EHR | Source #1 to EHR | Source #2 to EHR | Source #1 to EHR | Source #1 to EHR | Source #1 to EHR | Source #1 to EHR | |
|---|---|---|---|---|---|---|---|---|---|---|
| HbA1c | 67 | 60 | 91 | 90 | 7 | 6 | 0.4 | 0.4 | 9 | 7 |
| SCr | 73 | 53 | 92 | 92 | 49 | 7 | 0.3 | 0.2 | 10 | 8 |
| Age | 67 | 80 | 91 | 91 | 7 | 9 | 0.4 | 0.5 | 10 | 12 |
| Gender | 91 | 90 | 67 | 60 | 3 | 2 | 0.1 | 0.2 | 48 | 47 |
EHR=electronic health record; HbA1c=hemoglobin A1c; LR+=likelihood ratio positive; OR−=likelihood ratio negative; SCr=serum creatinine; Source #1 or #2=data from ARMUS or Amalga, each blinded
Figure 3.Characteristics of SCOAP Data Collection Elements