| Literature DB >> 25911572 |
Katherine P Liao1, Tianxi Cai2, Guergana K Savova3, Shawn N Murphy4, Elizabeth W Karlson5, Ashwin N Ananthakrishnan6, Vivian S Gainer7, Stanley Y Shaw8, Zongqi Xia9, Peter Szolovits10, Susanne Churchill11, Isaac Kohane12.
Abstract
Entities:
Mesh:
Year: 2015 PMID: 25911572 PMCID: PMC4707569 DOI: 10.1136/bmj.h1885
Source DB: PubMed Journal: BMJ ISSN: 0959-8138

Fig 1 Overview of the two main types of EMR data, structured and unstructured, and how these data can be integrated for research studies. In this instance, the figure illustrates the development of a phenotype algorithm for rheumatoid arthritis. *Including ICD-9 (international classification of diseases, 9th revision) codes and CPT (current procedural terminology) codes
Useful web resources for EMR phenotype development*
| Resource | Description | URL |
|---|---|---|
| UMLS | Unified Medical Language System | |
| RxNorm | Normalized names for clinical drugs | |
| SNOMED CT | Systemized Nomenclature of Medicine-Clinical Terms | |
| cTAKES | Apache clinical Text Analysis and Knowledge Extraction System | |
| HITex | Health Information Text Extraction | |
| eMERGE | Electronic Medical Records and Genomics Network | |
| i2b2 | Informatics for Integrating Biology and the Bedside project | |
*This table lists examples of resources used by the i2b2 team for EMR phenotype development or mentioned in this article; it is not a comprehensive list.

Fig 2 Overview of methods used to develop EMR phenotype algorithms
Comparison of performance algorithms using different types of data to classify phenotypes*
| Phenotype and performance characteristic | Performance algorithm | ||
|---|---|---|---|
| Structured data only | NLP data only | Structured and NLP data | |
| Crohn’s disease | |||
| Sensitivity (%) | 64 | 64 | 72 |
| PPV (%) | 98 | 98 | 98 |
| Ulcerative colitis | |||
| Sensitivity (%) | 60 | 68 | 73 |
| PPV (%) | 97 | 97 | 97 |
| Sensitivity (%) | 68 | 68 | 78 |
| PPV (%) | 94 | 94 | 95 |
| Sensitivity (%) | 51 | 56 | 63 |
| PPV (%) | 88 | 89 | 94 |
*Specificity cut-off for all phenotypes was set at 97%.

Fig 3 Proportion of patients in data marts for rheumatoid arthritis (n=28 982) and ulcerative colitis (n=14 335) who have been classified to have the phenotype. Numbers over each bar=EMR cohort size