| Literature DB >> 24303255 |
Hongfang Liu1, Suzette J Bielinski, Sunghwan Sohn, Sean Murphy, Kavishwar B Wagholikar, Siddhartha R Jonnalagadda, K E Ravikumar, Stephen T Wu, Iftikhar J Kullo, Christopher G Chute.
Abstract
Information extraction (IE), a natural language processing (NLP) task that automatically extracts structured or semi-structured information from free text, has become popular in the clinical domain for supporting automated systems at point-of-care and enabling secondary use of electronic health records (EHRs) for clinical and translational research. However, a high performance IE system can be very challenging to construct due to the complexity and dynamic nature of human language. In this paper, we report an IE framework for cohort identification using EHRs that is a knowledge-driven framework developed under the Unstructured Information Management Architecture (UIMA). A system to extract specific information can be developed by subject matter experts through expert knowledge engineering of the externalized knowledge resources used in the framework.Entities:
Year: 2013 PMID: 24303255 PMCID: PMC3845757
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Figure 1.
System architecture of the IE framework under cTAKES.
Statistics of contingency matrix on the implementation of PAD algorithm.
|
| |||
|---|---|---|---|
|
| Class I (Positive or Probable) | Class II (Negative or Unknown) | |
| Class I (Positive or Probable) | 263 | 10 | |
| Class II (Negative or UnKnown) | 46 | 136 | |
Statistics of patients identified by the HF algorithm.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| HF Case Cohort | 706 | 586 (83%) | 72 (10%) | 41 (6%) | 7 (1%) |
| MayoGC/eMERGE Cohort | 6923 | 535 (8%) | 94 (1%) | 684 (10%) | 5610 (81%) |