| Literature DB >> 22533507 |
Makoto Jones1, Scott L DuVall, Joshua Spuhl, Matthew H Samore, Christopher Nielson, Michael Rubin.
Abstract
BACKGROUND: Accurate information is needed to direct healthcare systems' efforts to control methicillin-resistant Staphylococcus aureus (MRSA). Assembling complete and correct microbiology data is vital to understanding and addressing the multiple drug-resistant organisms in our hospitals.Entities:
Mesh:
Year: 2012 PMID: 22533507 PMCID: PMC3394221 DOI: 10.1186/1472-6947-12-34
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Figure 1Data Network Diagram. RPC - remote procedure call. CPRS - Computerized Patient Record System. RDW- regional data warehouse. CDW-corporate data warehouse. PCS – VA Patient Care Services. NLP- natural language processing. Laboratory data are generated when tests on patient samples are processed in laboratories and entered into VistA. From there, they may be accessed through CPRS and PCS through RPCs. They may also be extracted through other processes (journaling and M). Because PCS microbiology data are free-text, information must be extracted into an analyzable form. Figure courtesy of Kiyoshi Jones.
Figure 2Information Extraction Strategy. VINCI -VA Informatics and Computing Infrastructure. SNOMED – Systematized Nomenclature of Medicine. RxNorm – a standardized nomenclature of clinical drugs developed by the National Library of Medicine.
Figure 3Sample of a Microbiology Report. A typical microbiology report is represented here. Metadata about the specimen taken are described first. There are two sections here: ‘Culture Results’ and ‘Antibiotic Susceptibility Test Results.’ The organisms listed in both are linked by numbers because the same genus and species may be isolated more than once. Susceptibilities may appear in the ‘Culture Results’ section with the organism name, in the same section in a ‘comments’ subsection as depicted above, or in a matrix in the ‘Antibiotic Susceptibility Test Results’ section as depicted above.
Information extraction accuracy
| | ||||||
| | | | Sensitivity | Specificity | PPV | NPV |
| | 99.6 (4026/4044) | 99.9 (4828/4829) | 99.9 (4026/4027) | 99.6 (4828/4846) | ||
| | | Methicillin Resistance | 99.9 (2789/2790) | 99.9 (59701/59710) | 99.7 (2786/2795) | 99.9 (59701/59705) |
| | | | | | ||
| | | | | |||
| | | | Sensitivity | Specificity | PPV | NPV |
| | 100 (2739/2739) | 99.9 (3185/3188) | 99.9 (2739/2742) | 100 (3185/3185) | ||
| | | Methicillin Resistance | 100 (1460/1460) | 99.9 (4465/4467) | 99.9 (1460/1462) | 100 (4465/4465) |
| | | | | |||
| | | | Sensitivity | Specificity | PPV | NPV |
| | 98.3 (1348/1372) | 99.7 (1714/1720) | 99.6 (1348/1354) | 98.6 (1714/1738) | ||
| Methicillin Resistance | 99.2 (703/710) | 99.4 (2368/2383) | 97.9 (703/718) | 99.8 (2368/2374) | ||
PPV - positive predictive value, NPV - negative predictive value depicts the accuracy of the extraction process on the training set (both electronic and expert-reviewed data sets combined), as well as on the validation set (reported separately). Both numbers and percentages are supplied.