| Literature DB >> 25601137 |
Tomasz Adamusiak1, Naoki Shimoyama, Mary Shimoyama.
Abstract
BACKGROUND: Structured information within patient medical records represents a largely untapped treasure trove of research data. In the United States, privacy issues notwithstanding, this has recently become more accessible thanks to the increasing adoption of electronic health records (EHR) and health care data standards fueled by the Meaningful Use legislation. The other side of the coin is that it is now becoming increasingly more difficult to navigate the profusion of many disparate clinical terminology standards, which often span millions of concepts.Entities:
Keywords: CPT; HCPCS; ICD-10; ICD-9; LOINC; RxNorm; SNOMED CT; UMLS; meaningful use; semantic interoperability
Year: 2014 PMID: 25601137 PMCID: PMC4288084 DOI: 10.2196/medinform.3172
Source DB: PubMed Journal: JMIR Med Inform
Common MU Dataset defined in Stage 2 MU Final Rule (Federal Register Vol. 77, No. 171, September 4, 2012) and corresponding vocabulary standards.
| Common MU Dataset | Vocabulary standard |
| 1. Patient name | N/A |
| 2. Sex | N/A |
| 3. Date of birth | N/A |
| 4. Race | The OMBa Standards for Maintaining, Collecting, and Presenting Federal Data on Race and Ethnicity, Statistical Policy Directive No. 15, as revised, October 30, 1997 |
| 5. Ethnicity | OMB |
| 6. Preferred language | As specified by the Library of Congress, ISOb639-2 alpha-3 codes limited to those that also have a corresponding alpha-2 code in ISO 639-1 |
| 7. Smoking status | Any of the following SNOMED CTc codes- |
| 8. Problems | At a minimum, SNOMED CT International Release July 2012 and US Extension to SNOMED CT March 2012 Release |
| 9. Medications | RxNorm, August 6, 2012 Release |
| 10. Medication allergies | RxNorm, August 6, 2012 Release |
| 11. Laboratory tests | LOINCd version 2.40 |
| 12. Laboratory values/results | N/A |
| 13. Vital signs (height, weight, BPe, BMIf) | N/A |
| 14. Care plan fields including goals and instructions | N/A |
| 15. Procedures | At a minimum, SNOMED CT International Release, July 2012 with US Extension to SNOMED CT March 2012 or the combination of HCPCSg and CPTh 4 |
| 16. Care team members | N/A |
aOMB=Office of Management and Budget
bISO=International Organization for Standardization
cSNOMED CT=Systematized Nomenclature of Medicine, Clinical Terms
dLOINC=Logical Observation Identifiers Names and Codes
eBP=blood pressure
fBMI=body mass index
gHCPCS=Health care Common Procedure Coding System
hCPT=Current Procedural Terminology
iCDT=Code on Dental Procedures and Nomenclature
jICD-10-PCS=International Classification of Diseases, Tenth Revision, Procedure Coding System
Clinical Avatars data mapped to the UMLS via MU ontologies.
| Clinical Avatars | MU source mapping | UMLS mapping | Term label | ||
|
| None |
|
| ||
|
| F |
| C0015780 | Female | |
|
| M |
| C0024554 | Male gender | |
|
| OMB standard |
|
| ||
|
| African American |
| C0085756 | African American | |
|
| Native American |
| C1515945 | American Indian or Alaska Native | |
|
| Asian |
| C0078988 | Asians | |
|
| White |
| C0043157 | Caucasians | |
|
| (no data) |
| C0086409 | Hispanic or Latino | |
|
| Pacific Islander |
| C1513907 | Native Hawaiian or other Pacific Islander | |
|
| Other/unknown |
| C1532697 | Unknown racial group | |
| Height | LOINC:3137-7 | C0365282 | Body height measured | ||
| Weight | LOINC:3141-9 | C0365286 | Body weight measured | ||
| BSAa | LOINC:3139-3 | C0365285 | Body surface area measured | ||
| INRb | LOINC:34714-6 | C1369580 | INR in blood by coagulation assay value | ||
|
|
|
|
| ||
|
| Y | SNOMED CT:77176002 | C0337664 | Smoker | |
|
| N | SNOMED CT:8392000 | C0337672 | Nonsmoker | |
|
|
|
|
| ||
|
| Y | SNOMED CT:128053003 | C0149871 | Deep venous thrombosis | |
|
| N | SNOMED CT:413076004 | C1446197 | No past history of venous thrombosis | |
|
|
|
|
| ||
|
| Y | SNOMED CT:57054005 | C0155626 | Acute myocardial infarction | |
|
| N | SNOMED CT:301121007 | C0577811 | Myocardial perfusion normal | |
| CYP2C9 | LNC:46724-1 | C1830800 | cyp2c9 gene mutations found [identifier] in blood or tissue by molecular genetics method nominal | ||
| CYP2C92 | LNC:56164-7 | C2734139 | cyp2c9 gene allele 2 [identifier] in blood by molecular genetics method nominal | ||
| CYP2C93 | LNC:56165-4 | C2734141 | cyp2c9 gene allele 3 [identifier] in blood by molecular genetics method nominal | ||
| VKORC1 | LNC:50722-8 | C1978717 | vkorc1 gene mutations found [identifier] in blood or tissue by molecular genetics method nominal | ||
| VKORC1A | LNC:50722-8 | C1978717 | vkorc1 gene mutations found [identifier] in blood or tissue by molecular genetics method nominal | ||
| VKORC1G | LNC:50722-8 | C1978717 | vkorc1 gene mutations found [identifier] in blood or tissue by molecular genetics method nominal | ||
| Warfarin | RxNorm:11289 | C0043031 | Warfarin | ||
aBSA=body surface area
bINR=international normalized ratio
cDVT=deep vein thrombosis
dAMI=acute myocardial infarction
Figure 1Screenshot of ClinMiner’s integrated terminology browser. The tabs allow switching between different terminologies and the integrated MU 360 view, default choice (A). Searching. Typing a query into the input field (B) brings up autosuggestions. Selecting a particular string populates the middle panel (D) with search results. Selecting a search result brings back the hierarchical view with the selected term (Warfarin) highlighted in yellow (G). Browsing. Parents of the active term are displayed in the left pane and child terms are displayed in the right pane (F). Meta data for the active term including semantic types, definitions, and non-isa relations to other concepts are displayed in a vignette directly below (H). A plus sign (+) after the term label denotes concepts with children, and the number in brackets reflects the number of participants annotated to a particular term (or its children) in the database. Selecting a study from the drop-down list (C) enables the data driven perspective that displays a compact terminology tree limited to only relevant concepts.
Figure 2An overview of the extract, load, and transform (ELT) process. Data is extracted from multiple sources including disease registries, hospital’s EHR system, and clinical notes.
Figure 3An example of the transformation stage in the extract, load, and transform (ELT) process (for a higher resolution image, see Multimedia Appendix 1). The SNOMED CT annotation "Deep venous thrombosis" made originally in the EHR, is mapped in the UMLS to the "Deep Vein Thrombosis" concept, and can be further remapped into the UMLS source concepts such as the ICD-10-CM "Acute embolism and thrombosis of unspecified deep veins of lower extremity" concept shown in the lower right portion of the figure. The UMLS concept "Deep Vein Thrombosis" is then expanded across a set of parent concepts that are within the same UMLS Semantic Network (solid lines). The concepts characterized by a different semantic type are not included in the expansion (dotted lines). In this example, two parent concepts of "Deep Vein Thrombosis", "Thrombophlebitis and Venous Thrombosis" have semantic types "Disease" or "Syndrome and Pathologic Function" respectively. Thus, the expansion does not include the term "Venous Thrombosis", as the semantic type is different from the originating concept’s semantic type ("Disease or Syndrome"), but does include "Thrombophlebitis", which share the same semantic type. There were four high level concepts that were additionally highlighted at the top of the figure, out of which "Disease" is the only one included in the expansion.