| Literature DB >> 27370271 |
Danielle L Mowery1, Brett R South2, Lee Christensen2, Jianwei Leng2, Laura-Maria Peltonen3, Sanna Salanterä3, Hanna Suominen4, David Martinez5,6, Sumithra Velupillai7, Noémie Elhadad8, Guergana Savova9, Sameer Pradhan9, Wendy W Chapman2.
Abstract
BACKGROUND: The ShARe/CLEF eHealth challenge lab aims to stimulate development of natural language processing and information retrieval technologies to aid patients in understanding their clinical reports. In clinical text, acronyms and abbreviations, also referenced as short forms, can be difficult for patients to understand. For one of three shared tasks in 2013 (Task 2), we generated a reference standard of clinical short forms normalized to the Unified Medical Language System. This reference standard can be used to improve patient understanding by linking to web sources with lay descriptions of annotated short forms or by substituting short forms with a more simplified, lay term.Entities:
Keywords: Abbreviations; Acronyms; Consumer health information; Natural language processing; Unified Medical Language System
Mesh:
Year: 2016 PMID: 27370271 PMCID: PMC4930590 DOI: 10.1186/s13326-016-0084-y
Source DB: PubMed Journal: J Biomed Semantics
aParticipant system performances from [32, 34] compared against a majority sense baseline performance
| Short form normalization system | Unique predictions by the system | Annotations comparable with reference standard | Accuracy |
|---|---|---|---|
| aUTHealthCCB.B.1 | 3,774 | 3,774 | 71.9* |
| Majority Sense Baseline | 3,774 | 3,774 | 69.6 |
| aUTHealthCCB.B.2 | 3,774 | 3,774 | 68.3 |
| aLIMSI.1 | 3,896 | 3,611 | 66.4 |
| aTHCIB.B.1 | 3,774 | 3,774 | 65.7* |
| aTeamHealthLanguageLABS | 2,987 | 2,633 | 46.7* |
| aWVU.1 | 3,068 | 2,359 | 42.6 |
*Indicates that the difference in accuracy is statistically significant with the system immediately below (p < 0.01)
Top 10 most frequent lexical variants with two or more senses according to distribution type
| Short form term | Total count | Senses according to concept unique identifiers | Distribution of senses |
|---|---|---|---|
|
| |||
| ‘pt’ | 137 | C0030705: Patients | 89 % |
| C0949766: Physical therapy procedure | 4 % | ||
| C0086835: Structure of the posterior tibial artery | 4 % | ||
| 3 more senses | 8 % | ||
| ‘ct’ | 82 | C0040405: X-Ray computed tomography | 95 % |
| C1274037: Cardiothoracic surgery | 2 % | ||
| C0008034: Thoracic drain | 2 % | ||
| 1 more sense | 1 % | ||
| ‘m’ | 62 | C0024554: Male gender | 81 % |
| C0018808: Heart murmur | 16 % | ||
| C0026591: Mother | 2 % | ||
| 1 more sense | 2 % | ||
| ‘ekg’ | 41 | C0013798: Electrocardiogram | 98 % |
| C1623258: Electrocardiographic procedure | 2 % | ||
| ‘f’ | 37 | C0015780: Female | 92 % |
| C0015967: Fever | 5 % | ||
| CUI-less | 3 % | ||
| ‘cath’ | 33 | C0007430: Catheterization | 97 % |
| C0085590: Catheter | 3 % | ||
| ‘lad’ | 33 | C0226032: Anterior descending branch of left coronary artery | 85 % |
| C0497156: Lymphadenopathy | 15 % | ||
| ‘pcp’ | 31 | C0033131: Primary care physicians | 84 % |
| C0032305: Pneumonia, Pneumocystis carinii | 16 % | ||
| ‘cad’ | 31 | C1956346: Coronary artery disease | 97 % |
| C0010068: Coronary heart disease | 3 % | ||
| ‘abd’ | 29 | C0562238: Examination of abdomen | 90 % |
| C0000726: Abdominal | 10 % | ||
|
| |||
| ‘bp’ | 53 | C1271104: Blood pressure finding | 68 % |
| C0005823: Blood pressure | 32 % | ||
| ‘r’ | 43 | C0205090: Right | 58 % |
| C0232267: Pericardial rub | 23 % | ||
| C0035508: Rhonchi | 11 % | ||
| 2 more senses | 9 % | ||
| ‘hr’ | 40 | C0577802: Finding of heart rate | 68 % |
| C0018810: Heart rate | 33 % | ||
| ‘neuro’ | 34 | C0027853: Neurologic examination | 79 % |
| C0205494: Neurologic (qualifier value) | 6 % | ||
| C0221571: Nervous system problem | 6 % | ||
| 3 more senses | 9 % | ||
| ‘pod’ | 28 | CUI-less | 79 % |
| C0032790: Postoperative period | 21 % | ||
| ‘ra’ | 26 | C2709070: On room air | 62 % |
| C0225844: Right sided atrium | 35 % | ||
| C0456165: Right atrial pressure | 4 % | ||
| ‘bs’ | 26 | C0232693: Bowel sounds | 77 % |
| C0035234: Respiratory Sounds | 23 % | ||
| ‘pa’ | 19 | C1996865: Postero-anterior | 53 % |
| C0034052: Pulmonary artery structure | 37 % | ||
| C0428642: Pulmonary artery pressure | 11 % | ||
| ‘rrr’ | 18 | C0232185: Cardiac rhythm AND/OR rate finding | 67 % |
| C0232188: Normal heart right | 28 % | ||
| C0513693: Monitor rate, rhythm, depth, and effort of respirations | 6 % | ||
| ‘mr’ | 18 | C0026266: Mitral valve insufficiency | 78 % |
| C0024485: Magnetic resonance imaging | 22 % | ||
|
| |||
| ‘c’ | 25 | C0010520: Cyanosis of skin | 32 % |
| C0149651: Clubbing | 32 % | ||
| C0205064: Cervical | 24 % | ||
| 2 more senses | 12 % | ||
| ‘trach’ | 9 | C0040590: Tracheostomy procedure | 33 % |
| C0184159: Tracheostomy tube | 33 % | ||
| C0040591: Tracheotomy procedure | 11 % | ||
| 2 more senses | 22 % | ||
| ‘meds’ | 9 | C0013227: Pharmaceutical preparations | 44 % |
| C0025118: Medicine | 33 % | ||
| C0033081: Drug prescriptions | 22 % | ||
| ‘cont’ | 8 | C0549178: Continuous | 38 % |
| CUI-less | 38 % | ||
| C0584669: Recommendation to continue with treatment | 13 % | ||
| 1 more sense | 13 % | ||
| ‘v’ | 6 | C0042963: Vomiting | 33 % |
| C0348013: Venous | 33 % | ||
| C2228490: Examination of trigeminal nerve | 33 % | ||
| ‘d/c’ | 3 | C0030685: Patient discharge | 33 % |
| C1444662: Discontinued | 33 % | ||
| C1548175: On discharge | 33 % | ||
| ‘pos’ | 3 | C0205531: Oral route | 33 % |
| C0518037: Oral food intake | 33 % | ||
| C1446409: Positive | 33 % | ||
| ‘cvp’ | 3 | C0199666: Measurement of central venous pressure | 33 % |
| C0428640: Central venous pressure | 33 % | ||
| C1321771: Central venous pressure finding | 33 % | ||
Fig. 1Accuracies of participating systems and Majority Sense Baseline for each majority sense distribution category
Accuracy of normalizing short forms with concept unique identifiers shared between the ShARe test set and the Consumer Health Vocabulary
| Short form normalization system | Accuracy |
|---|---|
| UTHealthCCB.B.1 | 75.0 |
| Majority Sense Baseline | 73.2 |
| THCIB.B.1 | 73.1 |
| UTHealthCCB.B.2 | 70.4 |
| LIMSI.1 | 69.6 |
| TeamHealthLanguageLABS | 50.9 |
| WVU.1 | 50.1 |