| Literature DB >> 21798029 |
Solweig Gerbier1, Olga Yarovaya, Quentin Gicquel, Anne-Laure Millet, Véronique Smaldore, Véronique Pagliaroli, Stefan Darmoni, Marie-Hélène Metzger.
Abstract
BACKGROUND: The identification of patients who pose an epidemic hazard when they are admitted to a health facility plays a role in preventing the risk of hospital acquired infection. An automated clinical decision support system to detect suspected cases, based on the principle of syndromic surveillance, is being developed at the University of Lyon's Hôpital de la Croix-Rousse. This tool will analyse structured data and narrative reports from computerized emergency department (ED) medical records. The first step consists of developing an application (UrgIndex) which automatically extracts and encodes information found in narrative reports. The purpose of the present article is to describe and evaluate this natural language processing system.Entities:
Mesh:
Year: 2011 PMID: 21798029 PMCID: PMC3158541 DOI: 10.1186/1472-6947-11-50
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Figure 1Processing of natural language data extracted from emergency department medical records by UrgIndex (University of Lyon's Hôpital de la Croix-Rousse, Lyon, France).
Figure 2Processing of a medical narrative report section by UrgIndex. Example of processing the narrative section "clinical observation" of a patient going to the emergency department of Hôpital de la Croix-Rousse (a University of Lyon hospital) with a flu-like syndrome.
Figure 3Processing of a medical narrative report section by UrgIndex. Example of processing narrative section "clinical observation", when there is more than one phrase, of a patient going to the emergency department of Hôpital de la Croix-Rousse (a University of Lyon hospital) with a bronchitis.
Recall and precision of UrgIndex in the test phase (n = 100 medical records)
| Number of medical concepts evaluated | Number of correctly processed concepts | Number of medical concepts proposed by UrgIndex | Recall * | Precision § | |||
|---|---|---|---|---|---|---|---|
| (%) | 95% confidence interval | (%) | 95% confidence interval | ||||
| Respiratory syndrome | 966 | 816 | 1,002 | 84.5 | 82.0-86.7 | 81.4 | 78.9-83.8 |
| Cutaneous syndrome | 986 | 858 | 1,115 | 87.0 | 84.8-89.1 | 77.0 | 74.4-79.4 |
| Total | 1,952 | 1,674 | 2,117 | 85.8 | 84.1-87.3 | 79,1 | 77.3-80.8 |
UrgIndex- Processing of emergency services medical narrative records (DMU) of the University of Lyon's Hôpital de la Croix-Rousse, France
*Recall = Number of relevant processed concepts (true positives)/number of medical concepts evaluated (manually coded by the medical epidemiologist)
§Precision = Number of relevant processed concepts (true positives)/number of medical concepts proposed by UrgIndex
Evaluation of the processing quality of concepts by type of variable on the test set
| Chief complaint | Observation | Specialists' notes | Procedures | Prescriptions | Total | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N* | % | 95% CI | N | % | 95% CI | N | % | 95% CI | N | % | 95% CI | N | % | 95% CI | N | % | 95% CI | |
| Correctly-processed concepts | 117 | 91.4 | 85.1-95.6 | 1,481 | 85.3 | 83.6-86.9 | 11 | 73.3 | 44.9-92.2 | 53 | 88.3 | 77.4-95.2 | 12 | 92.3 | 64.0-99.8 | 1,674 | 85.8 | 84.1-87.3 |
| Missing code in the ECMT § | 0 | - | - | 6 | 0.3 | 0.1-0.8 | 0 | - | - | 0 | - | - | 0 | - | 6 | 0.3 | 0.1-0.7 | |
| Missing code in UrgIndex filters | 0 | - | - | 16 | 0.9 | 0.5-1.5 | 0 | - | - | 0 | - | - | 1 | 7.7 | 0.2-36.0 | 17 | 0.9 | 0.5-1.4 |
| Term not recognized by UrgIndex † | 6 | 4.7 | 1.7-9.9 | 176 | 10.1 | 8.8-11.7 | 1 | 6.7 | 0.2-31.9 | 7 | 11.7 | 4.8-22.6 | 0 | - | 190 | 9.7 | 8.5-11.1 | |
| Negation not recognized by UrgIndex | 0 | - | - | 9 | 0.5 | 0.2-1.0 | 0 | - | - | 0 | - | - | 0 | - | 9 | 0.5 | 0.2-0.9 | |
| Other UrgIndex error | 5 | 3.9 | 1.3-8.9 | 48 | 2.8 | 2.0-3.6 | 3 | 20 | 0.4-48.1 | 0 | - | - | 0 | - | 56 | 2.9 | 2.2-3.7 | |
| Total number of concepts | 128 | 100 | - | 1,736 | 100 | - | 15 | 100 | - | 60 | 100 | - | 13 | 100 | - | 1,952 | 100 | - |
UrgIndex- Processing of emergency services medical narrative records (DMU) of the University of Lyon's Hôpital de la Croix-Rousse, Lyon, France
*N = Number of evaluated concepts
§ECMT = French-language medical multi-terminology indexer
† Abbreviation, spelling, synonym
Evaluation of reasons for coding false positives concepts by type of variable on the test set
| Chief complaint | Observation | Specialists' notes | Procedures | Prescriptions | Total | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N* | % | 95% CI | N* | % | 95% CI | N* | % | 95% CI | N* | % | 95% CI | N* | % | 95% CI | N* | % | 95% CI | |
| Temporality of event not recognized (antecedents) | 0 | - | - | 157 | 35.8 | 31.3-40.5 | 0 | - | - | 0 | - | - | 0 | - | - | 157 | 35.4 | 31.0-40.1 |
| False disambiguation of concept | 2 | 66.7 | 9.4-99.2 | 253 | 57.8 | 53.0-62.4 | 1 | 100 | - | 1 | 100 | - | 0 | - | - | 257 | 58.0 | 53.3-62.7 |
| Negation not detected | 0 | 0 | - | 19 | 4.3 | 2.6-6.7 | 0 | - | - | 0 | - | - | 0 | - | - | 19 | 4.3 | 2.6-6.6 |
| False disambiguation of abbreviation or acronym | 1 | 33.3 | 0.8-90.6 | 6 | 1.4 | 0.5-3.0 | 0 | - | - | 0 | - | - | 0 | - | - | 7 | 1.6 | 0.6-3.2 |
| Other UrgIndex error | 0 | - | - | 3 | 0.7 | 0.1-2.0 | 0 | - | - | 0 | - | - | 0 | - | - | 3 | 0.7 | 0.1-2.0 |
| Total number of false positive concepts | 3 | 100 | - | 438 | 100 | - | 1 | 100 | - | 1 | 100 | - | 0 | - | 443 | 100 | - | |
UrgIndex- Processing of emergency services medical narrative records (DMU) of the University of Lyon's Hôpital de la Croix-Rousse, Lyon, France
*N = Number of evaluated concepts