| Literature DB >> 35179507 |
Carmen Montoto1, Javier P Gisbert2,3,4,5, Iván Guerra6, Rocío Plaza7, Ramón Pajares Villarroya8, Luis Moreno Almazán9, María Del Carmen López Martín10, Mercedes Domínguez Antonaya11, Isabel Vera Mendoza12, Jesús Aparicio1, Vicente Martínez1, Ignacio Tagarro1, Alonso Fernandez-Nistal1, Lea Canales13, Sebastian Menke14, Fernando Gomollón15,16,17,18.
Abstract
BACKGROUND: The exploration of clinically relevant information in the free text of electronic health records (EHRs) holds the potential to positively impact clinical practice as well as knowledge regarding Crohn disease (CD), an inflammatory bowel disease that may affect any segment of the gastrointestinal tract. The EHRead technology, a clinical natural language processing (cNLP) system, was designed to detect and extract clinical information from narratives in the clinical notes contained in EHRs.Entities:
Keywords: Crohn disease; artificial intelligence; electronic health records; inflammatory bowel disease; linguistic validation; natural language processing
Year: 2022 PMID: 35179507 PMCID: PMC8900906 DOI: 10.2196/30345
Source DB: PubMed Journal: JMIR Med Inform
Figure 1Extracting and organizing unstructured clinical data into a structured database. The EHRead technology is a clinical NLP system that detects and extracts clinically relevant information contained in deidentified EHRs. The extracted information from participating sites is organized in a structured study database. From this database, patients that fulfill the study criteria based on the study inclusion and exclusion criteria make up the target population. In this case, clinical data from the population with a diagnosis of Crohn disease were used. EHR: electronic health record; NLP: natural language processing.
Figure 2Linguistic evaluation process. To validate the output of the EHRead technology, a statistical comparison was performed between its output and a gold standard consisting of a subset of EHRs annotated by expert physicians. The validation metrics calculated are expressed in terms of precision, recall, and F1 score. See text for further details. EHR: electronic health record.
Interannotator agreement (F1 score) per participating site.
|
| F1 score | ||
|
| Crohn disease | Crohn disease flare | Vedolizumab |
| Site 1 | 0.93 | 0.86 | 1.00 |
| Site 2 | 1.00 | 0.87 | 1.00 |
| Site 3 | 1.00 | 1.00 | 1.00 |
| Site 4 | 0.93 | 1.00 | 1.00 |
| Site 5 | 0.93 | 0.83 | 1.00 |
| Site 6 | 0.93 | 1.00 | 1.00 |
| Site 7 | 1.00 | 1.00 | 1.00 |
| Site 8 | 1.00 | 0.85 | 1.00 |
| Average | 0.97 | 0.93 | 1.00 |
Performance of the EHRead technology.
| Variable | Precision (95% CI) | Recall (95% CI) | F1 score (95% CI) |
| Crohn disease | 0.88 (0.85-0.91) | 0.98 (0.95-0.99) | 0.93 (0.90-0.95) |
| Crohn disease flare | 0.91 (0.85-0.95) | 0.71 (0.63-0.77) | 0.80 (0.72-0.85) |
| Vedolizumab | 0.86 (0.76-0.93) | 0.94 (0.86-0.98) | 0.90 (0.81-0.96) |