| Literature DB >> 31094361 |
Aryan Arbabi1,2, David R Adams3, Sanja Fidler1, Michael Brudno1,2.
Abstract
BACKGROUND: Automatic recognition of medical concepts in unstructured text is an important component of many clinical and research applications, and its accuracy has a large impact on electronic health record analysis. The mining of medical concepts is complicated by the broad use of synonyms and nonstandard terms in medical documents.Entities:
Keywords: biomedical ontologies; concept recognition; human phenotype ontology; machine learning; medical text mining; phenotyping
Year: 2019 PMID: 31094361 PMCID: PMC6533869 DOI: 10.2196/12596
Source DB: PubMed Journal: JMIR Med Inform
Figure 1Architecture of the neural dictionary model. The encoder is shown at the top, and the procedure for computing the embedding for a concept is illustrated at the bottom. Encoder: a query phrase is first represented by its word vectors, which are then projected by a convolution layer into a new space. Then, a max-over-time pooling layer is used to aggregate the set of vectors into a single one. Thereafter, a fully connected layer maps this vector into the final representation of the phrase. Concept embedding: a matrix of raw embeddings is learned, where each row represents one concept. The final embedding of a concept is retrieved by summing the raw embeddings for that concept and all of its ancestors in the ontology. FC: fully connected.
Synonym classification experiments on 607 phenotypic phrases extracted from 228 PubMed abstracts. Largest values for each category are italicized.
| Method | Accuracy (%) | |
| R@1a | R@5b | |
| PhenoTips | 28.9 | 49.3 |
| NCRc | 51.6 | |
| NCR-Hd | 45.5 | 69.8 |
| NCR-Ne | 78.2 | |
| NCR-HNf | 50.2 | 71.8 |
aR@1: recall using top 1 result from each method.
bR@5: recall using top 5 results from each method.
cNCR: Neural Concept Recognizer.
dNCR-H: variation of the NCR model that ignores taxonomic relations.
eNCR-N: variation of the NCR model that has not been trained on negative samples.
fNCR-HN: variation of the NCR model that ignores the taxonomy and has not been trained on negative examples.
Micro and macro measurements for concept recognition experiments on 188 PubMed abstracts. Neural Concept Recognizer models were trained on Human Phenotype Ontology. Largest values for each category are italicized.
| Method | Micro (%) | Macro (%) | ||||
| Precision | Recall | F1-score | Precision | Recall | F1-score | |
| BioLarK | 78.5 | 60.5 | 68.3 | 76.6 | 66.0 | 70.9 |
| cTAKESa | 72.2 | 55.6 | 62.8 | 74.0 | 61.4 | 67.1 |
| OBOb | 78.3 | 53.7 | 63.7 | 79.5 | 58.6 | 67.5 |
| NCBOc | 44.0 | 57.2 | 79.5 | 48.7 | 60.4 | |
| NCRd | 80.3 | 62.4 | 68.2 | |||
| NCR-He | 74.4 | 61.5 | 67.3 | 72.2 | 67.1 | 69.6 |
| NCR-Nf | 78.1 | 69.4 | 76.6 | 72.2 | ||
| NCR-HNg | 77.1 | 57.2 | 65.7 | 76.5 | 63.4 | 69.3 |
acTAKES: Clinical Text Analysis and Knowledge Extraction System.
bOBO: Open Biological and Biomedical Ontologies
cNCBO: National Center for Biomedical Ontology.
dNCR: Neural Concept Recognizer.
eNCR-H: variation of the NCR model that ignores taxonomic relations.
fNCR-N: variation of the NCR model that has not been trained on negative samples.
gNCR-HN: variation of the NCR model that ignores the taxonomy and has not been trained on negative examples.
Micro and macro measurements for concept recognition experiments on 39 Undiagnosed Diseases Program clinical notes. Neural Concept Recognizer models were trained on Human Phenotype Ontology. Largest values for each category are italicized.
| Method | Micro (%) | Macro (%) | ||||
| Precision | Recall | F1-score | Precision | Recall | F1-score | |
| BioLarK | 27.6 | 21.0 | 23.9 | 28.7 | 21.6 | 24.6 |
| cTAKESa | 31.5 | 18.9 | 23.6 | 20.2 | 26.2 | |
| OBOb | 26.8 | 20.5 | 23.2 | 28.8 | 20.1 | 23.7 |
| NCBOc | 16.9 | 22.5 | 37.1 | 19.9 | 25.9 | |
| NCRd | 24.5 | 27.2 | 25.8 | 26.5 | 27.6 | 27.0 |
| NCR-He | 25.1 | 26.8 | 25.9 | 26.2 | 27.0 | 26.6 |
| NCR-Nf | 24.3 | 26.2 | 27.0 | |||
| NCR-HNg | 25.5 | 27.2 | 27.4 | 27.7 | 27.6 | |
acTAKES: Clinical Text Analysis and Knowledge Extraction System.
bOBO: Open Biological and Biomedical Ontologies
cNCBO: National Center for Biomedical Ontology.
dNCR: Neural Concept Recognizer.
eNCR-H: variation of the NCR model that ignores taxonomic relations.
fNCR-N: variation of the NCR model that has not been trained on negative samples.
gNCR-HN: variation of the NCR model that ignores the taxonomy and has not been trained on negative examples.
Extended measurements for concept recognition experiments on 188 PubMed abstracts. Neural Concept Recognizer models were trained on Human Phenotype Ontology. Largest values for each category are italicized.
| Method | Extended value (%) | Jaccard value (%) | ||
| Precision | Recall | F1-score | ||
| BioLarK | 91.5 | 80.8 | 85.8 | 76.9 |
| cTAKESa | 95.6 | 73.9 | 83.3 | 72.1 |
| OBOb | 92.4 | 77.9 | 84.5 | 74.4 |
| NCBOc | 65.4 | 77.7 | 64.3 | |
| NCRd | 93.3 | 82.1 | ||
| NCR-He | 86.5 | 85.1 | 76.7 | |
| NCR-Nf | 90.6 | 83.1 | 86.7 | 78.2 |
| NCR-HNg | 89.7 | 78.9 | 83.9 | 73.2 |
acTAKES: Clinical Text Analysis and Knowledge Extraction System.
bOBO: Open Biological and Biomedical Ontologies
cNCBO: National Center for Biomedical Ontology.
dNCR: Neural Concept Recognizer.
eNCR-H: variation of the NCR model that ignores taxonomic relations.
fNCR-N: variation of the NCR model that has not been trained on negative samples.
gNCR-HN: variation of the NCR model that ignores the taxonomy and has not been trained on negative examples.
Extended measurements for concept recognition experiments on 39 Undiagnosed Diseases Program clinical notes. Neural Concept Recognizer models were trained on Human Phenotype Ontology. Largest values for each category are italicized.
| Method | Extended value (%) | Jaccard index (%) | ||
| Precision | Recall | F1-score | ||
| BioLarK | 58.9 | 42.6 | 49.5 | 29.5 |
| cTAKESa | 68.5 | 36.7 | 47.8 | 27.3 |
| OBOb | 59.2 | 46.4 | 52.0 | 31.3 |
| NCBOc | 37.2 | 48.5 | 27.2 | |
| NCRd | 57.1 | 49.4 | ||
| NCR-He | 54.0 | 49.4 | 51.6 | 30.5 |
| NCR-Nf | 54.7 | 52.5 | 31.4 | |
| NCR-HNg | 56.5 | 49.0 | 52.5 | 31.3 |
acTAKES: Clinical Text Analysis and Knowledge Extraction System.
bOBO: Open Biological and Biomedical Ontologies
cNCBO: National Center for Biomedical Ontology.
dNCR: Neural Concept Recognizer.
eNCR-H: variation of the NCR model that ignores taxonomic relations.
fNCR-N: variation of the NCR model that has not been trained on negative samples.
gNCR-HN: variation of the NCR model that ignores the taxonomy and has not been trained on negative examples.
Results for concept recognition experiments on 1800 Multiparameter Intelligent Monitoring in Intensive Care documents. The Neural Concept Recognizer models were trained on a subset of the Systematized Nomenclature of Medicine - Clinical Terms ontology. Largest values for each category are italicized.
| Method | Micro (%) | Macro (%) | ||||
| Precision | Recall | F1-score | Precision | Recall | F1-score | |
| cTAKESa | 9.1 | 14.6 | 8.7 | 14.1 | ||
| NCRb | 10.9 | 26.7 | 10.6 | 26.9 | 15.2 | |
| NCR-Hc | 10.0 | 30.6 | 15.1 | 9.6 | 30.4 | 14.6 |
| NCR-Nd | 24.8 | 15.4 | 25.3 | |||
| NCR-HNe | 9.6 | 28.6 | 14.4 | 9.2 | 28.9 | 13.9 |
acTAKES: Clinical Text Analysis and Knowledge Extraction System.
bNCR: Neural Concept Recognizer.
cNCR-H: variation of the NCR model that ignores taxonomic relations.
dNCR-N: variation of the NCR model that has not been trained on negative samples.
eNCR-HN: variation of the NCR model that ignores the taxonomy and has not been trained on negative examples.
Figure 2Visualization of the representations learned for Human Phenotype Ontology concepts. The representations are embedded into two dimensions using t-SNE. The colors denote the high-level ancestors of the concepts. The plot on the left shows the representations learned in NCR-N, where the taxonomy information was used in training, and the plot on the right shows representations learned for NCR-HN, where the taxonomy was ignored. NCR-HN: variation of the NCR model that ignores the taxonomy and has not been trained on negative examples; NCR-N: variation of the NCR model that has not been trained on negative samples; t-SNE: t-distributed stochastic neighbor embedding.