| Literature DB >> 35672982 |
Vinh Nguyen1, Olivier Bodenreider1.
Abstract
BACKGROUND: Terminology integration at the scale of the UMLS Metathesaurus (i.e., over 200 source vocabularies) remains challenging despite recent advances in ontology alignment techniques based on neural networks.Entities:
Keywords: Computer; Neural Networks; Unified Medical Language System
Mesh:
Year: 2022 PMID: 35672982 PMCID: PMC9484765 DOI: 10.3233/SHTI220043
Source DB: PubMed Journal: Stud Health Technol Inform ISSN: 0926-9630
Training and generalization test datasets (number of pairs of terms)
|
|
|
|
|
|
| |||
| ALL (training) | 147,750,794 | 22,324,834 | 170,075,628 |
|
| |||
|
|
|
|
|
|
| |||
| TOPN_SIM | 54,752,228 | 5,581,209 | 60,333,437 |
| RAN_SIM | 54,445,899 | 5,581,209 | 60,027,108 |
| RAN_NOSIM | 58,256,526 | 5,581,209 | 63,837,735 |
| ALL (testing) | 167,454,653 | 5,581,209 | 173,035,862 |
Figure 1:The proposed neural network architectures with three variants: V0 as the original architecture with LSTM alone, V1 with an attention layer on top of the LSTM layer, and V2 with the LSTM layer replaced by an attention layer.
Performance of the three models for testing using the ALL generalization test dataset
| Variant | Accuracy | Precision | Recall | F1 |
|---|---|---|---|---|
|
| ||||
| V0 | 0.9938 | 0.8875 | 0.9254 | 0.9061 |
| V1 |
|
|
|
|
| V2 | 0.9928 | 0.8876 | 0.8908 | 0.8892 |