| Literature DB >> 31711540 |
Mercedes Arguello-Casteleiro1, Robert Stevens2, Julio Des-Diz3, Chris Wroe4, Maria Jesus Fernandez-Prieto5, Nava Maroto6, Diego Maseda-Fernandez7,8, George Demetriou2, Simon Peters9, Peter-John M Noble10, Phil H Jones10, Jo Dukes-McEwan11, Alan D Radford10, John Keane2,12, Goran Nenadic2,12,13.
Abstract
BACKGROUND: Deep Learning opens up opportunities for routinely scanning large bodies of biomedical literature and clinical narratives to represent the meaning of biomedical and clinical terms. However, the validation and integration of this knowledge on a scale requires cross checking with ground truths (i.e. evidence-based resources) that are unavailable in an actionable or computable form. In this paper we explore how to turn information about diagnoses, prognoses, therapies and other clinical concepts into computable knowledge using free-text data about human and animal health. We used a Semantic Deep Learning approach that combines the Semantic Web technologies and Deep Learning to acquire and validate knowledge about 11 well-known medical conditions mined from two sets of unstructured free-text data: 300 K PubMed Systematic Review articles (the PMSB dataset) and 2.5 M veterinary clinical notes (the VetCN dataset). For each target condition we obtained 20 related clinical concepts using two deep learning methods applied separately on the two datasets, resulting in 880 term pairs (target term, candidate term). Each concept, represented by an n-gram, is mapped to UMLS using MetaMap; we also developed a bespoke method for mapping short forms (e.g. abbreviations and acronyms). Existing ontologies were used to formally represent associations. We also create ontological modules and illustrate how the extracted knowledge can be queried. The evaluation was performed using the content within BMJ Best Practice.Entities:
Keywords: CBOW; Deep learning; Module extraction; One health; Ontology; PubMed; SNOMED CT; Semantic deep learning; Skip-gram; Veterinary clinical narratives
Mesh:
Year: 2019 PMID: 31711540 PMCID: PMC6849172 DOI: 10.1186/s13326-019-0212-6
Source DB: PubMed Journal: J Biomed Semantics
Fig. 1Flowchart of the short form detector introduced – a diagrammatic representation outlining how the short form detector assigns the labels {SF-U, SF-NU, SF}. If no label is assigned, this means that the n-gram has no clinically meaningful short form(s)
Fig. 2Overview of the extended version of the lemon core ontology (called here the lemonEXT) used for this study
Fig. 3Overview of the modified version of the OBAN core ontology (called here the OBANmod) used for this study
The axiom patterns in Manchester OWL Syntax for populating the main classes of the two core ontologies: the extended lemon core ontology (lemonEXT); and the modified OBAN core ontology (OBAMmod). Each axiom pattern in the second column contains variables, which can be easily identified as they start with the character “?”. The third column exemplifies OWL individuals that are the result of populating the axiom patterns introduced in the second column
| Brief description of axiom pattern | Axiom pattern in | Example of populating the Axiom pattern in |
|---|---|---|
| Individual of the OWL class lemon:LexicalTopic | Individual:? x Annotations: label? lbx@en Types: ‘Lexical topic’ | Individual: heart_failure Annotations: label “heart_failure”@en Types: ‘Lexical topic’ |
| Individual of the OWL class lemon:LexicalEntry | Individual:? y Annotations: label? lby@en Types: ‘Lexical entry’ Facts: denotes? CUI | Individual: beta-blockers Annotations: label “beta-blockers”@en Types: ‘Lexical entry’ Facts: denotes ‘Adrenergic beta-Antagonists’ |
| Individual of the OWL class lemon:Lexicon | Individual:? z Annotations: label? lbz@en Types: Lexicon Facts: ‘correlated with’? dataset, ‘correlated with’? model, entry? y1, … … … entry? y20 | Individual: PMSB_CBOW_heart_failure Annotations: label “PMSB_CBOW_heart_failure”@en Types: Lexicon Facts: 'correlated with’ ‘PMSB dataset’, 'correlated with’ CBOW, entry beta-blockers, … … … entry ‘cardiac_resynchronization_therapy_(CRT)’ |
| Individual of the OWL class oban:association | Individual:? Cpair Annotations: rdfs:label? lbCpair@en Types: association Facts: oban:association_has_object? CUI1, oban:association_has_subject? CUI2 | Individual: ‘(C0017601,C0020581)’ Annotations: label “(C0017601,C0020581)”@en Types: association Facts: 'association has object’ Hyphema, 'association has subject’ Glaucoma |
| Individual of the OWL class oban: provenance | Individual:? ECpair Annotations: rdfs:label? lbECpair@en Types: provenance Facts: 'has source’? BMJdoc, 'has excerpt’? BMJterm, 'has evidence’? lbEvidence, 'has excerpt’? EEexcerpt, 'is about’? Cpair, 'date creation association’? d1^^string, 'source date issued’? d2^^string | Individual: ‘(C0017601,C0020581,Relatedness by inexact match (background knowledge))’ Annotations: label “(C0017601,C0020581,Relatedness by inexact match (background knowledge))”@en Types: provenance Facts: 'has source’ ‘BMJ Best Practice: Open-angle glaucoma’, 'has excerpt’ trabeculotomy, 'has evidence’ ‘Relatedness by inexact match (background knowledge)’, 'has excerpt’ EEexcerpt_70, 'is about’ ‘(C0017601,C0020581)’, 'date creation association’ “May-2018″^^string, 'source date issued’ “09-Dec-2016″^^string |
The target terms for PMSB and VetCN datasets
| Target terms for this study and their concept identifiers in UMLS and SNOMED CT | BMJ Best Practice document | |||
|---|---|---|---|---|
| UMLS CUI | SNOMED CT identifier | VetCN dataset | PMSB dataset | |
| C0018801 | 84,114,007 | heart_failure (1292) | heart_failure (4615) | Chronic congestive heart failure |
| C0004096 | 195,967,001 | asthma (1194) | asthma (8891) | Asthma in adults |
| C0014544 | 84,757,009 | epilepsy (1164) | epilepsy (3521) | Generalised seizure |
| C0017601 | 23,986,001 | glaucoma (1657) | glaucoma (1635) | Open-angle glaucoma |
| C1561643 | 709,044,004 | ckd (2698) | CKD (1550) | Chronic kidney disease |
| C0029408 | 396,275,006 | osteoarthritis (1765) | osteoarthritis (1991) | Osteoarthritis |
| C0002871 | 271,737,000 | anaemia (1414) | anaemia (1154) | Assessment of anaemia |
| C0003864 | 3,723,001 | arthritis (8276) | arthritis (1023) | Rheumatoid arthritis |
| C0011849 | 73,211,009 | diabetes (3660) | diabetes (12846) | Type 2 diabetes in adults |
| C0020538 | 38,341,003 | hypertension (1132) | hypertension (8365) | Essential hypertension |
| C0028754 | 414,916,001 | obesity (1763) | obesity (10030) | Obesity in adults |
The last column contains the names and references of BMJ Best Practice documents used for validation in Step 5 (see details within the section Materials and methods). The first column contains the UMLS CUI mapped to a target term (n-gram) with the aid of MetaMap. The second column shows the SNOMED CT identifier mapped to the UMLS CUI with the aid of the UMLS API. The third column displays the target terms from the VetCN dataset, i.e. the n-grams with their frequency counts in the corpus appear within brackets. The fourth column shows the target terms from PMSB dataset with the same format of the third column. All target terms (i.e. n-grams) are identical for both datasets except one. The well-known medical condition “chronic kidney disease” with UMLS CUI = “C1561643” has the n-gram “CKD” (i.e. a short form with all the characters in upper case) in the PMSB dataset; while in VetCN dataset it has the n-gram “ckd”. The difference in these two target terms “CKD” and “ckd” happens as in Step 1, VetCN corpus is transformed to lower case while PMSB corpus is not
The 20 n-grams that are the only common candidate terms among the 880 term pairs from both VetCN and PMSB datasets
The character ‘|’ that appears in the first column separates the different neural language models. The grey background indicates that the target term and the candidate term has the same focus concept, i.e. same UMLS CUI
Performance of the short form detector with VetCN and PMSB datasets
| Data set | Unique candidate terms | SF-U + | SF-I | SF-NF | n-grams with no clinically meaningful short forms | Detect n-grams with one or more clinically meaningful short forms | Detect n-grams with no clinically meaningful short forms | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| P | R | F | P | R | F | ||||||
| VetCN | 300 | 57 | 1 | 14 | 228 | 98.28 | 80.28 | 88.37 | 94.21 | 99.56 | 96.82 |
| PMSB | 333 | 75 | 2 | 0 | 256 | 97.40 | 100 | 98.68 | 100 | 99.22 | 99.61 |
| 97.78 | 90.41 | 93.95 | 97.07 | 99.36 | 98.20 | ||||||
To assess the capability of the short form detector to identify candidate terms (n-grams) with one or more clinically meaningful short forms: the value of the column “SF-U + SF-NU + SF” is interpreted as TP; the value of the column “SF-I” is interpreted as FP; and the value of the column “SF-NF” is interpreted as FN. To assess the capability of the short form detector to identify candidate terms (n-grams) with no clinically meaningful short forms: the value of the column “n-grams with no clinically meaningful short forms” is interpreted as TP; the value of the column “SF-I” is interpreted as FN; and the value of the column “SF-NF” is interpreted as FP. The last row shows the micro-averaging values taking into account the total of 613 unique candidate terms (n-grams) for the 880 term pairs. Abbreviations: P = precision; R = recall; and F = F measure
MetaMap performance for the candidate terms from VetCN dataset
| Target term | Candidate terms (20 top-ranked n-grams) from CBOW neural embeddings for a target term | Candidate terms (20 top-ranked n-grams) from Skip-gram neural embeddings for a target term | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MetaMap Experiment 1 | MetaMap Experiment 2 | MetaMap Experiment 1 | MetaMap Experiment 2 | |||||||||
| P | R | F | P | R | F | P | R | F | P | R | F | |
| anaemia | 84.21 | 94.12 | 88.89 | 95.00 | 100.00 | 97.44 | 94.74 | 94.74 | 94.74 | 95.00 | 100.00 | 97.44 |
| arthritis | 93.33 | 73.68 | 82.35 | 100.00 | 75.00 | 85.71 | 94.44 | 89.47 | 91.89 | 100.00 | 90.00 | 94.74 |
| asthma | 100.00 | 90.00 | 94.74 | 100.00 | 95.00 | 97.44 | 89.47 | 94.44 | 91.89 | 100.00 | 100.00 | 100.00 |
| ckd | 68.75 | 73.33 | 70.97 | 100.00 | 95.00 | 97.44 | 62.50 | 71.43 | 66.67 | 100.00 | 100.00 | 100.00 |
| diabetes | 76.47 | 81.25 | 78.79 | 100.00 | 90.00 | 94.74 | 88.89 | 88.89 | 88.89 | 94.74 | 94.74 | 94.74 |
| epilepsy | 100.00 | 90.00 | 94.74 | 100.00 | 95.00 | 97.44 | 100.00 | 90.00 | 94.74 | 100.00 | 95.00 | 97.44 |
| glaucoma | 87.50 | 77.78 | 82.35 | 94.74 | 94.74 | 94.74 | 93.33 | 73.68 | 82.35 | 94.74 | 94.74 | 94.74 |
| heart_failure | 73.68 | 93.33 | 82.35 | 95.00 | 100.00 | 97.44 | 84.21 | 94.12 | 88.89 | 100.00 | 100.00 | 100.00 |
| hypertension | 71.43 | 62.50 | 66.67 | 100.00 | 95.00 | 97.44 | 72.22 | 86.67 | 78.79 | 100.00 | 100.00 | 100.00 |
| obesity | 75.00 | 100.00 | 85.71 | 85.00 | 100.00 | 91.89 | 84.21 | 94.12 | 88.89 | 89.47 | 94.44 | 91.89 |
| osteoarthritis | 94.74 | 94.74 | 94.74 | 100.00 | 95.00 | 97.44 | 85.00 | 100.00 | 91.89 | 85.00 | 100.00 | 91.89 |
| 84.10 | 84.61 | 83.85 | 97.25 | 94.07 | 95.38 | 86.27 | 88.87 | 87.24 | 96.27 | 97.17 | 96.63 | |
The table shows the performance of MetaMap in Experiment 1 (applying MetaMap to the candidate terms) and Experiment 2 (short form detection and expansion into long form before applying MetaMap to the candidate terms) for each target term (n-gram for a well-known medical condition). The candidate terms are a list of the 20 top-ranked terms (highest cosine value) obtained from the created neural embeddings with CBOW or Skip-gram taking the vector for a target term. The last row shows the average of each evaluation measure over all 11 medical conditions under study to get an overall measure of performance (a.k.a. macro-averaging). Abbreviations: P = precision; R = recall; and F = F measure
MetaMap performance for the candidate terms from PMSB dataset
| Target term (n-gram) | Candidate terms (20 top-ranked n-grams) from CBOW neural embeddings for a target term | Candidate terms (20 top-ranked n-grams) from Skip-gram neural embeddings for a target term | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| MetaMap Experiment 1 | MetaMap Experiment 2 | MetaMap Experiment 1 | MetaMap Experiment 2 | |||||||||
| P | R | F | P | R | F | P | R | F | P | R | F | |
| anaemia | 90.00 | 100.00 | 94.74 | 85.00 | 100.00 | 91.89 | 84.21 | 94.12 | 88.89 | 94.74 | 94.74 | 94.74 |
| arthritis | 88.89 | 88.89 | 88.89 | 89.47 | 94.44 | 91.89 | 100.00 | 100.00 | 100.00 | 95.00 | 100.00 | 97.44 |
| asthma | 76.47 | 81.25 | 78.79 | 72.22 | 86.67 | 78.79 | 63.16 | 92.31 | 75.00 | 68.42 | 92.86 | 78.79 |
| CKD | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 90.00 | 100.00 | 94.74 | 90.00 | 100.00 | 94.74 |
| diabetes | 63.16 | 92.31 | 75.00 | 68.42 | 92.86 | 78.79 | 75.00 | 100.00 | 85.71 | 80.00 | 100.00 | 88.89 |
| epilepsy | 85.00 | 100.00 | 91.89 | 95.00 | 100.00 | 97.44 | 90.00 | 100.00 | 94.74 | 95.00 | 100.00 | 97.44 |
| glaucoma | 90.00 | 100.00 | 94.74 | 100.00 | 100.00 | 100.00 | 84.21 | 94.12 | 88.89 | 100.00 | 100.00 | 100.00 |
| heart_failure | 85.00 | 100.00 | 91.89 | 90.00 | 100.00 | 94.74 | 73.68 | 93.33 | 82.35 | 90.00 | 100.00 | 94.74 |
| hypertension | 95.00 | 100.00 | 97.44 | 100.00 | 100.00 | 100.00 | 84.21 | 94.12 | 88.89 | 95.00 | 100.00 | 97.44 |
| obesity | 100.00 | 95.00 | 97.44 | 100.00 | 100.00 | 100.00 | 94.74 | 94.74 | 94.74 | 95.00 | 100.00 | 97.44 |
| osteoarthritis | 90.00 | 100.00 | 94.74 | 100.00 | 100.00 | 100.00 | 90.00 | 100.00 | 94.74 | 100.00 | 100.00 | 100.00 |
| 87.59 | 96.13 | 91.41 | 90.92 | 97.63 | 93.96 | 84.47 | 96.61 | 89.88 | 91.20 | 98.87 | 94.70 | |
The table shows the performance of MetaMap in Experiment 1 (applying MetaMap to the candidate terms) and Experiment 2 (short form detection and expansion into long form before applying MetaMap to the candidate terms) for each target term (n-gram for a well-known medical condition). The candidate terms are a list of the 20 top-ranked terms (highest cosine value) obtained from the created neural embeddings with CBOW or Skip-gram taking the vector for a target term. The last row shows the average of each evaluation measure over all 11 medical conditions under study to get an overall measure of performance (a.k.a. macro-averaging). Abbreviations: P = precision; R = recall; and F = F measure
The 25 unique UMLS Metathesaurus concept pairs that are common to both VetCN and PMSB datasets and have validation labels different to “itself”
The validation label from the third column denotes the relatedness of BMJ Best Practice term (the column previous to the last) to the UMLS Metathesaurus concept representing the focus concept for the candidate term (the last column). The rows with grey background correspond to UMLS Metathesaurus concept pairs that are retrieved by the SPARQL queries q1VU, or q2VU, or q3VU (see the cells with grey background in Table 11 for details)
Results of the “One Health” queries performed that intend to acquire validated knowledge about the diagnosis and management of well-known medical conditions – The SPARQL SELECT queries appear within the Additional file 4 and the description of the queries appear within the subsection “Extracting locality-based modules with SNOMED CT and enabling One Health queries” of the section Materials and methods
Each query qiVU, with i = {1,2,3}, is the union of the results obtained for the query qiV over VetCN dataset and the query qiV over PMSB dataset (see Table 9 for details). The cells with grey background indicate that there are common UMLS Metathesaurus concept pairs in both VetCN and PMSB datasets, and therefore, the total number of results for the query qiVU is lower that the summation of the results obtained for the query qiV in each dataset (see the rows with grey background in Table 7 for details of the common UMLS Metathesaurus concept pairs retrieved). As more than one SNOMED CT concept can map one UMLS Metathesaurus concept, the number of results for the query qiVM is equal to or lower than the number of results for the query qiVS, with i = {1,2,3}. Each SPARQL SELECT query qiVR, with i = {1,2,3}, retrieves the asserted and inferred descendants (with FaCT++) of those SNOMED CT concepts mapped to candidate concepts of the SNOMED CT pairs retrieved from the SPARQL SELECT query qiVS
The 11 UMLS Metathesaurus concept pairs identified by the domain experts as unrelated, as they do not have an up-to-date clinically meaningful association for human medicine.
The rows with grey background correspond to UMLS Metathesaurus concept pairs that are retrieved for some of the SPARQL queries q1 to q3 (see the cells with grey background in Table 9 for details)
Results of the SPARQL SELECT queries performed over the lemonEXT ontology and the OBANmod ontology – The table shows the number of UMLS Metathesaurus concept pairs (target, candidate) retrieved for the SPARQL SELECT queries q1 to q3 as well as q1V to q3V
The SPARQL SELECT queries appear within the Additional file 4 and the description of the queries appear within the Step 6 of the section Materials and methods. Each UMLS Metathesaurus concept pair represents the focus concepts of the term pair (target term, candidate term). The difference in the number of results between the query qi and the query qiV, with i = {1,2,3}, indicates that there are UMLS Metathesaurus concept pairs that have not passed the evaluation with BMJ Best Practice, and thus, the query qi (see cells with grey background) has a higher number of results than the query qiV
Fig. 4Exemplifying the population of both the lemonEXT and OBANmod core ontologies – the figure illustrates how the 3-tuple (C0017601|Glaucoma, C0020581|Hyphema, “Relatedness by Inexact match (background knowledge)”) is represented as a modified version of the OBAN association. This 3-tuple has two bibo:excerpts: 1) the term “trabeculotomy” from BMJ Best Practice for open-angle glaucoma; and 2) few lines of text from the PubMed article with identifier PMID = 29,035,912. The top of the figure shows the lexical entry from the lemonEXT ontology corresponding to the focus concept C0020581|Hyphema
Locality-based modules extracted from the SNOMED CT ontology for the 11 well-known medical conditions
| Target term | Number of SNOMED CT concept identifiers for the signature | Total number of axioms | Number of OWL Classes | Number of OWL object properties | Number of SubClassOf axioms | Number of EquivalentClass axioms |
|---|---|---|---|---|---|---|
| anaemia | 34 | 105,205 | 22,227 | 46 | 7092 | 15,134 |
| arthritis | 33 | 74,398 | 16,180 | 31 | 7470 | 8709 |
| asthma | 37 | 97,647 | 20,804 | 39 | 7611 | 13,192 |
| ckd | 19 | 51,890 | 10,929 | 40 | 3899 | 7029 |
| diabetes | 29 | 463,437 | 100,119 | 44 | 49,818 | 50,300 |
| epilepsy | 19 | 10,072 | 2085 | 23 | 1055 | 1029 |
| glaucoma | 39 | 101,997 | 21,278 | 44 | 11,047 | 10,230 |
| heart_failure | 37 | 68,006 | 14,634 | 44 | 5490 | 9143 |
| hypertension | 31 | 152,283 | 32,270 | 48 | 16,425 | 15,844 |
| obesity | 28 | 90,224 | 18,994 | 40 | 12,376 | 6617 |
| osteoarthritis | 38 | 73,496 | 15,903 | 37 | 8922 | 6980 |
The second column just reports the total number of SNOMED CT identifiers for the ontological signature. The worksheet “signatures” within the Additional file 3 contains the list of SNOMED CT identifiers (as signature) for each target term. From the third to the last column ontology metrics information for the locality-based module created per target term is provided. The last two columns indicate the number of descriptions and definitions extracted from the SNOMED CT ontology for each locality-based module, respectively