| Literature DB >> 31438128 |
Olga Lyudovyk1, Chunhua Weng1.
Abstract
SNOMED Clinical Terms (SNOMED CT) defines over 70,000 diseases, including many rare ones. Meanwhile, descriptions of rare conditions are missing from online educational resources. SNOMEDtxt converts ontological concept definitions and relations contained in SNOMED CT into narrative disease descriptions using Natural Language Generation techniques. Generated text is evaluated using both computational methods and clinician and lay user feedback. User evaluations indicate that lay people prefer generated text to the original SNOMED content, find it more informative, and understand it significantly better. This method promises to improve access to clinical knowledge for patients and the medical community and to assist in ontology auditing through natural language descriptions.Entities:
Keywords: Access to Information; Natural Language Processing; Systematized Nomenclature of Medicine
Mesh:
Year: 2019 PMID: 31438128 PMCID: PMC6852688 DOI: 10.3233/SHTI190429
Source DB: PubMed Journal: Stud Health Technol Inform ISSN: 0926-9630
Figure 1 –Counts of Diseases in cdc.gov, medlineplus.gov, uptodate.com, webmd.com, mayoclinic.org, rarediseases.info.nih.gov, medscape.com, SNOMED [2] (Nov. 10, 2018)
Figure 2 –Framework for Disease Description Generation
Organizing Relationships
| Group | Relationship | Lexical Pattern |
|---|---|---|
| Definition | IS-A | “is a kind of” |
| Finding site | “that affects the” | |
| Has definitional manifestation | “It manifests itself in” | |
| Associated morphology | “The associated morphology is” | |
| Pathological process | “Pathological process associated with … is” | |
| Children: IS-A, searched term=destination | “An example of … is” / “Examples of … are” | |
| Causality | Causative agent | “is caused by” |
| Due to | “occurs due to” | |
| Associated with | “is associated with” | |
| Temporality | Occurrence | “presents in” (period) |
| During/Following/After | “can occur during / following / after” | |
| Temporally related | “can be temporally related to” | |
| Diagnosis | Finding method | “is discovered by” |
| Finding informer | “<is discovered> through” | |
| Clinical Course | Clinical course | “Clinical course is” |
| Severity | “The severity of … is” | |
| Episodicity | “The episodicity of … is” | |
| Other | Interprets | “interprets or evaluates” |
| Has interpretation | “… as” | |
| Other | “Other related concepts include…” |
User Evaluation: Comparison with OntoVerbal
| SNOMEDtxt | Onto Verbal | SNOMED CT | No Difference | |
|---|---|---|---|---|
| Easier to read | 49% | 43% | 3.9% | 3.9% |
| Preferred | 52% | 31% | 11.8% | 3.9% |
| Easier to read | 50% | 50% | 0% | 0% |
| Preferred | 50% | 17% | 17% | 17% |
Evaluation with Computed Metrics
| Readability | Redundancy | |||
|---|---|---|---|---|
| FK | ARI | Words | Unique/All | |
| Top 20 most searched diseases | ||||
| SNOMEDtxt | 14.3 | 12.0 | 49.3 | 0.74 |
| SNOMED CT | 17.9 | 15.0 | 64.1 | 0.55 |
| Reference | 6.6 | 6.1 | 263 | 0.77 |
| Random 20 SNOMED CT disease concepts | ||||
| SNOMEDtxt | 11.7 | 9.7 | 47.3 | 0.69 |
| SNOMED CT | 15.7 | 13.8 | 69.7 | 0.56 |
User Evaluation: SNOMEDtxt vs. SNOMED
| SNOMEDtxt | SNOMED CT | No Difference | |
|---|---|---|---|
| Easier to read | 76.5% | 14.4% | 9.2% |
| Preferred | 68.6% | 21.6% | 9.8% |
| Easier to read | 83% | 11% | 6% |
| Preferred | 44% | 28% | 28% |