| Literature DB >> 34604710 |
Syed-Amad Hussain1, Emre Sezgin1, Katelyn Krivchenia2,3, John Luna1, Steve Rust1, Yungui Huang1.
Abstract
OBJECTIVES: Patient-generated health data (PGHD) are important for tracking and monitoring out of clinic health events and supporting shared clinical decisions. Unstructured text as PGHD (eg, medical diary notes and transcriptions) may encapsulate rich information through narratives which can be critical to better understand a patient's condition. We propose a natural language processing (NLP) supported data synthesis pipeline for unstructured PGHD, focusing on children with special healthcare needs (CSHCN), and demonstrate it with a case study on cystic fibrosis (CF).Entities:
Keywords: artificial intelligence; chronic disease; cystic fibrosis; natural language processing; patient notes
Year: 2021 PMID: 34604710 PMCID: PMC8480545 DOI: 10.1093/jamiaopen/ooab084
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Figure 1.Process flow of note processing and information extraction.
Figure 2.An example for drug dosage extraction using dependency trees. The NER model identifies the drug within the sentence, that is, Tylenol (green). Once identified, we move up the dependency tree until we either find an NUM or NOUN. If we find an NOUN (blue), we see if there is an NUM as a child to the NOUN. It is assumed that this NUM child (yellow) is the quantity and the NOUN (blue) is the unit for the drug’s dosage. If no NOUN or NUM occurs in the same clause of the sentence, or subsection of the dependency tree, then no dosage information is extracted. NER: named entity recognition.
Information extraction pipeline focusing on extracting 4 categories of entities alongside quantitative and qualitative descriptors for each
| Category | Type of information extracted | Entity recognition method |
|---|---|---|
| Symptom | Qualitative | NER and manually defined |
| Drug | Quantitative | NER and manually defined |
| Other qualitative entities (OQLEs) | Qualitative | Manually defined |
| Other quantitative entities (OQNEs) | Quantitative and qualitative | Manually defined |
Note: These entities can be manually defined or detected automatically using NER linked to ontologies.
NER: named entity recognition.
Examples of extracted entities, definitions, types, and sentences in each category
| Extracted entities | Entity definitiona | Related information | Type | Example sentences |
|---|---|---|---|---|
| Pediasure | Manual | – | Drug/supplement | “Took his Pediasure before bed time, and had 3 vitamins.” |
| Vitamins | Manual | Quantity: “3” | ||
| Vest | Manual | Quantity: “2” | OQNE | “Jon had the vest two times today.” |
| Albuterol | Automatic | Dosage: “5 mg” | Drug | “Took his albuterol, pulmozyme, and tobi.” |
| Pulmozyme | Automatic | – | ||
| “I gave him 5 mg of albuterol today.” | ||||
| Tobramycin | Manual | – | ||
| Diaper | Manual | Diaper details: “wet” | OQLE | “Changed the wet diaper twice with loose stool before noon.” |
| Stool | Manual | Stool details: “loose” | ||
| Coughing | Automatic | Coughing details: “brief” | Symptom | “Jon had a brief coughing fit just now.” |
aEntity definitions are “Automatic” if they are present within the ontologies used (SNOMED and RXNORM). “Manual” refers to entities that were added manually via interactive customization to target this specific cohort and simulated patient.
OQLE: other qualitative entity; OQNE: other quantitative entity; SNOMED: Systematized Nomenclature of Medicine.