| Literature DB >> 32049282 |
Elizabeth A Campbell1,2, Ellen J Bass1,3, Aaron J Masino1,4.
Abstract
OBJECTIVE: This study introduces a temporal condition pattern mining methodology to address the sparse nature of coded condition concept utilization in electronic health record data. As a validation study, we applied this method to reveal condition patterns surrounding an initial diagnosis of pediatric asthma.Entities:
Keywords: asthma; data mining; data science; electronic health records
Mesh:
Year: 2020 PMID: 32049282 PMCID: PMC7075539 DOI: 10.1093/jamia/ocaa005
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1.The primary methodological steps involved in the study. SPADE: Sequential PAttern Discovery using Equivalence classes.
Figure 2.(A) A sample sequence database of clinical information (ie, expanded diagnostic cluster groupings for clinical diagnoses) for 3 patients, formatted for the SPADE (Sequential PAttern Discovery using Equivalence classes) algorithm. (B) The output of frequent sequential patterns that SPADE uncovers from the clinical information of the 3 patients in panel A. The expanded diagnostic cluster codes correspond to the following diagnostic categories: ALL04: asthma, without status asthmaticus; EAR08: deafness, hearing loss; EYE07: conjunctivitis, keratitis; GAS03: constipation; MUS02: acute sprains and strains; MUS08: fractures and dislocations/digits only.
Figure 3.Sample temporal diagnostic sequences discoverable by SPADE (Sequential PAttern Discovery using Equivalence classes). (A) A sequence that includes a clinical diagnosis in the preindex visit and the postindex visit. (B) A sequence that includes a clinical diagnosis in the index visit and the postindex visit. (C) A sequence that includes a clinical diagnosis in the preindex visit and the index visit. (D) A sequence that includes a clinical diagnosis in all 3 timing classes. Clinical diagnoses observed in a single timing class may also be considered common sequences.
Top 20 most prevalent conditions patterns surrounding initial diagnosis of pediatric asthma (EDC dataset)
| Preindex visit condition(s) | Index visit condition(s) | Postindex visit condition(s) | Support |
|---|---|---|---|
| Asthma without status asthmaticus | 0.942 | ||
| Asthma without status asthmaticus | 0.436 | ||
| Asthma without status asthmaticus | Asthma without status asthmaticus | 0.417 | |
| Acute upper respiratory tract infection | 0.186 | ||
| Acute upper respiratory tract infection | Asthma without status asthmaticus | 0.175 | |
| Allergic rhinitis | 0.164 | ||
| Allergic rhinitis, asthma without status asthmaticus | 0.158 | ||
| Acute upper respiratory tract infection | 0.145 | ||
| Respiratory signs and symptoms | 0.139 | ||
| Acute upper respiratory tract infection | 0.139 | ||
| Otitis media | 0.137 | ||
| Asthma without status asthmaticus | Acute upper respiratory tract infection | 0.136 | |
| Respiratory signs and symptoms | Asthma without status asthmaticus | 0.134 | |
| Asthma without status asthmaticus, Acute upper respiratory tract infection | 0.132 | ||
| Otitis media | Asthma without status asthmaticus | 0.131 | |
| Otitis media | 0.120 | ||
| Asthma without status asthmaticus | Otitis media | 0.116 | |
| Allergic rhinitis | 0.102 | ||
| Otitis media | 0.101 | ||
| Asthma without status asthmaticus, Otitis media | 0.098 |
EDC: expanded diagnostic cluster.
Conditions observed in prevalent sequences identified by SPADE (EDC dataset), by timing class
| All visit classes | Preindex visits | Preindex and index visits | Preindex and postindex visits | Index visits | Index and postindex visits |
|---|---|---|---|---|---|
| Acute lower respiratory tract infection | Allergic reactions | Chronic pharyngitis and tonsillitis | Abdominal pain | Seizure disorder | Asthma without status asthmaticus |
| Acute upper respiratory tract infection | Dermatophytosis | Musculoskeletal disorders, other | Acute sprains and strains | Asthma, with status asthmaticus | |
| Administrative concerns and nonspecific laboratory abnormalities | Exanthems | Contusions and abrasions | |||
| Allergic rhinitis | Nausea, vomiting | Deafness, hearing loss | |||
| Attention-deficit disorder | Nonfungal infections of skin and subcutaneous tissue | Musculoskeletal signs and symptoms | |||
| Conjunctivitis, keratitis | Urinary symptoms | ||||
| Constipation | |||||
| Cough | |||||
| Dermatitis and eczema | |||||
| Developmental disorder | |||||
| ENT disorders, other | |||||
| Failure to thrive | |||||
| Gastroenteritis | |||||
| Gastroesophageal reflux | |||||
| Nonspecific signs and symptoms | |||||
| Obesity | |||||
| Otitis media | |||||
| Respiratory signs and symptoms | |||||
| Sinusitis | |||||
| Viral syndromes |
EDC: expanded diagnostic cluster; ENT: ear, nose, and throat; SPADE: Sequential PAttern Discovery using Equivalence classes.
Top 20 most prevalent conditions patterns surrounding initial diagnosis of pediatric asthma (ICD dataset)
| Preindex visit condition(s) | Index visit condition(s) | Postindex visit condition(s) | Support |
|---|---|---|---|
| Asthma without status asthmaticus | 0.538 | ||
| Asthma without status asthmaticus | 0.251 | ||
| Asthma without status asthmaticus | Asthma without status asthmaticus | 0.183 | |
| Asthma without status asthmaticus | 0.149 | ||
| Asthma without status asthmaticus | 0.137 | ||
| Allergic rhinitis | 0.125 | ||
| Respiratory signs and symptoms | 0.108 | ||
| Acute upper respiratory tract infection | 0.096 | ||
| Acute upper respiratory tract infection | 0.088 | ||
| Cough | 0.081 | ||
| Asthma without status asthmaticus | 0.079 | ||
| Allergic rhinitis | 0.077 | ||
| Allergic rhinitis, asthma without status asthmaticus | 0.075 | ||
| Acute upper respiratory tract infection | 0.072 | ||
| Respiratory signs and symptoms | Asthma without status asthmaticus | 0.064 | |
| Asthma without status asthmaticus | 0.061 | ||
| Acute upper respiratory tract infection | 0.059 | ||
| Cough | 0.059 | ||
| Otitis media | 0.057 | ||
| Asthma without status asthmaticus | 0.055 |
ICD: International Classification of Diseases.
Conditions observed in prevalent sequences identified by SPADE (ICD dataset), by timing class
| All visit classes | Preindex visits | Preindex and postindex visits | Index visits | Index and postindex visits |
|---|---|---|---|---|
| Acute lower respiratory tract infection | Gastroenteritis | Deafness, hearing loss | Attention-deficit disorder | Asthma without status asthmaticus |
| Acute upper respiratory tract infection | Asthma, with status asthmaticus | |||
| Administrative concerns and nonspecific laboratory abnormalities | Obesity | |||
| Allergic rhinitis | ||||
| Constipation | ||||
| Cough | ||||
| Dermatitis and eczema | ||||
| Failure to thrive | ||||
| Gastroesophageal reflux | ||||
| Otitis media | ||||
| Respiratory signs and symptoms | ||||
| Sinusitis | ||||
| Viral syndromes |
ICD: International Classification of Diseases; SPADE: Sequential PAttern Discovery using Equivalence classes.