| Literature DB >> 29777125 |
Uri Kartoun1,2,3, Rahul Aggarwal1,2, Andrew L Beam2,4, Jennifer K Pai5, Arnaub K Chatterjee5,6, Timothy P Fitzgerald7, Isaac S Kohane2,4, Stanley Y Shaw8,9,10.
Abstract
We developed an insomnia classification algorithm by interrogating an electronic medical records (EMR) database of 314,292 patients. The patients received care at Massachusetts General Hospital (MGH), Brigham and Women's Hospital (BWH), or both, between 1992 and 2010. Our algorithm combined structured variables (such as International Classification of Diseases 9th Revision [ICD-9] codes, prescriptions, laboratory observations) and unstructured variables (such as text mentions of sleep and psychiatric disorders in clinical narrative notes). The highest classification performance of our algorithm was achieved when it included a combination of structured variables (billing codes for insomnia, common psychiatric conditions, and joint disorders) and unstructured variables (sleep disorders and psychiatric disorders). Our algorithm had superior performance in identifying insomnia patients compared to billing codes alone (area under the receiver operating characteristic curve [AUROC] = 0.83 vs. 0.55 with 95% confidence intervals [CI] of 0.76-0.90 and 0.51-0.58, respectively). When applied to the 314,292-patient population, our algorithm classified 36,810 of the patients with insomnia, of which less than 17% had a billing code for insomnia. In conclusion, an insomnia classification algorithm that incorporates clinical notes is superior to one based solely on billing codes. Compared to traditional methods, our study demonstrates that a classification algorithm that incorporates physician notes can more accurately, comprehensively, and quickly identify large cohorts of insomnia patients.Entities:
Mesh:
Year: 2018 PMID: 29777125 PMCID: PMC5959894 DOI: 10.1038/s41598-018-25312-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Variables selected for physician-documented insomnia algorithm. OR = odds ratio; CI = confidence interval.
Figure 2AUROCs of the algorithms for physician-documented insomnia using varying combinations of structured and unstructured (narrative) data. AUROC = area under the receiver operating characteristic curve.
Characteristics of physician-documented insomnia cohort. The top 20 conditions of prevalence are shown.
| Variable and category | Overall (n = 36,810) |
|---|---|
|
| 62.0 (16.3) |
|
| |
| Male | 40.6 |
| Female | 59.4 |
|
| |
| Caucasian | 72.9 |
| African American | 10.4 |
| Asian | 2.0 |
| Hispanic | 10.8 |
| Other | 1.1 |
| Unknown | 2.8 |
|
| |
| Married or partner | 44.4 |
| Other | 53.5 |
| Unknown | 2.1 |
| Medicaid | 6.9 |
| Medicare | 56.3 |
| Other | 99.4 |
| Body mass index (kg/m2); Mean (Standard Deviation) | 30 (8.2) |
|
| |
| Current | 16.9 |
| Past | 23.7 |
| Never | 50.3 |
| Unknown | 9.1 |
|
| |
| Joint disorder | 79.3 |
| Hypertension | 75.4 |
| Disorders of lipid metabolism | 66.5 |
| Diabetes (either type I or II) | 55.9 |
| Gastrointestinal disorder | 53.0 |
| Anxiety or depression | 46.9 |
| Psychiatric disorder | 38.4 |
| Pneumonia | 37.1 |
| Obesity | 34.2 |
| Congestive heart failure | 32.7 |
| Coronary artery disease | 27.9 |
| Asthma | 23.9 |
| Chronic obstructive pulmonary disease | 23.5 |
| Cerebrovascular disease | 22.9 |
| Atrial fibrillation/Atrial flutter | 21.8 |
| Cancer | 21.8 |
| Peripheral vascular disease | 19.7 |
| Osteoporosis | 18.1 |
| Chronic kidney disease/end stage renal disease | 16.3 |
| Renal failure | 12.3 |
Figure 3Insomnia algorithm and cohort development. A total of 600 patients were manually labeled: 230 patients had insomnia, 270 patients did not have insomnia, and 100 patients did not have a clear insomnia status. The 500 patients with known insomnia status were used to develop the algorithm; 7 of these 500 patients were excluded because their age was below 18 (date of death or date of the end of the study). Additionally, two-thirds of these (328) served as a training set, while the rest (165) served as a validation set.