Literature DB >> 32414887

Natural Language Processing Improves Detection of Nonsevere Hypoglycemia in Medical Records Versus Coding Alone in Patients With Type 2 Diabetes but Does Not Improve Prediction of Severe Hypoglycemia Events: An Analysis Using the Electronic Medical Record in a Large Health System.

Anita D Misra-Hebert1,2, Alex Milinovich2, Alex Zajichek2, Xinge Ji2, Todd D Hobbs3, Wayne Weng3, Paul Petraro3, Sheldon X Kong3, Michelle Mocarski3, Rahul Ganguly3, Janine M Bauman2, Kevin M Pantalone4, Robert S Zimmerman4, Michael W Kattan2.   

Abstract

OBJECTIVE: To determine if natural language processing (NLP) improves detection of nonsevere hypoglycemia (NSH) in patients with type 2 diabetes and no NSH documentation by diagnosis codes and to measure if NLP detection improves the prediction of future severe hypoglycemia (SH). RESEARCH DESIGN AND METHODS: From 2005 to 2017, we identified NSH events by diagnosis codes and NLP. We then built an SH prediction model.
RESULTS: There were 204,517 patients with type 2 diabetes and no diagnosis codes for NSH. Evidence of NSH was found in 7,035 (3.4%) of patients using NLP. We reviewed 1,200 of the NLP-detected NSH notes and confirmed 93% to have NSH. The SH prediction model (C-statistic 0.806) showed increased risk with NSH (hazard ratio 4.44; P < 0.001). However, the model with NLP did not improve SH prediction compared with diagnosis code-only NSH.
CONCLUSIONS: Detection of NSH improved with NLP in patients with type 2 diabetes without improving SH prediction.
© 2020 by the American Diabetes Association.

Entities:  

Mesh:

Year:  2020        PMID: 32414887      PMCID: PMC7372042          DOI: 10.2337/dc19-1791

Source DB:  PubMed          Journal:  Diabetes Care        ISSN: 0149-5992            Impact factor:   19.112


Introduction

The risk of hypoglycemia (glucose <70 mg/dL) with diabetes treatment is well recognized, and its prevention is considered a “critical component of diabetes management” (1). Episodes of severe hypoglycemia (SH), defined in this study as requiring hospitalization or emergency department visit in patients with type 2 diabetes, can be identified through structured electronic medical record (EMR) or administrative data (2,3). However, nonsevere hypoglycemia (NSH), not requiring assistance for recovery (1), is often reported by patients during outpatient visits and captured only if an EMR hypoglycemia diagnosis code is entered. Natural language processing (NLP) combines computer science and linguistics methods (4) and can be used to extract nonstructured EMR text data. The improvement of EMR hypoglycemia detection using NLP had been previously reported (5). We aimed to describe capture of NSH in patients with type 2 diabetes using both diagnosis codes and NLP and to create a predictive model for SH events.

Research Design and Methods

We used a modified version of the algorithm used by Kho et al. (6) to identify patients in the Cleveland Clinic Health System with type 2 diabetes during 2005 to 2017. Hypoglycemia events were identified using ICD-9/ICD-10 codes (Supplementary Appendix 1); events not associated with hospitalization/emergency department visit were categorized as NSH. A randomly selected subset of patient records stratified by years was reviewed to confirm code identification of NSH. The study protocol was approved by the Insitutional Review Board at Cleveland Clinic (Cleveland, OH). Using clinical progress notes, an NLP algorithm was developed. The cTAKES program (7) was used to break down sentences into phrases to identify hypoglycemia-related Unified Medical Language System concepts. Regular expressions were written to classify polarity (event or no event) of phrases by pattern matching. Using word embeddings, the remaining phrases were classified with additional pattern matching (see Supplementary Appendix 2 for detailed descriptions). For each year, 100 events were randomly selected, with 2005 to 2006 combined given the lower number of notes. The phrases were reviewed by practicing clinicians and confirmed through record review of a subset of patients as representing true events. The weighted proportion of actual events in a set of 1,200 notes was used to define the positive predictive value. We then limited our data set to include patients with at least one primary care or endocrinology department visit each year and created a Cox proportional hazards prediction model for SH (5 years’ duration) using the following variables based upon our previous work (2): sex, race, median income (8), history of comorbidities (cardiovascular disease, congestive heart failure [CHF], depression, other psychiatric disorders, dementia, cognitive impairment, chronic kidney disease [CKD], and alcohol or substance abuse), and time-dependent varables including age, insurance type (Medicare, Medicaid, commercial, or other), glycosylated hemoglobin (HbA1c), BMI, and diabetes medications (insulin, sulfonylurea, glucagon-like peptide 1 receptor agonists [GLP-1RAs], sodium–glucose cotransporter 2 inhibitors, dipeptidyl peptidase 4 inhibitors, metformin, and α-glucosidase inhibitors). We had previously identified NSH by ICD codes as a variable associated with SH but for this study added two variables: ever-history of NSH and history of NSH within the past 3 months detected by codes and NLP. We also included interaction variables for HbA1c, with insulin, and with sulfonylureas.

Results

The number of patients with type 2 diabetes in this data set increased from 12,706 in 2005 to 176,001 in 2017 (210,191 unique patients). A total of 1,177,590 progress notes were processed. Upon clinician chart review, 1,111 of 1,200 randomly selected events classified as NSH by NLP were confirmed (93% positive predictive value). From 2005 to 2017, 10,205 NSH events were captured by codes and 14,763 events by NLP, with overlap of only 5 events. Among 204,517 patients with no codes for NSH, evidence of NSH was found in 7,035 (3.4%) using NLP. The chart review confirmed NSH episodes were often documented under general diabetes diagnosis categories. The incidence proportion of patients with SH by ICD codes increased from 0.3 to 1.7% and those with NSH increased from 0.4 to 1.3% from 2005 to 2017. When NLP was added, the incidence proportion of patients with NSH increased from 0.8% (2005) to 2.6% (2017). There were 47,280 patients included to create the prediction model for SH (Supplementary Appendix 3). The models using NSH codes alone and codes plus NLP (Table 1) had C-statistics of 0.812 and 0.806, respectively. The codes plus NLP model showed increased risk for SH with NSH (ever-history of NSH: hazard ratio [HR] 4.44, P < 0.001; and NSH in past 3 months: HR 1.65, P < 0.001), black race (HR 1.81; P < 0.001), Medicaid insurance (HR 1.35; P = 0.008), history of cardiovascular disease (HR 1.58; P < 0.001), CHF (HR 2.35; P < 0.001), depression (HR 1.28; P = 0.03), psychiatric disorders other than depression (HR 1.55; P < 0.001), alcohol or substance abuse (HR 1.55; P = 0.04), and CKD (HR 1.86; P < 0.001). The effect of insulin use on SH was greater at lower HbA1c (HR of 3.32 when HbA1c is 6% [42 mmol/mol]; HR of 2.04 when HbA1c is 9% [75 mmol/mol]). There was decreased risk of SH with higher BMI (HR 0.69 when BMI was 30 kg/m2; HR 0.63 when BMI was 35 kg/m2) and decreased risk with metformin (HR 0.53; P < 0.001) and GLP-1RA (HR 0.36; P < 0.001). Sulfonylurea use had a mixed effect on the risk of SH (HR of 1.61 when HbA1c was 6% [42 mmol/mol]; HR of 0.69 when HbA1c was 9% [75 mmol/mol]). The effect of HbA1c varied at the extremes. With a reference HbA1c of 6% (42 mmol/mol), the HR for SH was 1.59 at an HbA1c of 5% (31 mmol/mol), HR was 0.73 at an HbA1c of 7% (53 mmol/mol), and HR was 1.39 when HbA1c was 9% (75 mmol/mol). Of note, NSH within the past 3 months was only a significant predictor in the the model including NLP. Receiver operating characteristic curves comparing code-detected NSH, NLP-detected NSH, and code plus NLP–detected NSH models were similar (Supplementary Appendix 4), while Kaplan-Meier curves for probability of being free from SH showed lower probability for code-detected NLP (Supplementary Appendix 5).
Table 1

Prediction models for risk of SH according to method used for capturing NSH events: diagnosis codes only or diagnosis codes plus NLP

VariableNSH using diagnosis codes (C index = 0.812)NSH using diagnosis codes plus NLP (C index = 0.806)
Adjusted HR*95% CIP valueAdjusted HR95% CIP value
History of NSH in past 3 months1.1160.805, 1.5460.511.6471.239, 2.189<0.001
Ever-history of NSH8.8727.464, 10.546<0.0014.4413.745, 5.268<0.001
Age1.0040.996, 1.0120.301.0070.999, 1.0150.08
Sex, male0.9060.78, 1.0540.200.8660.745, 1.0070.06
Race<0.001<0.001
 WhiteReferenceReference
 Black1.8271.527, 2.1861.8131.515, 2.169
 Other1.0310.741, 1.4351.0600.762, 1.475
Median income (per 1,000 U.S. dollars), based on patient’s zip code0.9990.995, 1.0030.570.9990.996, 1.0030.76
Insurance0.0090.008
 MedicareReferenceReference
 Medicaid1.3591.013, 1.8221.3511.007, 1.813
 Commercial0.8460.676, 1.0580.8460.676, 1.058
 Other0.7870.582, 1.0650.7720.571, 1.045
HbA1c, %§<0.001<0.001
 51.4831.200, 1.8321.5931.289, 1.969
 6ReferenceReference
 70.7700.650, 0.9120.7250.611, 0.859
 81.0070.832, 1.2190.9320.770, 1.129
 91.5021.207, 1.8701.3851.113, 1.724
BMI, kg/m2<0.001<0.001
 25ReferenceReference
 300.7120.638, 0.7950.6900.618, 0.770
 350.6440.569, 0.7290.6300.557, 0.713
Cardiovascular disease1.6251.359, 1.944<0.0011.5791.321, 1.889<0.001
CHF2.2611.821, 2.807<0.0012.3491.893, 2.916<0.001
Depression1.2641.005, 1.5890.0451.2821.02, 1.6110.03
Other psychiatric disorders1.4861.253, 1.763<0.0011.5491.307, 1.835<0.001
Dementia1.4100.905, 2.1950.131.3670.88, 2.1240.16
Cognitive impairment1.0000.633, 1.581.01.0510.667, 1.6550.83
CKD1.8431.529, 2.222<0.0011.8581.54, 2.241<0.001
Alcohol or substance abuse1.4580.96, 2.2150.081.5511.021, 2.3540.04
Insulin§<0.001<0.001
 HbA1c 5%3.1081.976, 4.8882.6371.675, 4.153
 HbA1c 6%3.7973.063, 4.7063.3232.671, 4.134
 HbA1c 7%4.2383.388, 5.3003.7943.024, 4.761
 HbA1c 8%3.2972.638, 4.1212.9212.330, 3.661
 HbA1c 9%2.3441.846, 2.9772.0371.600, 2.594
Sulfonylurea§<0.001<0.001
 HbA1c 5%2.9341.765, 4.8782.4841.498, 4.120
 HbA1c 6%1.8061.434, 2.2731.6101.280, 2.026
 HbA1c 7%1.1600.907, 1.4831.0850.849, 1.386
 HbA1c 8%0.8850.698, 1.1230.8510.671, 1.079
 HbA1c 9%0.7050.533, 0.9320.6940.525, 0.917
GLP-1RA0.3640.228, 0.581<0.0010.3610.227, 0.576<0.001
DPP-40.9240.723, 1.1810.530.8690.68, 1.1120.26
SGLT2i0.5390.255, 1.1420.110.5760.272, 1.220.15
Metformin0.5510.465, 0.653<0.0010.5320.449, 0.63<0.001
AGI1.2790.527, 3.1050.591.3730.566, 3.3350.48

AGI, α-glucosidase inhibitor; DPP-4, dipeptidyl peptidase 4 inhibitor; SGLT2i, sodium–glucose cotransporter 2 inhibitor.

Adjusted for NSH based on diagnosis code.

Wald test. For HbA1c, BMI, insulin, and sulfonylurea, it tests the linearity of the variable, all interactions, and all nonlinear terms.

Adjusted for NSH based on diagnosis code and NLP.

The restricted cubic spline term was used. The knots are placed at 6%, 7%, and 8% of HbA1c.

The restricted cubic spline term was used. The knots are placed at 25, 30, and 35 kg/m2 of BMI.

Prediction models for risk of SH according to method used for capturing NSH events: diagnosis codes only or diagnosis codes plus NLP AGI, α-glucosidase inhibitor; DPP-4, dipeptidyl peptidase 4 inhibitor; SGLT2i, sodium–glucose cotransporter 2 inhibitor. Adjusted for NSH based on diagnosis code. Wald test. For HbA1c, BMI, insulin, and sulfonylurea, it tests the linearity of the variable, all interactions, and all nonlinear terms. Adjusted for NSH based on diagnosis code and NLP. The restricted cubic spline term was used. The knots are placed at 6%, 7%, and 8% of HbA1c. The restricted cubic spline term was used. The knots are placed at 25, 30, and 35 kg/m2 of BMI.

Conclusions

Building upon previous work identifying SH in our health system, we found that applying NLP to progress notes improved NSH detection and that NSH is a significant predictor for SH. However, SH prediction using NSH diagnosis codes did not improve with adding NLP, possibly related to the strength of the other variables in contributing to the model and possible greater severity of coded NSH as the Kaplan-Meier curves suggest. There was little overlap between NSH events by codes plus NLP, suggesting that while NSH may be reported by patients, providers may not enter a hypoglycemia diagnosis code, highlighting the benefit of NLP. While it is known that NSH may have significant adverse consequences (9), treatment deintensification is uncommon (10,11). Identifying patients with increased risk for hypoglycemia (12,13) and alerting providers may affect provider behavior to adjust treatment for at-risk patients (13). Our prediction model identifies demographic characteristics and comorbidities predicting increased SH risk. Modifiable predictors include history of NSH and sulfonylurea and insulin use predicting increased risk of SH at lower HbA1c levels and metformin and GLP-1RAs predicting lower risk. The HbA1c effect is of higher predicted risk overall at very low HbA1c levels at which treatment deintensification may be relevant, but also at very high HbA1c, at which fluctuating blood sugars may increase SH risk and limit ability to attain a lower HbA1c. While our data set is robust, we recognize these limitations: 1) the EMR from one health system does not capture NSH or SH events outside of the system, and 2) lack of diabetes duration information in EMR for prediction. Future work will focus on notifying providers at the point of care of captured NSH events to consider deintensification, medication adjustments to lower SH risk, or meal and activity timing, especially in patients with multiple comorbidities, to reduce risk of future SH.
  11 in total

1.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.

Authors:  Guergana K Savova; James J Masanz; Philip V Ogren; Jiaping Zheng; Sunghwan Sohn; Karin C Kipper-Schuler; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2010 Sep-Oct       Impact factor: 4.497

2.  Severe Hypoglycemia Requiring Medical Intervention in a Large Cohort of Adults With Diabetes Receiving Care in U.S. Integrated Health Care Delivery Systems: 2005-2011.

Authors:  Ram D Pathak; Emily B Schroeder; Elizabeth R Seaquist; Chan Zeng; Jennifer Elston Lafata; Abraham Thomas; Jay Desai; Beth Waitzfelder; Gregory A Nichols; Jean M Lawrence; Andrew J Karter; John F Steiner; Jodi Segal; Patrick J O'Connor
Journal:  Diabetes Care       Date:  2015-12-17       Impact factor: 19.112

3.  The impact of non-severe hypoglycemic events on work productivity and diabetes management.

Authors:  Meryl Brod; Torsten Christensen; Trine L Thomsen; Donald M Bushnell
Journal:  Value Health       Date:  2011 Jul-Aug       Impact factor: 5.725

4.  Assessing occurrence of hypoglycemia and its severity from electronic health records of patients with type 2 diabetes mellitus.

Authors:  Anthony P Nunes; Jing Yang; Larry Radican; Samuel S Engel; Karen Kurtyka; Kaan Tunceli; Shengsheng Yu; Kristy Iglay; Michael C Doherty; David D Dore
Journal:  Diabetes Res Clin Pract       Date:  2016-09-21       Impact factor: 5.602

5.  Use of diverse electronic medical record systems to identify genetic risk for type 2 diabetes within a genome-wide association study.

Authors:  Abel N Kho; M Geoffrey Hayes; Laura Rasmussen-Torvik; Jennifer A Pacheco; William K Thompson; Loren L Armstrong; Joshua C Denny; Peggy L Peissig; Aaron W Miller; Wei-Qi Wei; Suzette J Bielinski; Christopher G Chute; Cynthia L Leibson; Gail P Jarvik; David R Crosslin; Christopher S Carlson; Katherine M Newton; Wendy A Wolf; Rex L Chisholm; William L Lowe
Journal:  J Am Med Inform Assoc       Date:  2011-11-19       Impact factor: 4.497

6.  Treatment Deintensification Is Uncommon in Adults With Type 2 Diabetes Mellitus: A Retrospective Cohort Study.

Authors:  Finlay A McAlister; Erik Youngson; Dean T Eurich
Journal:  Circ Cardiovasc Qual Outcomes       Date:  2017-04

7.  Patient Characteristics Associated With Severe Hypoglycemia in a Type 2 Diabetes Cohort in a Large, Integrated Health Care System From 2006 to 2015.

Authors:  Anita D Misra-Hebert; Kevin M Pantalone; Xinge Ji; Alex Milinovich; Tanujit Dey; Kevin M Chagin; Janine M Bauman; Michael W Kattan; Robert S Zimmerman
Journal:  Diabetes Care       Date:  2018-03-16       Impact factor: 19.112

8.  Overtreatment and Deintensification of Diabetic Therapy among Medicare Beneficiaries.

Authors:  Matthew L Maciejewski; Xiaojuan Mi; Jeremy Sussman; Melissa Greiner; Lesley H Curtis; Judy Ng; Samuel C Haffer; Eve A Kerr
Journal:  J Gen Intern Med       Date:  2017-09-13       Impact factor: 5.128

Review 9.  6. Glycemic Targets: Standards of Medical Care in Diabetes-2020.

Authors: 
Journal:  Diabetes Care       Date:  2020-01       Impact factor: 19.112

10.  A Quality Improvement Program to Reduce Potential Overtreatment of Diabetes Among Veterans at High Risk of Hypoglycemia.

Authors:  Varsha G Vimalananda; Kristine DeSotto; TeChieh Chen; Jenny Mullakary; James Schlosser; Cliona Archambeault; Jordan Peck; Hannah Cassidy; Paul R Conlin; Stewart Evans; Mark McConnell; Eric Shirley
Journal:  Diabetes Spectr       Date:  2017-08
View more
  3 in total

Review 1.  Identifying Patients With Hypoglycemia Using Natural Language Processing: Systematic Literature Review.

Authors:  Yaguang Zheng; Victoria Vaughan Dickson; Saul Blecker; Jason M Ng; Brynne Campbell Rice; Gail D'Eramo Melkus; Liat Shenkar; Marie Claire R Mortejo; Stephen B Johnson
Journal:  JMIR Diabetes       Date:  2022-05-16

2.  Using Natural Language Processing to Measure and Improve Quality of Diabetes Care: A Systematic Review.

Authors:  Alexander Turchin; Luisa F Florez Builes
Journal:  J Diabetes Sci Technol       Date:  2021-03-19

3.  Deep transfer learning and data augmentation improve glucose levels prediction in type 2 diabetes patients.

Authors:  Yixiang Deng; Lu Lu; Laura Aponte; Angeliki M Angelidi; Vera Novak; George Em Karniadakis; Christos S Mantzoros
Journal:  NPJ Digit Med       Date:  2021-07-14
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.