| Literature DB >> 30535584 |
Qiu-Yue Zhong1, Leena P Mittal2, Margo D Nathan2, Kara M Brown2, Deborah Knudson González3, Tianrun Cai4, Sean Finan5, Bizu Gelaye6, Paul Avillach6,5,7, Jordan W Smoller6,8, Elizabeth W Karlson4, Tianxi Cai7,9, Michelle A Williams6.
Abstract
We developed algorithms to identify pregnant women with suicidal behavior using information extracted from clinical notes by natural language processing (NLP) in electronic medical records. Using both codified data and NLP applied to unstructured clinical notes, we first screened pregnant women in Partners HealthCare for suicidal behavior. Psychiatrists manually reviewed clinical charts to identify relevant features for suicidal behavior and to obtain gold-standard labels. Using the adaptive elastic net, we developed algorithms to classify suicidal behavior. We then validated algorithms in an independent validation dataset. From 275,843 women with codes related to pregnancy or delivery, 9331 women screened positive for suicidal behavior by either codified data (N = 196) or NLP (N = 9,145). Using expert-curated features, our algorithm achieved an area under the curve of 0.83. By setting a positive predictive value comparable to that of diagnostic codes related to suicidal behavior (0.71), we obtained a sensitivity of 0.34, specificity of 0.96, and negative predictive value of 0.83. The algorithm identified 1423 pregnant women with suicidal behavior among 9331 women screened positive. Mining unstructured clinical notes using NLP resulted in a 11-fold increase in the number of pregnant women identified with suicidal behavior, as compared to solely reliance on diagnostic codes.Entities:
Keywords: Classification algorithm; Electronic medical; Natural language processing; Pregnant women; Records; Suicidal behavior
Mesh:
Year: 2018 PMID: 30535584 PMCID: PMC6370493 DOI: 10.1007/s10654-018-0470-0
Source DB: PubMed Journal: Eur J Epidemiol ISSN: 0393-2990 Impact factor: 8.082