Chengyi Zheng1, Nazia Rashid2, River Koblick2, JaeJin An3. 1. Department of Research and Evaluation, Kaiser Permanente Southern California, Pasadena, California. Electronic address: Chengyi.X.Zheng@kp.org. 2. Drug Information Services, Kaiser Permanente Southern California, Downey, California. 3. Department of Pharmacy Practice and Administration, Western University of Health Sciences, Pomona, California.
Abstract
PURPOSE: The purpose of this study was to investigate whether aspirin use can be captured from the clinical notes in a nonvalvular atrial fibrillation population. METHODS: A total of 29,507 patients with newly diagnosed nonvalvular atrial fibrillation were identified from January 1, 2006, through December 31, 2011, and were followed up through December 31, 2012. More than 3 million clinical notes were retrieved from electronic medical records. A training data set of 2949 notes was created to develop a computer-based method to automatically extract aspirin use status and dosage information using natural language processing (NLP). A gold standard data set of 5339 notes was created using a blinded manual review. NLP results were validated against the gold standard data set. The aspirin data from the structured medication databases were also compared with the results from NLP. Positive and negative predictive values, along with sensitivity and specificity, were calculated. FINDINGS: NLP achieved 95.5% sensitivity and 98.9% specificity when compared with the gold standard data set. The positive predictive value was 93.0%, and the negative predictive value was 99.3%. NLP identified aspirin use for 83.8% of the study population, and 70% of the low dose aspirin use was identified only by the NLP method. IMPLICATIONS: We developed and validated an NLP method specifically designed to identify low dose aspirin use status from the clinical notes with high accuracy. This method can be a valuable tool to supplement existing structured medication data.
PURPOSE: The purpose of this study was to investigate whether aspirin use can be captured from the clinical notes in a nonvalvular atrial fibrillation population. METHODS: A total of 29,507 patients with newly diagnosed nonvalvular atrial fibrillation were identified from January 1, 2006, through December 31, 2011, and were followed up through December 31, 2012. More than 3 million clinical notes were retrieved from electronic medical records. A training data set of 2949 notes was created to develop a computer-based method to automatically extract aspirin use status and dosage information using natural language processing (NLP). A gold standard data set of 5339 notes was created using a blinded manual review. NLP results were validated against the gold standard data set. The aspirin data from the structured medication databases were also compared with the results from NLP. Positive and negative predictive values, along with sensitivity and specificity, were calculated. FINDINGS: NLP achieved 95.5% sensitivity and 98.9% specificity when compared with the gold standard data set. The positive predictive value was 93.0%, and the negative predictive value was 99.3%. NLP identified aspirin use for 83.8% of the study population, and 70% of the low dose aspirin use was identified only by the NLP method. IMPLICATIONS: We developed and validated an NLP method specifically designed to identify low dose aspirin use status from the clinical notes with high accuracy. This method can be a valuable tool to supplement existing structured medication data.
Authors: Margaret Mahan; Daniel Rafter; Hannah Casey; Marta Engelking; Tessneem Abdallah; Charles Truwit; Mark Oswood; Uzma Samadani Journal: PLoS One Date: 2020-07-01 Impact factor: 3.240
Authors: Jinhe Shi; Xiangyu Gao; William C Kinsman; Chenyu Ha; Guodong Gordon Gao; Yi Chen Journal: Artif Intell Med Date: 2021-12-02 Impact factor: 5.326
Authors: Chengyi Zheng; Benjamin C Sun; Yi-Lin Wu; Ming-Sum Lee; Ernest Shen; Rita F Redberg; Maros Ferencik; Shaw Natsui; Aniket A Kawatkar; Visanee V Musigdilok; Adam L Sharp Journal: J Am Heart Assoc Date: 2020-02-21 Impact factor: 5.501
Authors: Ranier Bustamante; Ashley Earles; James D Murphy; Alex K Bryant; Olga V Patterson; Andrew J Gawron; Tonya Kaltenbach; Mary A Whooley; Deborah A Fisher; Sameer D Saini; Samir Gupta; Lin Liu Journal: Med Care Date: 2019-10 Impact factor: 2.983
Authors: Seyedmostafa Sheikhalishahi; Riccardo Miotto; Joel T Dudley; Alberto Lavelli; Fabio Rinaldi; Venet Osmani Journal: JMIR Med Inform Date: 2019-04-27