Jonathan Bates1, Samah J Fodeh2, Cynthia A Brandt3, Julie A Womack4. 1. Yale School of Medicine, New Haven, CT VA Connecticut Healthcare System, West Haven, CT jonathan.bates@yale.edu. 2. Yale School of Medicine, New Haven, CT. 3. Yale School of Medicine, New Haven, CT VA Connecticut Healthcare System, West Haven, CT. 4. Yale School of Nursing, West Haven, CT VA Connecticut Healthcare System, West Haven, CT.
Abstract
OBJECTIVE: To identify patients in a human immunodeficiency virus (HIV) study cohort who have fallen by applying supervised machine learning methods to radiology reports of the cohort. METHODS: We used the Veterans Aging Cohort Study Virtual Cohort (VACS-VC), an electronic health record-based cohort of 146 530 veterans for whom radiology reports were available (N=2 977 739). We created a reference standard of radiology reports, represented each report by a feature set of words and Unified Medical Language System concepts, and then developed several support vector machine (SVM) classifiers for falls. We compared mutual information (MI) ranking and embedded feature selection approaches. The SVM classifier with MI feature selection was chosen to classify all radiology reports in VACS-VC. RESULTS: Our SVM classifier with MI feature selection achieved an area under the curve score of 97.04 on the test set. When applied to all the radiology reports in VACS-VC, 80 416 of these reports were classified as positive for a fall. Of these, 11 484 were associated with a fall-related external cause of injury code (E-code) and 68 932 were not, corresponding to 29 280 patients with potential fall-related injuries who could not have been found using E-codes. DISCUSSION: Feature selection was crucial to improving the classifier's performance. Feature selection with MI allowed us to select the number of discriminative features to use for classification, in contrast to the embedded feature selection method, in which the number of features is chosen automatically. CONCLUSION: Machine learning is an effective method of identifying patients who have suffered a fall. The development of this classifier supplements the clinical researcher's toolkit and reduces dependence on under-coded structured electronic health record data.
OBJECTIVE: To identify patients in a human immunodeficiency virus (HIV) study cohort who have fallen by applying supervised machine learning methods to radiology reports of the cohort. METHODS: We used the Veterans Aging Cohort Study Virtual Cohort (VACS-VC), an electronic health record-based cohort of 146 530 veterans for whom radiology reports were available (N=2 977 739). We created a reference standard of radiology reports, represented each report by a feature set of words and Unified Medical Language System concepts, and then developed several support vector machine (SVM) classifiers for falls. We compared mutual information (MI) ranking and embedded feature selection approaches. The SVM classifier with MI feature selection was chosen to classify all radiology reports in VACS-VC. RESULTS: Our SVM classifier with MI feature selection achieved an area under the curve score of 97.04 on the test set. When applied to all the radiology reports in VACS-VC, 80 416 of these reports were classified as positive for a fall. Of these, 11 484 were associated with a fall-related external cause of injury code (E-code) and 68 932 were not, corresponding to 29 280 patients with potential fall-related injuries who could not have been found using E-codes. DISCUSSION: Feature selection was crucial to improving the classifier's performance. Feature selection with MI allowed us to select the number of discriminative features to use for classification, in contrast to the embedded feature selection method, in which the number of features is chosen automatically. CONCLUSION: Machine learning is an effective method of identifying patients who have suffered a fall. The development of this classifier supplements the clinical researcher's toolkit and reduces dependence on under-coded structured electronic health record data.
Authors: Berry de Bruijn; Ann Cranney; Siobhan O'Donnell; Joel D Martin; Alan J Forster Journal: J Am Med Inform Assoc Date: 2006-08-23 Impact factor: 4.497
Authors: Michael T Yin; Qiuhu Shi; Donald R Hoover; Kathryn Anastos; Anjali Sharma; Mary Young; Alexandra Levine; Mardge H Cohen; Elizabeth Shane; Elizabeth T Golub; Phyllis C Tien Journal: AIDS Date: 2010-11-13 Impact factor: 4.177
Authors: James A McCart; Donald J Berndt; Jay Jarman; Dezon K Finch; Stephen L Luther Journal: J Am Med Inform Assoc Date: 2012-12-15 Impact factor: 4.497
Authors: Julie A Womack; Joseph L Goulet; Cynthia Gibert; Cynthia Brandt; Chung Chou Chang; Barbara Gulanski; Liana Fraenkel; Kristin Mattocks; David Rimland; Maria C Rodriguez-Barradas; Janet Tate; Michael T Yin; Amy C Justice Journal: PLoS One Date: 2011-02-16 Impact factor: 3.240
Authors: Guido Zuccon; Amol S Wagholikar; Anthony N Nguyen; Luke Butt; Kevin Chu; Shane Martin; Jaimi Greenslade Journal: AMIA Jt Summits Transl Sci Proc Date: 2013-03-18
Authors: Daniel J Feller; Jason Zucker; Oliver Bear Don't Walk; Bharat Srikishan; Roxana Martinez; Henry Evans; Michael T Yin; Peter Gordon; Noémie Elhadad Journal: AMIA Annu Symp Proc Date: 2018-12-05
Authors: Julie A Womack; Terrence E Murphy; Christopher T Rentsch; Janet P Tate; Harini Bathulapalli; Alexandria C Smith; Jonathan Bates; Samah Jarad; Cynthia L Gibert; Maria C Rodriguez-Barradas; Phyllis C Tien; Michael T Yin; Thomas M Gill; Gary Friedlaender; Cynthia A Brandt; Amy C Justice Journal: J Acquir Immune Defic Syndr Date: 2019-11-01 Impact factor: 3.731
Authors: Julie A Womack; Terrence E Murphy; Linda Leo-Summers; Jonathan Bates; Samah Jarad; Alexandria C Smith; Thomas M Gill; Evelyn Hsieh; Maria C Rodriguez-Barradas; Phyllis C Tien; Michael T Yin; Cynthia A Brandt; Amy C Justice Journal: J Acquir Immune Defic Syndr Date: 2022-10-01 Impact factor: 3.771
Authors: Xing Song; Lemuel R Waitman; Yong Hu; Alan S L Yu; David C Robbins; Mei Liu Journal: J Am Med Inform Assoc Date: 2019-03-01 Impact factor: 4.497
Authors: Julie A Womack; Terrence E Murphy; Harini Bathulapalli; Alexandria Smith; Jonathan Bates; Samah Jarad; Nancy S Redeker; Stephen L Luther; Thomas M Gill; Cynthia A Brandt; Amy C Justice Journal: J Am Geriatr Soc Date: 2020-08-28 Impact factor: 5.562
Authors: Julie A Womack; Terrence E Murphy; Christine Ramsey; Harini Bathulapalli; Linda Leo-Summers; Alexandria C Smith; Jonathan Bates; Samah Jarad; Thomas M Gill; Evelyn Hsieh; Maria C Rodriguez-Barradas; Phyllis C Tien; Michael T Yin; Cynthia Brandt; Amy C Justice Journal: J Acquir Immune Defic Syndr Date: 2021-10-01 Impact factor: 3.771
Authors: Sebastian Gehrmann; Franck Dernoncourt; Yeran Li; Eric T Carlson; Joy T Wu; Jonathan Welt; John Foote; Edward T Moseley; David W Grant; Patrick D Tyler; Leo A Celi Journal: PLoS One Date: 2018-02-15 Impact factor: 3.240