Rina Kagawa1, Yoshimasa Kawazoe2, Yusuke Ida2, Emiko Shinohara2, Katsuya Tanaka1, Takeshi Imai3, Kazuhiko Ohe1,2. 1. 1 Department of Biomedical Informatics, Graduate School of Medicine, The University of Tokyo, Bunkyo-ku, Tokyo, Japan. 2. 2 Department of Healthcare Information Management, The University of Tokyo Hospital, Bunkyo-ku, Tokyo, Japan. 3. 3 Center for Disease Biology and Integrative Medicine, The University of Tokyo, Bunkyo-ku, Tokyo, Japan.
Abstract
BACKGROUND: Phenotyping is an automated technique that can be used to distinguish patients based on electronic health records. To improve the quality of medical care and advance type 2 diabetes mellitus (T2DM) research, the demand for T2DM phenotyping has been increasing. Some existing phenotyping algorithms are not sufficiently accurate for screening or identifying clinical research subjects. OBJECTIVE: We propose a practical phenotyping framework using both expert knowledge and a machine learning approach to develop 2 phenotyping algorithms: one is for screening; the other is for identifying research subjects. METHODS: We employ expert knowledge as rules to exclude obvious control patients and machine learning to increase accuracy for complicated patients. We developed phenotyping algorithms on the basis of our framework and performed binary classification to determine whether a patient has T2DM. To facilitate development of practical phenotyping algorithms, this study introduces new evaluation metrics: area under the precision-sensitivity curve (AUPS) with a high sensitivity and AUPS with a high positive predictive value. RESULTS: The proposed phenotyping algorithms based on our framework show higher performance than baseline algorithms. Our proposed framework can be used to develop 2 types of phenotyping algorithms depending on the tuning approach: one for screening, the other for identifying research subjects. CONCLUSIONS: We develop a novel phenotyping framework that can be easily implemented on the basis of proper evaluation metrics, which are in accordance with users' objectives. The phenotyping algorithms based on our framework are useful for extraction of T2DM patients in retrospective studies.
BACKGROUND: Phenotyping is an automated technique that can be used to distinguish patients based on electronic health records. To improve the quality of medical care and advance type 2 diabetes mellitus (T2DM) research, the demand for T2DM phenotyping has been increasing. Some existing phenotyping algorithms are not sufficiently accurate for screening or identifying clinical research subjects. OBJECTIVE: We propose a practical phenotyping framework using both expert knowledge and a machine learning approach to develop 2 phenotyping algorithms: one is for screening; the other is for identifying research subjects. METHODS: We employ expert knowledge as rules to exclude obvious control patients and machine learning to increase accuracy for complicated patients. We developed phenotyping algorithms on the basis of our framework and performed binary classification to determine whether a patient has T2DM. To facilitate development of practical phenotyping algorithms, this study introduces new evaluation metrics: area under the precision-sensitivity curve (AUPS) with a high sensitivity and AUPS with a high positive predictive value. RESULTS: The proposed phenotyping algorithms based on our framework show higher performance than baseline algorithms. Our proposed framework can be used to develop 2 types of phenotyping algorithms depending on the tuning approach: one for screening, the other for identifying research subjects. CONCLUSIONS: We develop a novel phenotyping framework that can be easily implemented on the basis of proper evaluation metrics, which are in accordance with users' objectives. The phenotyping algorithms based on our framework are useful for extraction of T2DM patients in retrospective studies.
Entities:
Keywords:
phenotyping; positive predictive value (PPV); sensitivity; support vector machine (SVM); type 2 diabetes mellitus (T2DM)
Authors: Wei-Qi Wei; Pedro L Teixeira; Huan Mo; Robert M Cronin; Jeremy L Warner; Joshua C Denny Journal: J Am Med Inform Assoc Date: 2015-09-02 Impact factor: 4.497
Authors: Jie Xu; Luke V Rasmussen; Pamela L Shaw; Guoqian Jiang; Richard C Kiefer; Huan Mo; Jennifer A Pacheco; Peter Speltz; Qian Zhu; Joshua C Denny; Jyotishman Pathak; William K Thompson; Enid Montague Journal: J Am Med Inform Assoc Date: 2015-07-29 Impact factor: 4.497
Authors: Rachel L Richesson; Shelley A Rusincovitch; Douglas Wixted; Bryan C Batch; Mark N Feinglos; Marie Lynn Miranda; W Ed Hammond; Robert M Califf; Susan E Spratt Journal: J Am Med Inform Assoc Date: 2013-09-11 Impact factor: 4.497
Authors: Wei-Qi Wei; Cynthia L Leibson; Jeanine E Ransom; Abel N Kho; Pedro J Caraballo; High Seng Chai; Barbara P Yawn; Jennifer A Pacheco; Christopher G Chute Journal: J Am Med Inform Assoc Date: 2012-01-16 Impact factor: 4.497
Authors: Catherine A McCarty; Rex L Chisholm; Christopher G Chute; Iftikhar J Kullo; Gail P Jarvik; Eric B Larson; Rongling Li; Daniel R Masys; Marylyn D Ritchie; Dan M Roden; Jeffery P Struewing; Wendy A Wolf Journal: BMC Med Genomics Date: 2011-01-26 Impact factor: 3.063
Authors: Gregory A Nichols; Jay Desai; Jennifer Elston Lafata; Jean M Lawrence; Patrick J O'Connor; Ram D Pathak; Marsha A Raebel; Robert J Reid; Joseph V Selby; Barbara G Silverman; John F Steiner; W F Stewart; Suma Vupputuri; Beth Waitzfelder Journal: Prev Chronic Dis Date: 2012-06-07 Impact factor: 2.830
Authors: Silvio E Inzucchi; Richard M Bergenstal; John B Buse; Michaela Diamant; Ele Ferrannini; Michael Nauck; Anne L Peters; Apostolos Tsapas; Richard Wender; David R Matthews Journal: Diabetes Care Date: 2012-04-19 Impact factor: 19.112
Authors: Nicoletta Musacchio; Annalisa Giancaterini; Giacomo Guaita; Alessandro Ozzello; Maria A Pellegrini; Paola Ponzani; Giuseppina T Russo; Rita Zilich; Alberto de Micheli Journal: J Med Internet Res Date: 2020-06-22 Impact factor: 5.428
Authors: Kavishwar B Wagholikar; Christina M Fischer; Alyssa P Goodson; Christopher D Herrick; Taylor E Maclean; Katelyn V Smith; Liliana Fera; Thomas A Gaziano; Jacqueline R Dunning; Joshua Bosque-Hamilton; Lina Matta; Eloy Toscano; Brent Richter; Layne Ainsworth; Michael F Oates; Samuel Aronson; Calum A MacRae; Benjamin M Scirica; Akshay S Desai; Shawn N Murphy Journal: J Clin Med Res Date: 2019-05-10