Literature DB >> 28327985

The need to approximate the use-case in clinical machine learning.

Sohrab Saeb1,2, Luca Lonini2,3, Arun Jayaraman2,3, David C Mohr1, Konrad P Kording2.   

Abstract

The availability of smartphone and wearable sensor technology is leading to a rapid accumulation of human subject data, and machine learning is emerging as a technique to map those data into clinical predictions. As machine learning algorithms are increasingly used to support clinical decision making, it is vital to reliably quantify their prediction accuracy. Cross-validation (CV) is the standard approach where the accuracy of such algorithms is evaluated on part of the data the algorithm has not seen during training. However, for this procedure to be meaningful, the relationship between the training and the validation set should mimic the relationship between the training set and the dataset expected for the clinical use. Here we compared two popular CV methods: record-wise and subject-wise. While the subject-wise method mirrors the clinically relevant use-case scenario of diagnosis in newly recruited subjects, the record-wise strategy has no such interpretation. Using both a publicly available dataset and a simulation, we found that record-wise CV often massively overestimates the prediction accuracy of the algorithms. We also conducted a systematic review of the relevant literature, and found that this overly optimistic method was used by almost half of the retrieved studies that used accelerometers, wearable sensors, or smartphones to predict clinical outcomes. As we move towards an era of machine learning-based diagnosis and treatment, using proper methods to evaluate their accuracy is crucial, as inaccurate results can mislead both clinicians and data scientists.
© The Author 2017. Published by Oxford University Press.

Entities:  

Keywords:  Machine learning; clinical outcomes; cross-validation; diagnosis; prediction accuracy; rehabilitation outcomes; smartphones; wearable technology

Mesh:

Year:  2017        PMID: 28327985      PMCID: PMC5441397          DOI: 10.1093/gigascience/gix019

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


  22 in total

1.  Reliability and validity of bilateral ankle accelerometer algorithms for activity recognition and walking speed after stroke.

Authors:  Bruce H Dobkin; Xiaoyu Xu; Maxim Batalin; Seth Thomas; William Kaiser
Journal:  Stroke       Date:  2011-06-02       Impact factor: 7.914

2.  Monitoring daily function in persons with transfemoral amputations using a commercial activity monitor: a feasibility study.

Authors:  Mark V Albert; Sean Deeny; Cliodhna McCarthy; Juliana Valentin; Arun Jayaraman
Journal:  PM R       Date:  2014-06-17       Impact factor: 2.298

3.  Detecting and monitoring the symptoms of Parkinson's disease using smartphones: A pilot study.

Authors:  S Arora; V Venkataraman; A Zhan; S Donohue; K M Biglan; E R Dorsey; M A Little
Journal:  Parkinsonism Relat Disord       Date:  2015-03-07       Impact factor: 4.891

4.  Using and understanding cross-validation strategies. Perspectives on Saeb et al.

Authors:  Max A Little; Gael Varoquaux; Sohrab Saeb; Luca Lonini; Arun Jayaraman; David C Mohr; Konrad P Kording
Journal:  Gigascience       Date:  2017-05-01       Impact factor: 6.524

5.  The need to approximate the use-case in clinical machine learning.

Authors:  Sohrab Saeb; Luca Lonini; Arun Jayaraman; David C Mohr; Konrad P Kording
Journal:  Gigascience       Date:  2017-05-01       Impact factor: 6.524

6.  Opportunities for smartphones in clinical care: the future of mobile mood monitoring.

Authors:  Gillian M Sandstrom; Neal Lathia; Cecilia Mascolo; Peter J Rentfrow
Journal:  J Clin Psychiatry       Date:  2016-02       Impact factor: 4.384

7.  High-resolution CMOS MEA platform to study neurons at subcellular, cellular, and network levels.

Authors:  Jan Müller; Marco Ballini; Paolo Livi; Yihui Chen; Milos Radivojevic; Amir Shadmani; Vijay Viswam; Ian L Jones; Michele Fiscella; Roland Diggelmann; Alexander Stettler; Urs Frey; Douglas J Bakkum; Andreas Hierlemann
Journal:  Lab Chip       Date:  2015-05-14       Impact factor: 6.799

8.  Mobile Phone Sensor Correlates of Depressive Symptom Severity in Daily-Life Behavior: An Exploratory Study.

Authors:  Sohrab Saeb; Mi Zhang; Christopher J Karr; Stephen M Schueller; Marya E Corden; Konrad P Kording; David C Mohr
Journal:  J Med Internet Res       Date:  2015-07-15       Impact factor: 5.428

9.  PSYCHOLOGY. Estimating the reproducibility of psychological science.

Authors: 
Journal:  Science       Date:  2015-08-28       Impact factor: 47.728

10.  Making Activity Recognition Robust against Deceptive Behavior.

Authors:  Sohrab Saeb; Konrad Körding; David C Mohr
Journal:  PLoS One       Date:  2015-12-11       Impact factor: 3.240

View more
  47 in total

1.  Cognitive-Behavioral Therapy in the Digital Age: Presidential Address.

Authors:  Sabine Wilhelm; Hilary Weingarden; Ilana Ladis; Valerie Braddick; Jin Shin; Nicholas C Jacobson
Journal:  Behav Ther       Date:  2019-08-08

2.  Fold-stratified cross-validation for unbiased and privacy-preserving federated learning.

Authors:  Romain Bey; Romain Goussault; François Grolleau; Mehdi Benchoufi; Raphaël Porcher
Journal:  J Am Med Inform Assoc       Date:  2020-08-01       Impact factor: 4.497

3.  Using and understanding cross-validation strategies. Perspectives on Saeb et al.

Authors:  Max A Little; Gael Varoquaux; Sohrab Saeb; Luca Lonini; Arun Jayaraman; David C Mohr; Konrad P Kording
Journal:  Gigascience       Date:  2017-05-01       Impact factor: 6.524

4.  The need to approximate the use-case in clinical machine learning.

Authors:  Sohrab Saeb; Luca Lonini; Arun Jayaraman; David C Mohr; Konrad P Kording
Journal:  Gigascience       Date:  2017-05-01       Impact factor: 6.524

Review 5.  A Comprehensive Review of Computer-Aided Diagnosis of Major Mental and Neurological Disorders and Suicide: A Biostatistical Perspective on Data Mining.

Authors:  Mahsa Mansourian; Sadaf Khademi; Hamid Reza Marateb
Journal:  Diagnostics (Basel)       Date:  2021-02-25

6.  Rapid Screening of Physiological Changes Associated With COVID-19 Using Soft-Wearables and Structured Activities: A Pilot Study.

Authors:  Luca Lonini; Nicholas Shawen; Olivia Botonis; Michael Fanton; Chadrasekaran Jayaraman; Chaithanya Krishna Mummidisetty; Sung Yul Shin; Claire Rushin; Sophia Jenz; Shuai Xu; John A Rogers; Arun Jayaraman
Journal:  IEEE J Transl Eng Health Med       Date:  2021-02-11       Impact factor: 3.316

7.  Natural Language Processing to Ascertain Cancer Outcomes From Medical Oncologist Notes.

Authors:  Kenneth L Kehl; Wenxin Xu; Eva Lepisto; Haitham Elmarakeby; Michael J Hassett; Eliezer M Van Allen; Bruce E Johnson; Deborah Schrag
Journal:  JCO Clin Cancer Inform       Date:  2020-08

8.  Multimodal spatio-temporal deep learning approach for neonatal postoperative pain assessment.

Authors:  Md Sirajus Salekin; Ghada Zamzmi; Dmitry Goldgof; Rangachar Kasturi; Thao Ho; Yu Sun
Journal:  Comput Biol Med       Date:  2020-11-28       Impact factor: 4.589

9.  How to remove or control confounds in predictive models, with applications to brain biomarkers.

Authors:  Darya Chyzhyk; Gaël Varoquaux; Michael Milham; Bertrand Thirion
Journal:  Gigascience       Date:  2022-03-12       Impact factor: 6.524

10.  Towards Machine Learning-Based Detection of Running-Induced Fatigue in Real-World Scenarios: Evaluation of IMU Sensor Configurations to Reduce Intrusiveness.

Authors:  Luca Marotta; Jaap H Buurke; Bert-Jan F van Beijnum; Jasper Reenalda
Journal:  Sensors (Basel)       Date:  2021-05-15       Impact factor: 3.576

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.