Literature DB >> 33846489

Inflated prediction accuracy of neuropsychiatric biomarkers caused by data leakage in feature selection.

Miseon Shim1, Seung-Hwan Lee2,3, Han-Jeong Hwang4,5.   

Abstract

In recent years, machine learning techniques have been frequently applied to uncovering neuropsychiatric biomarkers with the aim of accurately diagnosing neuropsychiatric diseases and predicting treatment prognosis. However, many studies did not perform cross validation (CV) when using machine learning techniques, or others performed CV in an incorrect manner, leading to significantly biased results due to overfitting problem. The aim of this study is to investigate the impact of CV on the prediction performance of neuropsychiatric biomarkers, in particular, for feature selection performed with high-dimensional features. To this end, we evaluated prediction performances using both simulation data and actual electroencephalography (EEG) data. The overall prediction accuracies of the feature selection method performed outside of CV were considerably higher than those of the feature selection method performed within CV for both the simulation and actual EEG data. The differences between the prediction accuracies of the two feature selection approaches can be thought of as the amount of overfitting due to selection bias. Our results indicate the importance of correctly using CV to avoid biased results of prediction performance of neuropsychiatric biomarkers.

Entities:  

Year:  2021        PMID: 33846489     DOI: 10.1038/s41598-021-87157-3

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


  4 in total

1.  Alteration of Neural Network Activity With Aging Focusing on Temporal Complexity and Functional Connectivity Within Electroencephalography.

Authors:  Momo Ando; Sou Nobukawa; Mitsuru Kikuchi; Tetsuya Takahashi
Journal:  Front Aging Neurosci       Date:  2022-02-04       Impact factor: 5.750

2.  Machine Learning-Based 30-Day Hospital Readmission Predictions for COPD Patients Using Physical Activity Data of Daily Living with Accelerometer-Based Device.

Authors:  Vijay Kumar Verma; Wen-Yen Lin
Journal:  Biosensors (Basel)       Date:  2022-08-05

3.  Accelerated functional brain aging in major depressive disorder: evidence from a large scale fMRI analysis of Chinese participants.

Authors:  Yunsong Luo; Wenyu Chen; Jiang Qiu; Tao Jia
Journal:  Transl Psychiatry       Date:  2022-09-21       Impact factor: 7.989

4.  Research on Classification of Open-Pit Mineral Exploiting Information Based on OOB RFE Feature Optimization.

Authors:  Yi Zhou; Shufang Tian; Jianping Chen; Yao Liu; Chaozhu Li
Journal:  Sensors (Basel)       Date:  2022-03-02       Impact factor: 3.576

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.