Literature DB >> 35145205

Machine learning-based diagnosis and risk factor analysis of cardiocerebrovascular disease based on KNHANES.

Taeseob Oh1, Dongkyun Kim2, Siryeol Lee3, Changwon Won1, Sunyoung Kim1, Ji-Soo Yang1, Junghwa Yu1, Byungsung Kim4, Joohyun Lee5.   

Abstract

The prevalence of cardiocerebrovascular disease (CVD) is continuously increasing, and it is the leading cause of human death. Since it is difficult for physicians to screen thousands of people, high-accuracy and interpretable methods need to be presented. We developed four machine learning-based CVD classifiers (i.e., multi-layer perceptron, support vector machine, random forest, and light gradient boosting) based on the Korea National Health and Nutrition Examination Survey. We resampled and rebalanced KNHANES data using complex sampling weights such that the rebalanced dataset mimics a uniformly sampled dataset from overall population. For clear risk factor analysis, we removed multicollinearity and CVD-irrelevant variables using VIF-based filtering and the Boruta algorithm. We applied synthetic minority oversampling technique and random undersampling before ML training. We demonstrated that the proposed classifiers achieved excellent performance with AUCs over 0.853. Using Shapley value-based risk factor analysis, we identified that the most significant risk factors of CVD were age, sex, and the prevalence of hypertension. Additionally, we identified that age, hypertension, and BMI were positively correlated with CVD prevalence, while sex (female), alcohol consumption and, monthly income were negative. The results showed that the feature selection and the class balancing technique effectively improve the interpretability of models.
© 2022. The Author(s).

Entities:  

Mesh:

Year:  2022        PMID: 35145205      PMCID: PMC8831514          DOI: 10.1038/s41598-022-06333-1

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


  31 in total

Review 1.  Receiver Operating Characteristic (ROC) Curve Analysis for Medical Diagnostic Test Evaluation.

Authors:  Karimollah Hajian-Tilaki
Journal:  Caspian J Intern Med       Date:  2013

Review 2.  Prevention of Cardiovascular Disease.

Authors:  Brian V Reamy; Pamela M Williams; Daniel P Kuckel
Journal:  Prim Care       Date:  2018-03       Impact factor: 2.907

Review 3.  When to use the Bonferroni correction.

Authors:  Richard A Armstrong
Journal:  Ophthalmic Physiol Opt       Date:  2014-04-02       Impact factor: 3.117

Review 4.  Machine learning in heart failure: ready for prime time.

Authors:  Saqib Ejaz Awan; Ferdous Sohel; Frank Mario Sanfilippo; Mohammed Bennamoun; Girish Dwivedi
Journal:  Curr Opin Cardiol       Date:  2018-03       Impact factor: 2.161

Review 5.  Vegetarian Dietary Patterns and Cardiovascular Disease.

Authors:  Hana Kahleova; Susan Levin; Neal D Barnard
Journal:  Prog Cardiovasc Dis       Date:  2018-05-22       Impact factor: 8.194

6.  Cardiovascular Event Prediction by Machine Learning: The Multi-Ethnic Study of Atherosclerosis.

Authors:  Bharath Ambale-Venkatesh; Xiaoying Yang; Colin O Wu; Kiang Liu; W Gregory Hundley; Robyn McClelland; Antoinette S Gomes; Aaron R Folsom; Steven Shea; Eliseo Guallar; David A Bluemke; João A C Lima
Journal:  Circ Res       Date:  2017-08-09       Impact factor: 17.367

7.  Cardiovascular disease risk prediction using automated machine learning: A prospective study of 423,604 UK Biobank participants.

Authors:  Ahmed M Alaa; Thomas Bolton; Emanuele Di Angelantonio; James H F Rudd; Mihaela van der Schaar
Journal:  PLoS One       Date:  2019-05-15       Impact factor: 3.240

8.  Machine Learning Outperforms ACC / AHA CVD Risk Calculator in MESA.

Authors:  Ioannis A Kakadiaris; Michalis Vrigkas; Albert A Yen; Tatiana Kuznetsova; Matthew Budoff; Morteza Naghavi
Journal:  J Am Heart Assoc       Date:  2018-11-20       Impact factor: 5.501

9.  Conditional variable importance for random forests.

Authors:  Carolin Strobl; Anne-Laure Boulesteix; Thomas Kneib; Thomas Augustin; Achim Zeileis
Journal:  BMC Bioinformatics       Date:  2008-07-11       Impact factor: 3.169

10.  Exploring feature selection and classification methods for predicting heart disease.

Authors:  Robinson Spencer; Fadi Thabtah; Neda Abdelhamid; Michael Thompson
Journal:  Digit Health       Date:  2020-03-29
View more
  1 in total

1.  Puerarin attenuates isoproterenol‑induced myocardial hypertrophy via inhibition of the Wnt/β‑catenin signaling pathway.

Authors:  Xiaoying Wang; Kai He; Linlin Ma; Lan Wu; Yan Yang; Yanfei Li
Journal:  Mol Med Rep       Date:  2022-08-10       Impact factor: 3.423

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.