Literature DB >> 28943335

Single Nucleotide Polymorphism relevance learning with Random Forests for Type 2 diabetes risk prediction.

Beatriz López1, Ferran Torrent-Fontbona2, Ramón Viñas3, José Manuel Fernández-Real4.   

Abstract

OBJECTIVE: The use of artificial intelligence techniques to find out which Single Nucleotide Polymorphisms (SNPs) promote the development of a disease is one of the features of medical research, as such techniques may potentially aid early diagnosis and help in the prescription of preventive measures. In particular, the aim is to help physicians to identify the relevant SNPs related to Type 2 diabetes, and to build a decision-support tool for risk prediction.
METHODS: We use the Random Forest (RF) technique in order to search for the most important attributes (SNPs) related to diabetes, giving a weight (degree of importance), ranging between 0 and 1, to each attribute. Support Vector Machines and Logistic Regression have also been used since they are two other machine learning techniques that are well-established in the health community. Their performance has been compared to that achieved by RF. Furthermore, the relevance of the attributes obtained through the use of RF has then been used to perform predictions with k-Nearest Neighbour method weighting attributes in the similarity measure according to the relevance of the attributes with RF.
RESULTS: Testing is performed on a set of 677 subjects. RF is able to handle the complexity of features' interactions, overfitting, and unknown attribute values, providing the SNPs' relevance with an up to 0.89 area under the ROC curve in terms of risk prediction. RF outperforms all the other tested machine learning techniques in terms of prediction accuracy, and in terms of the stability of the estimated relevance of the attributes.
CONCLUSIONS: The Random Forest is a useful method for learning predictive models and the relevance of SNPs without any underlying assumption.
Copyright © 2017 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Feature learning; Gini importance; Predictive model; Random Forest; Type 2 diabetes

Mesh:

Year:  2017        PMID: 28943335     DOI: 10.1016/j.artmed.2017.09.005

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  6 in total

1.  Single-nucleotide Polymorphisms in Medical Nutritional Weight Loss: Challenges and Future Directions.

Authors:  Moxi Chen; Wei Chen
Journal:  J Transl Int Med       Date:  2022-04-09

2.  The Times they Are a-Changin' - Healthcare 4.0 Is Coming!

Authors:  Chiehfeng Chen; El-Wui Loh; Ken N Kuo; Ka-Wai Tam
Journal:  J Med Syst       Date:  2019-12-23       Impact factor: 4.460

3.  Improved Functional Causal Likelihood-Based Causal Discovery Method for Diabetes Risk Factors.

Authors:  Xiue Gao; Wenxue Xie; Zumin Wang; Bo Chen; Shengbin Zhou
Journal:  Comput Math Methods Med       Date:  2021-05-14       Impact factor: 2.238

Review 4.  Artificial Intelligence in Clinical Decision Support: a Focused Literature Survey.

Authors:  Stefania Montani; Manuel Striani
Journal:  Yearb Med Inform       Date:  2019-08-16

5.  Benchmarking machine learning models for late-onset alzheimer's disease prediction from genomic data.

Authors:  Javier De Velasco Oriol; Edgar E Vallejo; Karol Estrada; José Gerardo Taméz Peña; The Alzheimer's Disease Neuroimaging Initiative
Journal:  BMC Bioinformatics       Date:  2019-12-16       Impact factor: 3.169

6.  Non-linear machine learning models incorporating SNPs and PRS improve polygenic prediction in diverse human populations.

Authors:  Michael Elgart; Genevieve Lyons; Santiago Romero-Brufau; Nuzulul Kurniansyah; Jennifer A Brody; Xiuqing Guo; Henry J Lin; Laura Raffield; Yan Gao; Han Chen; Paul de Vries; Donald M Lloyd-Jones; Leslie A Lange; Gina M Peloso; Myriam Fornage; Jerome I Rotter; Stephen S Rich; Alanna C Morrison; Bruce M Psaty; Daniel Levy; Susan Redline; Tamar Sofer
Journal:  Commun Biol       Date:  2022-08-22
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.