Literature DB >> 30204480

Machine Learning-Based Method for Obesity Risk Evaluation Using Single-Nucleotide Polymorphisms Derived from Next-Generation Sequencing.

Hsin-Yao Wang1,2, Shih-Cheng Chang1,3, Wan-Ying Lin4, Chun-Hsien Chen5, Szu-Hsien Chiang1, Kai-Yao Huang6, Bo-Yu Chu7, Jang-Jih Lu1,3, Tzong-Yi Lee7,8,9.   

Abstract

Obesity is a major risk factor for many metabolic diseases. To understand the genetic characteristics of obese individuals, single-nucleotide polymorphisms (SNPs) derived from next-generation sequencing (NGS) provide comprehensive insight into genome-wide genetic investigation. However, interpretation of these SNP data for clinical application is difficult given the high complexity of NGS data. Hence, in this study, obesity risk prediction models based on SNPs were designed using machine learning (ML) methods, namely support vector machine (SVM), k-nearest neighbor, and decision tree (DT). This investigation obtained clinicopathological features, including 130 SNPs, sex, and age, from 139 eligible individuals. Various feature selection methods, such as stepwise multivariate linear regression (MLR), DT, and genetic algorithms, were applied to select informative features for generating obesity prediction models. Multivariate logistic regression was used to evaluate the importance of the selected features. The models trained from various features evaluated their predictive performances based on fivefold cross-validation. Three measures, namely accuracy, sensitivity, and specificity, were used to examine and compare the predictive power among various models. To design obesity prediction models using ML methods, nine SNPs, including rs10501087, rs17700144, rs2287019, rs534870, rs660339, rs7081678, rs718314, rs9816226, and rs984222, were selected based on stepwise MLR. In evaluation of model performance, the SVM model significantly outperformed other classifiers based on the same training features. The SVM model exhibits 70.77% accuracy, 80.09% sensitivity, and 63.02% specificity. This investigation has demonstrated that the selected SNPs were effective in the detection of obesity risk. Additionally, the ML-based method provides a feasible mean for conducting preliminary analyses of genetic characteristics of obesity.

Entities:  

Keywords:  machine learning; next-generation sequencing (NGS); obesity; single-nucleotide polymorphisms (SNPs)

Mesh:

Year:  2018        PMID: 30204480     DOI: 10.1089/cmb.2018.0002

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  5 in total

1.  A survey on data mining techniques used in medicine.

Authors:  Saba Maleki Birjandi; Seyed Hossein Khasteh
Journal:  J Diabetes Metab Disord       Date:  2021-08-31

2.  Single-nucleotide Polymorphisms in Medical Nutritional Weight Loss: Challenges and Future Directions.

Authors:  Moxi Chen; Wei Chen
Journal:  J Transl Int Med       Date:  2022-04-09

Review 3.  Identifying Key Determinants of Childhood Obesity: A Narrative Review of Machine Learning Studies.

Authors:  Madison N LeCroy; Ryung S Kim; June Stevens; David B Hanna; Carmen R Isasi
Journal:  Child Obes       Date:  2021-03-04       Impact factor: 2.867

4.  Rapid classification of group B Streptococcus serotypes based on matrix-assisted laser desorption ionization-time of flight mass spectrometry and machine learning techniques.

Authors:  Hsin-Yao Wang; Wen-Chi Li; Kai-Yao Huang; Chia-Ru Chung; Jorng-Tzong Horng; Jen-Fu Hsu; Jang-Jih Lu; Tzong-Yi Lee
Journal:  BMC Bioinformatics       Date:  2019-12-24       Impact factor: 3.169

5.  Machine learning random forest for predicting oncosomatic variant NGS analysis.

Authors:  Eric Pellegrino; Coralie Jacques; Nathalie Beaufils; Isabelle Nanni; Antoine Carlioz; Philippe Metellus; L'Houcine Ouafik
Journal:  Sci Rep       Date:  2021-11-08       Impact factor: 4.379

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.