| Literature DB >> 35571045 |
Xinping Jiang1, Zhang Yang2, Shuai Wang2, Shuanglin Deng3.
Abstract
Metabolic syndrome (MetS) is characterized by the concurrence of multiple metabolic disorders resulting in the increased risk of a variety of diseases related to disrupted metabolism homeostasis. The prevalence of MetS has reached a pandemic level worldwide. In recent years, extensive amount of data have been generated throughout the research targeted or related to the condition with techniques including high-throughput screening and artificial intelligence, and with these "big data", the prevention of MetS could be pushed to an earlier stage with different data source, data mining tools and analytic tools at different levels. In this review we briefly summarize the recent advances in the study of "big data" applications in the three-level disease prevention for MetS, and illustrate how these technologies could contribute tobetter preventive strategies.Entities:
Keywords: big data and analytics; genome and epigenome; metabolic syndrome; methodology of computational research; pathogenesis and pathophysiology
Year: 2022 PMID: 35571045 PMCID: PMC9095427 DOI: 10.3389/fgene.2022.810152
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
SNPs associated with MetS&MetS components from recent GWAS studies in different races and ethnicities.
| Race and ethnicity | Number of samples | Identified genes | Identified SNPs | Race or ethnicity specific SNPs |
|---|---|---|---|---|
| European ancestry | 22,161 | LPL | rs13702 (LPL)--- HDLC, TG | NA |
| CETP | rs9939224 (CETP)--- HDLC, TG | |||
| APOA5 | rs2266788 (APOA5)---HDLC,TG | |||
| ZNF259 | rs2075290 (ZNF259)---HDLC, TG | |||
| BUD13 | rs10790162 (BUD13)---HDLC,TG | |||
| European ancestry (Finnish population) | 11,616 | APOA1/C3/A4/A5 LRP1B | rs964184 (APOA1/C3/A4/A5)---VLDL, TG, HDL, rs17771092 (LRP1B)---TG, insulin | NA |
| African ancestry | 4,820 | CA10 | rs73989312 (CA10), rs73989319(CA10) | rs73989312 rs77244975 |
| CTNNA3 | rs77244975 (CTNNA3)--- Waist circumference | |||
| RALYL | rs76822696 (RALYL) | |||
| KSR2 | rs7964157 (KSR2)--- Systolic BP | |||
| MBNL1 | rs146816516 (MBNL1)--- Systolic BP | |||
| BAI3 | rs9354671 (BAI3) | |||
| EDEM1-GRM7 | rs2061117 (BAI3) | |||
| LPL | rs149307971 ( EDEM1-GRM7)--- HDL | |||
| CETP | rs294 (LPL) | |||
| rs4523270 (LPL) | ||||
| rs2165558 (LPL) | ||||
| rs35237252 (LPL) | ||||
| rs4783961 (CETP) | ||||
| Eastern Asian ancestry (Han ethnicity) | 1,994 | APOA5ALDH2BUD13 | rs651821(APOA5)---TG,HDL-C | rs671rs180326 |
| rs671(ALDH2)---BMI, WHR, SBP and TG in alcohol drinkers | ||||
| rs445925---LDL-C | ||||
| rs180326 (BUD13)---TG | ||||
| Eastern Asian ancestry (Korean ethnicity) | 24 | APOA5 | rs662799 (APOA5)---MetS, TG,HDL | rs1260326, s1260333 |
| GCKR | rs2075291 (APOA5)----TG,HDL | rs1919127, rs964184 | ||
| C2orF16 | ,rs2266788 (APOA5)---TG rs780092, rs780093, rs780094 | rs2075295, rs1558861 | ||
| ZPR1 | rs1260326, rs1260333 (GCKR)---TG | rs4775041, rs10468017 rs1800588 | ||
| BUD13 | rs1919127, rs1919128 (C2orf16)---TG | rs72786786, rs173539, rs247616 | ||
| ALDH1A2 | rs603446, rs964184 (ZPR1)---TG | rs247617, rs3764261 | ||
| LIPC | rs2075295, rs11216126, rs1558861 (BUD13)---TG | rs708272, rs7499892 | ||
| HERPUD1, CETP | rs4775041, rs10468017 (ALDH1A2)---HDL | |||
| MTNR1B | rs1800588(LIPC)---HDL-C | |||
| rs72786786, rs173539, rs247616 | ||||
| rs247617, rs3764261 (HERPUD1, CETP)---HDL-C | ||||
| rs708272, rs7499892, rs2303790 (CETP)---HDL-C | ||||
| rs10830962, rs10830963 (MTNR1B)---FBG |
Machine learning for MetS risk/prediction with clinical data.
| Data source | Sample size | Studied machine learning tools | Clinical data type | Optimal algorithm | Associated/significant clinical index | |
|---|---|---|---|---|---|---|
| Retrospective Cohort, Cheng-Sheng Yu, 2020 | Health examination | 1,333 | Decision tree---classification and regression trees, C5.0, chi-square automatic interaction detection, conditional interference trees, evolutionary learning of globally optimal trees, generalized linear model trees, random forest | Anthropometrics, laboratory tests, medical imaging | NA | Obesity, serum GOT, serum GPT, CAP score, HbA1c |
| Cheng-Sheng Yu, 2021, Retrospective Cohort+3 year follow-up | Health examination | 1,129 | K-nearest neighbor classification (KNN), linear discriminant analysis (LDA), logistic regression for classification, ensemble learning:random forest, adaptive boosting, support vector machine (SVM), naive Bayes classification (NB), and hierarchical clustering analysis (HCA) | Anthropometrics, laboratory tests, medical imaging | Random forest | Body mass index, HbA1c, CAP score |
| Ji-Eun Park, 2021, Retrospective Cohor | Korean Genome and Epidemiology Study | 2,871 | K-nearest neighbor (KNN), naive Bayes, random forest, decision tree, multilayer perceptron (MLP), support vector machine (SVM) | Anthropometrics, life style data | Naive Bayes (most sensitive) | Age, stress (potential predictors included age, sex, education level, marital status, body mass index (BMI), physical activity, alcohol consumption, and smoking) |
| Shu-jie Xia, 2021, Retrospective Cohort | In-patient | 586 | Decision tree (DT), support vector machine (SVM) and random forest (RF) | Anthropometrics, laboratory tests, TCM indexes | Random forest (RF) (best performance) | Waist circumference, fasting blood-glucose, BMI, alkaline phosphatase creatinine, blood urea nitrogen, AST/ALT, weight, TCM indexes: body fat, wiry pulse, chest tightness, spontaneous perspiration, greasy tongue coating, snoring sleep |
| Perry J. Pickhardt, 2021, retrospective cohort | HIPAA-compliant investigation | 7,785 | Convolutional neural network (3D U-Net), region-based convolutional neural network (R-CNN) | CT-based biomakers | NA | Univariate L1-level total abdominal fat** (80.1% sensitivity, 85.4% specificity), L3-level skeletal muscle index, volumetric liver attenuation |
FIGURE 1Conceptual framework of prevention comparison on metabolic syndrome (MetS).