| Literature DB >> 29099854 |
Xiaoyun Lei1, Shi Huang1.
Abstract
Type 2 diabetes (T2D) is a complex disorder characterized by high blood sugar, insulin resistance, and relative lack of insulin. The collective effects of genome wide minor alleles of common SNPs, or the minor allele content (MAC) in an individual, have been linked with quantitative variations of complex traits and diseases. Here we studied MAC in T2D using previously published SNP datasets and found higher MAC in cases relative to matched controls. A set of 357 SNPs was found to have the best predictive accuracy in a British population. A weighted risk score calculated by using this set produced an area under the curve (AUC) score of 0.86, which is comparable to risk models built by phenotypic markers. These results identify a novel genetic risk element in T2D susceptibility and provide a potentially useful genetic method to identify individuals with high risk of T2D.Entities:
Mesh:
Year: 2017 PMID: 29099854 PMCID: PMC5669465 DOI: 10.1371/journal.pone.0187644
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Basic characteristics of samples used in the study.
| Training | Validation | ||
|---|---|---|---|
| WTCCC | WTCCC | phs000091 | |
| 829 | 820 | 1,707 | |
| 1,270 | 1,279 | 2,042 | |
| 411,165 | 411,165 | 703,407 |
Fig 1Average MAC (MAF < 0.4) values.
Average MAC values of case and control group in UK individuals of European ancestry from WTCCC dataset (A and C) and EA samples from phs000091 dataset (B and D) using SNPs either before (A and B) or after LD clumping (C and D). Student’s t test was used for comparing average MAC. Symbol *** means P value < 0.001.
Multivariate logistic regression analyses of T2D in phs000091.
| Factors | Explanations | Estimate | SE | P |
|---|---|---|---|---|
| Minor allele content of all SNPs | 0.01046 | 0.0036 | ||
| Family history of diabetes among first degree relatives | 1.197 | 0.0826 | ||
| Reported high blood pressure at/before blood draw | 0.8283 | 0.08562 | ||
| Reported high blood cholesterol at/before blood draw | 0.5204 | 0.09443 | ||
| Cigarette smoking. | 0.28 | 0.05942 | ||
| Total physical activity | -0.002863 | 0.001401 | ||
| BMI in kg/m2 | 0.1592 | 0.009498 | ||
| Age in years | 0.01201 | 0.005349 | ||
| Alcohol intake in G/day | -0.01058 | 0.003113 | ||
| Polyunsaturated fat intake | -0.03732 | 0.02658 | - | |
| Trans fat intake | 0.05584 | 0.07996 | - | |
| Magnesium intake in Mg/day | -0.00002306 | 0.0005684 | - | |
| Cereal fiber intake in G/day | -0.006541 | 0.01303 | - | |
| Heme iron intake in Mg/day | 0.3035 | 0.08606 |
The multivariate logistic regression was analyzed with R “glm” function. SE denotes standard error.
*** P value < 0.001
** P value < 0.01
* P value < 0.05. P value > 0.05 is indicated by—sign. Where a positive regression coefficient increases the risk of T2D, a negative one decreases the risk of T2D.
Fig 2The AUC and TPR values of models in external-cross-validation.
Shown are AUC (A) and TPR (B) values of different models consisting of different sets of SNPs at different P values from logistic regression test and different r2 values at LD clumping.
Fig 3The receiver operating curve for the risk prediction model.
(A) WTCCC refers to British population; (B) phs000091 refers the EA samples.
The annotation of genes.
| Genes of 357 SNPs for risk prediction | Genes of 357 SNPs chosen at random | P-value | |
|---|---|---|---|
| 180 (36.44%) | 159 (29.94%) | 0.03223 | |
| 106 (21.46%) | 87 (19.59%) | 0.04593 | |
| 105 (21.26%) | 39 (7.34%) | 2.72E-10 | |
| 33 (6.68%) | 0 (0%) | 4.18E-09 | |
| 28 (5.67%) | 18 (3.39%) | 0.1075 | |
| 42 (8.50%) | 228 (42.94%) | < 2.2e-16 | |
| 494 (100%) | 531 (100%) | - |
P value came from chi square test. BMI, body height and body weight are so closely tied to each other that they are put in a category (i.e. BMI related). Cholesterol, cholesterol HDL and cholesterol LDL are so closely tied to each other that they are put in a category (i.e. cholesterol related).