| Literature DB >> 28510587 |
Hoang V Lam1, Dat T Nguyen2, Cao D Nguyen3,4.
Abstract
We presented a risk assessment model to distinguish between type 1 diabetes (T1D) affected and unaffected siblings using only three single nucleotide polymorphism (SNP) genotypes. In addition we calculated the heritability from genome-wide identity-by-descent (IBD) sharing between full siblings. We analyzed 1,253 pairs of affected individuals and their unaffected siblings (750 pairs from a discovery set and 503 pairs from a validation set) from the T1D Genetics Consortium (T1DGC), applying a logistic regression to analyze the area under the receiver operator characteristic (ROC) curve (AUC). To calculate the heritability of T1D we used the Haseman-Elston regression analysis of the squared difference between the phenotypes of the pairs of siblings on the estimate of their genome-wide IBD proportion. The model with only 3 SNPs achieving an AUC of 0.75 in both datasets outperformed the model using the presence of the high-risk DR3/4 HLA genotype, namely AUC of 0.60. The heritability on the liability scale of T1D was approximately from 0.53 to 0.92, close to the results obtained from twin studies, ranging from 0.4 to 0.88.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28510587 PMCID: PMC5433695 DOI: 10.1371/journal.pone.0176341
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Histogram of the genome-wide additive genetic relationships of full-sib pairs estimated from genetic markers using Plink.
The heritability on liability scale (h) of T1D estimates using the well-known Haseman-Elston regression analysis.
| 0.003 (general population) | 0.53 (0.0017) |
AUC on T1D discovery and validation datasets generated from logistic regression models.
| Model | Discovery set | Validation set |
|---|---|---|
| Presence / Absence DR3/4 | 0.61 (s.e. 0.014) | 0.61 (s.e. 0.018) |
| 3 SNPs | 0.75 (s.e. 0.013) | 0.75 (s.e. 0.015) |
Fig 2ROC analyses from logistic regression models on A) T1D’s discovery set n = 750 pairs (1,500 samples) and B) T1D’s validation dataset n = 503 pairs (1,006 samples). Each point represents a test defined by a different logit score.