| Literature DB >> 27977676 |
Oriol Canela-Xandri1, Konrad Rawlik1, John A Woolliams1, Albert Tenesa1,2.
Abstract
Genome-wide association studies (GWAS) promised to translate their findings into clinically beneficial improvements of patient management by tailoring disease management to the individual through the prediction of disease risk. However, the ability to translate genetic findings from GWAS into predictive tools that are of clinical utility and which may inform clinical practice has, so far, been encouraging but limited. Here we propose to use a more powerful statistical approach, the use of which has traditionally been limited due to computational requirements and lack of sufficiently large individual level genotyped cohorts, but which improve the prediction of multiple medically relevant phenotypes using the same panel of SNPs. As a proof of principle, we used a shared panel of 319,038 common SNPs with MAF > 0.05 to train the prediction models in 114,264 unrelated White-British individuals for height and four obesity related traits (body mass index, basal metabolic rate, body fat percentage, and waist-to-hip ratio). We obtained prediction accuracies that ranged between 46% and 75% of the maximum achievable given the captured heritable component. For height, this represents an improvement in prediction accuracy of up to 68% (184% more phenotypic variance explained) over SNPs reported to be robustly associated with height in a previous GWAS meta-analysis of similar size. Across-population predictions in White non-British individuals were similar to those in White-British whilst those in Asian and Black individuals were informative but less accurate. We estimate that the genotyping of circa 500,000 unrelated individuals will yield predictions between 66% and 82% of the SNP-heritability captured by common variants in our array. Prediction accuracies did not improve when including rarer SNPs or when fitting multiple traits jointly in multivariate models.Entities:
Mesh:
Year: 2016 PMID: 27977676 PMCID: PMC5157980 DOI: 10.1371/journal.pone.0166755
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Prediction accuracies on related White-British and self-reported White-British.
| Traits | self-reported White-British (95% CI) | related White-British (95% CI) |
|---|---|---|
| 0.51 (0.49–0.52) | 0.53 (0.52–0.55) | |
| 0.27 (0.25–0.29) | 0.28 (0.26–0.30) | |
| 0.25 (0.24–0.27) | 0.27 (0.26–0.29) | |
| 0.20 (0.19–0.22) | 0.23 (0.21–0.25) | |
| 0.32 (0.31–0.34) | 0.34 (0.32–0.36) |
Across-population prediction accuracies.
| Traits | White non British (95% CI) | Asian/Asian-British (95% CI) | Black/Black-British (95% CI) |
|---|---|---|---|
| 0.50 (0.48–0.51) | 0.34 (0.30–0.38) | 0.18 (0.14–0.23) | |
| 0.26 (0.24–0.28) | 0.21 (0.16–0.25) | 0.12 (0.07–0.17) | |
| 0.25 (0.23–0.27) | 0.22 (0.18–0.26) | 0.11 (0.06–0.16) | |
| 0.21 (0.19–0.23) | 0.14 (0.10–0.19) | 0.07 (0.02–0.12) | |
| 0.32 (0.30–0.34) | 0.22 (0.18–0.26) | 0.12 (0.07–0.17) |
Fig 1Prediction accuracy as a function of sample size for height.
Inverse of the square of the prediction accuracy as a function of the inverse of the training sample size. Blue dots indicate prediction accuracies achieved on several trials. The dashed straight line shows the linear regression fit to the blue dots. The regression intercept indicates the maximum accuracy achievable using common variants represented in the array. The red dot is the expected prediction accuracy with a training sample size of 500,000 individuals.
Fig 2Prediction accuracies from GWAS analyses.
Predictions on self reported White-British obtained using independently estimated SNP effects from a GWAS. We plot the accuracies obtained for subsets of SNPs selected based on a particular p-value threshold against this threshold value. Different colours indicate different traits. Dashed lines indicate maximum accuracies obtained when the effects of all SNP were estimated jointly (SNP-BLUP) using DISSECT.