| Literature DB >> 34983942 |
Oliver Pain1,2, Alexandra C Gillett3, Jehannine C Austin4, Lasse Folkersen5, Cathryn M Lewis3,6,7.
Abstract
There is growing interest in the clinical application of polygenic scores as their predictive utility increases for a range of health-related phenotypes. However, providing polygenic score predictions on the absolute scale is an important step for their safe interpretation. We have developed a method to convert polygenic scores to the absolute scale for binary and normally distributed phenotypes. This method uses summary statistics, requiring only the area-under-the-ROC curve (AUC) or variance explained (R2) by the polygenic score, and the prevalence of binary phenotypes, or mean and standard deviation of normally distributed phenotypes. Polygenic scores are converted using normal distribution theory. We also evaluate methods for estimating polygenic score AUC/R2 from genome-wide association study (GWAS) summary statistics alone. We validate the absolute risk conversion and AUC/R2 estimation using data for eight binary and three continuous phenotypes in the UK Biobank sample. When the AUC/R2 of the polygenic score is known, the observed and estimated absolute values were highly concordant. Estimates of AUC/R2 from the lassosum pseudovalidation method were most similar to the observed AUC/R2 values, though estimated values deviated substantially from the observed for autoimmune disorders. This study enables accurate interpretation of polygenic scores using only summary statistics, providing a useful tool for educational and clinical purposes. Furthermore, we have created interactive webtools implementing the conversion to the absolute ( https://opain.github.io/GenoPred/PRS_to_Abs_tool.html ). Several further barriers must be addressed before clinical implementation of polygenic scores, such as ensuring target individuals are well represented by the GWAS sample.Entities:
Mesh:
Year: 2022 PMID: 34983942 PMCID: PMC8904577 DOI: 10.1038/s41431-021-01028-z
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 4.246
Comparison between observed and estimated values on the absolute scale across polygenic score quantiles for binary and normally distributed phenotypes.
| Phenotype | Mean Abs. Diff. | Mean Abs. Diff. of SD | Skewness | |||
|---|---|---|---|---|---|---|
| Binary | ||||||
| Depression | 1.5% | NA | 49,999 | 24,999 | 25,000 | NA |
| T2D | 2.4% | NA | 49,999 | 14,888 | 35,111 | NA |
| CAD | 1.5% | NA | 49,999 | 25,000 | 24,999 | NA |
| IBD | 6.8% | NA | 49,999 | 3461 | 46,538 | NA |
| MultiScler | 12.6% | NA | 49,999 | 1137 | 48,862 | NA |
| RheuArth | 6.8% | NA | 49,999 | 3408 | 46,591 | NA |
| Breast_Cancer | 4.6% | NA | 49,999 | 8512 | 41,487 | NA |
| Prostate_Cancer | 8.7% | NA | 50,000 | 2927 | 47,073 | NA |
| Continuous | ||||||
| Intelligence | 0.3% | 1.3% | 50,000 | NA | NA | 0.144 |
| Height | 0.1% | 1.5% | 49,999 | NA | NA | 0.117 |
| BMI | 0.2% | 6% | 49,999 | NA | NA | 0.592 |
Estimated values are based on the observed AUC/R2 of the polygenic score, the prevalence of binary phenotypes, and mean and standard deviation (SD) of continuous phenotypes in UKB.
Mean Abs. Diff. mean absolute difference between expected and observed case probability (binary) or trait mean (continuous), Mean Abs. Diff. of SD mean absolute difference between expected and observed trait standard deviation, N sample size, Ncas number of cases, Ncon number of controls.
Fig. 1Comparison of observed and estimated probability of being a case across 20 DBSLMM polygenic score quantiles.
Estimated values are based on either the observed polygenic score AUC, or the lassosum estimated AUC. Figures are available in colour online.
Fig. 2Comparison of observed and estimated phenotype mean and standard deviation across 20 DBSLMM polygenic score quantiles.
Estimated values are either based on the observed polygenic score R2, or the lassosum estimated R2. Figures are available in colour online.
Fig. 3Shiny app implementing absolute scale conversion for binary phenotypes.
Parameters reflect prevalence of schizophrenia and AUC of the schizophrenia polygenic score [36]. Figures are available in colour online.
Fig. 4Shiny app implementing absolute scale conversion for normally distributed phenotypes.
Parameters reflect mean and SD of IQ, and R2 of educational attainment polygenic score for IQ [37]. Figures are available in colour online.
Comparison between polygenic score AUC/R2 observed in UKB and estimated using lassosum.
| Binary | |||
| Phenotype | Observed AUC | Estimated AUC | Difference |
| Depression | 0.559 | 0.555 | −0.004 |
| T2D | 0.642 | 0.651 | 0.010 |
| CAD | 0.597 | 0.624 | 0.027 |
| IBD | 0.677 | 0.792 | 0.115 |
| MultiScler | 0.657 | 0.785 | 0.128 |
| RheuArth | 0.632 | 0.500 | −0.132 |
| Breast_Cancer | 0.658 | 0.698 | 0.040 |
| Prostate_Cancer | 0.691 | 0.721 | 0.030 |
| Continuous | |||
| Phenotype | Observed | Estimated | Difference |
| Intelligence | 0.008 | 0.023 | 0.015 |
| BMI | 0.081 | 0.074 | −0.007 |
| Height | 0.110 | 0.156 | 0.046 |
Comparison between observed and estimated case probabilities across polygenic score quantiles for binary phenotypes.
| Phenotype | Mean Abs. Diff. | Mean Abs. Diff. of SD | Skewness | |||
|---|---|---|---|---|---|---|
| Binary | ||||||
| Depression | 1.6% | NA | 49,999 | 24,999 | 25,000 | NA |
| T2D | 3.4% | NA | 49,999 | 14,888 | 35,111 | NA |
| CAD | 4.3% | NA | 49,999 | 25,000 | 24,999 | NA |
| IBD | 38.6% | NA | 49,999 | 3461 | 46,538 | NA |
| MultiScler | 43.7% | NA | 49,999 | 1137 | 48,862 | NA |
| RheuArth | 40% | NA | 49,999 | 3408 | 46,591 | NA |
| Breast_Cancer | 12.5% | NA | 49,999 | 8512 | 41,487 | NA |
| Prostate_Cancer | 11.1% | NA | 50,000 | 2927 | 47,073 | NA |
| Continuous | ||||||
| Intelligence | 1.9% | 1.5% | 50,000 | NA | NA | 0.144 |
| Height | 0.2% | 2.5% | 49,999 | NA | NA | 0.117 |
| BMI | 0.3% | 6% | 49,999 | NA | NA | 0.592 |
Estimated values are based on the lassosum estimated AUC/R2 of the polygenic score.
Mean Abs. Diff. mean absolute difference between expected and observed case probability (binary) or trait mean (continuous), Mean Abs. Diff. of SD mean absolute difference between expected and observed trait standard deviation, N sample size, Ncas number of cases, Ncon number of controls.