| Literature DB >> 34093660 |
Chong You1, Zhenwei Zhou2, Jia Wen3, Yun Li4, Cheng Heng Pang5, Haoyang Du6, Ziwen Wang7, Xiao-Hua Zhou1, Daniel A King8, Ching-Ti Liu2, Jie Huang9,10.
Abstract
Human height is a polygenic trait, influenced by a large number of genomic loci. In the pre-genomic era, height prediction was based largely on parental height. More recent predictions of human height have made great strides by integrating genotypic data from large biobanks with improved statistical techniques. Nevertheless, recent studies have not leveraged parental height, an added feature that we hypothesized would offer complementary predictive value. In this study, we assessed the predictive power of polygenic risk scores (PRS) combined with the traditional parental height predictors. Our study analyzed genotypic data and parental height from 1,071 trios from the United Kingdom Biobank and 444 trios from the Framingham Heart Study. We explored a series of statistical models to fully evaluate the performance of several PRS constructed together with parental information and proposed a model we call PRS++ that includes gender, parental height, and PRSs of parents and proband. Our estimate of height with an R 2 of ∼0.82 is, to our knowledge, the most accurate estimate yet achieved for predicting human adult height. Without parental information, the R 2 from the best PRS-driven model is ∼0.73. In summary, using adult height prediction as an example, we demonstrated that traditional predictors still play important roles and merit integration into the current trends of intensive PRS approaches.Entities:
Keywords: adult height; model selection; parental height; polygenic score; prediction
Year: 2021 PMID: 34093660 PMCID: PMC8176283 DOI: 10.3389/fgene.2021.669441
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Characteristics of study participants.
| Cohort | Subject | Age | Height | BMI | |
| UKB | Proband-male | 418 | 42.22 (1.8) | 178.12 (6.56) | 27.61 (4.3) |
| Proband-female | 599 | 42.48 (18.6) | 165.01 (6.01) | 25.72 (5.13) | |
| Father | 1,017 | 66.8 (2.05) | 174.07 (6.3) | 28.14 (3.98) | |
| Mother | 1,017 | 65.34 (2.26) | 160.2 (5.82) | 27.92 (4.9) | |
| FHS | Proband-male | 201 | 36.99 (8.63) | 175.43 (6.03) | 27.54 (5.01) |
| Proband-female | 243 | 37.45 (8.65) | 162.14 (5.9) | 25.94 (5.66) | |
| Father | 444 | 38.33* (8.06) | 173.27 (6.55) | 24.07 (4.62) | |
| Mother | 444 | 36.25* (7.67) | 159.42 (5.84) | 23.67 (4.18) |
Characteristics of nine polygenic risk scores (PRS).
| PRS | Selection criteria | Number of SNPs | Correlation with height ( |
| PRS.0 | 697 | 0.34 | |
| PRS.1 | 3,290 | 0.38 | |
| PRS.2 | 30,615 | 0.47 | |
| PRS.3 | 18,432 | 0.45 | |
| PRS.4 | 12,657 | 0.44 | |
| PRS.5 | 9,410 | 0.43 | |
| PRS.6 | 7,349 | 0.42 | |
| PRS.7 | 5,920 | 0.41 | |
| PRS.8 | 4,932 | 0.41 |
The R2 of different models including PRS++.
| Model | In-sample | In-sample | Out-of-sample | |
| Sex + PRS.2 | 0.7034 | 0.7135 (0.09) | 0.7193 | 0.707 |
| Sex + Parental height | 0.7353 | 0.7517 (0.08) | 0.7179 | 0.7093 |
| Sex + PRS.2 + Parental height | 0.7825 | 0.7890 (0.10) | 0.7828 | 0.7678 |
| Sex + PRS.2 + Parental height + Parental PSR.2s | 0.8025 | 0.8150 (0.05) | 0.8113 | 0.7866 |
| Full Model | 0.8317 | 0.7733 (0.07) | 0.8242 | –∗ |
FIGURE 1Measured vs. predicted heights of probands. Based on 300 randomly selected individuals from the United Kingdom Biobank trio data not included in predictor training in the PRS++ model (A), the PRS++ model without parental information (B), FHS using the PRS++ model fitted from United Kingdom Biobank data (C), and FHS using the PRS++ model without parental information fitted from United Kingdom Biobank data (D). The gray lines represent a 95% prediction interval.