| Literature DB >> 32373165 |
Chenglong Yu1,2,3, Guiyan Ni1,4,5, Julius van der Werf5, S Hong Lee1,3.
Abstract
Heterogeneity in the phenotypic mean and variance across populations is often observed for complex traits. One way to understand heterogeneous phenotypes lies in uncovering heterogeneity in genetic effects. Previous studies on genetic heterogeneity across populations were typically based on discrete groups in populations stratified by different countries or cohorts, which ignored the difference of population characteristics for the individuals within each group and resulted in loss of information. Here, we introduce a novel concept of genotype-by-population (G × P) interaction where population is defined by the first and second ancestry principal components (PCs), which are less likely to be confounded with country/cohort-specific factors. We applied a reaction norm model fitting each of 70 complex traits with significant SNP-heritability and the PCs as covariates to examine G × P interactions across diverse populations including white British and other white Europeans from the UK Biobank (N = 22,229). Our results demonstrated a significant population genetic heterogeneity for behavioral traits such as age at first sexual intercourse and academic qualification. Our approach may shed light on the latent genetic architecture of complex traits that underlies the modulation of genetic effects across different populations.Entities:
Keywords: SNP-based heritability; UK Biobank; complex traits; genetic heterogeneity; genotype-phenotype relationship; selection bias
Year: 2020 PMID: 32373165 PMCID: PMC7186421 DOI: 10.3389/fgene.2020.00379
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Two-dimensional scatter plots of PC1 and PC2 with red points representing white British individuals and blue points representing other white ethnic individuals from the UKBB. The white British group named as POP1 is situated within the group of the other white Europeans (see panel A). As shown in panels (B,C), we used a geometric method by which we constructed a rectangle with maximums and minimums of PC1 and PC2 of POP1 as four sides and then group the individuals of the other white Europeans inside this rectangle, named as POP3. The rest of the other white Europeans except POP3 were named as POP2. As POP1 and POP3 are very close in terms of PCs, the individuals in the two data designs POP1 + POP2 and POP2 + POP3 have similar population structures while POP1 + POP3 was a negative control as there was little population difference among this combination (see panel D).
FIGURE 2A flowchart of the G × P analysis on the design of POP1 + POP2.
Genetic variance, interaction variance and their covariance component estimates for six phenotypes across POP1 + POP2 with the covariates PC1 and PC2.
| UKBBdata field | Phenotype | Covariate | |||||
| 78 | Heel bone mineral density | PC1 | 0.3151(0.0459) | 0.0124(0.0110) | 0.0013(0.0120) | 0.6739(0.0456) | 0.1586 |
| (BMD) T-score, automated | PC2 | 0.3187(0.0460) | −0.0008(0.0047) | −0.0037(0.0110) | 0.6838(0.0450) | Excluded | |
| 3144 | Heel Broadband ultrasound | PC1 | 0.2754(0.0454) | 0.0094(0.0110) | 0.0087(0.0120) | 0.7161(0.0454) | 0.0789 |
| attenuation, direct entry | PC2 | 0.2774(0.0454) | 0.0006(0.0048) | −0.0024(0.0111) | 0.7232(0.0450) | 0.8987 | |
| 3147 | Heel quantitative ultrasound | PC1 | 0.3151(0.0459) | 0.0124(0.0110) | 0.0013(0.0120) | 0.6739(0.0456) | 0.1597 |
| index (QUI), direct entry | PC2 | 0.3187(0.0460) | −0.0009(0.0047) | −0.0037(0.0110) | 0.6839(0.0450) | Excluded | |
| 3148 | Heel bone mineral density | PC1 | 0.3070(0.0458) | 0.0107(0.0109) | 0.0046(0.0120) | 0.6836(0.0455) | 0.1315 |
| (BMD) | PC2 | 0.3106(0.0459) | −0.0016(0.0046) | −0.0069(0.0110) | 0.6926(0.0450) | Excluded | |
| 2139 | Age first had sexual | PC1 | 0.1006(0.0266) | 0.0080(0.0078) | 0.0203(0.0087) | 0.8909(0.0290) | 5.16E−05 |
| intercourse | PC2 | 0.1012(0.0266) | 0.0110(0.0057) | −0.0015(0.0087) | 0.8880(0.0286) | 0.0097 | |
| 6138 | Qualifications | PC1 | 0.1194(0.0235) | 0.0706(0.0103) | −0.0791(0.0090) | 0.8124(0.0261) | 9.21E−18 |
| PC2 | 0.1778(0.0214) | 0.0360(0.0059) | 0.0833(0.0081) | 0.7885(0.0233) | 2.22E−24 |
Genetic variance, interaction variance and their covariance component estimates for six phenotypes across POP2 + POP3 with the covariates PC1 and PC2.
| UKBB data field | Phenotype | Covariate | |||||
| 48 | Waist circumference | PC1 | 0.1802(0.0243) | 0.0222(0.0069) | −0.0395(0.0079) | 0.7990(0.0256) | 2.92E−06 |
| PC2 | 0.1789(0.0243) | 0.0076(0.0037) | 0.0300(0.0078) | 0.8147(0.0252) | 0.0004 | ||
| 21002 | Weight | PC1 | 0.2537(0.0252) | 0.0209(0.0069) | −0.0328(0.0081) | 0.7270(0.0257) | 0.0002 |
| PC2 | 0.2529(0.0252) | 0.0077(0.0040) | 0.0219(0.0080) | 0.7408(0.0252) | 0.0252 | ||
| 2443 | Diabetes diagnosed by | PC1 | 0.1688(0.0203) | 0.0259(0.0070) | −0.0015(0.0077) | 0.7901(0.0218) | 6.65E−11 |
| doctor | PC2 | 0.1734(0.0204) | 0.0162(0.0051) | −0.0005(0.0076) | 0.7966(0.0219) | 3.73E−08 | |
| 2139 | Age first had sexual | PC1 | 0.0936(0.0258) | 0.0267(0.0086) | −0.0072(0.0087) | 0.8795(0.0283) | 7.86E−05 |
| intercourse | PC2 | 0.0933(0.0258) | 0.0153(0.0056) | 0.0112(0.0086) | 0.8918(0.0278) | 0.0071 | |
| 6138 | Qualifications | PC1 | 0.0937(0.0264) | 0.0324(0.0094) | 0.0159(0.0091) | 0.8715(0.0287) | 1.06E−15 |
| PC2 | 0.1139(0.0267) | 0.0150(0.0057) | 0.0137(0.0086) | 0.8713(0.0285) | 0.0162 |
Simulation study results for selection bias on the phenotype Y across POP1 + POP2.
| Selection scenarios in POP1 + POP2 | Type I error rate by G × P RNM with PC1 | Type I error rate by bivariate GREML | 100 estimated genetic correlations | |
| Mean | ||||
| 5% | 0% | 0.9722 | 0.0145 | |
| 55% | 2% | 0.9876 | 0.0166 | |
| 1% | 0% | 1.0245 | 0.0160 | |
| 64% | 6% | 0.9882 | 0.0202 | |
Simulation study results for collider bias on two phenotypes Y and Z across POP1 + POP2.
| Selection scenarios with collider bias in POP1 + POP2 | Type I error rate | Estimated genetic correlations of the phenotype Y between POP1 and POP2 | Estimated genetic correlations between Y and Z on selected POP1 + POP2 | ||
| Mean | SE | Mean | SE | ||
| 1% | 1.0141 | 0.0189 | −0.2516 | 0.0032 | |
| 2% | 1.0220 | 0.0165 | −0.2942 | 0.0031 | |
| 2% | 1.0091 | 0.0187 | −0.3415 | 0.0036 | |
Genetic correlation estimates between population groups (POP1, POP2, and POP3) by bivariate GREML for two phenotypes.
| Phenotype | Genetic correlation between POP1 and POP2 | Genetic correlation between POP2 and POP3 | Genetic correlation between POP1 and POP3 | ||||||
| Estimate | Estimate | Estimate | |||||||
| Qualifications | 0.2554 | 0.2223 | 8.09E−04 | 0.4795 | 0.1550 | 7.85E−04 | 0.5676 | 0.2743 | 0.1149 |
| Age first had sexual intercourse | 0.7418 | 0.3984 | 0.5169 | 0.0491 | 0.2284 | 3.14E−05 | 1.2176 | 0.3629 | 0.5488 |