| Literature DB >> 26579196 |
Daniel Shriner1, Amy R Bentley1, Ayo P Doumatey1, Guanjie Chen1, Jie Zhou1, Adebowale Adeyemo1, Charles N Rotimi1.
Abstract
We surveyed 26 quantitative traits and disease outcomes to understand the proportion of phenotypic variance explained by local ancestry in admixed African Americans. After inferring local ancestry as the number of African-ancestry chromosomes at hundreds of thousands of genotyped loci across all autosomes, we used a linear mixed effects model to estimate the variance explained by local ancestry in two large independent samples of unrelated African Americans. We found that local ancestry at major and polygenic effect genes can explain up to 20 and 8% of phenotypic variance, respectively. These findings provide evidence that most but not all additive genetic variance is explained by genetic markers undifferentiated by ancestry. These results also inform the proportion of health disparities due to genetic risk factors and the magnitude of error in association studies not controlling for local ancestry.Entities:
Keywords: ancestry; complex traits; health disparities; phenotypic variance explained; random effects
Year: 2015 PMID: 26579196 PMCID: PMC4625172 DOI: 10.3389/fgene.2015.00324
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Manhattan plot from admixture mapping for white blood cell count in ARIC. White blood cell count was regressed on local ancestry, adjusted for age, global ancestry, sex, and center. The red line indicates the genome-wide significance level.
Figure 2Simulation study of bias in the random effects model. Conditional on local ancestry at rs2814778, we simulated a continuous phenotype with a known proportion of phenotypic variance explained by a single causal locus and the remainder of the phenotypic variance being random noise. We randomly generated 100 independent data sets. We then used the fixed effects model (top left), ancestral similarity defined by identity in state (top right), centered and scaled ancestral similarity as defined by GCTA (bottom left), and centered ancestral similarity as defined by GEMMA (bottom right) to estimate the proportion of phenotypic variance explained by local ancestry using similarity estimated genome-wide.
Bias in genome partitioning using GCTA.
| 0.000 | 0.002 | 5.77 × 10−19 | 0.001 | 2.85 × 10−19 | 0.002 | 2.49 × 10−19 |
| 0.005 | 0.001 | 0.716 | 0.000 | 0.938 | 0.002 | 0.953 |
| 0.010 | 0.000 | 0.694 | 0.000 | 0.639 | 0.002 | 0.712 |
| 0.050 | −0.001 | 0.156 | −0.004 | 8.20 × 10−5 | −0.028 | 4.20 × 10−18 |
| 0.100 | −0.013 | 3.07 × 10−12 | −0.025 | 1.20 × 10−17 | −0.073 | 3.96 × 10−18 |
| 0.150 | −0.024 | 3.01 × 10−17 | −0.051 | 3.96 × 10−18 | −0.118 | 3.96 × 10−18 |
| 0.193 | −0.039 | 4.08 × 10−18 | −0.079 | 3.96 × 10−18 | −0.158 | 3.96 × 10−18 |
Bias in genome partitioning using GEMMA.
| 0.000 | 0.002 | 3.24 × 10−19 | 0.001 | 5.19 × 10−19 | 0.002 | 3.89 × 10−20 |
| 0.005 | 0.001 | 0.706 | 0.001 | 0.551 | 0.008 | 2.68 × 10−9 |
| 0.010 | 0.002 | 0.172 | 0.001 | 0.756 | 0.024 | 2.27 × 10−10 |
| 0.050 | 0.001 | 0.985 | 0.002 | 0.391 | 0.028 | 1.05 × 10−7 |
| 0.100 | −0.001 | 0.224 | 0.006 | 7.82 × 10−3 | −0.006 | 6.11 × 10−3 |
| 0.150 | −0.003 | 0.136 | 0.008 | 1.43 × 10−3 | −0.045 | 7.67 × 10−17 |
| 0.193 | 0.001 | 0.493 | 0.011 | 2.92 × 10−4 | −0.071 | 2.11 × 10−17 |
Genome-wide proportion of phenotypic variance explained by local ancestry.
| Height | 0.0306 | 0.0128 | 0.0265 | 0.0119 | 0.0540 | 0.0303 | 0.0532 | 0.0326 |
| Weight | 0.0165 | 0.0113 | 0.0143 | 0.0110 | 0.0157 | 0.0249 | 0.0074 | 0.0238 |
| Body mass index | 0.0283 | 0.0127 | 0.0256 | 0.0123 | 0.0220 | 0.0256 | 0.0148 | 0.0247 |
| Waist circumference | 0.0114 | 0.0106 | 0.0116 | 0.0112 | 0.0370 | 0.0281 | 0.0249 | 0.0262 |
| Hip circumference | 0.0263 | 0.0125 | 0.0221 | 0.0118 | 0.0080 | 0.0231 | 0.0046 | 0.0229 |
| Waist-hip ratio | 0.0037 | 0.0097 | 0.0023 | 0.0103 | 0.0593 | 0.0313 | 0.0672 | 0.0362 |
| Systolic blood pressure | 0.0035 | 0.0089 | 0.0041 | 0.0092 | 0.0038 | 0.0229 | 0.0032 | 0.0234 |
| Diastolic blood pressure | 0.0081 | 0.0098 | 0.0082 | 0.0100 | 0.0000 | 0.0233 | 0.0000 | 0.0191 |
| Hypertension (observed scale) | 0.0203 | 0.0119 | 0.0207 | 0.0126 | 0.0084 | 0.0230 | 0.0127 | 0.0250 |
| Hypertension (liability scale) | 0.0322 | 0.0188 | NA | NA | 0.0131 | 0.0359 | NA | NA |
| Fasting glucose | 0.0283 | 0.0131 | 0.0212 | 0.0117 | 0.0293 | 0.0279 | 0.0250 | 0.0290 |
| Fasting insulin | 0.0049 | 0.0092 | 0.0039 | 0.0087 | 0.0057 | 0.0231 | 0.0096 | 0.0247 |
| Type 2 diabetes (observed scale) | 0.0173 | 0.0115 | 0.0146 | 0.0112 | 0.0324 | 0.0277 | 0.0273 | 0.0280 |
| Type 2 diabetes (liability scale) | 0.0247 | 0.0164 | NA | NA | 0.0809 | 0.0692 | NA | NA |
| Triglycerides | 0.0140 | 0.0112 | 0.0117 | 0.0111 | 0.0000 | 0.0254 | 0.0000 | 0.0411 |
| High density lipoprotein | 0.0292 | 0.0130 | 0.0295 | 0.0133 | 0.0115 | 0.0226 | 0.0171 | 0.0236 |
| Low density lipoprotein | 0.0380 | 0.0149 | 0.0328 | 0.0150 | 0.0073 | 0.0230 | 0.0073 | 0.0233 |
| Total cholesterol | 0.0249 | 0.0128 | 0.0236 | 0.0133 | 0.0347 | 0.0265 | 0.0400 | 0.0286 |
| Sodium | 0.0000 | 0.0097 | 0.0000 | 0.0105 | 0.0000 | 0.0215 | 0.0000 | 0.0219 |
| Potassium | 0.0197 | 0.0120 | 0.0156 | 0.0113 | 0.0000 | 0.0234 | 0.0000 | 0.0246 |
| Calcium | 0.0096 | 0.0098 | 0.0069 | 0.0087 | 0.0000 | 0.0231 | 0.0034 | 0.0253 |
| Uric Acid | 0.0059 | 0.0095 | 0.0056 | 0.0095 | 0.0000 | 0.0211 | 0.0001 | 0.0203 |
| C-reactive protein | 0.0091 | 0.0130 | 0.0096 | 0.0126 | 0.0000 | 0.0318 | 0.0000 | 0.0387 |
| Albumin | 0.0063 | 0.0098 | 0.0062 | 0.0102 | 0.0028 | 0.0223 | 0.0050 | 0.0231 |
| Total protein | 0.0180 | 0.0112 | 0.0154 | 0.0106 | 0.0604 | 0.0367 | 0.0698 | 0.0405 |
| Creatinine | 0.0161 | 0.0105 | 0.0159 | 0.0100 | 0.0000 | 0.0218 | 0.0000 | 0.0224 |
| Estimated glomerular filtration rate | 0.0181 | 0.0109 | 0.0173 | 0.0103 | 0.0012 | 0.0217 | 0.0044 | 0.0229 |
For hypertension and type 2 diabetes, we report phenotypic variance explained on the observed binary scale and on the unobserved liability scale, assuming a prevalence of hypertension of 0.44 (Centers for Disease Control and Prevention, 2014) and a prevalence of type 2 diabetes of 0.187 (Centers for Disease Control and Prevention, 2011).
Genetic distance assuming diploidy and three codominant alleles.
| 0 | 1 | 1 | 2 | 2 | 2 | |
| 1 | 0 | 1 | 1 | 1 | 2 | |
| 1 | 1 | 0 | 2 | 1 | 1 | |
| 2 | 1 | 0 | 1 | 2 | ||
| 1 | 1 | 1 | 0 | 1 | ||
| 2 | 1 | 2 | 1 | 0 |
The Euclidean distance-based model of Smouse and Peakall (1999) is below the diagonal. The Hamming distance-based model of Kosman and Leonard (2005) is above the diagonal. In both models, AB, AC, and BC are assumed to be identical to BA, CA, and CB, respectively.