| Literature DB >> 22532805 |
Barbara E Stranger1, Stephen B Montgomery, Antigone S Dimas, Leopold Parts, Oliver Stegle, Catherine E Ingle, Magda Sekowska, George Davey Smith, David Evans, Maria Gutierrez-Arcelus, Alkes Price, Towfique Raj, James Nisbett, Alexandra C Nica, Claude Beazley, Richard Durbin, Panos Deloukas, Emmanouil T Dermitzakis.
Abstract
The genetic basis of gene expression variation has long been studied with the aim to understand the landscape of regulatory variants, but also more recently to assist in the interpretation and elucidation of disease signals. To date, many studies have looked in specific tissues and population-based samples, but there has been limited assessment of the degree of inter-population variability in regulatory variation. We analyzed genome-wide gene expression in lymphoblastoid cell lines from a total of 726 individuals from 8 global populations from the HapMap3 project and correlated gene expression levels with HapMap3 SNPs located in cis to the genes. We describe the influence of ancestry on gene expression levels within and between these diverse human populations and uncover a non-negligible impact on global patterns of gene expression. We further dissect the specific functional pathways differentiated between populations. We also identify 5,691 expression quantitative trait loci (eQTLs) after controlling for both non-genetic factors and population admixture and observe that half of the cis-eQTLs are replicated in one or more of the populations. We highlight patterns of eQTL-sharing between populations, which are partially determined by population genetic relatedness, and discover significant sharing of eQTL effects between Asians, European-admixed, and African subpopulations. Specifically, we observe that both the effect size and the direction of effect for eQTLs are highly conserved across populations. We observe an increasing proximity of eQTLs toward the transcription start site as sharing of eQTLs among populations increases, highlighting that variants close to TSS have stronger effects and therefore are more likely to be detected across a wider panel of populations. Together these results offer a unique picture and resource of the degree of differentiation among human populations in functional regulatory variation and provide an estimate for the transferability of complex trait variants across populations.Entities:
Mesh:
Year: 2012 PMID: 22532805 PMCID: PMC3330104 DOI: 10.1371/journal.pgen.1002639
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Cis- associations detected with Spearman Rank Correlation analysis of normalized and PCA-corrected expression data.
| permutation threshold | ||||
| 0.01 | 0.001 | |||
| significant genes | FDR | significant genes | FDR | |
|
| 657 | 0.28 | 313 | 0.06 |
|
| 774 | 0.24 | 378 | 0.05 |
|
| 698 | 0.26 | 300 | 0.06 |
|
| 795 | 0.23 | 386 | 0.05 |
|
| 773 | 0.24 | 311 | 0.06 |
|
| 472 | 0.39 | 165 | 0.11 |
|
| 947 | 0.19 | 411 | 0.04 |
|
| 799 | 0.23 | 328 | 0.06 |
|
| 3130 | 1132 | ||
|
| 1074 | 547 | ||
|
| 63 | 28 | ||
Cis- associations detected with Spearman Rank Correlation analysis of “REDUCED” data.
| permutation threshold | ||||
| 0.01 | 0.001 | |||
| significant genes | FDR | significant genes | FDR | |
|
| 1966 | 0.09 | 1253 | 0.015 |
|
| 1950 | 0.09 | 1218 | 0.015 |
|
| 1984 | 0.09 | 1219 | 0.015 |
|
| 2131 | 0.09 | 1327 | 0.014 |
|
| 1794 | 0.10 | 962 | 0.019 |
|
| 1131 | 0.16 | 528 | 0.035 |
|
| 2562 | 0.07 | 1528 | 0.012 |
|
| 2415 | 0.08 | 1439 | 0.013 |
|
| 5691 | 3231 | ||
|
| 3240 | 2023 | ||
|
| 331 | 179 | ||
Figure 1Spearman's rho for each significant SNP-probe cis- association shared by at least two populations.
Shown are plots of rho for significant associations (permutation threshold 0.01) for each pairwise combination of populations. Within a panel, dots shown in upper left and lower right quadrants indicate significant SNP-probe associations where the allelic direction of the association is in opposite directions in the two populations being compared.
Figure 2Expression level fold-change for significant SNP-probe cis- associations shared by pairs of populations.
Shown are plots of the absolute value of expression level fold-change between median expression levels of homozygote classes for significant associations (permutation threshold 0.01) for each pairwise combination of populations. Within a panel, deviating from the 1 to 1 line (lower left to upper right) indicates differences in expression level fold-change (effect size) on log2 scale in the two populations being compared.
Figure 3Distribution of cis- associations in each population relative to the transcription start site (TSS).
−log10 of the p-value is plotted against distance measured in base pairs from the associated SNP to the TSS. Each dot represents the most significant SNP for a significant gene (permutation threshold 0.01) in a population. Each panel represents a different population.
Figure 4Distribution of cis- associations relative to the transcription start site (TSS) and in relation to population sharing.
−log10 of the p-value is plotted against distance measured in base pairs from the associated SNP to the TSS. Each dot represents the most significant SNP for a significant gene (permutation threshold 0.01) in a population. Panels separate associations that were significant in one population, two populations, etc. All populations are lumped together.