| Literature DB >> 22144904 |
Maxime Rotival1, Tanja Zeller, Philipp S Wild, Seraya Maouche, Silke Szymczak, Arne Schillert, Raphaele Castagné, Arne Deiseroth, Carole Proust, Jessy Brocheton, Tiphaine Godefroy, Claire Perret, Marine Germain, Medea Eleftheriadis, Christoph R Sinning, Renate B Schnabel, Edith Lubos, Karl J Lackner, Heidi Rossmann, Thomas Münzel, Augusto Rendon, Jeanette Erdmann, Panos Deloukas, Christian Hengstenberg, Patrick Diemert, Gilles Montalescot, Willem H Ouwehand, Nilesh J Samani, Heribert Schunkert, David-Alexandre Tregouet, Andreas Ziegler, Alison H Goodall, François Cambien, Laurence Tiret, Stefan Blankenberg.
Abstract
One major expectation from the transcriptome in humans is to characterize the biological basis of associations identified by genome-wide association studies. So far, few cis expression quantitative trait loci (eQTLs) have been reliably related to disease susceptibility. Trans-regulating mechanisms may play a more prominent role in disease susceptibility. We analyzed 12,808 genes detected in at least 5% of circulating monocyte samples from a population-based sample of 1,490 European unrelated subjects. We applied a method of extraction of expression patterns-independent component analysis-to identify sets of co-regulated genes. These patterns were then related to 675,350 SNPs to identify major trans-acting regulators. We detected three genomic regions significantly associated with co-regulated gene modules. Association of these loci with multiple expression traits was replicated in Cardiogenics, an independent study in which expression profiles of monocytes were available in 758 subjects. The locus 12q13 (lead SNP rs11171739), previously identified as a type 1 diabetes locus, was associated with a pattern including two cis eQTLs, RPS26 and SUOX, and 5 trans eQTLs, one of which (MADCAM1) is a potential candidate for mediating T1D susceptibility. The locus 12q24 (lead SNP rs653178), which has demonstrated extensive disease pleiotropy, including type 1 diabetes, hypertension, and celiac disease, was associated to a pattern strongly correlating to blood pressure level. The strongest trans eQTL in this pattern was CRIP1, a known marker of cellular proliferation in cancer. The locus 12q15 (lead SNP rs11177644) was associated with a pattern driven by two cis eQTLs, LYZ and YEATS4, and including 34 trans eQTLs, several of them tumor-related genes. This study shows that a method exploiting the structure of co-expressions among genes can help identify genomic regions involved in trans regulation of sets of genes and can provide clues for understanding the mechanisms linking genome-wide association loci to disease.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22144904 PMCID: PMC3228821 DOI: 10.1371/journal.pgen.1002367
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1An example of the ICA method with K = 2 (K, number of independent components).
Data are represented using a heat color map, from dark blue (minimum) to dark red (maximum). ICA splits the gene expression matrix X into a matrix product X = SA, introducing two new components (“signatures”, contained in the columns of S) with minimal statistical dependencies between them. These components may be viewed as reflecting hidden underlying processes influencing gene expressions (P1 and P2). In the example, P1 influences 3 genes and P2 influences 4 genes. Gene G3 is influenced by both processes, which is reflected by the dark red and dark blue colors in the row corresponding to G3 in matrix S. The rows of matrix A represent the levels of the two components in individuals (“patterns”). The same data are shown as continuous profiles below. Individuals have been ordered to show that when levels of pattern 1 increase, levels of pattern 2 decrease, resulting in a negative correlation between the two patterns.
Figure 2Analysis workflow.
The graph shows the workflows used in parallel for expression data and genotype data. MDS: multidimensional scaling; SVD: singular value decomposition; ICA: independent component analysis; GO: gene ontology; MAF: minor allele frequency; HWE: Hardy-Weinberg equilibrium.
Genome-wide association of SNPs with patterns.
| Pattern | Lead SNP associated to the pattern at the locus | Chr | Position (bp) | Genes nearby (genes |
| Variance of the pattern explained by the SNP (R2) | Number of genes within the module | Number of expression traits associated to the SNP within the module |
| SNP-pattern association potentially due to contamination (type of cell incriminated) |
| 33 | rs2300573 | 1 | 166560874 |
| 3.15 E-08 | 0.023 | 176 | 18 | 1.25 E-34 | No |
| 21 | rs13023213 | 2 | 86875454 |
| 7.52 E-08 | 0.022 | 292 | 36 | 3.08 E-59 | Yes (T cells) |
| 12 | rs12485738 | 3 | 56840816 |
| 8.76 E-24 | 0.069 | 379 | 288 | <1.0 E 250 | Yes (platelets) |
| 93 | rs1344142 | 3 | 56832473 |
| 1.48 E-18 | 0.054 | 135 | 61 | 1.05 E-60 | Yes (platelets) |
| 48 | rs13196564 | 6 | 91563760 |
| 5.24 E-08 | 0.022 | 311 | 7 | 5.24 E-10 | Yes (B cells) |
| 66 | rs2842892 | 6 | 132856076 |
| 9.40 E-08 | 0.019 | 137 | 5 | 1.30 E-10 | No |
| 35 | rs12705417 | 7 | 77856777 |
| 1.23 E-08 | 0.024 | 395 | 10 | 6.96 E-16 | Yes (erythrocytes) |
| 7 | rs1058348 | 10 | 11342351 |
| 3.49 E-08 | 0.020 | 189 | 49 | 1.45 E-46 | No |
| 62 | rs653178 | 12 | 110492139 |
| 2.36 E-09 | 0.026 | 62 | 5 | 5.49 E-10 | No |
| 98 | rs11177644 | 12 | 68072015 |
| 1.14 E-92 | 0.248 | 45 | 36 | 1.22 E-86 | No |
| 102 | rs11171739 | 12 | 54756892 |
| 2.89 E-70 | 0.194 | 14 | 7 | 2.45 E-21 | No |
P-values<10−7 were considered for SNP-pattern associations;
P-values<10−5 were considered for associations between the SNP and expression traits within the module;
P-values<1.15×10−9 were considered for the enrichment in expression traits associated to the SNP within the module.
Figure 3Box plots showing the association of rs653178 at locus 12q24 (SH2B3) with CRIP1 expression in GHS and Cardiogenics.
Figure 4Heatmap of absolute Pearson correlation coefficients between expression patterns obtained by ICA and module eigengenes (ME) obtained by WGCNA.
ICA patterns (rows) are ordered by decreasing explained variance.
Figure 5Quantile–quantile plot comparing the associations of 675,350 SNPs with patterns obtained by ICA (64 patterns – red) and module eigengenes (MEs) obtained by WGCNA with default (26 MEs – black ) or tuned parameters (71 MEs – blue).
For each SNP, the best P-value over the 26 or 71 MEs and the 64 ICA patterns, respectively, is shown. A Sidak correction was applied to correct for the number of MEs (patterns, resp.) tested.