| Literature DB >> 20041197 |
Gregory Hannum1, Rohith Srivas, Aude Guénolé, Haico van Attikum, Nevan J Krogan, Richard M Karp, Trey Ideker.
Abstract
This work demonstrates how gene association studies can be analyzed to map a global landscape of genetic interactions among protein complexes and pathways. Despite the immense potential of gene association studies, they have been challenging to analyze because most traits are complex, involving the combined effect of mutations at many different genes. Due to lack of statistical power, only the strongest single markers are typically identified. Here, we present an integrative approach that greatly increases power through marker clustering and projection of marker interactions within and across protein complexes. Applied to a recent gene association study in yeast, this approach identifies 2,023 genetic interactions which map to 208 functional interactions among protein complexes. We show that such interactions are analogous to interactions derived through reverse genetic screens and that they provide coverage in areas not yet tested by reverse genetic analysis. This work has the potential to transform gene association studies, by elevating the analysis from the level of individual markers to global maps of genetic interactions. As proof of principle, we use synthetic genetic screens to confirm numerous novel genetic interactions for the INO80 chromatin remodeling complex.Entities:
Mesh:
Substances:
Year: 2009 PMID: 20041197 PMCID: PMC2788232 DOI: 10.1371/journal.pgen.1000782
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Using genome-wide association data to identify natural genetic interactions.
(A) Two interacting interval pairs (green and blue) which represent significantly dense groups of marker-marker interactions are shown. (B) A matrix view of the same genomic regions. The blue and green interval pairs appear as two rectangles. (C) The entire set of marker pairs was bi-clustered to form a set of high-confidence interval pairs (blue rectangles).
Correspondence of interval and marker pairs with complexes and functions.
| Between | Within | ||||||
| Nodes | Edges | Complexes | Terms | Complexes | Terms | ||
|
| |||||||
| Bi-clustering | 1,977 | 2,023 | 208 | 17,714 | 0 | 12 | |
| Raw Marker Pairs | 1,157 | 4,687 | 38 | 3,546 | 0 | 3 | |
|
| |||||||
| Bi-clustering | 1,387 | 964 | 0 | 19 | 0 | 0 | |
| Raw Marker Pairs | 1,141 | 4,687 | 0 | 0 | 0 | 0 | |
|
| 2,117 | 29,275 | 140 | 1,833 | 13 | 33 | |
†: Node definition: For Storey et al. and Full 2D ANOVA, nodes represent genomic intervals. For the synthetic network, nodes represent genes.
‡: All cases report the number of distinct interactions in the network, removing redundancies due to marker pairs that associate with multiple traits (Storey et al., Full 2D ANOVA) or gene pairs scoring positive in multiple data sets (Synthetic Genetic Analysis).
*: These bi-clustered interval pairs were used to define the “Natural Network” explored in this work.
**: We also considered an exhaustive scan of all marker pairs using two-way analysis of variance (ANOVA). The most significant 4,687 marker-marker interactions (Table S7) were taken to match the number of interactions from Storey et al. (Text S1). Both the raw marker-pairs and the bi-clustered interval network identified substantially fewer enrichments than the Storey et al. method.
Figure 2Natural genetic networks elucidate pathway architecture.
(A) A global map of the top 50 complex–complex interactions found using the natural network. Each node represents a protein complex and each interaction represents a significant number of genetic interactions (False Discovery Rate<5%) [49]. We analyzed the set of gene expression traits associated with each complex-complex interaction for functional enrichment using the hypergeometric test. Nodes and edges are colored according to the functional enrichment of gene expression traits underlying the natural interactions (Bonferroni P′<0.05). Node sizes are proportional to the number of proteins in the complex. When available, nodes have been labeled with the common name of the complex. (B,C) Two specific examples of complexes spanned by dense bundles of natural genetic interactions.
Figure 3Comparison of the natural and synthetic networks.
(A) The overlap between the natural network and four previously-published synthetic genetic networks (Tong [4], Pan [3], Collins [2], Wilmes [1]) is shown as a percentage of the synthetic network size. An asterisk indicates significance at P<0.05. (B) A map of the functions and functional relationships supported by either the natural or synthetic networks. Each node represents a broad GO term, with colors (green, orange, blue) indicating terms that contain many within-term interactions (Text S1). Edges show the top 30 between-term interactions for each of the natural and synthetic networks. Two broad GO terms (regulation of nucleotide metabolism and DNA repair) contained many within-term interactions in both the natural and synthetic networks.
Figure 4Guiding synthetic genetic screens using natural genetic networks.
(A) Complex-complex interactions common to both the natural and synthetic networks at a relaxed threshold of P<0.05. Many of these complexes, including INO80 (orange), have more coverage in the natural network (node height) than in the synthetic network (node width). (B) Each point in the scatter plot represents the significance of support for a possible complex-complex interaction with INO80 from the natural (y-axis) versus synthetic (x-axis) networks. Due to low coverage, comparatively few complex pairs have support in the synthetic network. New E-MAP data for INO80 support nine new complex-complex interactions predicted by the natural network (blue arrows). (C) A network of natural genetic interactions for INO80 validated by the new E-MAP. Functional enrichment for traits is shown as in Figure 2. The thickness of each link is proportional to its support in the new genetic interaction screen.