| Literature DB >> 24586303 |
Marc Harper1, Luisa Gronenberg2, James Liao3, Christopher Lee4.
Abstract
Discovering all the genetic causes of a phenotype is an important goal in functional genomics. We combine an experimental design for detecting independent genetic causes of a phenotype with a high-throughput sequencing analysis that maximizes sensitivity for comprehensively identifying them. Testing this approach on a set of 24 mutant strains generated for a metabolic phenotype with many known genetic causes, we show that this pathway-based phenotype sequencing analysis greatly improves sensitivity of detection compared with previous methods, and reveals a wide range of pathways that can cause this phenotype. We demonstrate our approach on a metabolic re-engineering phenotype, the PEP/OAA metabolic node in E. coli, which is crucial to a substantial number of metabolic pathways and under renewed interest for biofuel research. Out of 2157 mutations in these strains, pathway-phenoseq discriminated just five gene groups (12 genes) as statistically significant causes of the phenotype. Experimentally, these five gene groups, and the next two high-scoring pathway-phenoseq groups, either have a clear connection to the PEP metabolite level or offer an alternative path of producing oxaloacetate (OAA), and thus clearly explain the phenotype. These high-scoring gene groups also show strong evidence of positive selection pressure, compared with strictly neutral selection in the rest of the genome.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24586303 PMCID: PMC3935835 DOI: 10.1371/journal.pone.0088072
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 11000 Simulations of the Wright-Fisher Selective Dynamics [58] of a Randomly Mutagenized Population.
A. (Top) a simulation of 26 strains of various fitnesses that grow exponentially from a founder population of individual mutants to a carrying capacity under Wright-Fisher selection dynamics. The results of a single simulation show that one mutant dominates the population after a small number of generations. Note diversity is lost due not only to selection, but also genetic drift. B. (Middle) As reproduction and selection proceeds, the mean number of distinct strains decreases very quickly. On average half of the strains are lost after just 6–7 generations. C. (Bottom) Similarly, the mean Shannon entropy [59] of the population distribution also decreases quickly. This differs from (B) in that the population proportions are also taken into account.
Figure 2Schematic of metabolic pathways affected by Ppc knockout (dotted line).
The strain requires additional oxaloacetate to grow. Growth is achieved through direct synthesis of oxaloacetate by alternative pathways such as the glyoxylate shunt or pck, or through an increase of PEP levels, which drives flux through these pathways. The top seven mutated pathways identified by pathway phenotype sequencing are shown in red. It has been shown that Ppc knockouts cause increased flux through the glyoxylate shunt [60], consistent with our observed mutations in AceK and IclR. Mutations in PtsI have previously been observed in response to a growth-based selection for increased succinate production, in a scenario where Pck overexpression was also observed [27]. Similarly, deletion of ptsH, which also deactivates the PTS system and increases the intracellular PEP pool, has also been shown to increase succinate yields [48].
Top 10 gene groups ranked by pathway-phenoseq p-value (Bonferroni corrected for 536 tests).
| Group | Genes | p-value (phenoseq) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Top 20 hits ranked by Bonferroni corrected gene-phenoseq p-value computed on non-synonymous SNPs.
| Gene | p-value |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Positive Selection evidence for Top 10 gene groups.
| Pathway | cumulative p-value | excluding |
|
| 0.0037 | N/A |
|
| 0.0044 | 0.28 |
|
| 0.0027 | 0.28 |
|
| 0.0027 | 0.29 |
|
| 0.0020 | 0.19 |
|
| 0.00043 | 0.056 |
|
| 0.000068 | 0.011 |
|
| 0.000043 | 0.0063 |
|
| 0.000043 | 0.0063 |
|
| 0.000034 | 0.0051 |
Top 10 gene groups ranked by hypergeometric p-value (Bonferroni corrected for 28 tests).
| Group | Genes | Genes in top 20 | p-value (hypergeometric) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Estimated Causal Mutations in the Top 10 gene groups.
| Group | Synonymous Mutations | Non-synonymous Mutations | Causal Mutations |
|
| 5 | 34 | 24 |
|
| 6 | 18 | 6 |
|
| 3 | 11 | 5 |
|
| 3 | 10 | 4 |
|
| 0 | 7 | 7 |
|
| 1 | 11 | 9 |
|
| 3 | 12 | 6 |