| Literature DB >> 27128993 |
Ben O Oyserman1, Francisco Moya1, Christopher E Lawson1, Antonio L Garcia1, Mark Vogt1, Mitchell Heffernen1, Daniel R Noguera1, Katherine D McMahon1,2.
Abstract
The evolution of complex traits is hypothesized to occur incrementally. Identifying the transitions that lead to extant complex traits may provide a better understanding of the genetic nature of the observed phenotype. A keystone functional group in wastewater treatment processes are polyphosphate accumulating organisms (PAOs), however the evolution of the PAO phenotype has yet to be explicitly investigated and the specific metabolic traits that discriminate non-PAO from PAO are currently unknown. Here we perform the first comprehensive investigation on the evolution of the PAO phenotype using the model uncultured organism Candidatus Accumulibacter phosphatis (Accumulibacter) through ancestral genome reconstruction, identification of horizontal gene transfer, and a kinetic/stoichiometric characterization of Accumulibacter Clade IIA. The analysis of Accumulibacter's last common ancestor identified 135 laterally derived genes, including genes involved in glycogen, polyhydroxyalkanoate, pyruvate and NADH/NADPH metabolisms, as well as inorganic ion transport and regulatory mechanisms. In contrast, pathways such as the TCA cycle and polyphosphate metabolism displayed minimal horizontal gene transfer. We show that the transition from non-PAO to PAO coincided with horizontal gene transfer within Accumulibacter's core metabolism; likely alleviating key kinetic and stoichiometric bottlenecks, such as anaerobically linking glycogen degradation to polyhydroxyalkanoate synthesis. These results demonstrate the utility of investigating the derived genome of a lineage to identify key transitions leading to an extant complex phenotype.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27128993 PMCID: PMC5148189 DOI: 10.1038/ismej.2016.67
Source DB: PubMed Journal: ISME J ISSN: 1751-7362 Impact factor: 10.302
Figure 1A defining feature of many biological wastewater treatment systems is the recycling of microbial biomass, commonly called activated sludge (AS). Recycling AS provides two features: (1) a mechanism for achieving high densities of microorganisms and (2) a mechanism for the ecological selection of organisms based on their growth characteristics and physiology. A common design is to have an anaerobic basin preceding an aerobic basin. Under these conditions, polyphosphate accumulating organisms (PAO) are selected for, enhancing the phosphorus removal capabilities of the system. This configuration is commonly referred to as Enhanced Biological Phosphorus Removal (EBPR). Anaerobic zone: in the absence of a terminal electron acceptor, volatile fatty acids (VFA) are transported into the cell and stored as polyhydroxyalkanoates (PHA) with a concomitant release of P and degradation of glycogen. Aerobic zone: carbon stored as PHA is used to drive growth, cell division, P-uptake and glycogen synthesis. At the end of the Aerobic zone, the activated sludge is settled in a clarifier and removed from the system to be recycled, further processes or disposed. Figure adapted from McMahon and Read (2013).
(A) The estimated completeness for the 10 Accumulibacter genomes in this study. (B) The expected probability of observing pattern of presence and absence across the 10 Accumulibacter genome set
| Completeness | 0.92 | 0.92 | 0.87 | 0.91 | 0.89 | 0.88 | 1 | 0.85 | 0.89 | 0.88 |
Given the completeness estimates, it is possible to calculate the expected probability of observing pattern of presence and absence across the 10 Accumulibacter genome set. For example, here we present 11 patterns of presence and absences and demonstrate how the probability of each pattern was calculated. The first pattern represents a gene that is present in all genomes. The 10 patterns below represent the possibilities for a single absence. Presence is indicated by a 'P', and absence is indicated by an 'A' or in bold for the calculation. For each pattern, if a gene family was present in a genome, the product of the completeness estimates for those genome was calculated. This was then multiplied by the product of 1 minus the completeness estimate of genomes in which the gene family was absent. The sum of these probabilities within a particular number of genomes may then be calculated. Presence and absence is binomial, therefore, there are 210 (1024) possible patterns.
Figure 2(a) The expected percentage of core gene families identified for each pattern of presence and absence was calculated using the genome completeness estimates. Using these probabilities, a cutoff of seven genomes is expected to identify 99% of all core genes. This cutoff was used in conjunction with ancestral state reconstructions to determine the core genome of the Accumulibacter lineage. Only gene families that were inferred at the LCA of Accumulibacter and all internal nodes (for example, not lost until a terminal node) and were present in seven or more genomes were considered core in this analysis. (b) The observed number of core and derived core gene families using variable cutoffs. Each potential core gene family was sorted based on the number of genomes they were present in and then on the expected frequency of the pattern. Next, the cumulative sum of each additional pattern was calculated as patterns of increasing likelihood were added. The cutoff at seven genes is demarcated with a dotted line.
Figure 3Gene gain (blue triangle), loss (red triangle) and presence (circle) at each node in the Accumulibacter lineage with other Rhodocyclaceae branches collapsed. The gains and losses were inferred using Count implementing Wagner parsimony with a cost of 2 for gain and 1 for loss.
Figure 4Five-way Venn diagram depicting the number of ancestral, derived, flexible and lineage-specific genes within the CAP2UW1 Accumulibacter genome. Although many comparative genomic studies use similar plots, they generally do not highlight the shared derived genome, which we have shown to be important in understanding both the ecology and evolution of the lineage.
Figure 5A simplified biochemical model and the measured kinetic and stoichiometric parameters for phosphorus, magnesium, potassium, acetate and polyhydroxybutyrate (PHB) of Accumulibacter Clade IIA. Calcium and polyhydroxyvalerate (PHV) were measured but showed negligible changes over an anaerobic/aerobic cycle.
Figure 6(a) The contribution of ancestral and derived genes to broad KEGG maps and the COG categories involved in Inorganic ion transport and metabolism. (b) The contribution of ancestral and derived genes to specific KEGG pathways and COG categories involved in specific inorganic ion transporters.
Figure 7An evolutionary model of CAP2UW1 depicting ancestral, laterally derived, flexible and lineage-specific genes. Ac, acetate; AcAc-CoA, acetoacetyl-CoA; Ac-CoA, acyl-CoA; Ac-AMP, acetyl AMP; Ac-P, acetyl-P; ADP-Glu, adenosine 5-diphosphoglucose; CDPD, cytidine diphosphate diacylglycerol; C.I, complex I oxidative phosphorylation; C.II, complex II oxidative phosphorylation; C.III, complex III oxidative phosphorylation; C.IV, complex IV oxidative phosphorylation; E4-P, erythrose 4-phosphate; FNR, NADPH-ferredoxin reductase; Fru-1-6P, fructose 1,6-bisphosphate; Fru-6-P, fructose 6-phosphate; G3P, glyceraldehyde 3-phosphate; Glu, glucose; Glu-1-p, glucose 1-phosphate; Glu-6-P, glucose 6-phosphate; Gly, glycogen; GlyA, glycogen amylose; Glyc-P, glycerone-P; Long Chain FA, long chain fatty acid; PE, phosphatidylethanolamine; PEP, phosphoenolpyruvate; PGP, 1,2-diacyl-sn-glycerol-3p; pntAB, proton-translocating transhydrogenase; PolyP, polyphosphate; PPP, pyrophosphate-energized proton pump; Ptd-L-Ser, phosphatidylserine; Pyr, pyruvate; 1,3-bPG, 1,3-bisphosphoglyceric acid; Ri15P2, ribulose 1,5P2; Ri5-P, ribose 5-phosphate; Ru5P, ribulose 5-phosphate; S7-P, sedoheptulose-7-phosphate; SBP, sedoheptulose 1,7-bisphosphate; X5P, xylulose 5-phosphate; 3HB-CoA, (R)-3-hydroxy-butanoyl-CoA; 2-PG, 2-phosphoglycerate; 3-PG, 3-phosphoglyceric acid.