| Literature DB >> 26173980 |
Dennis H-J Wong1, Robert G Beiko2.
Abstract
BACKGROUND: Lateral gene transfer (LGT) is an important evolutionary process in microbial evolution. In sewage treatment plants, LGT of antibiotic resistance and xenobiotic degradation-related proteins has been suggested, but the role of LGT outside these processes is unknown. Microbial communities involved in Enhanced Biological Phosphorus Removal (EBPR) have been used to treat wastewater in the last 50 years and may provide insights into adaptation to an engineered environment. We introduce two different types of analysis to identify LGT in EBPR sewage communities, based on identifying assembled sequences with more than one strong taxonomic match, and on unusual phylogenetic patterns. We applied these methods to investigate the role of LGT in six energy-related metabolic pathways.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26173980 PMCID: PMC4502571 DOI: 10.1186/s12864-015-1752-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
List of enzymes by Enzyme Commission number, and common name in text
| Enzyme Commission Number | Name |
|---|---|
| 1.1.1.1 | alcohol dehydrogenase |
| 1.2.1.12 | glyceraldehyde-3-phosphate dehydrogenase |
| 1.6.5.3 | NADH:ubiquinone reductase |
| 1.9.3.1 | cytochrome-c oxidase |
| 2.3.1.9 | acetyl-CoA C-acetyltransferase |
| 2.7.1.11 | 6-phosphofructokinase |
| 2.7.1.2 | glucokinase |
| 2.7.1.63 | polyphosphate-glucose phosphotransferase |
| 2.7.2.3 | phosphoglycerate kinase |
| 4.2.1.11 | phosphopyruvate hydratase |
| 4.2.1.17 | enoyl-CoA hydratase |
| 5.4.2.1 | phosphoglycerate mutase |
| 6.3.5.4 | asparagine synthase |
Summary of sequences used in analyses from all communities
| USA | AU | DK | |
|---|---|---|---|
| # (%) of contigs retained | 7,610 (47.96 %) | 7,331 (65.52 %) | 18,024 (6.69 %) |
| # (%) of ORFs retained | 22,894 (66.06 %) | 25,003 (81.15 %) | 30,516 (10.14 %) |
| # enzymes in energy pathways (%) of annotated enzymes | 645 (22.13 %) | 714 (22.60 %) | 524 (22.40 %) |
Number of retained contigs, open reading frames from retained contigs, and energy pathway related enzymes (butanoate metabolism, citric acid cycle, glycolysis and gluconeogenesis, nitrogen metabolism, pentose phosphate pathway, and propanoate metabolism) from open reading frames annotated as enzymes. Contigs at least 1000 nucleotides in length were retained
Fig. 1Directed network of gene transfer for Butanoate metabolism and Propanoate metabolism. Directed transfer of enzymes involved in KEGG a) Butanoate metabolism and b) Propanoate metabolism for the Denmark (DK), Australia (AU) and United States (USA) EBPR communities. Taxonomic groups are nodes, and direction of transfer from donor to recipient is indicated by arrows. See Additional file 18 for taxonomic abbreviation guide
Fig. 2Directed network of gene transfer for the Glycolysis/Gluconeogenesis metabolic pathway. KEGG Glycolysis/Gluconeogenesis metabolic pathway and directed LGT for the Denmark (DK), Australia (AU) and United States (USA) EBPR communities. Dashed boxes indicate LGT, with solid symbols indicating LGT predicted within a community, and hollow symbols indicating enzymes not inferred to be present in a community. EC numbers in gray correspond to enzymes not found in any community. See Additional file 18 for taxonomic abbreviation guide
Fig. 3Sample contigs from classification discordance. AU contigs classified as having gammaproteobacterial origin but with an inferred transfer of Enoyl-CoA hydratase (EC 4.2.1.17), an enzyme involved in butanoate metabolism and propanoate metabolism, originating from the Betaproteobacteria or Alphaproteobacteria. Transposases are present on two contigs. Colours represent the taxonomic origin of different genes on each contig according to RITA’s naïve Bayes compositional classifier
Fig. 4Three-way Venn diagram between classification discordance, phylogenetic incongruence and MGE homology filtering for all sequences from all KEGG pathways. Intersections for circles are the number of transferred genes shared between analyses. Remaining genes not in intersections are unique potential LGT events identified by each analysis. All sequences have homologs with known MGEs. Venn diagrams were generated using Venny (http://bioinfogp.cnb.csic.es/tools/venny/)
Length statistics for contigs with putative LGT events
| Community | Average Contig Length | Average Number of Genes |
|---|---|---|
| DK all | 1808.84 | 1.26 |
| DK LGT six pathways | 2885.32 | 1.91 |
| DK LGT transposases & integrases 6 pathways | __ | __ |
| AU all | 2883.52 | 2.70 |
| AU LGT six pathways | 11793.48 | 9.80 |
| AU LGT transposases & integrases 6 pathways | 25755.10 | 23.20 |
| USA all | 2477.62 | 2.37 |
| USA LGT six pathways | 23000.37 | 17.21 |
| USA LGT transposases & integrases 6 pathways | 68048.67 | 53.0 |
Predicted gene counts and length for each contig from each community that are at least 1000 nucleotides in length, that have a detected LGT event, and those that have an annotated transposase and integrase