| Literature DB >> 23407308 |
Vladimir Sentchilo1, Antonia P Mayer, Lionel Guy, Ryo Miyazaki, Susannah Green Tringe, Kerrie Barry, Stephanie Malfatti, Alexander Goessmann, Marc Robinson-Rechavi, Jan R van der Meer.
Abstract
Plasmids have long been recognized as an important driver of DNA exchange and genetic innovation in prokaryotes. The success of plasmids has been attributed to their independent replication from the host's chromosome and their frequent self-transfer. It is thought that plasmids accumulate, rearrange and distribute nonessential genes, which may provide an advantage for host proliferation under selective conditions. In order to test this hypothesis independently of biases from culture selection, we study the plasmid metagenome from microbial communities in two activated sludge systems, one of which receives mostly household and the other chemical industry wastewater. We find that plasmids from activated sludge microbial communities carry among the largest proportion of unknown gene pools so far detected in metagenomic DNA, confirming their presumed role of DNA innovators. At a system level both plasmid metagenomes were dominated by functions associated with replication and transposition, and contained a wide variety of antibiotic and heavy metal resistances. Plasmid families were very different in the two metagenomes and grouped in deep-branching new families compared with known plasmid replicons. A number of abundant plasmid replicons could be completely assembled directly from the metagenome, providing insight in plasmid composition without culturing bias. Functionally, the two metagenomes strongly differed in several ways, including a greater abundance of genes for carbohydrate metabolism in the industrial and of general defense factors in the household activated sludge plasmid metagenome. This suggests that plasmids not only contribute to the adaptation of single individual prokaryotic species, but of the prokaryotic community as a whole under local selective conditions.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23407308 PMCID: PMC3660673 DOI: 10.1038/ismej.2013.13
Source DB: PubMed Journal: ISME J ISSN: 1751-7362 Impact factor: 10.302
Figure 1Analysis and comparison of the WWTP plasmid metagenomes. (a), Agarose gel separated recovered CCSD pools from WWTP Morges (M) in 2007 and 2009, and WWTP Visp (V) in 2009, compared with regular DNA size standards (L) and the TOL plasmid (L2). (b), Percent protein coding genes with function prediction of total nonredundant contigs plus singletons annotated uniformly using IMG/M (Markowitz ), for a variety of prokaryotic community metagenomes, viral and plasmid metagenomes (indicated with a black arrow). For metagenome accession numbers and for a complete ranking of 291 metagenomes see , Supplementary Table S3. (c), COG category predictions (percentages of total) for protein coding genes uniformly annotated using IMG/M in the Morges 2007 and Visp 2009 plasmid metagenomes compared with collective plasmids in IMG/M and the full WWTP metagenome from Aalborg (DK) (Albertsen ).
Figure 2Community level functional bias of protein coding genes in the communal (Morges 2007) versus industrial WWTP (Visp) plasmid metagenomes. (a), Percent SEED categorizable protein coding genes and pairwise proportional differences calculated using STAMP (Statistical Analysis of Metagenomic Profiles). (b), Functional overrepresentation of SEED subcategories in Morges versus Visp plasmid metagenomes, as percent of total functional assignments. Whiskers denote calculated 95% confidence intervals.
Figure 3Subset of manually curated assembled plasmids and one phage replicon from Morges 2007 using MIRA on contigs covered more than five times. Replicon closures verified through paired-end reads and by PCR. Locations of protein-encoding open reading frames predicted using the GenDB expert annotation system are indicated (Meyer ). Gene names were assigned to open reading frames in case of a functional prediction above 80% identities. Colors depict predicted COG categorization, as indicated. Replicon numbers accessible in GenBank through numbers as specified in Supplementary Table S6.
Representation of different replication initiation protein families in the WWTP CCSD metagenomes.
| PF04796 | RepA_C | pCAR3, pIPO2T (IncP9), pXAC33, pAP12875, pNL1 | 45.4 (769) | 2.0 (205) | |
| PF01051 | Rep_3 | pRC18, pCL300, pSCFS1, R46 (IncN) | 15.7 (266) | 16.2 (1167) | |
| PF10134 | RPA | pSD20, pMTH1, pHLHK8 | Mostly | 7.8 (133) | 69.5 (7145) |
| PF01446 | Rep_1 | pGI1 (RCR group III), pAE78, pLNU5, pLTK2 | Mostly | 6.8 (116) | 6.3 (648) |
| PF08707 | PriCT_2 | pUH24 | Diverse | 6.4 (109) | 2.1 (216) |
| PF07042 | TrfA | pB4, pJP4 (IncP1b), pUO1, R751, pIJB1 | Mostly | 6.4 (109) | 0.8 (80) |
| PF08708 | PriCT_1 | pLIM (ColE2), pRE25, pEI1, pMG1 | 3.7 (62) | 0.9 (97) | |
| PF03090 | Replicase | pNC500, pAP2, pA501, pLO2 | 2.7 (46) | 0.5 (55) | |
| PF09250 | Prim-Pol | pHEN7, pDL10, pCG2 (pBL1 fam) | 2.7 (45) | 0.3 (29) | |
| PF05144 | Phage_CRI | pNAD1, pVT1, pMM1 | Mostly | 0.8 (14) | 0.0 (2) |
| PF02486 | Rep_trans | pBMB2062, pRE25, pSTK1, pMLU1, pC223 | 0.6 (11) | 0.2 (18) | |
| PF05732 | RepL | pSMQ1, pKH12, pSN2 | Mostly | 0.6 (11) | 0.3 (29) |
| PF03428 | RP-C | pHCG3, Ti | Mostly | 0.2 (4) | 0.9 (94) |
| 100 (1695) | 100 (10258) | ||||
Abbreviation: Pfam, protein family.
Non-exhaustive list of example plasmid replicons occurring in the ACLAME database. For a full list, see Supplementary Information-File 1.
Pfam counts (in parentheses) are the estimated number of gene copies from integrated contig coverage data.
Figure 4PhyML (Guindon and Gascuel, 2003) tree based on a PRANK alignment (Loytynoja and Goldman, 2005) of full-length RepA_C family protein sequences from WWTP Morges (names beginning with M) and Visp (names beginning with V), highlighted within gray shaded areas, with their most similar orthologs (GenBank entries with existing plasmid names in black; others in gray). Topology search using best of nearest neighbor interchanges and subtree pruning and regraftings, the LG model of amino acid substitution and discrete gamma model with four categories. Numbers at the nodes indicate the minimum of SH-like and χ2-based branch supports (Anisimova and Gascuel, 2006) (closer to one indicates stronger support). Outgroup is the RepA_C|EIJ25216.1| from Bifidobacterium longum subsp. longum 2-2B. A complete list of all assigned full names is available in Supplementary Table S11.
Figure 5Comparison of proportional abundances of genes for heavy metal (a) and antibiotic resistance (b) in the Morges and Visp CCSD metagenomes. Proportions are calculated as the number of predicted proteins in each category compared with the total number of annotated proteins in the respective metagenome using MG-RAST subsystems classification (Meyer ). The distribution of resistance proteins differs significantly between samples (χ2 test P<2·10−16). Letters indicate strong deficit (c, C), large excess (a, A) or little difference (b, B) compared with the χ2 expectation. The dotted lines indicate the computed average proportion corrected for the sample size.