| Literature DB >> 34728656 |
María Florencia Perez1, Luis Alberto Saona1, María Eugenia Farías1, Anja Poehlein2, Friedhelm Meinhardt3, Rolf Daniel4, Julián Rafael Dib5,6.
Abstract
Diamante Lake located at 4589 m.a.s.l. in the Andean Puna constitutes an extreme environment. It is exposed to multiple extreme conditions such as an unusually high concentration of arsenic (over 300 mg L-1) and low oxygen pressure. Microorganisms thriving in the lake display specific genotypes that facilitate survival, which include at least a multitude of plasmid-encoded resistance traits. Hence, the genetic information provided by the plasmids essentially contributes to understand adaptation to different stressors. Though plasmids from cultivable organisms have already been analyzed to the sequence level, the impact of the entire plasmid-borne genetic information on such microbial ecosystem is not known. This study aims at assessing the plasmidome from Diamante Lake, which facilitates the identification of potential hosts and prediction of gene functions as well as the ecological impact of mobile genetic elements. The deep-sequencing analysis revealed a large fraction of previously unknown DNA sequences of which the majority encoded putative proteins of unknown function. Remarkably, functions related to the oxidative stress response, DNA repair, as well as arsenic- and antibiotic resistances were annotated. Additionally, all necessary capacities related to plasmid replication, mobilization and maintenance were detected. Sequences characteristic for megaplasmids and other already known plasmid-associated genes were identified as well. The study highlights the potential of the deep-sequencing approach specifically targeting plasmid populations as it allows to evaluate the ecological impact of plasmids from (cultivable and non-cultivable) microorganisms, thereby contributing to the understanding of the distribution of resistance factors within an extremophilic microbial community.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34728656 PMCID: PMC8563766 DOI: 10.1038/s41598-021-00753-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1(A) Panoramic photography of the Diamante Lake, located in the Catamarca province, Argentina. (B-C) Submerged microbialites. (D) Red biofilms attached to gaylussite crystals at the bottom of the submerged microbialite.
Figure 2Predicted functional profile of the Diamante Lake plasmidome. (A) Number of annotated proteins of each of the major SEED subsystems by using a maximum E-value of 10–5. SEED subsystems with less than 50 hits were grouped into “Others”. (B) On top, “DNA Metabolism” subsystem level 2 classification of the SEED database. On the bottom, “DNA Repair” subsystem level 3 classification of the SEED database (E-value ≤ 10–5).
Figure 3Predicted functional profile of the Diamante Lake plasmidome. On top, “Virulence, Disease and Defense” subsystem level 2 classification of the SEED database. In the middle, “Resistance to antibiotics and toxic compounds” subsystem level 3 classification of the SEED database. On the bottom, “Arsenic resistance” subsystem level 4 classification of the SEED database (E-value ≤ 10–5).
Figure 4Predicted functional profiles derived from the Diamante Lake plasmidome (red bars) and the Puquio de Campo Naranja plasmidome (orange bars). Percent SEED categorizable protein-encoding genes and pairwise proportional differences calculated using STAMP. Fisher’s exact test was used and corrected P-values were calculated using Storey’s FDR. Only the statistically significant SEED subsystems are shown (q < 0.05).
Figure 5Predicted functional profiles derived from the Diamante Lake plasmidome (red bars), the Puquio de Campo Naranja plasmidome (orange bars) and the one derived from the wastewater treatment plant plasmidome in Visp, Switzerland (green bars). Percent “Resistance to antibiotics and toxic compounds” categorizable protein coding genes belonging to the “Virulence, Disease and Defense” SEED subsystem, and pairwise proportional differences calculated using STAMP. Fisher’s exact test was used and corrected P-values were calculated using Benjamini–Hochberg’s FDR. Only the statistically significant SEED subsystems are shown (q < 0.05).
Figure 6Predicted functional profiles derived from the plasmidome (red bars) and the metagenome (blue bars) of Diamante Lake red biofilm. Percent SEED categorizable protein-encoding genes and pairwise proportional differences calculated using STAMP. G-test (w/Yates’) was used and corrected P-values were calculated using Benjamini–Hochberg’s FDR. Only the statistically significant SEED subsystems are shown (q < 0.05).
Plasmid replication-related Pfam in the Diamante Lake plasmidome.
| Pfam name | Pfam code | Description | Hits |
|---|---|---|---|
| Rep_1 | PF01446.17 | Replication protein | 45 |
| Rep_2 | PF01719.17 | Plasmid replication protein | 1 |
| Rep_3 | PF01051.21 | Initiator Replication protein | 5 |
| RepL | PF05732.11 | Firmicute plasmid replication protein (RepL) | 85 |
| TrfA | PF07042.11 | TrfA protein | 3 |
| RepA_C | PF04796.12 | Plasmid encoded RepA protein | 2 |
| Rep_trans | PF02486.19 | Replication initiation factor | 0 |
| RHH_1 | PF01402.21 | Ribbon-helix-helix protein, copG family | 344 |
| Rop | PF01815.16 | Rop protein | 0 |
| RP-C | PF03428.13 | Replication protein C N-terminal domain | 32 |
| RPA | PF10134.9 | Replication initiator protein A | 2 |
| RepA_N | PF06970.11 | Replication initiator protein A (RepA) N-terminus | 5 |
| RepC | PF06504.11 | Replication protein C (RepC) | 4 |
| Replicase | PF03090.17 | Replicase family | 5 |
| IncFII_repA | PF02387.15 | IncFII RepA protein family | 0 |
| PriCT_1 | PF08708.11 | Primase C terminal 1 (PriCT-1) | 4 |
| DUF1424 | PF07232.11 | Putative rep protein (DUF1424) | 179 |
| Phage_CRI | PF05144.14 | Phage replication protein CRI | 0 |
| PriCT_2 | PF08707.11 | Primase C terminal 2 (PriCT-2) | 1 |
| Prim-Pol | PF09250.11 | Bifunctional DNA primase/polymerase, N-terminal | 12 |
| SSB | PF00436.25 | Single-strand binding protein family | 0 |
| RepA1_leader | PF08048.12 | Tap RepA1 leader peptide | 0 |
| DUF1738 | PF08401.11 | Domain of unknown function (DUF1738) | 0 |
| UvrD-helicase | PF00580.21 | UvrD/REP helicase N-terminal domain | 12 |
| RepB-RCR_reg | PF10723.9 | Replication regulatory protein RepB | 30 |
| Replic_Relax | PF13814.6 | Replication-relaxation | 73 |
| KorB_C | PF06613.11 | KorB C-terminal beta-barrel domain | 0 |
| KorB | PF08535.10 | KorB domain | 6 |
| Activator-TraM | PF11657.8 | Transcriptional activator TraM | 1 |
| pRN1_helical | PF13010.6 | Primase helical domain | 0 |
| KORA | PF16509.5 | TrfB plasmid transcriptional repressor | 17 |
| Rol_Rep_N | PF18106.1 | Rolling Circle replication initiation protein N-terminal domain | 0 |
Relaxase MOB families in the Diamante Lake plasmidome.
| Relaxase MOB family | Profile HMM | Hits |
|---|---|---|
| MOBB | T4SS_MOBB | 0 |
| MOBC | T4SS_MOBC | 41 |
| MOBF | Profile_MOBF | 0 |
| MOBH | T4SS_MOBH | 0 |
| MOBP | T4SS_MOBP1 | 18 |
| T4SS_MOBP2 | 0 | |
| T4SS_MOBP3 | 0 | |
| MOBQ | T4SS_MOBQ | 0 |
| MOBT | Profile_MOBT | 3 |
| MOBV | T4SS_MOBV | 2 |
| MOBM | Profile_MOBM | 1 |
Plasmid matches of the Diamante Lake plasmidome with entries derived from the NCBI database.
| Plasmid | Host | Size (kb) | Isolation source | Matches | NCBI accession number | References |
|---|---|---|---|---|---|---|
| pNRC100 | 191 | NE* | 6 | NC_001869.1 | Ng et al. (2000)[ | |
| pNRC200 | 365 | 8 | NC_002608.1 | |||
| pNG500 | 132 | Dead Sea | 1 | NC_006393.1 | Baliga et al. (2004)[ | |
| pNG600 | 155 | 7 | NC_006394.1 | |||
| PHS1 | 147 | NE | 3 | NC_010366.1 | Pfeiffer et al. (2008)[ | |
| PHS4 | 41 | 1 | NC_010367.1 | |||
| PHS3 | 284 | 4 | NC_010368.1 | |||
| PHS2 | 195 | 2 | NC_010369.1 | |||
| pHLAC01 | 431 | Deep Lake, Antarctica | 14 | NC_012030.1 | NA** | |
| pHTUR01 | 698 | Saline soil in Turkmenistan | 1 | NC_013744.1 | Saunders et al. (2010)[ | |
| pHV3 | 438 | Bottom sediment of the Dead Sea | 3 | NC_013964.1 | Hartman et al. (2010)[ | |
| pHV4 | 635 | 4 | NC_013966.1 | |||
| pHV1 | 85 | 1 | NC_013968.1 | |||
| p1 | 406 | Salt-fermented seafood from South Korea | 2 | NC_014298.1 | Roh et al. (2010)[ | |
| p2 | 363 | 1 | NC_014299.1 | |||
| pHBOR02 | 339 | Solar salterns of Cabo Rojo, Puerto Rico | 1 | NC_014731.1 | Malfatti et al. (2009)[ | |
| pHBOR01 | 362 | 1 | NC_014735.1 | |||
| pHBOR03 | 210 | 1 | NC_014736.1 | |||
| pHALXA01 | 436 | Saline sediment of Lake Shangmatala, China | 2 | NC_015658.1 | Anderson et al. (2012)[ | |
| pHALXA02 | 181 | 1 | NC_015667.1 | |||
| pHTIA | 330 | Shaban deep-sea hypersaline anoxic lake in the Red Sea | 13 | NC_021913.1 | Werner et al. (2014)[ | |
| pHH126 | 125 | Solar saltern in Alicante, Spain | 1 | NC_023011.1 | Ding et al. (2014)[ | |
| pXH-48 | 131 | Saline sediment of Lake Xilinhot, China | 1 | NZ_CP007057.1 | Castillo et al. (2006)[ | |
| NA | 315 | Fresh water, Ponds | 23 | NZ_CP007061.1 | NA |
*NE not specified, **NA not available.
Figure 7Red biofilm taxonomic analysis from Diamante Lake. Red bars show the relative abundance of each archaeal (A) or bacterial (B) phylum in the plasmidome by MG-RAST analysis using similarity to the RefSeq database (E-value ≤ 10–5). Blue bars show the relative abundance of each archaeal (A) or bacterial (B) phylum by metagenomic DNA analysis using 16S rRNA gene amplicon sequencing. Phyla with abundance less than 1% in both datasets were not included.