| Literature DB >> 32143559 |
Agnieszka Klonowska1, Lionel Moulin1, Julie Kaye Ardley2, Florence Braun3, Margaret Mary Gollagher4, Jaco Daniel Zandberg2, Dora Vasileva Marinova4, Marcel Huntemann5, T B K Reddy5, Neha Jacob Varghese5, Tanja Woyke5, Natalia Ivanova5, Rekha Seshadri5, Nikos Kyrpides5, Wayne Gerald Reeve6.
Abstract
BACKGROUND: Cupriavidus strain STM 6070 was isolated from nickel-rich soil collected near Koniambo massif, New Caledonia, using the invasive legume trap host Mimosa pudica. STM 6070 is a heavy metal-tolerant strain that is highly effective at fixing nitrogen with M. pudica. Here we have provided an updated taxonomy for STM 6070 and described salient features of the annotated genome, focusing on heavy metal resistance (HMR) loci and heavy metal efflux (HME) systems.Entities:
Keywords: Cupriavidus; HGT; Heavy metal efflux; Heavy metal resistance; Mimosa; Nickel tolerance; Rhizobia; Rhizobial biogeography
Year: 2020 PMID: 32143559 PMCID: PMC7060636 DOI: 10.1186/s12864-020-6623-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Genome Statistics for Cupriavidus strain STM 6070
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 6,771,773 | 100.00 |
| DNA coding region (bp) | 5,928,188 | 87.54 |
| DNA G + C content (bp) | 4,551,463 | 67.21 |
| Number of scaffolds | 107 | |
| Total gene | 6182 | 100.00 |
| RNA genes | 64 | 1.04 |
| rRNA operons* | 1 | 0.02 |
| Protein-coding genes | 6118 | 98.96 |
| Genes with function prediction | 5050 | 81.69 |
| Genes assigned to COGs | 4500 | 72.79 |
| Genes assigned Pfam domains | 5305 | 85.81 |
| Genes with signal peptides | 677 | 10.95 |
| Genes with transmembrane helices | 1402 | 22.68 |
| CRISPR repeats | 1 |
*1 copy of 16S rRNA and 4 copies of 5S rRNA
Fig. 1Genome alignments using progressive Mauve software [44]. a: scaffolds of the draft genomes of Cupriavidus neocaledonicus STM 6070 (STM 6070) and C. taiwanensis STM 6018 aligned to the replicons of the finished genome of Cupriavidus taiwanensis LMG 19424T (LMG 19424). b: scaffolds of the draft genomes of Cupriavidus sp. strains AMP6 and UYPR2.512 aligned to the replicons of the finished genome of Cupriavidus taiwanensis LMG 19424T (LMG 19424). The blocks in the alignment represent the common local colinear blocks (LCBs) among the compared genomes, and homologous blocks in each genome are shown as identical coloured regions. The vertical red lines represent replicon boundaries for LMG 19424T, whereas they represent contig boundaries for the draft genomes. The shaded red region represents a putative genomic rearrangement between CHR2 and CHR1. Circles with numbers represent the location of heavy metal resistance regions identified in this paper found in LMG 19424T (white circles containing letters) and in STM 6070 (yellow circles containing letters). See Fig. 3 for the heavy metal resistance regions. Dashed arrows show the location of the LMG 19424T heavy metal resistance regions in STM 6070
Fig. 3Cupriavidus neocaledonicus STM 6070 HMR gene clusters containing annotated putative genes encoding proteins involved in heavy metal efflux (HME). A to L: HMR loci (see also Table S6). Colour coding: light blue, HME-RND systems composed of canonical CBA genes [45]; dark blue, czcD encoding a CDF type protein; turquoise, nre genes; dark and light grey, putative corresponding regulatory genes; green, cop genes; purple, chr genes; red, ars genes; yellow, P-ATPase encoding genes; white, genes encoding putative proteins of unknown function; black, transposases; cep: conserved exported protein; ep, exported protein; hk, histidine kinase. Truncated genes are identified with a delta (Δ) symbol. Thick lines identify genes encoding the transmembrane proteins. Gene coordinates for STM 6070 (CT6070v1_XXXXXX-XX) correspond to the annotation in the MaGe Microscope platform (https://www.genoscope.cns.fr/agc/microscope/mage/viewer.php) (see Table S6 for the corresponding IMG locus tags)
Fig. 2Gene content analysis of the STM 6070 genome. a: Venn diagram of gene number counts of symbiotic Cupriavidus strains; b: functional COG categories of STM 6070 specific genes (107 assigned genes out of 483). STM 6070, Cupriavidus neocaledonicus STM 6070; STM 6018, C. taiwanensis STM 6018; LMG 19424T, C. taiwanensis LMG 19424T; AMP6, Cupriavidus sp. AMP6; UYPR2.512, Cupriavidus sp. UYPR2.512. Numbers under the strain names describe the total number of genes for each corresponding genome. The analysis was performed using the “Gene Phyloprofile” tool in the Microscope MaGe platform [48], https://www.genoscope.cns.fr/agc/microscope/mage). The orthologous counterparts in the genomes were detected by applying a minimum of 30% for protein sequences identity over a minimum of 80% of the protein length (> 30% protein MinLrap 0.8)
HME determinants in STM 6070 genome and their comparison with those detected in other Cupriavidus species
| Locus | Proposed gene annotation | IMG Locus tag | HM transporter | Putative substrate | Orthologous genes | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| LMG 19424 | STM 6018 | UYPR2.512 | AMP6 | CH34 | N-1 | H16 | |||||
| A | 00876–00879 | HME3b | monovalent cations | + | + | + | + | + | + | + | |
| B | 00906–00910 | HME5 | nd | + | + | + | + | – | + | + | |
| 00912–00911 | CHR | CrO42− | + | + | + | – | + | – | – | ||
| C | 00978 (zntA) | P-ATPase | Zn2+ | + | + | + | + | + | + | + | |
| D | 00999 | P-ATPase | Ag | – | – | – | + | – | – | + | |
| 01005 | P-ATPase | Cu2+ | – | – | + | + | + | – | + | ||
| E | 02911–02906 | – | nd | + | + | + | + | + | + | + | |
| F | 03877–03882 | HME1 | Co2+, Zn2+, Cd2+ | + | + | + | + | + | + | + | |
| G | 04395–04397 | HME3a | divalent cations | + | + | + | + | + | + | + | |
| H | 04504 | P-ATPase | Cu2+ | + | + | + | + | + | + | + | |
| I | 04975–04973 | HME1 | nd | + | + | + | + | + | + | + | |
| 04978–04977 | MFS | Ni2+ | – | – | + | + | + | – | – | ||
| 04985–04980 | CHR | CrO42− | + | + | + | + | + | + | + | ||
| 04993–04989 | HME3a | nd | + | + | + | + | + | + | + | ||
| J | 05405 | P-ATPase | Cu2+ | + | + | + | + | + | + | + | |
| 05406–05409 | HME4 | Cu2+, Ag | + | + | + | + | + | + | + | ||
| K | 05655–05651 | CDF + HME1 | Co2+, Zn2+, Cd2+ | + | + | + | + | + | + | + | |
| 05663–05656 | ARC3 | nd | + | + | – | + | + | + | + | ||
| +MFS | + | + | – | + | + | + | + | ||||
| L | 05853–05856 | ARC3 | nd | + | + | – | + | + | + | + | |
| Not in HMR clusters | 04892 | CDF + HME1 | nd | + | + | – | + | + | + | + | |
| 03199 | CDF + HME1 | nd | + | + | – | + | + | + | + | ||
| 06164 | CDF + HME1 | nd | + | + | – | + | + | + | + | ||
Footnote: STM 6070, Cupriavidus sp. STM 6070; LMG 19424T, C. taiwanensis LMG 19424T; STM 6018, C. taiwanensis STM 6018; UYPR2.512, Cupriavidus sp. UYPR2.512; AMP6, Cupriavidus sp. AMP6; CH34T, C. metallidurans CH34T; N-1, C. necator N-1 T
a, HM loci names from Fig. 2 and Table S6
b, IMG Locus tag (JGI);
c, classification of the annotated HM proteins from Table S5
Fig. 4Phylogenetic tree of the RND transmembrane proteins (component A of the CBA transport system) from Cupriavidus neocaledonicus STM 6070 (in bold) and from sequenced genomes of reference Cupriavidus (C.), Escherichia (E.), Legionella (L.), Microbacterium (M.), Pseudomonas (P.), Ralstonia (R.) and Xanthomonas (X.) and other named reference strains. The HME class of the protein is designated according to the current classification scheme [45, 59, 60]. HME1 to 5 represent five classes of HME systems, HAE represents here an RND protein group involved in in export of hydrophobic and amphiphilic compounds. The evolutionary history was inferred by the Neighbor-Joining method with a bootstrap consensus tree inferred from 500 replicates. The evolutionary distances were computed using the Poisson correction method and are presented as the number of amino acid substitutions per site. The rate variation among sites was modelled with a gamma distribution (shape parameter = 1). Evolutionary analyses were conducted in MEGA6. STM 6070 HME locus tags are displayed in Fig. 2 and Table S6. GenBank accession numbers or locus tags are given in brackets. For C. metallidurans CH34T only the gene numbers of the annotated genome (NC_007973) are given