Literature DB >> 26203327

High-quality permanent draft genome sequence of the Parapiptadenia rigida-nodulating Cupriavidus sp. strain UYPR2.512.

Sofie E De Meyer1, Elena Fabiano2, Rui Tian1, Peter Van Berkum3, Rekha Seshadri4, Tbk Reddy4, Victor Markowitz5, Natalia N Ivanova4, Amrita Pati4, Tanja Woyke4, John Howieson1, Nikos C Kyrpides6, Wayne Reeve1.   

Abstract

Cupriavidus sp. strain UYPR2.512 is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from a root nodule of Parapiptadenia rigida grown in soils from a native forest of Uruguay. Here we describe the features of Cupriavidus sp. strain UYPR2.512, together with sequence and annotation. The 7,858,949 bp high-quality permanent draft genome is arranged in 365 scaffolds of 369 contigs, contains 7,411 protein-coding genes and 76 RNA-only encoding genes, and is part of the GEBA-RNB project proposal.

Entities:  

Keywords:  Betaproteobacteria; GEBA-RNB; Nitrogen fixation; Rhizobia; Root-nodule bacteria

Year:  2015        PMID: 26203327      PMCID: PMC4511410          DOI: 10.1186/1944-3277-10-13

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

Legumes establish symbiotic associations with a group of soil bacteria, rhizobia, able to fix atmospheric nitrogen (N2). Rhizobia elicit the formation of a symbiotic organ called a nodule comprising differentiated plant and bacterial cells. Differentiated rhizobia within nodules are termed bacteroids, and acquire the ability to fix nitrogen. Rhizobia are phylogenetically diverse including genera from the (, , , , , , etc.) as well as from the (, ) [1,2]. The biological nitrogen fixation process significantly contributes to the development of sustainable agriculture reducing the use of supplies dependent on fuel and alleviating environmental impacts produced by the addition of chemical fertilizer [3]. Moreover, forestation with leguminous trees associated with rhizobia, “nitrogen-fixing trees”, has been successfully used for recovering degraded soils [4]. Parapiptadenia rigida (Benth.) Brenan, is a “nitrogen-fixing tree” belonging to the Piptadenia group from the Mimosoideae subfamily [5]. It is a multipurpose tree, very appreciated because of its timber and therefore used in high quality furniture and construction. It is also used for gums, tannins and essential oil extraction, has medicinal properties and is included in agroforestry and reforestation programs [4,6,7]. Taulé et al.[8] demonstrated that this species could be nodulated either by Alpha-rhizobia () or by Beta-rhizobia ( and ) with being the preferred natural symbiont of this legume. In the case of Cupriavidus sp. UYPR2.512, this strain was isolated from a nodule of a P. rigida plant grown in soils collected from Mandiyú native forest in Artigas, Uruguay. Isolated bacterial colonies of Cupriavidus sp. UYPR2.512 were able to nodulate and to promote the growth of P. rigida, as well as Mimosa pudica plants [8]. To our knowledge, the only published sequenced genome of a Beta-rhizobia belonging to the genus so far is that of LMG 19424T[9]. Interestingly, the closest relative of Cupriavidus sp. UYPR2.512 is ATCC 43291T[8]. Here, we present the description of the Cupriavidus sp. UYPR2.512 high-quality permanent draft genome sequence and its annotation.

Organism information

Classification and features

Cupriavidus sp. strain UYPR2.512 is a motile, Gram-negative, non-spore-forming rod (Figure 1 Left, Center) in the order of the class . The rod-shaped form varies in size with dimensions of 0.5-0.7 μm in width and 0.9-1.2 μm in length (Figure 1 Left). It is fast growing, forming 0.5-0.8 mm diameter colonies after 24 h when grown on TY [10] at 28°C. Colonies on TY are white-opaque, slightly domed, moderately mucoid with smooth margins (Figure 1 Right).
Figure 1

Images of sp. strain UYPR2.512 using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on solid media (Right).

Images of sp. strain UYPR2.512 using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on solid media (Right). Figure 2 shows the phylogenetic relationship of Cupriavidus sp. strain UYPR2.512 in a 16S rRNA gene sequence based tree. This strain is the most similar to ATCC 43291T, DSM 1105T and LMG 19424T based on the 16S rRNA gene alignment with sequence identities of 99.32%, 98.49% and 98.42%, respectively, as determined using the EzTaxon-e server [11]. ATCC 43291T has been isolated from soil and is a non-obligate predator causing lysis of various Gram-positive and Gram-negative bacteria in the soil [12]. LMG 19424T is a plant symbiont and was isolated from root nodules of Mimosa pudica collected from three fields at Ping-Tung Country in the southern part of Taiwan [1]. Minimum Information about the Genome Sequence (MIGS) is provided in Table 1 and Additional file 1: Table S1.
Figure 2

Phylogenetic tree highlighting the position of sp. strain UYPR2.512 (shown in blue print) relative to other type and non-type strains in the genus using a 1,034 bp internal region of the 16S rRNA gene. Several Alpha-rhizobia sequences were used as an outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5.05 [13]. The tree was built using the maximum likelihood method with the General Time Reversible model. Bootstrap analysis with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [14] are shown in bold and have the GOLD ID mentioned after the strain number, otherwise the NCBI accession number has been provided. Finished genomes are designated with an asterisk.

Table 1

Classification and general features of sp. strain UYPR2.512 in accordance with the MIGS recommendations[15]published by the Genome Standards Consortium[16]

MIGS IDPropertyTermEvidence code
 Current classificationDomain BacteriaTAS [17]
Phylum ProteobacteriaTAS [18,19]
Class BetaproteobacteriaTAS [20]
Order BurkholderialesTAS [21]
Family BurkholderiaceaeTAS [22]
Genus CupriavidusTAS [23]
Species Cupriavidus sp.IDA
 Gram stainNegativeIDA [23]
 Cell shapeRodIDA
 MotilityMotileIDA
 SporulationNon-sporulatingIDA [23]
 Temperature rangemesophileIDA [23]
 Optimum temperature28°CIDA
 pH range; OptimumNot reported 
 Carbon sourceNot reported 
MIGS-6HabitatSoil, root nodule on hostIDA
MIGS-6.3SalinityNot reported 
MIGS-22Oxygen requirementaerobicIDA
MIGS-15Biotic relationshipSymbioticIDA
MIGS-14PathogenicityNon-pathogenicNAS
MIGS-4Geographic locationUruguayIDA
MIGS-5Nodule collection date2006IDA
MIGS-4.1Latitude-30.507IDA
MIGS-4.2Longitude-57.71IDA
MIGS-4.4Altitude58 mIDA

Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author. Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author. Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [24].

Phylogenetic tree highlighting the position of sp. strain UYPR2.512 (shown in blue print) relative to other type and non-type strains in the genus using a 1,034 bp internal region of the 16S rRNA gene. Several Alpha-rhizobia sequences were used as an outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5.05 [13]. The tree was built using the maximum likelihood method with the General Time Reversible model. Bootstrap analysis with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [14] are shown in bold and have the GOLD ID mentioned after the strain number, otherwise the NCBI accession number has been provided. Finished genomes are designated with an asterisk. Classification and general features of sp. strain UYPR2.512 in accordance with the MIGS recommendations[15]published by the Genome Standards Consortium[16] Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author. Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author. Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [24].

Symbiotaxonomy

Cupriavidus sp. strain UYPR2.512 was isolated from Parapiptadenia rigida, a Mimosoideae legume native to Uruguay [8]. This tree is native to South America, including south Brazil, Argentina, Paraguay, and Uruguay, and used by locals for timber and as a source of gums, tannins and essential oils [8]. sp. strain UYPR2.512 is able to renodulate its original host and is highly efficient in fixing nitrogen with this host [8]. A selection of other host plants, including Trifolium repens, Medicago sativa, Peltophorum dubium and Mimosa pudica were tested for their ability to nodulate with UYPR2.512. Of these plants, strain UYPR2.512 was only able to nodulate and fix nitrogen effectively with M. pudica[8].

Genome sequencing information

Genome project history

This organism was selected for sequencing on the basis of its environmental and agricultural relevance to issues in global carbon cycling, alternative energy production, and biogeochemical importance, and is part of the Genomic Encyclopedia of Bacteria and , The Root Nodulating Bacteria chapter (GEBA-RNB) project at the U.S. Department of Energy, Joint Genome Institute [25]. The genome project is deposited in the Genomes OnLine Database [14] and the high-quality permanent draft genome sequence in IMG [26]. Sequencing, finishing and annotation were performed by the JGI using state of the art sequencing technology [27]. A summary of the project information is shown in Table 2.
Table 2

Genome sequencing project information for sp. strain UYPR2.512

MIGS IDPropertyTerm
MIGS-31Finishing qualityPermanent-draft
MIGS-28Libraries usedIllumina Std PE
MIGS-29Sequencing platformsIllumina HiSeq 2000
MIGS-31.2Fold coverage106.8 X Illumina
MIGS-30AssemblersVelvet 1.1.04, ALLPATHS-LG V.r41043
MIGS-32Gene calling methodsProdigal 1.4
 Locus TagA3A5
 Genbank IDARBE00000000
 Genbank Date of ReleaseSeptember 16, 2013
 GOLD IDGp0009663 [28]
 BIOPROJECTPRJNA165301
MIGS-13Source Material IdentifierUYPR2.512
 Project relevanceSymbiotic N2fixation, agriculture
Genome sequencing project information for sp. strain UYPR2.512

Growth conditions and DNA isolation

Cupriavidus sp. strain UYPR2.512 was grown to mid logarithmic phase in TY rich media [10] on a gyratory shaker at 28°C. DNA was isolated from 60 mL of cells using a CTAB (Cetyl trimethyl ammonium bromide) bacterial genomic DNA isolation method [29].

Genome sequencing and assembly

The draft genome of Cupriavidus sp. UYPR2.512 was generated at the DOE Joint Genome Institute [27]. An Illumina Std shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform which generated 29,312,424 reads totaling 4,396.9 Mbp [30]. All general aspects of library construction and sequencing performed at the JGI can be found at the JGI web site [31]. All raw Illumina sequence data was passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artifacts (Mingkun L, Copeland A, Han J. unpublished). Artifact filtered sequence data was then screened and trimmed according to the k–mers present in the dataset. High–depth k–mers, presumably derived from MDA amplification bias, cause problems in the assembly, especially if the k–mer depth varies in orders of magnitude for different regions of the genome. Reads with high k–mer coverage (>30x average k–mer depth) were normalized to an average depth of 30x. Reads with an average kmer depth of less than 2x were removed. Following steps were then performed for assembly: (1) normalized Illumina reads were assembled using Velvet version 1.1.04 [32] (2) 1–3 Kbp simulated paired end reads were created from Velvet contigs using wgsim [33] (3) normalized Illumina reads were assembled with simulated read pairs using Allpaths–LG (version r41043)[34]. Parameters for assembly steps were: 1) Velvet (velveth: 63 –shortPaired and velvetg: -very clean yes –exportFiltered yes –min contig lgth 500 –scaffolding no –cov cutoff 10) 2) wgsim (-e 0 –1 100 –2 100 –r 0 –R 0 –X 0) 3) Allpaths–LG (PrepareAllpathsInputs: PHRED 64 = 1 PLOIDY = 1 FRAG COVERAGE = 125 JUMP COVERAGE = 25 LONG JUMP COV = 50, RunAllpathsLG: THREADS = 8 RUN = std_shredpairs TARGETS = standard VAPI_WARN_ONLY = True OVERWRITE = True). The final draft assembly contained 369 contigs in 365 scaffolds. The total size of the genome is 7.9 Mbp and the final assembly is based on 839.6 Mbp of Illumina data, which provides an average of 106.8x coverage.

Genome annotation

Genes were identified using Prodigal [35], as part of the DOE-JGI genome annotation pipeline [36,37] followed by a round of manual curation using GenePRIMP [38] for finished genomes and Draft genomes in fewer than 10 scaffolds. The predicted CDSs were translated and used to search the National Center for Biotechnology Information (NCBI) non-redundant database, UniProt, TIGRFam, Pfam, KEGG, COG, and InterPro databases. The tRNAScanSE tool [39] was used to find tRNA genes, whereas ribosomal RNA genes were found by searches against models of the ribosomal RNA genes built from SILVA [40]. Other non–coding RNAs such as the RNA components of the protein secretion complex and the RNase P were identified by searching the genome for the corresponding Rfam profiles using INFERNAL [41]. Additional gene prediction analysis and manual functional annotation was performed within the Integrated Microbial Genomes-Expert Review (IMG-ER) system [42] developed by the Joint Genome Institute, Walnut Creek, CA, USA.

Genome properties

The genome is 7,858,949 nucleotides with 65.25% GC content (Table 3) and comprised of 365 scaffolds and 369 contigs (Figure 3). From a total of 7,487 genes, 7,411 were protein encoding and 76 RNA only encoding genes. The majority of genes (75.64%) were assigned a putative function whilst the remaining genes were annotated as hypothetical. The distribution of genes into COG functional categories is presented in Table 4.
Table 3

Genome statistics for sp. strain UYPR2.512

AttributeValue% of total
Genome size (bp)7,858,949100
DNA coding (bp)6,709,33285.37
DNA G + C (bp)5,128,15865.25
DNA scaffolds365 
Total genes7,487100
Protein coding genes7,41198.98
RNA genes761.02
Pseudo genes00
Genes in internal clusters4195.6
Genes with function prediction5,66375.64
Genes assigned to COGs4,80764.20
Genes with Pfam domains5,95979.59
Genes with signal peptides6969.30
Genes with transmembrane helices1,54520.64
CRISPR repeats1 
Figure 3

Graphical map of the four largest scaffolds of the genome of sp. strain UYPR2.512. From the bottom to the top of each scaffold: Genes on forward strand (color by COG categories as denoted by the IMG platform), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC content, GC skew.

Table 4

Number of protein coding genes of sp. strain UYPR2.512 associated with the general COG functional categories

CodeValue% of total (5,426)COG Category
J1833.37Translation, ribosomal structure and biogenesis
A10.02RNA processing and modification
K5269.69Transcription
L1923.54Replication, recombination and repair
B30.06Chromatin structure and dynamics
D350.65Cell cycle control, Cell division, chromosome partitioning
Y00.00Nuclear structure
V561.03Defense mechanisms
T2103.87Signal transduction mechanisms
M2775.11Cell wall/membrane/envelope biogenesis
N991.82Cell motility
Z00.00Cytoskeleton
W00.00Extracellular structures
U1402.58Intracellular trafficking, secretion, and vesicular transport
O1813.34Posttranslational modification, protein turnover, chaperones
C4348.00Energy production and conversion
G2684.94Carbohydrate transport and metabolism
E4878.98Amino acid transport and metabolism
F891.64Nucleotide transport and metabolism
H1943.58Coenzyme transport and metabolism
I3376.21Lipid transport and metabolism
P2725.01Inorganic ion transport and metabolism
Q2364.35Secondary metabolite biosynthesis, transport and catabolism
R66112.18General function prediction only
S54510.04Function unknown
-268035.80Not in COGS
Genome statistics for sp. strain UYPR2.512 Graphical map of the four largest scaffolds of the genome of sp. strain UYPR2.512. From the bottom to the top of each scaffold: Genes on forward strand (color by COG categories as denoted by the IMG platform), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC content, GC skew. Number of protein coding genes of sp. strain UYPR2.512 associated with the general COG functional categories

Conclusion

Cupriavidus sp. UYPR2.512 belongs to a group of Beta-rhizobia isolated from Parapiptadenia rigida, a native tree from Uruguay belonging to the Mimosoideae legume group [8]. This tree is also native to the south of Brazil, Argentina and Paraguay [8]. Greenhouse experiments from previous studies have shown that Cupriavidus sp. UYPR2.512 is also able to nodulate and fix nitrogen with Mimosa pudica, an invasive species in many regions around the world [8]. Phylogenetic analysis revealed that UYPR2.512 is the most closely related to ATCC 43291T, DSM 1105T and LMG 19424T. In contrast to the other two strains, LMG 19424T is a microsymbiont that is able to nodulate and fix nitrogen in association with Mimosa species [43]. In total five strains (AMP6, LMG 19424T, STM6018, STM6070 and UYPR2.512), which can form a symbiotic association have now been sequenced. A comparison of these strains reveals that UYPR2.512 has the largest genome (7.9 Mbp), with the highest KOG count (1398), the lowest G + C (65.25%) and signal peptide (9.3%) percentages in this group. All of these genomes share the nitrogenase-RXN MetaCyc pathway catalyzed by a multiprotein nitrogenase complex. Out of five strains (AMP6, LMG 19424T, STM6018, STM6070 and UYPR2.512), which contain the N-fixation pathway, only Cupriavidus sp. UYPR2.512 has been shown to nodulate and fix effectively with Parapiptadenia rigida. The genome attributes of Cupriavidus sp. UYPR2.512 will therefore be important for ongoing molecular analysis of the plant microbe interactions required for the establishment of leguminous tree symbioses with this host.

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

EF supplied the strain and background information for this project, PVB supplied DNA to JGI, TR performed all imaging, SDM and WR drafted the paper, JH provided financial support and all other authors were involved in sequencing the genome and editing the final manuscript. All authors read and approved the final manuscript.

Additional file 1

Associated MIGS Record. Click here for file
  29 in total

1.  Nodulation of legumes by members of the beta-subclass of Proteobacteria.

Authors:  L Moulin; A Munive; B Dreyfus; C Boivin-Masson
Journal:  Nature       Date:  2001-06-21       Impact factor: 49.962

2.  Legume symbiotic nitrogen fixation by beta-proteobacteria is widespread in nature.

Authors:  Wen-Ming Chen; Lionel Moulin; Cyril Bontemps; Peter Vandamme; Gilles Béna; Catherine Boivin-Masson
Journal:  J Bacteriol       Date:  2003-12       Impact factor: 3.490

3.  GenePRIMP: a gene prediction improvement pipeline for prokaryotic genomes.

Authors:  Amrita Pati; Natalia N Ivanova; Natalia Mikhailova; Galina Ovchinnikova; Sean D Hooper; Athanasios Lykidis; Nikos C Kyrpides
Journal:  Nat Methods       Date:  2010-05-02       Impact factor: 28.547

4.  Validation of publication of new names and new combinations previously effectively published outside the IJSEM.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2005-05       Impact factor: 2.747

5.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

6.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.

Authors:  Koichiro Tamura; Daniel Peterson; Nicholas Peterson; Glen Stecher; Masatoshi Nei; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2011-05-04       Impact factor: 16.240

7.  Ralstonia taiwanensis sp. nov., isolated from root nodules of Mimosa species and sputum of a cystic fibrosis patient.

Authors:  W M Chen; S Laevens; T M Lee; T Coenye; P De Vos; M Mergeay; P Vandamme
Journal:  Int J Syst Evol Microbiol       Date:  2001-09       Impact factor: 2.747

8.  Genome sequence of the beta-rhizobium Cupriavidus taiwanensis and comparative genomics of rhizobia.

Authors:  Claire Amadou; Géraldine Pascal; Sophie Mangenot; Michelle Glew; Cyril Bontemps; Delphine Capela; Sébastien Carrère; Stéphane Cruveiller; Carole Dossat; Aurélie Lajus; Marta Marchetti; Véréna Poinsot; Zoé Rouy; Bertrand Servin; Maged Saad; Chantal Schenowitz; Valérie Barbe; Jacques Batut; Claudine Médigue; Catherine Masson-Boivin
Journal:  Genome Res       Date:  2008-05-19       Impact factor: 9.043

9.  The Genomic Standards Consortium.

Authors:  Dawn Field; Linda Amaral-Zettler; Guy Cochrane; James R Cole; Peter Dawyndt; George M Garrity; Jack Gilbert; Frank Oliver Glöckner; Lynette Hirschman; Ilene Karsch-Mizrachi; Hans-Peter Klenk; Rob Knight; Renzo Kottmann; Nikos Kyrpides; Folker Meyer; Inigo San Gil; Susanna-Assunta Sansone; Lynn M Schriml; Peter Sterk; Tatiana Tatusova; David W Ussery; Owen White; John Wooley
Journal:  PLoS Biol       Date:  2011-06-21       Impact factor: 8.029

10.  Improving microbial genome annotations in an integrated database context.

Authors:  I-Min A Chen; Victor M Markowitz; Ken Chu; Iain Anderson; Konstantinos Mavromatis; Nikos C Kyrpides; Natalia N Ivanova
Journal:  PLoS One       Date:  2013-02-12       Impact factor: 3.240

View more
  2 in total

1.  Novel Cupriavidus Strains Isolated from Root Nodules of Native Uruguayan Mimosa Species.

Authors:  Raúl Platero; Euan K James; Cecilia Rios; Andrés Iriarte; Laura Sandes; María Zabaleta; Federico Battistoni; Elena Fabiano
Journal:  Appl Environ Microbiol       Date:  2016-05-16       Impact factor: 4.792

2.  High-quality permanent draft genome sequence of the Parapiptadenia rigida-nodulating Burkholderia sp. strain UYPR1.413.

Authors:  Sofie E De Meyer; Elena Fabiano; Rui Tian; Peter Van Berkum; Rekha Seshadri; Tbk Reddy; Victor Markowitz; Natalia Ivanova; Amrita Pati; Tanja Woyke; John Howieson; Nikos Kyrpides; Wayne Reeve
Journal:  Stand Genomic Sci       Date:  2015-06-04
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.