Literature DB >> 26380037

High-quality permanent draft genome sequence of Bradyrhizobium sp. Ai1a-2; a microsymbiont of Andira inermis discovered in Costa Rica.

Rui Tian1, Matthew Parker2, Rekha Seshadri3, Tbk Reddy3, Victor Markowitz4, Natalia Ivanova3, Amrita Pati3, Tanja Woyke3, Mohammed Baeshen5, Nabih Baeshen6, Nikos Kyrpides7, Wayne Reeve1.   

Abstract

Bradyrhizobium sp. Ai1a-2 is is an aerobic, motile, Gram-negative, non-spore-forming rod that was isolated from an effective nitrogen fixing root nodule of Andira inermis collected from Tres Piedras in Costa Rica. In this report we describe, for the first time, the genome sequence information and annotation of this legume microsymbiont. The 9,029,266 bp genome has a GC content of 62.56% with 247 contigs arranged into 246 scaffolds. The assembled genome contains 8,482 protein-coding genes and 102 RNA-only encoding genes. This rhizobial genome was sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project proposal.

Entities:  

Keywords:  Alphaproteobacteria; GEBA-RNB; Nitrogen fixation; Root-nodule bacteria; Symbiosis

Year:  2015        PMID: 26380037      PMCID: PMC4571121          DOI: 10.1186/s40793-015-0007-z

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

sp. strain Ai1a.2 is a representative of a distinctive lineage affiliated with the superclade [1]. The superclade is one of the three main branches of the genus, together with the / superclade [2,3], and the group encompassing photosynthetic Aeschynomene symbionts [4]. Members of the lineage represented by strain Ai1a.2 are readily diagnosed because they share a distinctive length variant in helix 9 within the 5′ intervening sequence region of the 23S rRNA gene [5]. Strain Ai1a.2 and its relatives have an insertion of 16 nucleotides in this region in comparison to USDA76, which can be identified by a straightforward PCR assay [6]. In a survey of 420 strains from 25 countries [1], only 2% of the strains had this 23S rRNA length variant. These strains all clustered together into a strongly supported clade based on concatenated data for 23S rRNA and five protein-coding genes [1]. This clade was placed as the most basally diverging lineage within the superclade, and it included strains from three locations: Central America, the Caribbean, and South Africa. Strain Ai1a.2 was sampled in Costa Rica from the tree [6], and highly similar strains are also known to occur as symbionts of the same host legume in Panama [7]. Parker and Rousteau [8] also detected strains from this group in nodule samples from the beach legume in two Caribbean locations (Guadeloupe and Puerto Rico). Two strains from distantly related legume hosts (Leobordea spp.) in South Africa (WSM2632, WSM2783) also belong to this clade [9]. , the host of strain Ai1a.2, is a large tree (up to 35 m height) commonly found in riparian habitats from southern Mexico through northern South America [10]. Andira was traditionally considered to be an early-diverging lineage within the Tribe Dalbergieae [11], but more recent phylogenetic analyses have suggested that it forms a separate lineage with unclear relationship to dalbergioid legumes [12]. Here we provide an analysis of the high-quality permanent draft genome sequence of strain Ai1a.1. The fact that the genome of its close relative WSM2783 has also been sequenced as part of the Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project [13] will enable detailed comparative analysis of this group.

Organism information

Classification and features

sp. Ai1a-2 is a motile, non-sporulating, non-encapsulated, Gram-negative strain in the order of the class . The rod shaped form has dimensions of approximately 0.5 μm in width and 1.5-2.0 μm in length (Figure 1 Left and Center). It is relatively slow growing, forming colonies after 6–7 days when grown on half strength Lupin Agar (½LA) [14], tryptone-yeast extract agar (TY) [15] or a modified yeast-mannitol agar (YMA) [16] at 28°C. Colonies on ½LA are opaque, slightly domed and moderately mucoid with smooth margins (Figure 1 Right).
Figure 1

Images of Bradyrhizobium sp. Ai1a-2 using scanning (Left) and transmission (Center) electron microscopy as well as light microscopy to visualize colony morphology on solid media (Right).

Images of Bradyrhizobium sp. Ai1a-2 using scanning (Left) and transmission (Center) electron microscopy as well as light microscopy to visualize colony morphology on solid media (Right). Figure 2 shows the phylogenetic relationship of sp. Ai1a-2 in a 16S rRNA gene sequence based tree. The 16S rRNA gene sequence of Aia1-2 (using a 1,370 bp intragenic sequence) is identical to that of sp. WSM2783. sp. Ai1a-2 is also closely related to sp. Cp5.3 and sp. Th.b2 with 16S rRNA gene sequence identities of 99.77% and 99.23%, respectively, as determined using NCBI BLAST analysis [17]. The highest identity (99.16%) of the 16S rRNA gene sequence of strain Ai1a-2 to type strain sequences occurs with LMTR 13T and LMTR 21T based on alignment using the EzTaxon-e server [18,19].
Figure 2

Phylogenetic tree showing the relationship of Bradyrhizobium sp. Ai1a-2 (shown in blue print) relative to other type and non-type strains in the Bradyrhizobium genus using a 1,310 bp intragenic sequence of the 16S rRNA gene. Azorhizobium caulinodans ORS 571T sequence was used as an outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5.05 [37]. The tree was built using the maximum likelihood method with the General Time Reversible model. Bootstrap analysis with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [20] have the GOLD ID mentioned after the strain number and are represented in bold, otherwise the NCBI accession number is provided.

Phylogenetic tree showing the relationship of Bradyrhizobium sp. Ai1a-2 (shown in blue print) relative to other type and non-type strains in the Bradyrhizobium genus using a 1,310 bp intragenic sequence of the 16S rRNA gene. Azorhizobium caulinodans ORS 571T sequence was used as an outgroup. All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5.05 [37]. The tree was built using the maximum likelihood method with the General Time Reversible model. Bootstrap analysis with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Strains with a genome sequencing project registered in GOLD [20] have the GOLD ID mentioned after the strain number and are represented in bold, otherwise the NCBI accession number is provided. Minimum Information about the Genome Sequence (MIGS) is provided in Table 1 and Additional file 1: Table S1.
Table 1

Classification and general features of sp. Ai1a-2 in accordance with the MIGS recommendations [38] published by the Genome Standards Consortium [39]

MIGS ID Property Term Evidence code
ClassificationDomain Bacteria TAS [40]
Phylum Proteobacteria TAS [41,42]
Class Alphaproteobacteria TAS [42,43]
Order Rhizobiales TAS [44]
Family Bradyrhizobiaceae TAS [45]
Genus Bradyrhizobium TAS [46]
Species Bradyrhizobium sp.IDA
Gram stainNegativeIDA
Cell shapeRodIDA
MotilityMotileIDA
SporulationNon-sporulatingNAS
Temperature rangeUnknownNAS
Optimum temperature28°CNAS
pH range; OptimumUnknownNAS
Carbon sourceVariedNAS
Energy sourceChemoorganotrophNAS
MIGS-6HabitatSoil, root nodule, hostTAS [6]
MIGS-6.3SalinityNon-halophileNAS
MIGS-22Oxygen requirementAerobicNAS
MIGS-15Biotic relationshipFree living, symbioticTAS [6]
MIGS-14PathogenicityNon-pathogenicNAS
Biosafety level1TAS [47]
IsolationRoot nodule of Andira inermis TAS [6]
MIGS-4Geographic locationTres Piedras, Costa RicaTAS [6]
MIGS-5Sample collectionJuly 14, 2000IDA
MIGS-4.1Latitude9.2835IDA
MIGS-4.2Longitude−83.8533IDA
MIGS-4.3Depth5 cmIDA
MIGS-4.4Altitude50 mIDA

Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from the Gene Ontology project [48,49].

Classification and general features of sp. Ai1a-2 in accordance with the MIGS recommendations [38] published by the Genome Standards Consortium [39] Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from the Gene Ontology project [48,49].

Symbiotaxonomy

Strain Ai1a.2 was isolated from the tree , Costa Rica [6]. The authentication of the symbiotic ability could not be performed using this host because seeds could not be accessed. The symbiotic capability of strain Ai1a.2 was tested on and this strain was able to nodulate this host. Acetylene reduction assays showed established nodules contained active nitrogenase, indicating an effective symbiosis with this host [6].

Genome sequencing information

Genome project history

This organism was selected for sequencing on the basis of its environmental and agricultural relevance to issues in global carbon cycling, alternative energy production, and biogeochemical importance, and is part of the Genomic Encyclopedia of Bacteria and Archaea, Root Nodulating Bacteria (GEBA-RNB) project at the U.S. Department of Energy, Joint Genome Institute (JGI). The genome project is deposited in the Genomes OnLine Database [20] and a high-quality permanent draft genome sequence in IMG [21]. Sequencing, finishing and annotation were performed by the JGI using state of the art sequencing technology [22]. A summary of the project information is shown in Table 2.
Table 2

Project information

MIGS ID Property Term
MIGS-31Finishing qualityHigh-quality permanent draft
MIGS-28Libraries usedIllumina Standard PE
MIGS-29Sequencing platformsIllumina HiSeq2000
MIGS-31.2Fold coverageIllumina, 119.7x
MIGS-30AssemblersVelvet version 1.1.04; Allpaths-LG version r42328
MIGS-32Gene calling methodProdigal 1.4
Locus TagK288
GenBank IDAUEZ00000000
GenBank release dateJune 12, 2014
GOLD IDGp0009887 [50]
BIOPROJECT195749
MIGS-13Source Material IdentifierAi1a-2
Project relevanceSymbiotic nitrogen fixation, agriculture
Project information

Growth conditions and genomic DNA preparation

sp. Ai1a-2 was cultured to mid logarithmic phase in 60 ml of TY rich media on a gyratory shaker at 28°C [23]. DNA was isolated from the cells using a CTAB (Cetyl trimethyl ammonium bromide) bacterial genomic DNA isolation method [24].

Genome sequencing and assembly

The draft genome of sp. Ai1a–2 was generated at the DOE Joint Genome Institute (JGI) using the Illumina technology [25]. An Illumina standard shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform which generated 21,669,974 reads totaling 3,250.5 Mbp. All general aspects of library construction and sequencing were performed at the JGI and details can be found on the JGI website [26]. All raw Illumina sequence data was passed through DUK, a filtering program developed at JGI, which removes known Illumina sequencing and library preparation artifacts (Mingkun L, Copeland A, Han J, Unpublished). Following steps were then performed for assembly: (1) filtered Illumina reads were assembled using Velvet (version 1.1.04) [27], (2) 1–3 Kbp simulated paired end reads were created from Velvet contigs using wgsim [28], (3) Illumina reads were assembled with simulated read pairs using Allpaths–LG (version r42328) [29]. Parameters for assembly steps were: 1) Velvet (velveth: 63 –shortPaired and velvetg: −very_clean yes –exportFiltered yes –min_contig_lgth 500 –scaffolding no –cov_cutoff 10) 2) wgsim (−e 0 –1 100 –2 100 –r 0 –R 0 –X 0) 3) Allpaths–LG (PrepareAllpathsInputs: PHRED_64 = 1 PLOIDY = 1 FRAG_COVERAGE = 125 JUMP_COVERAGE = 25 LONG_JUMP_COV = 50, RunAllpathsLG: THREADS = 8 RUN = std_shredpairs TARGETS = standard VAPI_WARN_ONLY = True OVERWRITE = True). The final draft assembly contained 247 contigs in 246 scaffolds. The total size of the genome is 9.0 Mbp and the final assembly is based on 1,081.2 Mbp of Illumina data, which provides an average 119.7X coverage of the genome.

Genome annotation

Genes were identified using Prodigal [30], as part of the DOE-JGI genome annotation pipeline [31,32]. The predicted CDSs were translated and used to search the National Center for Biotechnology Information (NCBI) non-redundant database, UniProt, TIGRFam, Pfam, KEGG, COG, and InterPro databases. The tRNAScanSE tool [33] was used to find tRNA genes, whereas ribosomal RNA genes were found by searches against models of the ribosomal RNA genes built from SILVA [34]. Other non–coding RNAs such as the RNA components of the protein secretion complex and the RNase P were identified by searching the genome for the corresponding Rfam profiles using INFERNAL [35]. Additional gene prediction analysis and manual functional annotation was performed within the Integrated Microbial Genomes-Expert Review (IMG-ER) system [36] developed by the Joint Genome Institute, Walnut Creek, CA, USA.

Genome properties

The genome is 9,029,266 nucleotides with 62.56% GC content (Table 3) and comprised of 246 scaffolds. From a total of 8,584 genes, 8,482 were protein encoding and 102 RNA only encoding genes. The majority of genes (75.10%) were assigned a putative function whilst the remaining genes were annotated as hypothetical. The distribution of genes into COGs functional categories is presented in Table 4.
Table 3

Genome statistics for sp. Ai1a-2

Attribute Value % of total
Genome size (bp)9,029,266100.00
DNA coding (bp)7,683,92285.10
DNA G + C (bp)5,648,84962.56
DNA scaffolds246100
Total genes8,584100.00
Protein coding genes8,48298.81
RNA genes1021.19
Pseudo genes00.00
Genes in internal clusters8379.75
Genes with function prediction6,44775.10
Genes assigned to COGs5,11159.54
Genes with Pfam domains6,59076.77
Genes with signal peptides8379.75
Genes with transmembrane helices1,91422.30
CRISPR repeats00.00
Table 4

Number of genes associated with the general COG functional categories

Code Value % of total (5,698) COG category
J1853.25Translation, ribosomal structure and biogenesis
A00.00RNA processing and modification
K4127.23Transcription
L2233.91Replication, recombination and repair
B20.04Chromatin structure and dynamics
D330.58Cell cycle control, cell division, chromosome partitioning
V891.56Defense mechanisms
T2344.11Signal transduction mechanisms
M2774.86Cell wall/membrane/envelope biogenesis
N941.65Cell motility
U1282.25Intracellular trafficking, secretion, and vesicular transport
O1913.35Posttranslational modification, protein turnover, chaperones
C4357.63Energy production and conversion
G3405.97Carbohydrate transport and metabolism
E58710.30Amino acid transport and metabolism
F771.35Nucleotide transport and metabolism
H1983.47Coenzyme transport and metabolism
I3115.46Lipid transport and metabolism
P3646.39Inorganic ion transport and metabolism
Q2564.49Secondary metabolite biosynthesis, transport and catabolism
R69612.21General function prediction only
S5669.93Function unknown
-3,47340.46Not in COGS
Genome statistics for sp. Ai1a-2 Number of genes associated with the general COG functional categories

Conclusions

sp. Ai1a-2 is a member of a widely distributed lineage, isolated from diverse legume hosts in North, Central and South America and South Africa. Little is currently known of the symbiotic associations of its host , apart from the discovery that the Puerto Rican isolate sp. EC3.3 can also establish a symbiosis with this host [8]. The Costa Rican isolate Aia1-2 16S rRNA gene sequence is distinct to that of EC3.3 but identical to the 16S rRNA sequence of South African isolate sp. WSM2783. Phylogentically, Ai1a-2 is closely related to sp. Cp5.3 and sp. Th.b2 from Panama and USA, respectively. The genome of 1a-2 and Ai sp.WSM2783 were sequenced along with 23 other genomes as a part of the GEBA-RNB project. Of these 25 sequenced strains, the spp. Ai1a-2, WSM2783, Cp5.3, Th.b2 and USDA76T are affiliated with the superclade. The Ai1a-2 genome has the 2nd lowest genome size (9 Mbp), gene count (8,584) and signal peptide percentage (9.75%) among these five strains. Comparing the genome attributes of sp. Ai1a-2 along with other sequenced genomes will be important for the understanding of the biogeography of spp. interactions required for the successful establishments of effective symbioses with their diverse hosts.
  31 in total

1.  Validation of publication of new names and new combinations previously effectively published outside the IJSEM.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2005-05       Impact factor: 2.747

2.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

3.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.

Authors:  Koichiro Tamura; Daniel Peterson; Nicholas Peterson; Glen Stecher; Masatoshi Nei; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2011-05-04       Impact factor: 16.240

4.  The spread of Bradyrhizobium lineages across host legume clades: from Abarema to Zygia.

Authors:  Matthew A Parker
Journal:  Microb Ecol       Date:  2014-10-10       Impact factor: 4.552

5.  Constructs for insertional mutagenesis, transcriptional signal localization and gene regulation studies in root nodule and other bacteria.

Authors:  Wayne G Reeve; Ravi P Tiwari; Penelope S Worsley; Michael J Dilworth; Andrew R Glenn; John G Howieson
Journal:  Microbiology       Date:  1999-06       Impact factor: 2.777

6.  RNase III processing of intervening sequences found in helix 9 of 23S rRNA in the alpha subclass of Proteobacteria.

Authors:  E Evguenieva-Hackenberg; G Klug
Journal:  J Bacteriol       Date:  2000-09       Impact factor: 3.490

7.  Nodule morphology, symbiotic specificity and association with unusual rhizobia are distinguishing features of the genus Listia within the Southern African crotalarioid clade Lotononis s.l.

Authors:  Julie K Ardley; Wayne G Reeve; Graham W O'Hara; Ron J Yates; Michael J Dilworth; John G Howieson
Journal:  Ann Bot       Date:  2013-05-26       Impact factor: 4.357

8.  The Genomic Standards Consortium.

Authors:  Dawn Field; Linda Amaral-Zettler; Guy Cochrane; James R Cole; Peter Dawyndt; George M Garrity; Jack Gilbert; Frank Oliver Glöckner; Lynette Hirschman; Ilene Karsch-Mizrachi; Hans-Peter Klenk; Rob Knight; Renzo Kottmann; Nikos Kyrpides; Folker Meyer; Inigo San Gil; Susanna-Assunta Sansone; Lynn M Schriml; Peter Sterk; Tatiana Tatusova; David W Ussery; Owen White; John Wooley
Journal:  PLoS Biol       Date:  2011-06-21       Impact factor: 8.029

9.  A Genomic Encyclopedia of the Root Nodule Bacteria: assessing genetic diversity through a systematic biogeographic survey.

Authors:  Wayne Reeve; Julie Ardley; Rui Tian; Leila Eshragi; Je Won Yoon; Pinyaruk Ngamwisetkun; Rekha Seshadri; Natalia N Ivanova; Nikos C Kyrpides
Journal:  Stand Genomic Sci       Date:  2015-02-09

10.  Improving microbial genome annotations in an integrated database context.

Authors:  I-Min A Chen; Victor M Markowitz; Ken Chu; Iain Anderson; Konstantinos Mavromatis; Nikos C Kyrpides; Natalia N Ivanova
Journal:  PLoS One       Date:  2013-02-12       Impact factor: 3.240

View more
  1 in total

Review 1.  Ancient Evolution and Recent Evolution Converge for the Biodegradation of Cyanuric Acid and Related Triazines.

Authors:  Jennifer L Seffernick; Lawrence P Wackett
Journal:  Appl Environ Microbiol       Date:  2016-01-04       Impact factor: 4.792

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.