| Literature DB >> 25197433 |
Wayne Reeve1, Rui Tian1, Lambert Bräu2, Lynne Goodwin3, Christine Munk3, Chris Detter3, Roxanne Tapia3, Cliff Han3, Konstantinos Liolios4, Marcel Huntemann4, Amrita Pati4, Tanja Woyke4, Konstantinos Mavrommatis5, Victor Markowitz5, Natalia Ivanova4, Nikos Kyrpides4, Anne Willems6.
Abstract
Ensifer arboris LMG 14919(T) is an aerobic, motile, Gram-negative, non-spore-forming rod that can exist as a soil saprophyte or as a legume microsymbiont of several species of legume trees. LMG 14919(T) was isolated in 1987 from a nodule recovered from the roots of the tree Prosopis chilensis growing in Kosti, Sudan. LMG 14919(T) is highly effective at fixing nitrogen with P. chilensis (Chilean mesquite) and Acacia senegal (gum Arabic tree or gum acacia). LMG 14919(T) does not nodulate the tree Leucena leucocephala, nor the herbaceous species Macroptilium atropurpureum, Trifolium pratense, Medicago sativa, Lotus corniculatus and Galega orientalis. Here we describe the features of E. arboris LMG 14919(T), together with genome sequence information and its annotation. The 6,850,303 bp high-quality-draft genome is arranged into 7 scaffolds of 12 contigs containing 6,461 protein-coding genes and 84 RNA-only encoding genes, and is one of 100 rhizobial genomes sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.Entities:
Keywords: Alphaproteobacteria; nitrogen fixation; rhizobia; root-nodule bacteria
Year: 2013 PMID: 25197433 PMCID: PMC4148966 DOI: 10.4056/sigs.4828625
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of LMG 14919T according to the MIGS recommendations [7]
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Strain LMG 14919T | |||
| Gram stain | Negative | IDA | |
| Cell shape | Rod | IDA | |
| Motility | Motile | IDA | |
| Sporulation | Non-sporulating | NAS | |
| Temperature range | Mesophile | NAS | |
| Optimum temperature | 28°C | NAS | |
| Salinity | Non-halophile | NAS | |
| MIGS-22 | Oxygen requirement | Aerobic | TAS [ |
| Carbon source | Varied | TAS [ | |
| Energy source | Chemoorganotroph | NAS | |
| MIGS-6 | Habitat | Soil, root nodule, on host | TAS [ |
| MIGS-15 | Biotic relationship | Free living, symbiotic | TAS [ |
| MIGS-14 | Pathogenicity | Non-pathogenic | NAS |
| Biosafety level | 1 | TAS [ | |
| Isolation | Root nodule | TAS [ | |
| MIGS-4 | Geographic location | Kosti, Sudan | TAS [ |
| MIGS-5 | Soil collection date | 1987 | IDA |
| MIGS-4.1 | Longitude | 32.66342 | TAS [ |
| MIGS-4.3 | Depth | Not reported | NAS |
| MIGS-4.4 | Altitude | Not reported | NAS |
Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [18].
Figure 1Images of LMG 14919T using scanning (Left) and transmission (Center) electron microscopy and the appearance of colony morphology on a solid medium (Right).
Figure 2Phylogenetic tree showing the relationship of LMG 14919T (shown in bold print) to other spp. in the order based on aligned sequences of the 16S rRNA gene (1,290 bp internal region). All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA, version 5 [22]. The tree was built using the Maximum-Likelihood method with the General Time Reversible model [23]. Bootstrap analysis [24] with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Brackets after the strain name contain a DNA database accession number and/or a GOLD ID (beginning with the prefix G) for a sequencing project registered in GOLD [25]. Published genomes are indicated with an asterisk.
Genome sequencing project information for LMG 14919T.
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Improved high-quality draft |
| MIGS-28 | Libraries used | Illumina Standard (short PE) and Illumina CLIP (long PE) library |
| MIGS-29 | Sequencing platforms | Illumina HiSeq 2000 |
| MIGS-31.2 | Sequencing coverage | Illumina: 448x |
| MIGS-30 | Assemblers | Velvet version 1.1.05; Allpaths-LG version r38445 |
| MIGS-32 | Gene calling methods | Prodigal 1.4, GenePRIMP |
| GenBank | ATYB00000000 | |
| GenBank release date | July 15, 2013 | |
| GOLD ID | Gi08822 | |
| NCBI project ID | 74465 | |
| Database: IMG | 2512047086 | |
| Project relevance | Symbiotic N2 fixation, agriculture |
Genome Statistics for LMG 14919T
| | | |
|---|---|---|
| Genome size (bp) | 6,850,303 | 100.00 |
| DNA coding region (bp) | 5,921,899 | 86.45 |
| DNA G+C content (bp) | 4,248,771 | 62.02 |
| Number of scaffolds | 7 | |
| Number of contigs | 12 | |
| Total gene | 6,545 | 100.00 |
| RNA genes | 84 | 1.28 |
| rRNA operons | 3 | 0.05 |
| Protein-coding genes | 6,461 | 98.72 |
| Genes with function prediction | 5,287 | 80.78 |
| Genes assigned to COGs | 5,233 | 79.95 |
| Genes assigned Pfam domains | 5,438 | 83.09 |
| Genes with signal peptides | 588 | 8.98 |
| Genes with transmembrane helices | 1,456 | 22.25 |
| CRISPR repeats | 0 |
Figure 3Graphical map of the genome of LMG 14919T showing the seven largest scaffolds. From bottom to the top of each scaffold: Genes on forward strand (color by COG categories as denoted by the IMG platform), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC content, GC skew.
Number of protein coding genes of LMG 14919T associated with the general COG functional categories.
| | | | |
|---|---|---|---|
| J | 195 | 3.35 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.00 | RNA processing and modification |
| K | 510 | 8.76 | Transcription |
| L | 212 | 3.64 | Replication, recombination and repair |
| B | 1 | 0.02 | Chromatin structure and dynamics |
| D | 49 | 0.84 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.00 | Nuclear structure |
| V | 60 | 1.03 | Defense mechanisms |
| T | 248 | 4.26 | Signal transduction mechanisms |
| M | 274 | 4.71 | Cell wall/membrane biogenesis |
| N | 77 | 1.32 | Cell motility |
| Z | 0 | 0.00 | Cytoskeleton |
| W | 0 | 0.00 | Extracellular structures |
| U | 122 | 2.10 | Intracellular trafficking and secretion |
| O | 185 | 3.18 | Posttranslational modification, protein turnover, chaperones |
| C | 349 | 6.00 | Energy production conversion |
| G | 598 | 10.27 | Carbohydrate transport and metabolism |
| E | 653 | 11.22 | Amino acid transport metabolism |
| F | 104 | 1.79 | Nucleotide transport and metabolism |
| H | 201 | 3.45 | Coenzyme transport and metabolism |
| I | 205 | 3.52 | Lipid transport and metabolism |
| P | 292 | 5.02 | Inorganic ion transport and metabolism |
| Q | 182 | 3.13 | Secondary metabolite biosynthesis, transport and catabolism |
| R | 721 | 12.39 | General function prediction only |
| S | 582 | 10.00 | Function unknown |
| - | 1,312 | 20.05 | Not in COGS |