| Literature DB >> 25780500 |
Simon Kelly1, John Sullivan1, Clive Ronson1, Rui Tian2, Lambert Bräu3, Karen Davenport4, Hajnalka Daligault4, Tracy Erkkila4, Lynne Goodwin4, Wei Gu4, Christine Munk4, Hazuki Teshima4, Yan Xu4, Patrick Chain4, Tanja Woyke5, Konstantinos Liolios5, Amrita Pati5, Konstantinos Mavromatis6, Victor Markowitz6, Natalia Ivanova5, Nikos Kyrpides7, Wayne Reeve2.
Abstract
Mesorhizobium loti strain NZP2037 was isolated in 1961 in Palmerston North, New Zealand from a Lotus divaricatus root nodule. Compared to most other M. loti strains, it has a broad host range and is one of very few M. loti strains able to form effective nodules on the agriculturally important legume Lotus pedunculatus. NZP2037 is an aerobic, Gram negative, non-spore-forming rod. This report reveals that the genome of M. loti strain NZP2037 does not harbor any plasmids and contains a single scaffold of size 7,462,792 bp which encodes 7,318 protein-coding genes and 70 RNA-only encoding genes. This rhizobial genome is one of 100 sequenced as part of the DOE Joint Genome Institute 2010 Genomic Encyclopedia for Bacteria and Archaea-Root Nodule Bacteria (GEBA-RNB) project.Entities:
Keywords: Alphaproteobacteria; Nitrogen fixation; Root-nodule bacteria; Symbiosis
Year: 2014 PMID: 25780500 PMCID: PMC4334872 DOI: 10.1186/1944-3277-9-7
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Images of strain NZP2037 using scanning (left) and transmission (center) electron microscopy and the appearance of colony morphology on ½LA (right).
Classification and general features of strain NZP2037 according to the MIGS recommendations[18,19]
| | Current classification | Domain | TAS [ |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Strain NZP2037 | TAS [ | ||
| | Gram stain | Negative | IDA |
| | Cell shape | Rod | IDA |
| | Motility | Motile | IDA |
| | Sporulation | Non-sporulating | NAS |
| | Temperature range | Mesophile | NAS |
| | Optimum temperature | 28°C | NAS |
| | Salinity | Unknown | NAS |
| MIGS-22 | Oxygen requirement | Aerobic | TAS [ |
| | Carbon source | Various | TAS [ |
| | Energy source | chemoorganotroph | TAS [ |
| MIGS-6 | Habitat | Soil, root nodule, host | TAS [ |
| MIGS-15 | Biotic relationship | Free living, Symbiotic | TAS [ |
| MIGS-14 | Pathogenicity | None | NAS |
| | Biosafety level | 1 | TAS [ |
| | Isolation | Root nodule of | TAS [ |
| MIGS-4 | Geographic location | Adjacent Palmerston North Airport, NZ | TAS [ |
| MIGS-5 | Nodule collection date | 1961 | TAS [ |
| MIGS-4.1 | Latitude | -40.1914 | TAS [ |
| MIGS-4.2 | Longitude | 175.3701 | TAS [ |
| MIGS-4.3 | Depth | 5 cm | IDA |
| MIGS-4.4 | Altitude | 46 meters | IDA |
Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [28].
Figure 2Phylogenetic tree showing the relationships of NZP2037 with other root nodule bacteria based on aligned sequences of the 16S rRNA gene (1,290 bp internal region). All sites were informative and there were no gap-containing sites. Phylogenetic analyses were performed using MEGA [29], version 5. The tree was built using the Maximum-Likelihood method with the General Time Reversible model [30]. Bootstrap analysis [31] with 500 replicates was performed to assess the support of the clusters. Type strains are indicated with a superscript T. Brackets after the strain name contain a DNA database accession number and/or a GOLD ID (beginning with the prefix G) for a sequencing project registered in GOLD [32]. Published genomes are indicated with an asterisk.
Genome sequencing project information for NZP2037
| MIGS-31 | Finishing quality | High-quality-draft |
| MIGS-28 | Libraries used | Illumina Standard (short PE) and CLIP (long PE) libraries |
| MIGS-29 | Sequencing platforms | Illumina HiSeq2000 technology |
| MIGS-31.2 | Sequencing coverage | Illumina: 509× |
| MIGS-30 | Assemblers | Velvet version 1.1.05; Allpaths-LG version r39750 phrap, version 4.24 |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| | Genbank accession | AQZP00000000 |
| | Genbank Registration Date | September 16, 2013 |
| | GOLD ID | Gi08826 |
| | NCBI project ID | 81803 |
| | Database: IMG | 2517572076 |
| Project relevance | Symbiotic nitrogen fixation, agriculture |
Genome statistics for NZP2037
| Genome size (bp) | 7,462,792 | 100.00 |
| DNA coding region (bp) | 6,448,323 | 86.41 |
| DNA G + C content (bp) | 4,683,660 | 62.76 |
| Number of scaffolds | 1 | |
| Number of contigs | 5 | |
| Total genes | 7,388 | 100.00 |
| RNA genes | 70 | 0.95 |
| rRNA operons | 1* | |
| Protein-coding genes | 7,318 | 99.05 |
| Genes with function prediction | 5,982 | 80.97 |
| Genes assigned to COGs | 5,882 | 79.62 |
| Genes assigned Pfam domains | 6,121 | 82.85 |
| Genes with signal peptides | 654 | 8.85 |
| Genes coding transmembrane proteins | 1,735 | 23.48 |
*1 copy of 5S, 2 copies of 16S and 1 copy of 23S rRNA genes.
Figure 3Graphical map of the single scaffold of NZP2037. From bottom to the top: Genes on forward strand (color by COG categories as denoted by the IMG platform), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, sRNAs red, other RNAs black), GC content, GC skew.
Number of protein coding genes of NZP2037 associated with the general COG functional categories
| J | 205 | 3.14 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.00 | RNA processing and modification |
| K | 603 | 9.24 | Transcription |
| L | 242 | 3.71 | Replication, recombination and repair |
| B | 7 | 0.11 | Chromatin structure and dynamics |
| D | 34 | 0.52 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.00 | Nuclear structure |
| V | 82 | 1.26 | Defense mechanisms |
| T | 262 | 4.01 | Signal transduction mechanisms |
| M | 320 | 4.90 | Cell wall/membrane biogenesis |
| N | 51 | 0.78 | Cell motility |
| Z | 2 | 0.03 | Cytoskeleton |
| W | 1 | 0.02 | Extracellular structures |
| U | 142 | 2.17 | Intracellular trafficking and secretion |
| O | 197 | 3.02 | Posttranslational modification, protein turnover, chaperones |
| C | 381 | 5.84 | Energy production conversion |
| G | 624 | 9.56 | Carbohydrate transport and metabolism |
| E | 798 | 12.22 | Amino acid transport metabolism |
| F | 95 | 1.46 | Nucleotide transport and metabolism |
| H | 242 | 3.71 | Coenzyme transport and metabolism |
| I | 275 | 4.21 | Lipid transport and metabolism |
| P | 267 | 4.09 | Inorganic ion transport and metabolism |
| Q | 213 | 3.26 | Secondary metabolite biosynthesis, transport and catabolism |
| R | 811 | 12.42 | General function prediction only |
| S | 675 | 10.34 | Function unknown |
| - | 1,506 | 20.38 | Not in COGS |