| Literature DB >> 21304699 |
Hope Tice, Shanmugam Mayilraj, David Sims, Alla Lapidus, Matt Nolan, Susan Lucas, Tijana Glavina Del Rio, Alex Copeland, Jan-Fang Cheng, Linda Meincke, David Bruce, Lynne Goodwin, Sam Pitluck, Natalia Ivanova, Konstantinos Mavromatis, Galina Ovchinnikova, Amrita Pati, Amy Chen, Krishna Palaniappan, Miriam Land, Loren Hauser, Yun-Juan Chang, Cynthia D Jeffries, John C Detter, Thomas Brettin, Manfred Rohde, Markus Göker, Jim Bristow, Jonathan A Eisen, Victor Markowitz, Philip Hugenholtz, Nikos C Kyrpides, Hans-Peter Klenk, Feng Chen.
Abstract
Nakamurella multipartita (Yoshimi et al. 1996) Tao et al. 2004 is the type species of the monospecific genus Nakamurella in the actinobacterial suborder Frankineae. The nonmotile, coccus-shaped strain was isolated from activated sludge acclimated with sugar-containing synthetic wastewater, and is capable of accumulating large amounts of polysaccharides in its cells. Here we describe the features of the organism, together with the complete genome sequence and annotation. This is the first complete genome sequence of a member of the family Nakamurellaceae. The 6,060,298 bp long single replicon genome with its 5415 protein-coding and 56 RNA genes is part of the Genomic Encyclopedia of Bacteria and Archaea project.Entities:
Keywords: Frankineae; GEBA; Gram-positive; MK-8 (H4); nonmotile; polysaccharide-accumulating; septa-forming; ‘Microsphaeraceae’
Year: 2010 PMID: 21304699 PMCID: PMC3035273 DOI: 10.4056/sigs.721316
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of N. multipartita Y-104T relative to the other type strains within the Frankineae. The tree was inferred from 1362 aligned characters [8,9] of the 16S rRNA gene sequence under the maximum likelihood criterion [10] and rooted with the type strain of the order. The branches are scaled in terms of the expected number of substitutions per site. Numbers above branches are support values from 1,000 bootstrap replicates if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [11] such as the GEBA organism Geodermatophilus obscurus [12] are shown in blue. Important non-type strains are shown in green [13], and published genomes in bold.
Classification and general features of N. multipartita strain Y-104T according to the MIGS recommendations [14]
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Gram stain | positive | TAS [ | |
| Cell shape | coccus | TAS [ | |
| Motility | non-motile | TAS [ | |
| Sporulation | non-sporulating | TAS [ | |
| Temperature range | 10-35°C | TAS [ | |
| Optimum temperature | 25°C | TAS [ | |
| Salinity | up to 6g NaCl/L | TAS [ | |
| MIGS-22 | Oxygen requirement | aerobic chemoorganotroph | TAS [ |
| Carbon source | sugars, alcohols, glucose, maltose, | TAS [ | |
| Energy source | starch, ethanol, propanol | TAS [ | |
| MIGS-6 | Habitat | activated sludge cultured in fed-batch reactors | TAS [ |
| MIGS-15 | Biotic relationship | free-living | NAS |
| MIGS-14 | Pathogenicity | none | NAS |
| Biosafety level | 1 | TAS [ | |
| Isolation | activated sludge | TAS [ | |
| MIGS-4 | Geographic location | not reported | |
| MIGS-5 | Sample collection time | not reported | |
| MIGS-4.1 | Latitude | not reported | |
| MIGS-4.3 | Depth | not reported | |
| MIGS-4.4 | Altitude | not reported |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from of the Gene Ontology project [18]. If the evidence code is IDA, then the property was directly observed for a live isolate by one of the authors, or an expert mentioned in the acknowledgements.
Figure 2Scanning electron micrograph of N. multipartita strain Y-104T
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Two Sanger genomic libraries: 8kb |
| MIGS-29 | Sequencing platforms | ABI3730 |
| MIGS-31.2 | Sequencing coverage | 15.4× Sanger |
| MIGS-30 | Assemblers | Arachne, phrap |
| MIGS-32 | Gene calling method | Prodigal, GenePRIMP |
| INSDC ID | CP001737 | |
| Genbank Date of Release | September 18, 2009 | |
| GOLD ID | Gi02230 | |
| NCBI project ID | 29537 | |
| Database: IMG-GEBA | 2501939634 | |
| MIGS-13 | Source material identifier | DSM 44233 |
| Project relevance | Tree of Life, GEBA |
Genome Statistics
| | | |
|---|---|---|
| Genome size (bp) | 6,060,298 | 100.00% |
| DNA coding region (bp) | 5,526,464 | 91.19% |
| DNA G+C content (bp) | 4,297,749 | 70.92% |
| Number of replicons | 1 | |
| Extrachromosomal elements | 0 | |
| Total genes | 5,471 | 100.00% |
| RNA genes | 56 | 1.02% |
| rRNA operons | 2 | |
| Protein-coding genes | 5,415 | 98.98% |
| Pseudo genes | 175 | 3.20% |
| Genes with function prediction | 3,638 | 66.50% |
| Genes in paralog clusters | 3,319 | 60.67% |
| Genes assigned to COGs | 3,673 | 67.14% |
| Genes assigned Pfam domains | 4,054 | 74.10% |
| Genes with signal peptides | 1,713 | 31.31% |
| Genes with transmembrane helices | 1,258 | 22.99% |
| CRISPR repeats | 9 |
Figure 3Graphical circular map of the genome. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Number of genes associated with the general COG functional categories
| | | | |
|---|---|---|---|
| J | 160 | 3.9 | Translation, ribosomal structure and biogenesis |
| A | 2 | 0.0 | RNA processing and modification |
| K | 400 | 9.7 | Transcription |
| L | 324 | 7.8 | Replication, recombination and repair |
| D | 31 | 0.8 | Cell cycle control, mitosis and meiosis |
| V | 81 | 2.0 | Defense mechanisms |
| T | 238 | 5.8 | Signal transduction mechanisms |
| M | 173 | 4.2 | Cell wall/membrane biogenesis |
| Z | 1 | 0.0 | Cytoskeleton |
| U | 44 | 1.1 | Intracellular trafficking and secretion |
| O | 113 | 2.7 | Posttranslational modification, protein turnover, chaperones |
| C | 308 | 7.5 | Energy production and conversion |
| G | 341 | 8.3 | Carbohydrate transport and metabolism |
| E | 334 | 8.1 | Amino acid transport and metabolism |
| F | 97 | 2.4 | Nucleotide transport and metabolism |
| H | 190 | 4.6 | Coenzyme transport and metabolism |
| I | 160 | 3.9 | Lipid transport and metabolism |
| P | 182 | 4.4 | Inorganic ion transport and metabolism |
| Q | 117 | 2.8 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 506 | 12.2 | General function prediction only |
| S | 330 | 8.0 | Function unknown |
| - | 1773 | 32.4 | Not in COGs |