| Literature DB >> 21304675 |
Matt Nolan, Johannes Sikorski, Marlen Jando, Susan Lucas, Alla Lapidus, Tijana Glavina Del Rio, Feng Chen, Hope Tice, Sam Pitluck, Jan-Fang Cheng, Olga Chertkov, David Sims, Linda Meincke, Thomas Brettin, Cliff Han, John C Detter, David Bruce, Lynne Goodwin, Miriam Land, Loren Hauser, Yun-Juan Chang, Cynthia D Jeffries, Natalia Ivanova, Konstantinos Mavromatis, Natalia Mikhailova, Amy Chen, Krishna Palaniappan, Patrick Chain, Manfred Rohde, Markus Göker, Jim Bristow, Jonathan A Eisen, Victor Markowitz, Philip Hugenholtz, Nikos C Kyrpides, Hans-Peter Klenk.
Abstract
Streptosporangium roseum Crauch 1955 is the type strain of the species which is the type species of the genus Streptosporangium. The 'pinkish coiled Streptomyces-like organism with a spore case' was isolated from vegetable garden soil in 1955. Here we describe the features of this organism, together with the complete genome sequence and annotation. This is the first completed genome sequence of a member of the family Streptosporangiaceae, and the second largest microbial genome sequence ever deciphered. The 10,369,518 bp long genome with its 9421 protein-coding and 80 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.Entities:
Keywords: Gram-positive; S. cloviforme; Sporangia; Streptosporangiaceae; aerobic; non-motile; non-motile spores; vegetative and aerial mycelia
Year: 2010 PMID: 21304675 PMCID: PMC3035251 DOI: 10.4056/sigs.631049
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1aPhylogenetic tree highlighting the position of S. roseum NI 9100T relative to the type strains of the other species within the genus (1a) except for S. claviforme (see text). The tree was inferred from 1,411 and aligned characters [8,9] of the 16S rRNA gene sequence under the maximum likelihood criterion [10] and either rooted with the results of Figure 1b (Figure 1a) or rooted in accordance with the current taxonomy. The branches are scaled in terms of the expected number of substitutions per site. Numbers above branches are support values from 1,000 bootstrap replicates if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [11] are shown in blue, published genomes in bold.
Figure 1bPhylogenetic tree highlighting the position of S. roseum NI 9100T relative to the type strains of the other genera within the family Streptosporanginea . The tree was inferred from 1,369 aligned characters [8,9] of the 16S rRNA gene sequence under the maximum likelihood criterion [10] and either rooted with the results of Figure 1b (Figure 1a) or rooted in accordance with the current taxonomy. The branches are scaled in terms of the expected number of substitutions per site. Numbers above branches are support values from 1,000 bootstrap replicates if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [11] are shown in blue, published genomes in bold.
Classification and general features of S. roseum NI 9100T according to the MIGS recommendations [12]
| | |||
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Subclass | TAS [ | ||
| Order | TAS [ | ||
| Suborder | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain NI 9100 | |||
| Gram stain | not tested, probably positive | NAS [ | |
| Cell shape | produces aerial mycelium | TAS [ | |
| Motility | non-motile | TAS [ | |
| Sporulation | non-motile spores | TAS [ | |
| Temperature range | mesophile, temperature range not determined, does not grow at 42°C | TAS [ | |
| Optimum temperature | 28°C | TAS [ | |
| Salinity | 2.5% NaCl | TAS [ | |
| MIGS-22 | Oxygen requirement | aerobic | TAS [ |
| Carbon source | several (see text), but be aware of contradicting results | TAS [ | |
| Energy source | carbohydrates | TAS [ | |
| MIGS-6 | Habitat | soil | TAS [ |
| MIGS-15 | Biotic relationship | free living | TAS [ |
| MIGS-14 | Pathogenicity | non pathogenic | NAS |
| Biosafety level | 1 | TAS [ | |
| Isolation | vegetable garden soil | TAS [ | |
| MIGS-4 | Geographic location | most probably Chapel Hill, North Carolina, USA | TAS [ |
| MIGS-5 | Sample collection time | 1955 or before | TAS [ |
| MIGS-4.1 MIGS-4.2 | Latitude, Longitude | 35.913, -79.055 | |
| MIGS-4.3 | Depth | not reported | |
| MIGS-4.4 | Altitude | not reported |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from of the Gene Ontology project [23]. If the evidence code is IDA, then the property was directly observed for a live isolate by one of the authors or an expert or mentioned in the acknowledgements.
Figure 2Scanning electron micrograph of S. roseum NI 9100T
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Two Sanger libraries: 6kb pMCL200 and fosmid pcc1Fos |
| MIGS-29 | Sequencing platforms | ABI3730, 454 GS FLX |
| MIGS-31.2 | Sequencing coverage | 8.45× Sanger; 27.6× Pyrosequence |
| MIGS-30 | Assemblers | Newbler, phrap |
| MIGS-32 | Gene calling method | Prodigal, GenePrimp |
| INSDC ID | CP001814 (genome), CP001815 (plasmid) | |
| Genbank Date of Release | 12/10/2009 | |
| GOLD ID | Gc01156 | |
| NCBI project ID | 21083 | |
| Database: IMG-GEBA | 2501799901 | |
| MIGS-13 | Source material identifier | DSM 43021 |
| Project relevance | Tree of Life, GEBA |
Genome Statistics
| Genome size (bp) | 10,369,518 | 100.00% |
| DNA coding region (bp) | 9,121,910 | 87.97% |
| DNA G+C content (bp) | 7,348,162 | 70.86% |
| Number of replicons | 2 | |
| Extrachromosomal elements | 1 | |
| Total genes | 9,501 | 100.00% |
| RNA genes | 80 | 0.84% |
| rRNA operons | 6 | |
| Protein-coding genes | 9,421 | 99.16% |
| Pseudo genes | 446 | 4.49% |
| Genes with function prediction | 5,939 | 62.47% |
| Genes in paralog clusters | 2,792 | 29.37% |
| Genes assigned to COGs | 6,224 | 65.47% |
| Genes assigned Pfam domains | 6,596 | 69.38% |
| Genes with signal peptides | 2,248 | 23.65% |
| Genes with transmembrane helices | 2,235 | 23.51% |
| CRISPR repeats | 0 |
Figure 3Graphical circular map of the genome; plasmid not to scale. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Number of genes associated with the general COG functional categories
| | |||
|---|---|---|---|
| J | 226 | 2.4 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.0 | RNA processing and modification |
| K | 966 | 10.3 | Transcription |
| L | 293 | 3.1 | Replication, recombination and repair |
| B | 1 | 0.0 | Chromatin structure and dynamics |
| D | 38 | 0.4 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.0 | Nuclear structure |
| V | 189 | 2.0 | Defense mechanisms |
| T | 511 | 5.4 | Signal transduction mechanisms |
| M | 298 | 3.2 | Cell wall/membrane biogenesis |
| N | 2 | 0.0 | Cell motility |
| Z | 1 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 43 | 0.5 | Intracellular trafficking and secretion |
| O | 167 | 1.8 | Posttranslational modification, protein turnover, chaperones |
| C | 424 | 4.5 | Energy production and conversion |
| G | 639 | 6.8 | Carbohydrate transport and metabolism |
| E | 600 | 6.4 | Amino acid transport and metabolism |
| F | 124 | 1.3 | Nucleotide transport and metabolism |
| H | 254 | 2.7 | Coenzyme transport and metabolism |
| I | 306 | 3.2 | Lipid transport and metabolism |
| P | 320 | 3.4 | Inorganic ion transport and metabolism |
| Q | 315 | 3.3 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 974 | 10.3 | General function prediction only |
| S | 473 | 5.0 | Function unknown |
| - | 3187 | 33.8 | Not in COGs |