| Literature DB >> 22768365 |
Hans-Peter Klenk, Brittany Held, Susan Lucas, Alla Lapidus, Alex Copeland, Nancy Hammon, Sam Pitluck, Lynne A Goodwin, Cliff Han, Roxanne Tapia, Evelyne-Marie Brambilla, Gabriele Pötter, Miriam Land, Natalia Ivanova, Manfred Rohde, Markus Göker, John C Detter, Nikos C Kyrpides, Tanja Woyke.
Abstract
Saccharomonospora azurea Runmao et al. 1987 is a member of the genus Saccharomonospora, which is in the family Pseudonocardiaceae and thus far poorly characterized genomically. Members of the genus Saccharomonospora are of interest because they originate from diverse habitats, such as leaf litter, manure, compost, the surface of peat, and moist and over-heated grain, and may play a role in the primary degradation of plant material by attacking hemicellulose. Next to S. viridis, S. azurea is only the second member in the genus Saccharomonospora for which a completely sequenced type strain genome will be published. Here we describe the features of this organism, together with the complete genome sequence with project status 'Improved high quality draft', and the annotation. The 4,763,832 bp long chromosome with its 4,472 protein-coding and 58 RNA genes was sequenced as part of the DOE funded Community Sequencing Program (CSP) 2010 at the Joint Genome Institute (JGI).Entities:
Keywords: CSP 2010; Gram-positive; Pseudonocardiaceae; aerobic; chemoheterotrophic; non-motile; soil bacterium; spore-forming; vegetative and aerial mycelia
Year: 2012 PMID: 22768365 PMCID: PMC3387790 DOI: 10.4056/sigs.2635833
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of S. azurea relative to the type strains of the other species within the family Pseudonocardiaceae. The tree was inferred from 1,386 aligned characters [20,21] of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion [22]. Rooting was done initially using the midpoint method [23] and then checked for its agreement with the current classification (Table 1). The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches are support values from 550 ML bootstrap replicates [24] (left) and from 1,000 maximum parsimony bootstrap replicates [25] (right) if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [26] are labeled with one asterisk, those also listed as 'Complete and Published' with two asterisks [8,27,28]. Actinopolyspora iraqiensis Ruan et al. 1994 was ignored in the tree. The species was proposed to be a later heterotypic synonym of S. halophila [29], although the name A. iraqiensis would have had priority over S. halophila. This taxonomic problem will soon be resolved with regard to the genomes of A. iraqiensis and S. halophila, which were both part of CSP 312.
Classification and general features of S. azurea AN-128 T according to the MIGS recommendations [30].
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Subclass | TAS [ | ||
| Order | TAS [ | ||
| Suborder | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type-strain AN-128 | TAS [ | ||
| Gram stain | positive | NAS | |
| Cell shape | variable | NAS | |
| Motility | non-motile | NAS | |
| Sporulation | single spores with smooth surface, mainly on aerial mycelium | TAS [ | |
| Temperature range | mesophile, 24–40°C | TAS [ | |
| Optimum temperature | 28–37°C | TAS [ | |
| Salinity | grows in up to 7% NaCl; 10% is inhibitory | TAS [ | |
| MIGS-22 | Oxygen requirement | aerobic | TAS [ |
| Carbon source | mono, di- and trisaccharides | TAS [ | |
| Energy metabolism | chemoheterotrophic | NAS | |
| MIGS-6 | Habitat | soil | TAS [ |
| MIGS-15 | Biotic relationship | free living | NAS |
| MIGS-14 | Pathogenicity | none | NAS |
| Biosafety level | 1 | TAS [ | |
| MIGS-23.1 | Isolation | soil | TAS [ |
| MIGS-4 | Geographic location | Guangyuan City, Sichuan (China) | TAS [ |
| MIGS-5 | Sample collection time | 1986 or before | NAS |
| MIGS-4.1 | Latitude | 32.45 | NAS |
| MIGS-4.2 | Longitude | 105.84 | NAS |
| MIGS-4.3 | Depth | not reported | |
| MIGS-4.4 | Altitude | not reported |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [41]. If the evidence code is IDA, then the property was directly observed for a living isolate by one of the authors or an expert mentioned in the acknowledgements.
Figure 2Scanning electron micrograph of S. azurea AN-128T
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Improved high quality draft |
| MIGS-28 | Libraries used | Three genomic libraries: one 454 pyrosequence standard library, |
| MIGS-29 | Sequencing platforms | Illumina GAii, 454 GS FLX Titanium |
| MIGS-31.2 | Sequencing coverage | 1,025.0 × Illumina; 8.6 × pyrosequence |
| MIGS-30 | Assemblers | Newbler version 2.3, Velvet version 1.0.13, phrap version SPS - 4.24 |
| MIGS-32 | Gene calling method | Prodigal |
| INSDC ID | AGIU00000000, CM001466 | |
| GenBank Date of Release | March 6, 2012 | |
| GOLD ID | Gi07579 | |
| NCBI project ID | 62037 | |
| Database: IMG | 2508501044 | |
| MIGS-13 | Source material identifier | DSM 44631 |
| Project relevance | Bioenergy and phylogenetic diversity |
Genome Statistics
| | | |
|---|---|---|
| Genome size (bp) | 4,763,852 | 100.00% |
| DNA coding region (bp) | 4,287,642 | 90.00% |
| DNA G+C content (bp) | 3,331,901 | 70.08% |
| Number of replicons | 1 | |
| Extrachromosomal elements | 0 | |
| Total genes | 4,530 | 100.00% |
| RNA genes | 58 | 1.28% |
| rRNA operons | 3 | |
| tRNA genes | 47 | 1.04% |
| Protein-coding genes | 4,472 | 98.72% |
| Pseudo genes | 96 | 2.12% |
| Genes with function prediction (proteins) | 3,342 | 73.77% |
| Genes in paralog clusters | 2,354 | 51.96% |
| Genes assigned to COGs | 3,312 | 73.11% |
| Genes assigned Pfam domains | 3,450 | 76.16% |
| Genes with signal peptides | 1,332 | 29.40% |
| Genes with transmembrane helices | 1,070 | 23.62% |
| CRISPR repeats | 0 |
Figure 3Graphical map of the chromosome. From left to the right: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Number of genes associated with the general COG functional categories
| | | | |
|---|---|---|---|
| J | 171 | 4.6 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.0 | RNA processing and modification |
| K | 394 | 10.6 | Transcription |
| L | 175 | 4.7 | Replication, recombination and repair |
| B | 2 | 0.1 | Chromatin structure and dynamics |
| D | 35 | 0.9 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 58 | 1.6 | Defense mechanisms |
| T | 190 | 5.1 | Signal transduction mechanisms |
| M | 156 | 4.2 | Cell wall/membrane biogenesis |
| N | 6 | 0.2 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 36 | 1.0 | Intracellular trafficking and secretion, and vesicular transport |
| O | 134 | 3.6 | Posttranslational modification, protein turnover, chaperones |
| C | 245 | 6.6 | Energy production and conversion |
| G | 259 | 7.0 | Carbohydrate transport and metabolism |
| E | 313 | 8.4 | Amino acid transport and metabolism |
| F | 91 | 2.4 | Nucleotide transport and metabolism |
| H | 194 | 5.2 | Coenzyme transport and metabolism |
| I | 179 | 4.8 | Lipid transport and metabolism |
| P | 176 | 4.7 | Inorganic ion transport and metabolism |
| Q | 152 | 4.1 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 478 | 12.8 | General function prediction only |
| S | 282 | 7.6 | Function unknown |
| - | 1,218 | 26.9 | Not in COGs |