| Literature DB >> 21304647 |
Alex Copeland, Alla Lapidus, Tijana Glavina Del Rio, Matt Nolan, Susan Lucas, Feng Chen, Hope Tice, Jan-Fang Cheng, David Bruce, Lynne Goodwin, Sam Pitluck, Natalia Mikhailova, Amrita Pati, Natalia Ivanova, Konstantinos Mavromatis, Amy Chen, Krishna Palaniappan, Patrick Chain, Miriam Land, Loren Hauser, Yun-Juan Chang, Cynthia D Jeffries, Olga Chertkov, Thomas Brettin, John C Detter, Cliff Han, Zahid Ali, Brian J Tindall, Markus Göker, James Bristow, Jonathan A Eisen, Victor Markowitz, Philip Hugenholtz, Nikos C Kyrpides, Hans-Peter Klenk.
Abstract
Catenulispora acidiphila Busti et al. 2006 is the type species of the genus Catenulispora, and is of interest because of the rather isolated phylogenetic location it occupies within the scarcely explored suborder Catenulisporineae of the order Actinomycetales. C. acidiphilia is known for its acidophilic, aerobic lifestyle, but can also grow scantly under anaerobic conditions. Under regular conditions, C. acidiphilia grows in long filaments of relatively short aerial hyphae with marked septation. It is a free living, non motile, Gram-positive bacterium isolated from a forest soil sample taken from a wooded area in Gerenzano, Italy. Here we describe the features of this organism, together with the complete genome sequence and annotation. This is the first complete genome sequence of the actinobacterial family Catenulisporaceae, and the 10,467,782 bp long single replicon genome with its 9056 protein-coding and 69 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.Entities:
Keywords: Catenulisporineae; acidophilic; aerobic; filamentous actinomycete; free-living; non-pathogenic; vegetative and aerial mycelia
Year: 2009 PMID: 21304647 PMCID: PMC3035231 DOI: 10.4056/sigs.17259
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree of C. acidiphila ID 139908T and all type strains of the genus Catenulispora, inferred from 1,421 aligned characters [3,4] of the 16S rRNA sequence under the maximum likelihood criterion [5]. The tree was rooted with the type strains of the genera within the Streptomycetaceae (Streptomycineae, Actinomycetales). Also included are the type strains from the sister family of Catenulisporaceae, Actinospicaceae. The branches are scaled in terms of the expected number of substitutions per site. Numbers above branches are support values from 1,000 bootstrap replicates if larger than 60%. Strains with a genome sequencing project registered in GOLD [6] are printed in blue; published genomes in bold.
Classification and general features of C. acidiphila ID 139908T according to the MIGS recommendations [9]
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Suborder | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain ID 139908 | TAS [ | ||
| Gram stain | positive | TAS [ | |
| Cell shape | non-fragmentary vegetative mycelium | TAS [ | |
| Motility | nonmotile | TAS [ | |
| Sporulation | produces arthrospores when induced | TAS [ | |
| Temperature range | mesophilic, 11-37°C | TAS [ | |
| Optimum temperature | 22-28°C | TAS [ | |
| Salinity | 3% NaCl | TAS [ | |
| MIGS-22 | Oxygen requirement | essentially aerobic; capable of reduced and non-pigmented growth under microaerophilic and anaerobic conditions | TAS [ |
| Carbon source | glucose, arabinose, xylose, mannitol, fructose, glycerol | TAS [ | |
| Energy source | starch | NAS | |
| MIGS-6 | Habitat | soil | TAS [ |
| MIGS-15 | Biotic relationship | free living | NAS |
| MIGS-14 | Pathogenicity | none | NAS |
| Biosafety level | 1 | TAS [ | |
| Isolation | forest soil from wooden area | TAS [ | |
| MIGS-4 | Geographic location | Gerenzano, Italy | TAS [ |
| MIGS-5 | Sample collection time | before 2006 | TAS [ |
| MIGS-4.1 MIGS-4.2 | Latitude, Longitude | 45.640, 9.002 | NAS |
| MIGS-4.3 | Depth | not reported | |
| MIGS-4.4 | Altitude | not reported |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [14]. If the evidence code is IDA, then the property was observed for a living isolate by one of the authors or an expert mentioned in the acknowledgements.
Figure 2Scanning electron micrograph of C. acidiphila strain ID 139908T (Manfred Rohde, Helmholtz Centre for Infection Research Braunschweig)
Genome sequencing project information
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Two Sanger libraries - 8 kb pMCL200 and fosmid pcc1Fos |
| MIGS-29 | Sequencing platforms | ABI3730 |
| MIGS-31.2 | Sequencing coverage | 10× Sanger |
| MIGS-30 | Assemblers | Phred/Phrap/Consed |
| MIGS-32 | Gene calling method | Prodigal, GenePrimp |
| INSDC / Genbank ID | CP001700 | |
| Genbank Date of Release | August 26, 2009 | |
| GOLD ID | Gc01085 | |
| NCBI project ID | 21085 | |
| Database: IMG-GEBA | 2501533203 | |
| MIGS-13 | Source material identifier | DSM 44928 |
| Project relevance | Tree of Life, GEBA |
Figure 3Graphical circular map of the genome. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Genome Statistics
| Value | % of Total | |
|---|---|---|
| Genome size (bp) | 10,467,782 | 100.00% |
| DNA Coding region (bp) | 9,386,056 | 89.67% |
| DNA G+C content (bp) | 7,303,066 | 69.77% |
| Number of replicons | 1 | |
| Extrachromosomal elements | 0 | |
| Total genes | 9122 | 100.00% |
| RNA genes | 66 | 0.76% |
| rRNA operons | 3 | |
| Protein-coding genes | 9056 | 99.28% |
| Pseudo genes | 142 | 1.56% |
| Genes with function prediction | 6226 | 68.25% |
| Genes in paralog clusters | 2379 | 26.08% |
| Genes assigned to COGs | 5805 | 63.64% |
| Genes assigned Pfam domains | 6202 | 67.99% |
| Genes with signal peptides | 2279 | 24.98% |
| Genes with transmembrane helices | 2231 | 24.46% |
| CRISPR repeats | 4 |
Number of genes associated with the general COG functional categories
| J | 182 | 2.0 | Translation, ribosomal structure and biogenesis |
| A | 2 | 0.0 | RNA processing and modification |
| K | 607 | 6.7 | Transcription |
| L | 173 | 1.9 | Replication, recombination and repair |
| B | 2 | 0.0 | Chromatin structure and dynamics |
| D | 34 | 0.4 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.0 | Nuclear structure |
| V | 96 | 1.1 | Defense mechanisms |
| T | 389 | 4.3 | Signal transduction mechanisms |
| M | 210 | 2.3 | Cell wall/membrane biogenesis |
| N | 45 | 0.5 | Cell motility |
| Z | 1 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 46 | 0.5 | Intracellular trafficking and secretion |
| O | 149 | 1.6 | Posttranslational modification, protein turnover, chaperones |
| C | 306 | 3.4 | Energy production and conversion |
| G | 441 | 4.9 | Carbohydrate transport and metabolism |
| E | 425 | 4.7 | Amino acid transport and metabolism |
| F | 108 | 1.2 | Nucleotide transport and metabolism |
| H | 223 | 2.5 | Coenzyme transport and metabolism |
| I | 226 | 2.5 | Lipid transport and metabolism |
| P | 241 | 2.7 | Inorganic ion transport and metabolism |
| Q | 265 | 2.9 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 670 | 7.4 | General function prediction only |
| S | 328 | 3.6 | Function unknown |
| - | 3251 | 35.9 | Not in COGs |