| Literature DB >> 21304737 |
Hui Sun, Alla Lapidus, Matt Nolan, Susan Lucas, Tijana Glavina Del Rio, Hope Tice, Jan-Fang Cheng, Roxane Tapia, Cliff Han, Lynne Goodwin, Sam Pitluck, Ioanna Pagani, Natalia Ivanova, Konstantinos Mavromatis, Natalia Mikhailova, Amrita Pati, Amy Chen, Krishna Palaniappan, Miriam Land, Loren Hauser, Yun-Juan Chang, Cynthia D Jeffries, Olivier Duplex Ngatchou Djao, Manfred Rohde, Johannes Sikorski, Markus Göker, Tanja Woyke, James Bristow, Jonathan A Eisen, Victor Markowitz, Philip Hugenholtz, Nikos C Kyrpides, Hans-Peter Klenk.
Abstract
Nocardiopsis dassonvillei (Brocq-Rousseau 1904) Meyer 1976 is the type species of the genus Nocardiopsis, which in turn is the type genus of the family Nocardiopsaceae. This species is of interest because of its ecological versatility. Members of N. dassonvillei have been isolated from a large variety of natural habitats such as soil and marine sediments, from different plant and animal materials as well as from human patients. Moreover, representatives of the genus Nocardiopsis participate actively in biopolymer degradation. This is the first complete genome sequence in the family Nocardiopsaceae. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 6,543,312 bp long genome consist of a 5.77 Mbp chromosome and a 0.78 Mbp plasmid and with its 5,570 protein-coding and 77 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.Entities:
Keywords: GEBA; Gram-positive; Nocardiopsaceae; actinomycetoma; aerobic; cholangitis; conjunctivitis; mesophile; non alkaliphilic; pathogen; zig-zag-shaped mycelium
Year: 2010 PMID: 21304737 PMCID: PMC3035300 DOI: 10.4056/sigs.1363462
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of N. dassonvillei strain IMRU 509T relative to the type strains of the other species within the genus and to the type strains of the other genera within the family Nocardiopsaceae. The trees were inferred from 1,442 aligned characters [32,33] of the 16S rRNA gene sequence under the maximum likelihood criterion [34] and rooted in accordance with the current taxonomy [35]. The branches are scaled in terms of the expected number of substitutions per site. Numbers above branches are support values from 750 bootstrap replicates [36] if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [37] are shown in blue, published genomes in bold [38]. Note that the tree is more in accordance with the view of Grund and Kroppenstedt (1990) [39] to treat N. alborubida as a species of its own, rather than with the view of Yassin et al. (1997) [25] and Evtushenko et al. 2000 [10] to regard it as a subspecies of N. dassonvillei based on a 71% DDH value [10].
Classification and general features of N. dassonvillei strain IMRU 509T according to the MIGS recommendations [40]
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain IMRU 509 | TAS [ | ||
| Gram stain | positive | TAS [ | |
| Cell shape | zig-zag shaped mycelium | NAS [ | |
| Motility | none | NAS | |
| Sporulation | yes | TAS [ | |
| Temperature range | mesophile, up to 42°C, but not 45°C | TAS [ | |
| Optimum temperature | about 28°C | IDA | |
| Salinity | probably 0% NaCl, no growth at 20% NaCl | NAS | |
| MIGS-22 | Oxygen requirement | aerobic | TAS [ |
| Carbon source | carbohydrates | TAS [ | |
| Energy source | L-arabinose, D-xylose, D-mannose, D-glucose, L-rhamnose, | TAS [ | |
| MIGS-6 | Habitat | soil, mildewed grain, and clinical | TAS [ |
| MIGS-15 | Biotic relationship | free-living | NAS |
| MIGS-14 | Pathogenicity | actinomycetoma, conjunctivitis, cholangitis | TAS [ |
| Biosafety level | 2 | TAS [ | |
| Isolation | not clearly reported | NAS | |
| MIGS-4 | Geographic location | probably Paris, France | NAS |
| MIGS-5 | Sample collection time | not reported | TAS |
| MIGS-4.1 | Latitude | 48.85 | NAS |
| MIGS-4.3 | Depth | not reported | NAS |
| MIGS-4.4 | Altitude | not reported | NAS |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from of the Gene Ontology project [50]. If the evidence code is IDA, then the property was directly observed by one of the authors or an expert mentioned in the acknowledgements.
Figure 2Scanning electron micrograph of N. dassonvillei strain IMRU 509T
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished |
| MIGS-28 | Libraries used | Genomic libraries: one Sanger 8 kb pMCL200 library, |
| MIGS-29 | Sequencing platforms | ABI3730, 454 GS FLX Titanium, Illumina GAii |
| MIGS-31.2 | Sequencing coverage | 9.0 × Sanger; 19.8 × pyrosequence; 22.3 × Illumina |
| MIGS-30 | Assemblers | Newbler version 1.1.03.24, phrap |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| INSDC ID | CP002040 chromosome | |
| Genbank Date of Release | June 4, 2010 | |
| GOLD ID | Gc01339 | |
| NCBI project ID | 19709 | |
| Database: IMG-GEBA | 646564557 | |
| MIGS-13 | Source material identifier | DSM 43111 |
| Project relevance | Tree of Life, GEBA |
Genome Statistics
| | | |
|---|---|---|
| Genome size (bp) | 6,543,312 | 100.00% |
| DNA coding region (bp) | 5,543,886 | 84.73% |
| DNA G+C content (bp) | 4,758,935 | 72.73% |
| Number of replicons | 2 | |
| Extrachromosomal elements | 1 | |
| Total genes | 5,647 | 100.00% |
| RNA genes | 77 | 1.36% |
| rRNA operons | 5 | |
| Protein-coding genes | 5,570 | 98.64% |
| Pseudo genes | 73 | 1.29%2 |
| Genes with function prediction | 3,930 | 69.59% |
| Genes in paralog clusters | 1,055 | 18.68% |
| Genes assigned to COGs | 3,793 | 67.17% |
| Genes assigned Pfam domains | 4,204 | 74.45% |
| Genes with signal peptides | 1,686 | 29.86% |
| Genes with transmembrane helices | 1,337 | 23.68% |
| CRISPR repeats | 8 |
Figure 3aGraphical circular map of the chromosome (not drawn to scale with plasmid). From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Figure 3bGraphical circular map of the plasmid (not drawn to scale with chromosome). From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Number of genes associated with the general COG functional categories
| Code | value | %age | Description |
|---|---|---|---|
| J | 180 | 4.1 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.0 | RNA processing and modification |
| K | 518 | 11.9 | Transcription |
| L | 173 | 4.0 | Replication, recombination and repair |
| B | 1 | 0.0 | Chromatin structure and dynamics |
| D | 38 | 0.9 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 103 | 2.4 | Defense mechanisms |
| T | 279 | 6.4 | Signal transduction mechanisms |
| M | 190 | 4.4 | Cell wall/membrane/envelope biogenesis |
| N | 3 | 0.1 | Cell motility |
| Z | 2 | 0.1 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 35 | 0.8 | Intracellular trafficking and secretion, and vesicular transport |
| O | 115 | 2.6 | Posttranslational modification, protein turnover, chaperones |
| C | 266 | 6.1 | Energy production and conversion |
| G | 371 | 8.5 | Carbohydrate transport and metabolism |
| E | 354 | 8.1 | Amino acid transport and metabolism |
| F | 102 | 2.3 | Nucleotide transport and metabolism |
| H | 210 | 4.8 | Coenzyme transport and metabolism |
| I | 184 | 4.2 | Lipid transport and metabolism |
| P | 212 | 4.9 | Inorganic ion transport and metabolism |
| Q | 144 | 3.3 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 578 | 13.3 | General function prediction only |
| S | 295 | 6. 8 | Function unknown |
| - | 1,854 | 32.8 | Not in COGs |