| Literature DB >> 25197488 |
Roshan Padmanabhan1, Grégory Dubourg1, Jean-Christophe Lagier1, Carine Couderc1, Caroline Michelle1, Didier Raoult2, Pierre-Edouard Fournier1.
Abstract
Corynebacterium ihumii strain GD7(T) sp. nov. is proposed as the type strain of a new species, which belongs to the family Corynebacteriaceae of the class Actinobacteria. This strain was isolated from the fecal flora of a 62 year-old male patient, as a part of the culturomics study. Corynebacterium ihumii is a Gram positive, facultativly anaerobic, nonsporulating bacillus. Here, we describe the features of this organism, together with the high quality draft genome sequence, annotation and the comparison with other member of the genus Corynebacteria. C. ihumii genome is 2,232,265 bp long (one chromosome but no plasmid) containing 2,125 protein-coding and 53 RNA genes, including 4 rRNA genes. The whole-genome shotgun sequence of Corynebacterium ihumii strain GD7(T) sp. nov has been deposited in EMBL under accession number GCA_000403725.Entities:
Keywords: Corynebacterium ihumii; culturomics; genome; taxono-genomics
Year: 2014 PMID: 25197488 PMCID: PMC4149009 DOI: 10.4056/sigs.5149006
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of strain GD7T according to the MIGS recommendations [35]
| | | ||
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | IDA | ||
| Type strain GD7 | IDA | ||
| Gram stain | positive | IDA | |
| Cell shape | rod | IDA | |
| Motility | non motile | IDA | |
| Sporulation | non endospore forming | IDA | |
| Temperature range | mesophilic | IDA | |
| Optimum temperature | 37°C | IDA | |
| MIGS-6.3 | Salinity | unknown | IDA |
| MIGS-22 | Oxygen requirement | facultative anaerobic | IDA |
| Carbon source | unknown | NAS | |
| Energy source | unknown | NAS | |
| MIGS-6 | Habitat | human gut | IDA |
| MIGS-15 | Biotic relationship | free living | IDA |
| MIGS-14 | Pathogenicity | unknown | IDA |
| MIGS-4 | Geographic location | France | IDA |
| MIGS-5 | Sample collection time | January 2012 | IDA |
| MIGS-4.1 | Latitude | 43.296482 | IDA |
| MIGS-4.1 | Longitude | 5.36978 | IDA |
| MIGS-4.3 | Depth | Surface | IDA |
| MIGS-4.4 | Altitude | 0 m above sea level | IDA |
a Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [45]. If the evidence is IDA, then the property was directly observed for a live isolate by one of the authors or an expert mentioned in the acknowledgements.
Figure 1Phylogenetic tree highlighting the position of strain GD7T relative to other type strains within the genus. GenBank accession numbers are indicated for each strain. Sequences were aligned using CLUSTALW, and phylogenetic inferences obtained using the neighbor-joining method within the MEGA software. Numbers at the nodes are percentages of bootstrap values obtained by repeating the analysis 1,000 times to generate a majority consensus tree. was used as an outgroup. The scale bar represents a 2% nucleotide sequence divergence.
Figure 2Gram staining of strain GD7T
Figure 3Transmission electron microscopy of strain GD7T, using a Morgani 268D (Philips) at an operating voltage of 60kV. The scale bar represents 1 μm.
Differential characteristics of sp. nov. strain GD7T, and .
| Colony size (mm) | 0.5 | 0.5 – 2.0 | 1.0 | na | 1.0 – 1.5 |
| Oxygen requirement | facultative anaerobic | facultative anaerobic | facultative anaerobic | facultative anaerobic | facultative anaerobic |
| Gram stain | + | + | + | + | + |
| Motility | - | - | - | - | - |
| Endospore formation | - | - | - | - | - |
| Alkaline phosphatase | + | + | + | + | + |
| Acid phosphatase | + | + | + | - | + |
| Catalase | + | + | + | + | + |
| Oxidase | - | - | - | - | - |
| Nitrate reductase | - | - | - | - | - |
| Urease | - | - | - | - | - |
| α-galactosidase | - | - | - | - | - |
| β-galactosidase | - | - | - | - | - |
| β-glucuronidase | - | - | - | - | - |
| α -glucosidase | - | - | - | - | - |
| β-glucosidase | - | - | - | - | - |
| Esterase | - | - | + | - | + |
| Esterase lipase | - | - | + | + | + |
| naphthol-AS-BI-phosphohydrolase | + | + | na | + | na |
| N-acetyl-β-glucosaminidase | - | - | - | - | - |
| Pyrazinamidase | + | + | + | + | + |
| α-mannosidase | - | - | - | - | - |
| α-fucosidase | - | - | - | - | - |
| Leucine arylamidase | + | + | + | + | na |
| Valine arylamidase | + | - | - | - | - |
| Cystine arylamidase | - | - | + | - | + |
| α-chemotrypsin | - | - | - | - | - |
| Trypsin | - | - | - | - | - |
| 5-keto-gluconate | - | na | + | na | - |
| D-xylose | + | - | - | - | - |
| D-fructose | + | na | + | na | + |
| D-glucose | + | + | + | + | + |
| D-mannose | + | na | + | na | + |
| Habitat | Human gut | Human joint fluid | Human blood | Cosmetic dye | Human blood |
na = data not available
Figure 4Reference mass spectrum from strain GD7T. Spectra from 12 individual colonies were compared and a reference spectrum was generated.
Figure 5Gel view comparing sp. nov. strain GD7T (= CSUR P902 = DSM 45751) to other members of the genus. The gel view displays the raw spectra of all loaded spectrum files arranged in a pseudo-gel like look. The x-axis records the m/z value. The left y-axis displays the running spectrum number originating from subsequent spectra loading. The peak intensity is expressed by a Gray scale scheme code. The color bar and the right y-axis indicate the relation between the color a peak is displayed with and the peak intensity in arbitrary units.
Project information
| | ||
|---|---|---|
| MIGS-31 | Finishing quality | High-quality draft |
| MIGS-28 | Libraries used | One 454 paired end 3-kb library |
| MIGS-29 | Sequencing platforms | 454 GS FLX Titanium |
| MIGS-31.2 | Fold coverage | 30× |
| MIGS-30 | Assemblers | Newbler version 2.5.3 |
| MIGS-32 | Gene calling method | Prodigal |
| BioProject ID | PRJEB646 | |
| Genbank Assembly ID | GCA_000403725.1 | |
| Genbank Accession number | CAVS000000000 | |
| Genbank Date of Release | 2013/05/29 | |
| MIGS-13 | Project relevance | Study of the human gut microbiome |
Nucleotide content and gene count levels of the genome
| Value | % of totala | |
|---|---|---|
| Genome size (bp) | 2,232,265 | |
| DNA Coding region (bp) | 2,041,113 | 91.43 |
| DNA G+C content (bp) | 1,453,204 | 65.1 |
| Number of replicons | 1 | |
| Extrachromosomal elements | 0 | |
| Total genes | 2,182 | 100 |
| RNA genes | 57 | 2.61 |
| rRNA operons | 1 | |
| Predicted tRNA pseudogenes | 1 | |
| Protein-coding genes | 2,125 | 97.38 |
| Genes with function prediction | 1,562 | 71.58 |
| Genes assigned to COGs | 1,703 | 78.04 |
| Genes with peptide signals | 189 | 8.66 |
| Genes with transmembrane helices | 553 | 25.34 |
| CRISPR repeats | 1 |
a The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
Figure 6Graphical circular map of the chromosome. From the outside in, the outer two circles show open reading frames oriented in the forward and reverse directions (colored by COG categories), respectively. The third circle marks the rRNA gene operon (red) and tRNA genes (green). The fourth circle shows the G+C% content plot. The inner-most circle shows the GC skew, purple and olive indicating negative and positive values, respectively.
Number of genes associated with the 25 general COG functional categories
| | |||
|---|---|---|---|
| J | 142 | 6.68 | Translation |
| A | 1 | 0.05 | RNA processing and modification |
| K | 131 | 6.16 | Transcription |
| L | 114 | 5.36 | Replication, recombination and repair |
| B | 0 | 0.00 | Chromatin structure and dynamics |
| D | 19 | 0.89 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.00 | Nuclear structure |
| V | 31 | 1.46 | Defense mechanisms |
| T | 60 | 2.82 | Signal transduction mechanisms |
| M | 95 | 4.47 | Cell wall/membrane biogenesis |
| N | 1 | 0.05 | Cell motility |
| Z | 0 | 0.00 | Cytoskeleton |
| W | 0 | 0.00 | Extracellular structures |
| U | 22 | 1.04 | Intracellular trafficking and secretion |
| O | 62 | 2.92 | Posttranslational modification, protein turnover, chaperones |
| C | 83 | 3.91 | Energy production and conversion |
| G | 100 | 4.71 | Carbohydrate transport and metabolism |
| E | 158 | 7.44 | Amino acid transport and metabolism |
| F | 63 | 2.96 | Nucleotide transport and metabolism |
| H | 78 | 3.67 | Coenzyme transport and metabolism |
| I | 46 | 2.16 | Lipid transport and metabolism |
| P | 117 | 5.51 | Inorganic ion transport and metabolism |
| Q | 35 | 1.64 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 204 | 9.60 | General function prediction only |
| S | 141 | 6.63 | Function unknown |
| - | 422 | 19.8 | Not in COGs |
a The total is based on the total number of protein coding genes in the annotated genome.
Main characteristics of genomes compared to that of strain GD7T.
| Species | Strain | NCIBI ID | Coding density | Length (bp) | GC% | Proteins |
|---|---|---|---|---|---|---|
| GD7T | 90.65 | 2,232,265 | 64.95 | 2,125 | ||
| ATCC 49726 | uid52361 | 86.51 | 2,465,976 | 59.23 | 2,360 | |
| DSM 20306 | uid48813 | 90.3 | 2,764,417 | 55.56 | 2,654 | |
| SK46 | uid55411 | 85.4 | 2,514,382 | 58.58 | 2,103 | |
| uid78139 | 85.95 | 3,113,786 | 55.34 | 2,700 | ||
| ATCC 700975 | uid59409 | 88.49 | 2,790,189 | 60.63 | 2,531 | |
| DSM 20582 | uid67345 | 85.72 | 2,527,982 | 72.55 | 2,339 | |
| VA01 | uid84305 | 88.36 | 2,395,441 | 53.44 | 2,191 | |
| HC01 | uid84297 | 88.03 | 2,427,149 | 53.43 | 2,248 | |
| HC02 | uid84317 | 87.7 | 2,468,612 | 53.71 | 2,230 | |
| INCA 402 | uid83605 | 87.72 | 2,449,071 | 53.65 | 2,214 | |
| NCTC 13129 | uid57691 | 87.96 | 2,488,635 | 53.48 | 2,272 | |
| 241 | uid83607 | 87.87 | 2,426,551 | 53.43 | 2,245 | |
| F0235 | uid183766 | 90.37 | 2,809,766 | 56.84 | 2,823 | |
| YS 314 | uid62905 | 91.38 | 3,147,090 | 63.14 | 2,938 | |
| ATCC 33030 | uid52785 | 90.81 | 2,349,953 | 62.73 | 2,226 | |
| ATCC 51867 | uid55397 | 85.44 | 2,809,779 | 59.09 | 2,645 | |
| R | uid58897 | 86.83 | 3,314,179 | 54.13 | 3,052 | |
| ATCC 13032 | uid57905 | 86.41 | 3,309,401 | 53.81 | 2,993 | |
| ATCC 13032 | uid61611 | 87.53 | 3,282,708 | 53.84 | 3,057 | |
| K411 | uid58399 | 89.41 | 2,462,499 | 61.4 | 2,104 | |
| DSM 44385 | uid59411 | 86.73 | 2,446,804 | 57.46 | 2,018 | |
| DSM 44291 | uid55469 | 87.87 | 2,386,544 | 64.26 | 2,371 | |
| ATCC 14266 | uid51885 | 86.43 | 2,856,058 | 57.09 | 2,619 | |
| S6 4 | uid77677 | 89.61 | 3,107,265 | 69.49 | 2,797 | |
| ATCC 33035 | uid55395 | 89.9 | 2,601,506 | 59.53 | 2,493 | |
| FRC41 | uid50585 | 87.91 | 2,337,913 | 52.19 | 2,110 | |
| 1002 | uid159677 | 85.31 | 2,337,913 | 52.19 | 2,090 | |
| 267 | uid162175 | 86.54 | 2,337,628 | 52.19 | 2,148 | |
| 42 02 A | uid159669 | 84.23 | 2,337,606 | 52.19 | 2,051 | |
| P54B96 | uid157909 | 84.93 | 2,337,657 | 52.19 | 2,084 | |
| DSM 45100 | uid50555 | 87.87 | 2,601,311 | 57.09 | 2,171 | |
| ATCC 6940 | uid55471 | 86.33 | 2,829,831 | 59.05 | 2,677 | |
| SK141 | uid55413 | 89.57 | 2,372,621 | 60.01 | 2,210 | |
| 809 | uid159659 | 87.66 | 2,502,095 | 53.3 | 2,180 | |
| 102 | uid169879 | 87.66 | 2,579,188 | 53.36 | 2,349 | |
| BR AD22 | uid68291 | 87.72 | 2,606,374 | 53.4 | 2,334 | |
| DSM 7109 | uid61639 | 89.7 | 2,369,219 | 64.19 | 2,022 | |
| DSM 7111 | uid188688 | 88.17 | 2,316,065 | 64.24 | 1,935 | |
| DSM 44702 | uid62003 | 87.56 | 3,433,007 | 67.15 | 3,039 |
Figure 7Distribution of functional classes of predicted genes of strain GD7T (colored in thick red line) along with other genomes according to the clusters of orthologous groups of proteins.