| Literature DB >> 26388967 |
Vicky Merhej1, Anne Pfleiderer1, Dhamodharan Ramasamy1, Jean-Christophe Lagier1, Caroline Michelle1, Didier Raoult1, Pierre-Edouard Fournier1.
Abstract
Clostridium ihumii strain AP5(T) sp. nov. is a new species within the genus Clostridium. This strain, whose genome is described here, was isolated from the stool sample of a 21-year-old French Caucasian female with anorexia nervosa. C. ihumii is a Gram-positive, anaerobic bacillus. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 4,433,668 bp long genome contains 4,076 protein-coding and 85 RNA genes, including 9 rRNA genes.Entities:
Keywords: Clostridium ihumii; Culturomics; Genome; Taxono-genomics
Year: 2015 PMID: 26388967 PMCID: PMC4575456 DOI: 10.1186/s40793-015-0025-x
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of Clostridium ihumii strain AP5T according to the MIGS specification [18]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Current classification | Domain Bacteria | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | IDA [ | ||
| Species | IDA | ||
| Type strain AP5T | IDA | ||
| Gram stain | Positive | IDA | |
| Cell shape | Rod-shapped | IDA | |
| Motility | Motile | IDA | |
| Sporulation | Sporulating | IDA | |
| Temperature range | Mesophile | IDA | |
| Optimum temperature | 37 °C | IDA | |
| pH range; Optimum | Not determined | ||
| Carbon source | Not determined | ||
| MIGS-6 | Habitat | Human gut | IDA |
| MIGS-6.3 | Salinity | Not determined | |
| MIGS-22 | Oxygen requirement | Anaerobic | IDA |
| MIGS-15 | Biotic relationship | free living | IDA |
| MIGS-14 | Pathogenicity | unknown | |
| MIGS-4 | Geographic location | France | IDA |
| MIGS-5 | Sample collection time | January 2012 | IDA |
| MIGS-4.1 | Latitude | 43.296482 | IDA |
| MIGS-4.2 | Longitude | 5.36978 | IDA |
| MIGS-4.3 | Depth | Surface | IDA |
| MIGS-4.4 | Altitude | 0 m above sea level | IDA |
aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [30]
Fig. 1Phylogenetic tree highlighting the position of C. ihumii strain AP5T relative to other type strains within the genus Clostridium. GenBank accession numbers are indicated in parentheses. Sequences were aligned using CLUSTALW, and phylogenetic inferences obtained using the maximum-likelihood method within the MEGA software. Numbers at the nodes are percentages of bootstrap values obtained by repeating 500 times the analysis to generate a majority consensus tree. Oxobaxter pfennigii was used as an outgroup. The scale bar represents 2 % nucleotide sequence divergence
Fig. 2Gram staining of C. ihumii strain AP5T
Fig. 3Transmission electron microscopy of C. ihumii strain AP5T using a Morgani 268D (Philips) at an operating voltage of 60 kV. The scale bar represents 1um
Fig. 4Reference mass spectrum from C. ihumii strain AP5T. Spectra from 12 individual colonies were compared and a reference spectrum was generated
Fig. 5Gel view comparing C. ihumii strain AP5T with C. senegalense, C. perfringens, C. difficile, C. dakarense, C. carboxidivorans, C. botulinum and C. beijerinckii, respectively. The gel view displays the raw spectra of loaded spectrum files arranged in a pseudo-gel like look. The x-axis records the m/z value. The left y-axis displays the running spectrum number originating from subsequent spectra loading. The peak intensity is expressed by a Gray scale scheme code. The color bar and the right y-axis indicate the relation between the color a peak is displayed with and the peak intensity in arbitrary units. The compared species are indicated on the left
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality draft |
| MIGS-28 | Libraries used | Paired-end libraries |
| MIGS-29 | Sequencing platforms | Roche 454 and MiSeq (Illumina) |
| MIGS-31.2 | Fold coverage | 32.7× |
| MIGS-30 | Assemblers | CLC denovo assembly |
| MIGS-32 | Gene calling method | Prodigal |
| Locus Tag | BN346 | |
| Genbank ID | CCAT000000000 | |
| EMBL Date of release | 03-20-2014 | |
| BIOPROJECT | PRJEB373 | |
| MIGS-13 | Source Material identifier | Human feces |
| Project relevance | Study of the human gut microbiome |
Fig. 6Graphical circular map of the chromosome. From the outside in, the outer two circles show open reading frames oriented in the forward and reverse directions (colored by COG categories), respectively. The third circle shows the rRNA gene operon (red) and tRNA genes (green). The fourth circle shows the G + C% content plot. The inner-most circle shows GC skew, purple and olive indicating negative and positive values, respectively
Genome statistics
| Attribute | Value | % of Totala |
|---|---|---|
| Genome size (bp) | 4,433,668 | 100 |
| DNA coding (bp) | 3,646,866 | 82.25 |
| DNA G + C (bp) | 1,183,789 | 26.70 |
| DNA scaffolds | 21 | |
| Total genes | 4,161 | 100 |
| Protein coding genes | 4,076 | 97.96 |
| RNA genes | 85 | 2.04 |
| Pseudo genes | ND | |
| Genes in internal clusters | ND | |
| Genes with function prediction | 2,408 | 57.83 |
| Genes assigned to COGs | 2,777 | 66.74 |
| Genes with Pfam domains | ND | |
| Genes with signal peptides | 84 | 2.02 |
| Genes with transmembrane helices | 1,107 | 26.60 |
| CRISPR repeats | ND |
a The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome. ND for Not determined
Number of genes associated with general COG functional categories
| Code | Value | % of totala | Description |
|---|---|---|---|
| J | 208 | 4.60 | Translation |
| K | 292 | 6.50 | Transcription |
| L | 181 | 4.02 | Replication, recombination and repair |
| D | 31 | 0.69 | Cell cycle control, mitosis and meiosis |
| V | 158 | 3.51 | Defense mechanisms |
| T | 226 | 5.02 | Signal transduction mechanisms |
| M | 154 | 3.42 | Cell wall/membrane biogenesis |
| N | 92 | 2.04 | Cell motility |
| U | 43 | 0.95 | Intracellular trafficking and secretion |
| O | 86 | 1.91 | Posttranslational modification, protein turnover, chaperones |
| C | 165 | 3.66 | Energy production and conversion |
| G | 101 | 2.24 | Carbohydrate transport and metabolism |
| E | 229 | 5.09 | Amino acid transport and metabolism |
| F | 78 | 1.73 | Nucleotide transport and metabolism |
| H | 119 | 2.64 | Coenzyme transport and metabolism |
| I | 66 | 1.47 | Lipid transport and metabolism |
| P | 163 | 3.62 | Inorganic ion transport and metabolism |
| Q | 72 | 1.60 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 493 | 10.95 | General function prediction only |
| S | 245 | 5.44 | Function unknown |
| - | 1299 | 28.85 | Not in COGs |
a The total is based on the total number of protein coding genes in the annotated genome
Genomic comparison of C. ihumii with 7 other members of Clostridium species
| A | ||||||||
|---|---|---|---|---|---|---|---|---|
| Species | Strain | Genome accession number | Genome size (Mb) | G + C content | ||||
|
| AP5 | CCAT000000000 | 4.43 | 26.7 | ||||
|
| ATCC 13124 | NC_008261 | 3.26 | 28.4 | ||||
|
| FF1 | CBTZ010000000 | 3.73 | 27.9 | ||||
|
| JC122 | CAEV01000001 | 3.89 | 26.8 | ||||
|
| ATCC 3502 | NC_009495 | 3.90 | 28.2 | ||||
|
| P7 | NZ_ADEK00000000 | 4.41 | 29.7 | ||||
|
| B1 | NC_017179 | 4.46 | 28.4 | ||||
|
| NCIMB 8052 | NC_009617 | 6.00 | 29.0 | ||||
| B | ||||||||
|
|
|
|
|
|
|
|
| |
|
| 4,076 | 1185 | 1189 | 1688 | 1427 | 1186 | 1124 | 1310 |
|
| 72.10 | 2,876 | 1080 | 1173 | 1132 | 978 | 996 | 1268 |
|
| 70.40 | 70.36 | 3,818 | 1156 | 1099 | 1022 | 1307 | 1189 |
|
| 79.10 | 72.14 | 70.34 | 3,704 | 1442 | 1183 | 1095 | 1292 |
|
| 72.58 | 72.01 | 69.74 | 73.10 | 3,572 | 1342 | 1143 | 1450 |
|
| 72.17 | 71.57 | 69.40 | 72.48 | 74.09 | 4,174 | 1046 | 1342 |
|
| 69.70 | 69.55 | 77.68 | 69.53 | 69.18 | 69.08 | 3,591 | 1194 |
|
| 70.85 | 71.97 | 69.10 | 71.10 | 71.01 | 71.45 | 68.52 | 5,020 |
C.ihu = C. ihumii, C. bej = C. beijerinckii, C. bot = C. botulinum, C. car = C. carboxidivorans, C. dak = C. dakarense, C. dif = C. difficile, C. per = C. perfringens, C. sen = C. senegalense
a: Species name, Strain, EMBL and GenBank accession number, Genome size and GC content of compared genomes
b: Numbers of orthologous protein shared between genomes (upper right triangle), average percentage similarity of nucleotides corresponding to orthologous proteins shared between genomes (lower left triangle) and the numbers of proteins per genome (bold diagonal)
Fig. 7Distribution of functional classes of predicted genes in the C. ihumii AP5T and other 7 clostridium genomes according to the clusters of orthologous groups of proteins