| Literature DB >> 28603641 |
N Dione1, J Rathored1, E Tomei1, J-C Lagier1, S Khelaifia1, C Robert1, F Bretelle1,2, D Raoult1,3, P-E Fournier1, F Fenollar1.
Abstract
Strain ND3T was isolated from the genital tract of a 28-year-old woman with bacterial vaginosis. This strain exhibited a 16S rRNA gene sequence similarity of 92.4% with Sutterella wadsworthensis, the phylogenetically closest species with standing in nomenclature. Strain ND3T was a strictly anaerobic Gram-negative rod and member of the family Sutterellaceae. It exhibited a genome of 2 476 884 bp containing 2175 protein-coding and 62 RNA genes. On the basis of these data, we propose the creation of 'Dakarella massiliensis' sp. nov. with strain ND3T (= CSUR P1938 = DSM 100447) as the type strain.Entities:
Keywords: bacterial vaginosis; culturomics; taxono-genomics
Year: 2017 PMID: 28603641 PMCID: PMC5454129 DOI: 10.1016/j.nmni.2017.05.003
Source DB: PubMed Journal: New Microbes New Infect ISSN: 2052-2975
Fig. 1Phylogenetic tree highlighting position of ‘Dakarella massiliensis’ strain ND3T relative to other type strains within Sutterellaceae family. Sequences were aligned using CLUSTALW, and phylogenetic inferences were obtained using maximum-likelihood method within MEGA software. Numbers at nodes are percentages of bootstrap values obtained by repeating analysis 500 times to generate majority consensus tree. GenBank accession numbers are indicated in tree. Lautropia mirabilis was used as outgroup. Scale bar represents 2% nucleotide sequence divergence.
Fig. 2Reference mass spectrum from ‘Dakarella massiliensis’ strain ND3T. Spectra from 12 individual colonies were compared and reference spectrum generated.
Fig. 3Transmission electron microscopy of ‘Dakarella massiliensis’ strain ND3T using Tecnai G20(Fei) at operating voltage of 60 kV. Scale bar = 500 nm.
Fig. 4Gram staining of ‘Dakarella massiliensis’ strain ND3T.
Phenotypic features that distinguish Dakarella massiliensis strain ND3T from the closely related type strains for Alicycliphilus denitrificans strain K601, Caldimonas manganoxidans strain JCM 10698, Comamonas badia strain IAM 14839, Comamonas composti strain YY287, Lautropia mirabilis strain ATCC 51599, Parasutterella excrementihominis strain YIT 11859 and Sphaerotilus natans strain DSM 6575
| Characteristic | ||||||||
|---|---|---|---|---|---|---|---|---|
| Cell diameter (μm) | 0.70 | 0.5–1.0 | 0.5–0.7 | 0.8–0.9 | 0.5 | 1.5–2 | 0.4–1.1 | 1.2–2.06 |
| Oxygen requirement | Anaerobic | Anaerobic | Aerobic | Anaerobic | Anaerobic | Facultative anaerobic | Anaerobic | Anaerobic |
| Gram stain | − | − | − | − | − | − | − | − |
| Motility | − | + | + | + | + | + | − | + |
| Endospore formation | − | NA | − | − | − | − | − | NA |
| DNA G+C content (mol%) | 57 | 68 | 66 | 66 | 63.3 | 65.6 | 48.1 | 69.9 |
| Biochemical | ||||||||
| Oxidase | − | + | NA | + | + | + | − | + |
| Catalase | − | NA | + | + | + | + | − | + |
| Indole | + | NA | − | NA | − | NA | − | − |
| Nitrate reductase | NA | NA | NA | + | + | − | + | |
| | − | − | NA | NA | − | − | − | + |
| Mannitol | − | NA | + | − | − | + | NA | − |
| | − | + | − | NA | + | + | NA | + |
| | − | + | NA | NA | − | − | NA | + |
| α-Glucosidase | − | NA | + | NA | − | NA | − | NA |
| β-Glucosidase | + | NA | − | NA | − | NA | − | NA |
| Urease | − | NA | − | − | − | + | − | NA |
| Lipase | + | NA | − | + | + | NA | w | NA |
| Isolated from: | Human gut and vagina | Sewage | Hot spring | Sludge | Food waste | Human oral cavity | Human gut | Freshwater |
+, positive result; −, negative result; w, weakly positive result; NA, data not available.
Number of genes associated with 25 general COGs functional categories
| Code | Value | % of total | Description |
|---|---|---|---|
| J | 153 | 7.04 | Translation |
| A | 2 | 0.09 | RNA processing and modification |
| K | 141 | 6.49 | Transcription |
| L | 132 | 6.07 | Replication, recombination and repair |
| B | 0 | 0.0 | Chromatin structure and dynamics |
| D | 23 | 1.06 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.0 | Nuclear structure |
| V | 26 | 1.20 | Defense mechanisms |
| T | 80 | 3.68 | Signal transduction mechanisms |
| M | 145 | 6.67 | Cell wall/membrane biogenesis |
| N | 1 | 0.05 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 6 | 0.28 | Extracellular structures |
| U | 53 | 2.44 | Intracellular trafficking and secretion |
| O | 87 | 4.0 | Posttranslational modification, protein turnover, chaperones |
| C | 174 | 8.0 | Energy production and conversion |
| G | 74 | 3.40 | Carbohydrate transport and metabolism |
| E | 198 | 9.11 | Amino acid transport and metabolism |
| F | 57 | 2.62 | Nucleotide transport and metabolism |
| H | 77 | 3.54 | Coenzyme transport and metabolism |
| I | 58 | 2.67 | Lipid transport and metabolism |
| P | 113 | 5.20 | Inorganic ion transport and metabolism |
| Q | 27 | 1.24 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 260 | 11.96 | General function prediction only |
| S | 118 | 5.43 | Function unknown |
| — | 171 | 7.87 | Not in COGs |
COGs, Clusters of Orthologous Groups database.
Total is based on total number of protein-coding genes in annotated genome.
Fig. 5Graphical circular map of chromosome. From outside to centre: genes on forward strand coloured by COGs categories (only genes assigned to COGs), genes on reverse strand coloured by COGs categories (only genes assigned to COGs), RNA genes (tRNAs green, rRNAs red), GC content and GC skew. COGS, Clusters of Orthologous Groups database.
Nucleotide content and gene count levels of genome
| Attribute | Genome (total) | |
|---|---|---|
| Value | % of total | |
| Size (bp) | 2 476 884 | 100 |
| G+C content (%) | 1 411 823 | 57.0 |
| Coding region (bp) | 2 126 880 | 85.86 |
| Total genes | 2236 | 100 |
| RNA genes | 62 | 2.77 |
| Protein-coding genes | 2174 | 97.22 |
| Genes with function prediction | 1780 | 79.60 |
| Genes assigned to COGs | 1609 | 71.95 |
| Genes with peptide signals | 420 | 18.78 |
| No. of pseudogenes | 76 | 3.39 |
| Genes with transmembrane helices | 428 | 19.14 |
| CRISPR repeats | 01 | 0.04 |
| No. of genes with Pfam-A domains | 2001 | 89.49 |
| ORFan genes | 59 | 2.63 |
COG2, Clusters of Orthologous Groups database.
Total is based on either size of genome in base pairs or total number of protein-coding genes in annotated genome.
Genome comparison of closely related species to Dakarella massiliensis strain ND3T
| Microorganism | INSDC | Size (Mb) | G+C (%) | Protein- coding genes | Total genes |
|---|---|---|---|---|---|
| CVTY00000000.1 | 2.47 | 57 | 2174 | 2236 | |
| CP002657.1 | 5.0 | 68 | 4573 | 4705 | |
| ARLH00000000.1 | 3.53 | 66 | 3187 | 3385 | |
| AXVM00000000.1 | 3.68 | 66 | 3388 | 3499 | |
| AUCQ00000000.1 | 4.63 | 63.3 | 3893 | 4078 | |
| AEQP00000000.1 | 3.15 | 65.6 | 2413 | 2569 | |
| AFBP00000000.1 | 2.83 | 48.1 | 2470 | 2570 | |
| AZRA00000000.1 | 4.59 | 69.9 | 3898 | 4143 |
INSDC, International Nucleotide Sequence Database Collaboration.
Numbers of orthologous protein shared between genomes (upper right)a
| 1597 | 1710 | 1871 | 1089 | 786 | 1564 | 771 | ||
| 72.74 | 1399 | 1530 | 1020 | 756 | 1453 | 729 | ||
| 79.04 | 71.25 | 1679 | 1011 | 739 | 1360 | 724 | ||
| 77.53 | 70.89 | 74.56 | 1063 | 776 | 1523 | 755 | ||
| 69.18 | 67.93 | 67.73 | 67.53 | 701 | 999 | 680 | ||
| 60.00 | 60.65 | 60.17 | 60.59 | 60.54 | 744 | 757 | ||
| 74.24 | 74.69 | 72.47 | 71.46 | 69.40 | 60.13 | 726 | ||
| 63.15 | 62.83 | 62.95 | 62.66 | 63.25 | 63.91 | 63.84 |
Average percentage similarity of nucleotides corresponding to orthologous protein shared between genomes (lower left) and numbers of proteins per genome (bold).
Pairwise comparison of Dakarella massiliensis strain ND3 with other species using GGDC, formula 2 (DDH estimates based on identities/HSP length)a (upper right)
| 100% ± 00 | 18.8% ± 2.65 | 18.6% ± 2.60 | 19.6% ± 2.58 | 22.4% ± 2.53 | 19.2% ± 2.55 | 34.4% ± 2.52 | 20.1% ± 2.67 | |
| 100% ± 00 | 22.8% ± 2.93 | 22.3% ± 2.82 | 24.7% ± 2.53 | 18.6% ± 2.58 | 32.1% ± 2.52 | 20.1% ± 2.69 | ||
| 100% ± 00 | 20.8% ± 2.71 | 26.7% ± 2.53 | 17.9% ± 2.56 | 33.7% ± 2.52 | 19.1% ± 2.65 | |||
| 100% ± 00 | 26.9% ± 2.52 | 23.2% ± 2.53 | 26.8% ± 2.53 | 22.1% ± 2.53 | ||||
| 100% ± 00 | 19.2% ± 2.55 | 32.7% ± 2.52 | 19.8% ± 2.60 | |||||
| 100% ± 00 | 35.0% ± 2.52 | 18.7% ± 2.57 | ||||||
| 100% ± 00 | 31.5% ± 2.52 | |||||||
| 100% ± 00 |
DDH, DNA-DNA hybridization; GGDC, Genome-to-Genome Distance Calculator; HSP, high-scoring segment pairs.
Confidence intervals indicate the inherent uncertainty in estimating DDH values from intergenomic distances based on models derived from empirical test data sets (which are always limited in size) These results are in accordance with the 16S rRNA (Fig. 1) and phylogenomic analyses as well as the GGDC results.