| Literature DB >> 28116104 |
M Mailhe1, D Ricaboni2, V Vitton3, A Benezech3, G Dubourg1, C Michelle1, C Andrieu1, N Armstrong1, F Bittar1, P-E Fournier1, D Raoult4, M Million1.
Abstract
The strain Marseille-P2749T (= CSUR P2749 = DSM 103085) was isolated as part of culturomics study from a liquid duodenum sample from a French man. Bacterial cells were Gram-negative bacilli, fusiform shaped and non-spore forming, and they grew in microaerophilic and anaerobic atmosphere. Its genome is 1 809 169 bp long and contains 1646 protein-coding genes. The DNA G+C content was 27.33 mol%. This strain exhibited a 95.9% sequence similarity with Fusobacterium periodonticum, the phylogenetically closest species with standing in nomenclature. Strain Marseille-P2749T is suggested to be a novel species belonging to the genus Fusobacterium, for which the name Fusobacterium massiliense sp. nov. is proposed.Entities:
Keywords: Culturomics; Fusobacterium massiliense; gastrointestinal microbiome; genomics; taxonomy
Year: 2016 PMID: 28116104 PMCID: PMC5225283 DOI: 10.1016/j.nmni.2016.12.003
Source DB: PubMed Journal: New Microbes New Infect ISSN: 2052-2975
Fig. 1MALDI-TOF MS analysis of strain Marseille-P2749T. (a) Reference mass spectrum of strain Marseille-P2749T. (b) Gel view comparing strain Marseille-P2749T to other close species. Gel view displays raw spectra of loaded spectrum files arranged in pseudo-gel-like look. x-axis records m/z value. Left y-axis displays running spectrum number originating from subsequent spectra loading. Peak intensity is expressed by greyscale scheme code. Colour bar and right y-axis indicate relation between colour peak displayed and peak intensity in arbitrary units. Displayed species are indicated at left. MALDI-TOF MS, matrix-assisted desorption ionization–time of flight mass spectrometry.
Fig. 2Phylogenetic tree showing position of strain Marseille-P2749T relative to other phylogenetically close neighbours. Sequences were aligned using Muscle 3.8.31 with default parameters, and phylogenetic inferences were obtained using neighbour-joining method with 500 bootstrap replicates within MEGA6 software. Only bootstraps >95% are shown. Scale bar represents 1% nucleotide sequence divergence.
Percentage 16S rRNA similarities within Fusobacterium genus: 1) Strain Marseille-P2749T, 2) F. periodonticum ATCC 33693T, 3) F. nucleatum ATCC 25586T, 4) F. canifelinum ATCC BAA-689T, 5) F. simiae ATCC 33568T, 6) F. russii ATCC 25533T, 7) F. equinum JCM 11174T and 8) F. necrophorum ATCC 25286T
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | |
|---|---|---|---|---|---|---|---|---|
| 1 | 100 | 95.9 | 95.8 | 95.8 | 95.4 | 94.7 | 94.6 | 94.1 |
| 2 | 100 | 97.2 | 97.2 | 96.6 | 93.6 | 93.1 | 92.8 | |
| 3 | 100 | 99.4 | 98.6 | 93.9 | 92.9 | 92.6 | ||
| 4 | 100 | 98.1 | 93.7 | 92.7 | 92.5 | |||
| 5 | 100 | 93.7 | 92.4 | 92.1 | ||||
| 6 | 100 | 93.4 | 93.1 | |||||
| 7 | 100 | 98.2 | ||||||
| 8 | 100 |
Fig. 3Gram staining of strain Marseille-P2749T.
Fig. 4Transmission electron microscopy of strain Marseille-P2749T with Tecnai G20 Cryo microscope (FEI Company) at operating voltage of 200 keV. Scale bar = 500 nm.
Classification and general features of Fusobacterium massiliense strain Marseille-P2749T
| Property | Term |
|---|---|
| Current classification | Domain: |
| Phylum: | |
| Class: | |
| Order: | |
| Family: | |
| Genus: | |
| Species: | |
| Marseille-P2749T | |
| Gram stain | Gram-negative bacillus |
| Cell shape | Fusiform shaped |
| Motility | Nonmotile |
| Sporulation | Non–endospore forming |
| Oxygen requirement | Anaerobic and microaerophilic |
| Temperature range | 28 to 37°C |
| Optimum temperature | 37°C |
| pH | 5.5 to 8 |
| Optimum pH | 7 |
| Salinity | 0 g/L |
| Optimum salinity | 0 g/L |
| Habitat | Human gut: Duodenum |
| Pathogenicity | Unknown |
| Isolation | Duodenum |
Characteristics of Fusobacterium species: 1) strain Marseille-P2749T, 2) F. periodonticum ATCC 33693T, 3) F. nucleatum ATCC 25586T, 4) F. canifelinum ATCC BAA-689T, 5) F. simiae ATCC 33568T, 6) F. russii ATCC 25533T, 7) F. equinum JCM 11174T and 8) F. necrophorum ATCC 25286T
| Property | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
|---|---|---|---|---|---|---|---|---|
| Oxygen requirement | − | − | − | − | − | − | − | − |
| Gram stain | − | − | − | − | − | − | − | − |
| Motility | − | − | − | − | − | − | − | − |
| Spore formation | − | − | − | − | − | − | − | − |
| Production of: | ||||||||
| Catalase | − | − | − | − | − | − | − | − |
| Oxidase | − | − | − | − | − | − | − | − |
| Urease | − | − | − | − | − | − | − | + |
| Gelatin | − | − | − | − | − | − | − | − |
| Indole | + | + | + | + | + | − | + | + |
| β-Galactosidase | − | − | − | − | − | − | − | − |
| β-Glucuronidase | − | − | − | − | − | − | − | − |
| | − | − | − | − | − | − | − | − |
| Alkaline–phosphatase | + | − | − | − | − | + | − | + |
| Acid–phosphatase | + | − | − | − | − | + | + | + |
| Esterase (C4) | + | − | − | − | + | − | + | + |
| Lipase esterase (C8) | + | − | − | − | − | − | + | + |
| Utilization of: | ||||||||
| Esculin | + | − | − | − | − | − | − | − |
| | − | − | − | − | − | − | − | − |
| | − | + | − | − | + | − | − | − |
| | − | + | + | + | + | − | − | − |
| | − | − | − | − | − | − | − | − |
| | − | − | − | − | − | − | − | − |
| | + | − | − | − | − | − | − | − |
| Original source | HD | HOC | HOC | COC, DOC | MOC | HNO | EOC | HOC, COC |
+, positive result; −, negative result; COC, cat oral cavity; DOC, dog oral cavity; EOC, horse oral cavity; HD, human duodenum; HNO, human nonoral; HOC, human oral cavity; MOC, monkey oral cavity.
Cellular fatty acid composition (mean peak area percentage) of Fusobacterium species: 1) strain Marseille-P2749T, 2) F. periodonticum ATCC 33693T, 3) F. nucleatum ATCC 25586T, 4) F. canifelinum ATCC BAA-689T, 5) F. russii ATCC 25533T and 6) F. necrophorum ATCC 25286T
| Fatty acid | 1 | 2 | 3 | 4 | 5 | 6 |
|---|---|---|---|---|---|---|
| 4:0 | 2.6 ± 1.7 | NA | NA | NA | NA | NA |
| 12:0 | TR | 1.03 | 1.3 ± 0.25 | 2.38 ± 0.66 | 5.0 ± 1.14 | 6.0 ± 2.08 |
| 14:0 | 18.1 ± 4.5 | 20.17 | 26.1 ± 3.69 | NA | 11.7 ± 2.25 | 16.7 ± 3.14 |
| 14:0 3-OH | 8.9 ± 3.3 | NA | NA | NA | NA | NA |
| 15:1n5 anteiso | 1.1 ± 0.4 | NA | NA | NA | NA | NA |
| 15:0 | TR | NA | NA | NA | NA | NA |
| 16:1n7 | 9.9 ± 1.8 | 0.75 | 0.7 ± 0.17 | NA | 8.1 ± 0.63 | 5.3 ± 0.53 |
| 16:1n9 | 1.5 ± 0.4 | 12.86 | 20.3 ± 4.29 | NA | 15.7 ± 1.20 | 19.7 ± 1.53 |
| 16:0 | 40.2 ± 6.3 | 18.61 | 21.8 ± 4.50 | NA | 35.7 ± 0.93 | 16.4 ± 1.88 |
| 16:0 3-OH | 5.4 ± 0.3 | 5.43 | 4.1 ± 0.56 | 3.48 ± 0.30 | NA | 1.8 ± 1.13 |
| 18:1n9 | 5.4 ± 2.3 | 4.69 | 0.5 ± 0.22 | NA | 7.1 ± 1.37 | 9.2 ± 1.85 |
| 18:2n6 | 2.2 ± 0.7 | NA | NA | NA | NA | NA |
| 18:0 | 2.3 ± 1.9 | 3.79 | NA | NA | NA | NA |
| 18:1n7 | 1.0 ± 0.6 | NA | NA | NA | NA | NA |
16:0, hexadecanoic acid; 14:0, tetradecanoic acid; 16:1n7, 9-hexadecenoic acid; 14:0 3-OH, 3-hydroxy-tetradecanoic acid; 16:0 3-OH, 3-hydroxy-hexadecanoic acid; 18:1n9, 9-octadecenoic acid; 4:0, butanoic acid; 18:0, octadecanoic acid; 18:2n6, 9,12-octadecadienoic acid; 16:1n9, 7-hexadecenoic acid; 15:1n5 anteiso, 12-methyltetradec-9-enoic acid; 18:1n7, 11-octadecenoic acid; 12:0, dodecanoic acid; 15:0, Pentadecanoic acid; NA, not available; TR, trace amounts <1%.
Nucleotide content and gene count levels of the genome of strain Marseille-P2749T
| Attribute | Value | % of total |
|---|---|---|
| Size (bp) | 1 809 169 | 100 |
| G+C content (bp) | 494 380 | 27.33 |
| Coding region (bp) | 1 617 050 | 89.38 |
| Total of genes | 1703 | 100 |
| Protein-coding genes | 1646 | 96.65 |
| RNA genes | 57 | 3.35 |
| Genes associated with function prediction | 1274 | 77.39 |
| Genes assigned to COGs | 1123 | 68.22 |
| Genes associated with ORFans | 27 | 1.64 |
| Genes with peptide signals | 208 | 12.63 |
| Genes associated with PKS or NRPS | 4 | 0.24 |
| Genes associated with resistance genes | 0 | 0 |
| Genes associated with bacteriocin genes | 10 | 0.61 |
| Genes associated with toxin/antitoxin | 43 | 2.61 |
| Genes associated with virulence | 264 | 16.04 |
| Genes associated with mobilome | 605 | 36.76 |
| Genes with paralogues ( | 189 | 11.48 |
| Genes with paralogues ( | 105 | 6.37 |
| Genes larger than 5000 nucleotides | 6 | 0 |
COGs, Clusters of Orthologous Groups database; NRPS, nonribosomal peptide synthase; PKS, polyketide synthase.
Total is based on either size of genome in base pairs or total number of protein-coding genes in annotated genome.
Fig. 5Graphical circular map of strain of chromosome of Marseille-P2749T. From outside to center: outer two circles show ORF oriented forward (coloured by COGs categories) and backwards (coloured by COGs categories) respectively. Third circle marks tRNA genes (green). Fourth circle shows G+C% content plot. Innermost circle shows GC skew; purple indicates negative values and olive positive values. COGs, Clusters of Orthologous Groups database; ORF, open reading frame.
Genes associated with the 25 COGs functional categories of strain Marseille-P2749T
| Code | Value | % Value | Description |
|---|---|---|---|
| J | 173 | 10.51 | Translation |
| A | 0 | 0 | RNA processing and modification |
| K | 50 | 3.03 | Transcription |
| L | 58 | 3.52 | Replication, recombination and repair |
| B | 0 | 0 | Chromatin structure and dynamics |
| D | 21 | 1.28 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0 | Nuclear structure |
| V | 51 | 3.10 | Defense mechanisms |
| T | 34 | 2.07 | Signal transduction mechanisms |
| M | 86 | 5.22 | Cell wall/membrane biogenesis |
| N | 8 | 0.49 | Cell motility |
| Z | 0 | 0 | Cytoskeleton |
| W | 3 | 0.18 | Extracellular structures |
| U | 27 | 1.64 | Intracellular trafficking and secretion |
| O | 43 | 2.61 | Posttranslational modification, protein turnover, chaperones |
| X | 16 | 0.97 | Mobilome: prophages, transposons |
| C | 65 | 3.95 | Energy production and conversion |
| G | 71 | 4.31 | Carbohydrate transport and metabolism |
| E | 144 | 8.75 | Amino acid transport and metabolism |
| F | 45 | 2.73 | Nucleotide transport and metabolism |
| H | 85 | 5.16 | Coenzyme transport and metabolism |
| I | 48 | 2.92 | Lipid transport and metabolism |
| P | 61 | 3.71 | Inorganic ion transport and metabolism |
| Q | 11 | 0.67 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 90 | 5.47 | General function prediction only |
| S | 45 | 2.73 | Unknown function |
| — | 523 | 31.77 | Not in COGs |
COGs, Clusters of Orthologous Groups database.
Fig. 6Distribution of functional classes of predicted genes according to COGs of proteins from strain Marseille-P2749T. COGs, Clusters of Orthologous Groups database.
Orthologous proteins shared between genomes (upper right), average percentage similarity of nucleotides corresponding to orthologous proteins shared between genomes (lower left) and number of proteins per genome (bold)
| Strain Marseille-P2749T | |||||||
|---|---|---|---|---|---|---|---|
| Strain Marseille-P2749T | 993 | 912 | 880 | 855 | 823 | 811 | |
| 76.55 | 931 | 886 | 854 | 834 | 793 | ||
| 67.50 | 66.76 | 1267 | 775 | 1095 | 1054 | ||
| 68.48 | 67.49 | 69.27 | 750 | 1069 | 1040 | ||
| 64.13 | 62.35 | 60.12 | 59.37 | 690 | 685 | ||
| 71.57 | 70.53 | 66.96 | 68.84 | 59.89 | 823 | ||
| 57.70 | 57.46 | 58.37 | 58.99 | 59.09 | 58.05 |
Pairwise comparison of strain Marseille-P2749T with other species using GGDC, formula 2 (DDH estimates based on identities/HSP length)a
| Strain Marseille-P2749 | ||||||||
|---|---|---|---|---|---|---|---|---|
| Strain Marseille-P2749 | 100 (100–100) | 26.10 (23.8–28.6) | 26.00 (23.7–28.5) | 21.30 (19.0–23.7) | 21.10 (18.8–23.5) | 20.90 (18.6–23.3) | 20.60 (18.4–23.1) | 18.00 (15.9–20.4) |
| 100 (100–100) | 30.30 (27.9–32.8) | 21.00 (18.8–23.4) | 23.60 (21.3–26.0) | 21.30 (19.1–23.7) | 20.10 (17.9–22.5) | 19.10 (16.9–21.4) | ||
| 100 (100–100) | 21.10 (18.9–23.5) | 22.90 (20.7–25.4) | 20.90 (18.7–23.4) | 21.30 (19.1–23.7) | 19.20 (17.0–21.6) | |||
| 100 (100–100) | 18.40 (16.3–20.8) | 19.30 (17.1–21.7) | 18.70 (16.5–21.1) | 19.70 (17.5–22.1) | ||||
| 100 (100–100) | 22.40 (20.1–24.8) | 22.30 (20.0–24.7) | 19.50 (17.3–21.9) | |||||
| 100 (100–100) | 20.40 (18.2–22.9) | 22.40 (20.1–24.8) | ||||||
| 100 (100–100) | 18.20 (16.0–20.6) | |||||||
| 100 (100–100) |
DDH, DNA-DNA hybridization; GGDC, Genome-to-Genome Distance Calculator; HSP, high-scoring segment pairs.
Confidence intervals indicate inherent uncertainly in estimating DDH values from intergenomic distances based on models derived from empirical test data sets (which are always limited in size). These results are in accordance with 16S rRNA (Fig. 1(b)) and phylogenomic analyses as well as GGDC result.