| Literature DB >> 27540086 |
Mia Yang Ang1, Avirup Dutta2, Wei Yee Wee1, David Dymock3, Ian C Paterson4, Siew Woh Choo5.
Abstract
Fusobacterium nucleatum is considered to be a key oral bacterium in recruiting periodontal pathogens into subgingival dental plaque. Currently F. nucleatum can be subdivided into five subspecies. Our previous genome analysis of F. nucleatum W1481 (referred to hereafter as W1481), isolated from an 8-mm periodontal pocket in a patient with chronic periodontitis, suggested the possibility of a new subspecies. To further investigate the biology and relationships of this possible subspecies with other known subspecies, we performed comparative analysis between W1481 and 35 genome sequences represented by the five known Fusobacterium subspecies. Our analyses suggest that W1481 is most likely a new F. nucleatum subspecies, supported by evidence from phylogenetic analyses and maximal unique match indices (MUMi). Interestingly, we found a horizontally transferred W1481-specific genomic island harboring the tripartite ATP-independent (TRAP)-like transporter genes, suggesting this bacterium might have a high-affinity transport system for the C4-dicarboxylates malate, succinate, and fumarate. Moreover, we found virulence genes in the W1481 genome that may provide a strong defense mechanism which might enable it to colonize and survive within the host by evading immune surveillance. This comparative study provides better understanding of F. nucleatum and the basis for future functional work on this important pathogen.Entities:
Keywords: Fusobacterium nucleatum; F. nucleatum W1481; comparative genomics
Mesh:
Year: 2016 PMID: 27540086 PMCID: PMC5630926 DOI: 10.1093/gbe/evw199
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
F(A) Circular representation of the W1481 genome. The indication for each feature from the outermost layer: (a) Contigs (Dark Red); (b) Forward protein-coding genes (Orange); (c) Reverse protein-coding genes (Light Blue); (d) tRNAs (Light Green); (e) rRNAs (Dark Green); (f) Predicted Genomic Islands (Dark Blue); and (g) GC plot (Purple and Green). (B) Subcellular distribution as predicted by RAST.
FPhylogenetic classification of W1481. (A) Phylogenetic tree constructed with 16S rRNA gene sequences of 26 strains of Fusobacterium strains. (B) Phylogenetic tree constructed with the core-genome SNPs of 23 strains of Fusobacterium.
F(A) MUMi analysis between W1481 and other Fusobacterium spp. The initial result showed that W1481 has the closest value (0.64) when compared to F. nucleatum subsp. animalis. (B) Comparison between the MUMi values among W1481 and the groups of five existing subspecies of F. nucleatum. The MUMi results of each subspecies are summarized and compared. The upper value of the box and whisker box showed the highest MUMi value, whereas the lower value showed the lowest MUMi value while the value in middle showed the average MUMi value when we performed comparison. A clear separation of W1481 from the rest of the known F. nucleatum subspecies is able to be performed at the threshold of 0.6342. (Note: FNA denotes subsp. animalis; FNV denotes subsp. vincentii; FNN denotes subsp. nucleatum; FNP denotes subsp. polymorphum and FNF denotes subsp. fusiforme).
Overview of the genome statistics
| # | Strain name | Strain status | Genome size (Mb) | GC content (%) | Number of contigs | Contig N50 | Number of ORFs | Number of tRNAs | Number of rRNAs |
|---|---|---|---|---|---|---|---|---|---|
| 1 |
| I | 2.48 | 27.5 | 186 | 7,109 | 2,219 | 46 | 10 |
| 2 |
| I | 2.7 | 26.9 | 81 | 101,807 | 2,725 | 47 | 14 |
| 3 |
| I | 2.15 | 27 | 2 | 1,755,614 | 2,083 | 46 | 7 |
| 4 |
| I | 2.29 | 27 | 18 | 266,619 | 2,135 | 44 | 3 |
| 5 |
| C | 2.26 | 27.1 | 1 | – | 2,202 | 56 | 15 |
| 6 |
| I | 2.47 | 26.8 | 95 | 45,444 | 2,477 | 42 | 2 |
| 7 |
| I | 2.48 | 25.2 | 465 | 4,679 | 2,443 | 42 | 5 |
| 8 |
| I | 2.27 | 26.9 | 123 | 39,283 | 2,210 | 35 | 0 |
| 9 |
| I | 2.34 | 26.9 | 355 | 11,476 | 2,387 | 44 | 4 |
| 10 |
| I | 2.44 | 27 | 5 | 2,257,469 | 2,345 | 39 | 13 |
| 11 |
| I | 2.24 | 27 | 67 | 69,905 | 2,143 | 43 | 3 |
| 12 |
| C | 2.17 | 27.2 | 1 | – | 2,102 | 47 | 15 |
| 13 |
| I | 2.16 | 27 | 83 | 68,085 | 2,112 | 32 | 3 |
| 14 |
| C | 2.43 | 26.8 | 1 | – | 2,390 | 45 | 10 |
| 15 |
| I | 2.51 | 27 | 2 | 2,507,721 | 2,533 | 51 | 19 |
| 16 |
| I | 2.2 | 27 | 77 | 111,961 | 2,115 | 46 | 13 |
| 17 |
| C | 2.27 | 27.1 | 1 | – | 2,189 | 43 | 3 |
| 18 |
| I | 2.25 | 26.9 | 50 | 106,163 | 2,207 | 43 | 5 |
| 19 |
| I | 2.12 | 27.3 | 302 | 14,248 | 2,321 | 44 | 12 |
| 20 |
| I | 1.99 | 26.9 | 186 | 23,738 | 1,833 | 28 | 1 |
| 21 |
| I | 1.84 | 27.2 | 198 | 23,781 | 1,778 | 45 | 11 |
Note.—C, complete genome; I, incomplete genome.
FPan-genome and core genome size prediction for 21 F. nucleatum genomes. The blue diamonds, represent the functional pan-genome size for each combination and the blue dotted line represents the relationship between the number of genomes and functional pan-genome size. The green dots represent the core functional clusters for each combination and the green dotted line represents the relationship between the number of genomes and the number of core functional cluster.
FStrain-specific genes for the 21 Fusobacterium nucleatum genomes.
FComparative pathogenomic analysis of W1481. The threshold used in this analysis was 50% sequence identity and 50% sequence completeness.