Literature DB >> 27540086

Comparative Genome Analysis of Fusobacterium nucleatum.

Mia Yang Ang1, Avirup Dutta2, Wei Yee Wee1, David Dymock3, Ian C Paterson4, Siew Woh Choo5.   

Abstract

Fusobacterium nucleatum is considered to be a key oral bacterium in recruiting periodontal pathogens into subgingival dental plaque. Currently F. nucleatum can be subdivided into five subspecies. Our previous genome analysis of F. nucleatum W1481 (referred to hereafter as W1481), isolated from an 8-mm periodontal pocket in a patient with chronic periodontitis, suggested the possibility of a new subspecies. To further investigate the biology and relationships of this possible subspecies with other known subspecies, we performed comparative analysis between W1481 and 35 genome sequences represented by the five known Fusobacterium subspecies. Our analyses suggest that W1481 is most likely a new F. nucleatum subspecies, supported by evidence from phylogenetic analyses and maximal unique match indices (MUMi). Interestingly, we found a horizontally transferred W1481-specific genomic island harboring the tripartite ATP-independent (TRAP)-like transporter genes, suggesting this bacterium might have a high-affinity transport system for the C4-dicarboxylates malate, succinate, and fumarate. Moreover, we found virulence genes in the W1481 genome that may provide a strong defense mechanism which might enable it to colonize and survive within the host by evading immune surveillance. This comparative study provides better understanding of F. nucleatum and the basis for future functional work on this important pathogen.
© The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  Fusobacterium nucleatum; F. nucleatum W1481; comparative genomics

Mesh:

Year:  2016        PMID: 27540086      PMCID: PMC5630926          DOI: 10.1093/gbe/evw199

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


Introduction

The genus Fusobacterium can be distinguished by its Gram-negative, nonmotile and nonsporulating features, as described in the studies conducted by Bachmann and Gregor (1936). Cultures of Fusobacterium grow only under strictly anaerobic conditions and they are extremely susceptible to atmospheric oxygen (Spaulding and Rettger 1937b). Fusobacterium are normally present in the human oral cavity and throat (Spaulding and Rettger 1937a), however, recent studies have shown that the oropharynx, as well as the gastrointestinal and genitourinary tracts may also be populated with Fusobacterium spp. (Castellarin et al. 2012). Fusobacterium spp. can occasionally cause localized infections such as tonsillitis, para-tonsillar abscess or dental sepsis, invasive disease including septicaemia, abscesses of the brain, liver and lung, or Lemierre’s syndrome, a septic thrombophlebitis of the internal jugular vein that is often associated with oropharyngeal infection (Huggan and Murdoch 2008; Kuppalli et al. 2012). The majority of research has focused on the role of Fusobacterium spp. in gingivitis and periodontitis, where Fusobacterium nucleatum (F. nucleatum) is the most frequently isolated species from dental plaque, where it is believed to play an important role in recruiting periodontal pathogens into subgingival dental plaque (Socransky et al. 1998). Based on the polyacrylamide gel electrophoresis pattern of whole-cell proteins and DNA homology, glutamate dehydrogenase and 2-oxoglutarate reductase electrophoretic patterns, DNA–DNA hybridization comparisons and phenotypic characteristics, F. nucleatum was further classified into five subspecies (nucleatum, polymorphum, animalis, vincentii and fusiforme) (Gharbia and Shah 1990; Morris et al. 1996). In this study, we present a detailed genomic and comparative analysis of F. nucleatum W1481 (referred to hereafter as W1481), which was isolated from an 8-mm periodontal pocket in a patient with chronic periodontitis and was subsequently sequenced and reported by our group (Ang et al. 2014). An in-depth phylogenetic and comparative genomic analysis of this genome reconfirmed our initial assumption that W1481 belongs to F. nucleatum, but it is quite distinct from the other existing subspecies of F. nucleatum and is possibly a new subspecies. The distinct taxonomic status of F. nucleatum W1481 prompted us to investigate the genome more closely, especially its virulence profile and compare it with the other subspecies of F. nucleatum to provide a better insight into the biology of this unique strain. This genomic study is likely to form the basis for future functional work on this important pathogen.

Materials and Methods

Genome Annotation

To ensure the uniformity in the annotations of the genomes which is important for comparative analysis, all the genome sequences of Fusobacterium spp. used in the study were annotated by uploading their sequence files to the Rapid Annotation using Subsystem Technology (RAST) web server (Aziz et al. 2008) for genomic annotations. The RAST pipeline produced putative protein sequences and functions based on the subsystems in FIGfams (Meyer et al. 2009), a new collection of over 100,000 protein families that were the product of manual curation and close strain comparison across hundreds of bacteria and archaea genomes.

Phylogenetic Analyses

16S rRNA sequences for 25 identified Fusobacterium strains (comprising of F. nucleatum, F. periodonticum, F. ulcerans, F. mortiferum, F. varium, F. russii, F. gonidiaformans and F. necrophorum) were obtained from National Centre for Biotechnology Information (NCBI) RefSeq (Pruitt et al. 2007) and were aligned with that of W1481 using Clustal Omega (Sievers et al. 2011). To construct a core genome SNP-based tree, all genome sequences were uploaded to PanSeq (Laing et al. 2010) for alignment and the Single Nucleotide Polymorphism (SNPs) of the core/common genome regions were extracted. MEGA version 5.2 (Tamura et al. 2011) was used for DNA substitution model testing and also to generate the phylogenetic trees. The model selection was based on Bayesian information criterion (BIC). All the neighbor-joining phylogenetic trees were based on Kimura’s two parameter model, inferred with 1,000 bootstrap replications.

Genomic Distance Calculation

Calculation of genomic distance was performed based on the number of maximal unique and exact matches (MUMs) of a minimal length shared by two genomes when being compared. For each genome pairs, MUMs were generated using MUMmer 3 software package (Delcher et al. 2003) using the parameters: -mum, -b, -c and -119. MUMmer package automatically detects matches that may not be unique while a Perl script developed by Marc and his colleague (Deloger et al. 2009) was used to trim overlapping MUMs and to calculate the MUM index (MUMi) values. MUMi varies between 0 for very similar and 1 for very distant genomes.

Whole-Genome Average Nucleotide Identity Analysis

To evaluate the genetic relatedness between W1481 and the other Fusobacterium genomes the average nucleotide identity (ANI) (Konstantinidis and Tiedje 2005) was calculated based on the method proposed by Goris et al. (2007). Two-way BLAST was chosen and only forward and reversed-matched orthologs were used in the calculations. For the robustness, the BLAST match have been set at least 50% identity at the nucleotide and amino acid level and a sequence coverage of at least 70%.

Amino Acid Identity Analysis

The calculation of average amino acid identity (AAI) was performed by the method as described by Konstantinidis and Tiedje (2005). The RAST annotated protein-coding sequences of the W1481 genome was used as the reference for comparison against other Fusobacterium genomes (also annotated using RAST), using the BLAST search to determine the conserved genes. The cut-off was set at ≥30% sequence identity and ≥70% sequence coverage at the amino acid level for the BLAST search. The average of the amino acid identity of all conserved genes between a pair of genomes was computed to measure the genetic relatedness between them.

Comparative Genomic Island Analysis

All genomes sequences were submitted to IslandViewer (Langille and Brinkman 2009) for genomic island (GI) prediction. IslandViewer implements a sequence composition based approaches derived from other validated GI prediction software packages, which were SIGI-HMM (Waack et al. 2006) and IslandPath-DIMOB (Hsiao et al. 2003). Both the composition based approach were shown to have a higher than 86% overall accuracy compared with other prediction software. IslandViewer also integrated IslandPick, which was developed based on comparative genomics approach (Langille et al. 2008). To cluster these GIs for comparison across different F. nucleatum strains, the GI nucleotide sequences were clustered using BLASTClust (Altschul et al. 1990) with the thresholds of at least 50% sequence identity and 50% sequence coverage.

Comparative Pathogenomic Analyses

Homology searches were performed on the protein sequences of 36 Fusobacterium strains against the Virulence Factors Database (VFDB) (Chen et al. 2005, 2012), by applying BLASTP of the BLAST software package (Altschul et al. 1997). The BLAST hits were processed using in-house Python script filtering and accepting only orthologs at the threshold of at least 50% sequence identity and 50% sequence completeness between the query and subject sequences. Instead of using the tabular comparison for pathogenomic composition provided in VFDB release of 2012, the data collected from VFDB was used to construct a graphical heat map representation using our in-house R Scripts, allowing for the comparison of the virulence gene profiles across all Fusobacterium strains.

Pan-Genome Analyses

Functional ortholog clustering was performed using PGAP (Pan-genome Analysis Pipeline) (Zhao et al. 2012). The protein sequences of 21 F. nucleatum strains were used as the input file. The orthologs among these 21 strains were searched using the Gene Family (GF) method included in the PGAP pipeline. Protein sequences of each strain were clustered together and denoted with strain identifiers. BLASTALL program (Altschul et al. 1990) was performed among the protein sequences with the minimum score value of 50 and E-value at 1e-8. The filtered BLAST results were clustered by MCL algorithm (Enright et al. 2002). In order to group the same genes into the same cluster, the global match region must have at least 70% of the longer gene protein sequence (coverage) and 40% sequence identity.

Results and Discussion

Genome Overview and Annotation

The size of the sequenced genome of F. nucleatum W1481 was 2,477,971 bp, with a GC composition of 27.6%. About 2,163 coding sequences (CDSs) and 56 RNAs were predicted by RAST in W1481 (Ang et al. 2014). An illustration of genomic contents in the genome of W1481 is shown in figure 1. The subsystem distribution statistic of W1481 based on the genome annotation performed by RAST pipeline is presented in figure 1.
F

(A) Circular representation of the W1481 genome. The indication for each feature from the outermost layer: (a) Contigs (Dark Red); (b) Forward protein-coding genes (Orange); (c) Reverse protein-coding genes (Light Blue); (d) tRNAs (Light Green); (e) rRNAs (Dark Green); (f) Predicted Genomic Islands (Dark Blue); and (g) GC plot (Purple and Green). (B) Subcellular distribution as predicted by RAST.

(A) Circular representation of the W1481 genome. The indication for each feature from the outermost layer: (a) Contigs (Dark Red); (b) Forward protein-coding genes (Orange); (c) Reverse protein-coding genes (Light Blue); (d) tRNAs (Light Green); (e) rRNAs (Dark Green); (f) Predicted Genomic Islands (Dark Blue); and (g) GC plot (Purple and Green). (B) Subcellular distribution as predicted by RAST.

Phylogenetic Analysis Using Different Biomarker

Twenty-six (26) published Fusobacterium strains (including F. nucleatum strain W1481) were obtained from the National Center for Biotechnology Information (NCBI) database (Pruitt et al. 2007). Phylogenetic trees were reconstructed by maximum likelihood method using the Kimura 2-parameter (K2P) distance model. The 16S rRNA phylogenetic tree clearly clustered all the subspecies together, as previously shown (Manson McGuire et al. 2014). Based on the 16S rRNA phylogenetic tree (fig. 2), W1481 was clustered together with F. nucleatum strains, indicating that it was indeed a F. nucleatum species. To further confirm the taxonomic position of W1481, 271,947 Single Nucleotide Polymorphisms (SNPs) in the core genomes were concatenated and aligned. We reconstructed a more robust phylogenetic tree using the core-genome SNPs by using K2P distance model (fig. 2). The core-genome SNP-based tree showed W1481 was clearly clustered together with F. nucleatum strains, further supporting that W1481 is F. nucleatum.
F

Phylogenetic classification of W1481. (A) Phylogenetic tree constructed with 16S rRNA gene sequences of 26 strains of Fusobacterium strains. (B) Phylogenetic tree constructed with the core-genome SNPs of 23 strains of Fusobacterium.

Phylogenetic classification of W1481. (A) Phylogenetic tree constructed with 16S rRNA gene sequences of 26 strains of Fusobacterium strains. (B) Phylogenetic tree constructed with the core-genome SNPs of 23 strains of Fusobacterium. Interestingly, W1481 did not cluster with any of the known subspecies, pointing to the possibility that W1481 might be a novel subspecies of F. nucleatum.

Genomic Distance Based on Maximum Unique Matches Index (MUMi)

To further study the genetic relationship between W1481 and other subspecies of F. nucleatum at the subspecies level, we performed genomic distance-type index calculations. Genomic distance based on MUMi analysis was then performed by comparing two sets of data, in which the first set consisted of all the 36 genomes of Fusobacterium spp. and the second set consisted only of F. nucleatum genomes. The results of the MUMi analysis of the first set as summarized in figure 3 showed that W1481 was quite distant compared to other Fusobacterium strains/species with the range of values between 0.64 and 0.98. The closest strains to W1481 were all of F. nucleatum, reinforcing our view that this isolate is indeed F. nucleatum.
F

(A) MUMi analysis between W1481 and other Fusobacterium spp. The initial result showed that W1481 has the closest value (0.64) when compared to F. nucleatum subsp. animalis. (B) Comparison between the MUMi values among W1481 and the groups of five existing subspecies of F. nucleatum. The MUMi results of each subspecies are summarized and compared. The upper value of the box and whisker box showed the highest MUMi value, whereas the lower value showed the lowest MUMi value while the value in middle showed the average MUMi value when we performed comparison. A clear separation of W1481 from the rest of the known F. nucleatum subspecies is able to be performed at the threshold of 0.6342. (Note: FNA denotes subsp. animalis; FNV denotes subsp. vincentii; FNN denotes subsp. nucleatum; FNP denotes subsp. polymorphum and FNF denotes subsp. fusiforme).

(A) MUMi analysis between W1481 and other Fusobacterium spp. The initial result showed that W1481 has the closest value (0.64) when compared to F. nucleatum subsp. animalis. (B) Comparison between the MUMi values among W1481 and the groups of five existing subspecies of F. nucleatum. The MUMi results of each subspecies are summarized and compared. The upper value of the box and whisker box showed the highest MUMi value, whereas the lower value showed the lowest MUMi value while the value in middle showed the average MUMi value when we performed comparison. A clear separation of W1481 from the rest of the known F. nucleatum subspecies is able to be performed at the threshold of 0.6342. (Note: FNA denotes subsp. animalis; FNV denotes subsp. vincentii; FNN denotes subsp. nucleatum; FNP denotes subsp. polymorphum and FNF denotes subsp. fusiforme). Although W1481 had the closest value (0.64) with F. nucleatum subsp. animalis, it was still unclear if this value was distant enough to support W1481 as a novel subspecies since we did not have a specific cut-off to define a subspecies. To define a cut-off value, we calculated the MUMi distance between all pairs of known F. nucleatum subspecies. To use a very stringent criterion, we chose the highest value (most distant) to separate a pair of two known subspecies as a cut-off to discriminate subspecies. Our MUMi analysis showed that the highest value between the subsp. animalis (F. nucleatum subsp. animalis 11_3_2) and subsp. polymorphum (F. nucleatum subsp. polymorphum ATCC10953), which was 0.6342 (fig. 3). Therefore, we selected 0.6342 as a cut-off value to discriminate subspecies. Interestingly, the MUMi distances between W1481 with any strain of known subspecies were all above the defined cut-off, supporting our view that W1481 might be a novel subspecies. W1481 was distantly separated from all known subspecies compared to the distances between any pair of known subspecies (fig. 3). Supplementary table S1, Supplementary Material online, shows the full sets of MUMi values generated by comparing all 36 strains of Fusobacterium species. The MUMi results were further supported by the ANI and AAI analysis (using the same set of genomes with W1481 as the reference genome) which also showed that the closest strain to W1481, belonged to F. nucleatum subsp. animalis as shown in supplementary figure S1, Supplementary Material online.

Pan-Genome and Genome Composition Analysis

To further investigate the genomic structure and gene content of F. nucleatum W1481, we performed pan-genome analysis using the protein sequences of 21 F. nucleatum strains. All the genome sequences of Fusobacterium used in the study were also annotated using the RAST pipeline to ensure uniformity. The genome size of the strains analyzed ranged from 1.71 to 3.68 Mb with GC composition varying between 25.2% and 35.2%. An overview of all the 21 Fusobacterium genomes used in the present study is shown in table 1. The results showed that F. nucleatum pan-genome contained 6,666 gene clusters which were composed of 880 (13%) core gene clusters, 2,850 (43%) dispensable gene clusters and 2,936 (44%) strain-specific gene clusters. The accessory genes (dispensable and strain-specific genes) which cover 87% of the total F. nucleatum pan-genome suggested that this species contains a very diverse genomic structure.
Table 1

Overview of the genome statistics

#Strain nameStrain statusGenome size (Mb)GC content (%)Number of contigsContig N50Number of ORFsNumber of tRNAsNumber of rRNAs
1 Fusobacterium nucleatum W1481I2.4827.51867,1092,2194610
2 F. nucleatum subsp. animalis 11_3_2I2.726.981101,8072,7254714
3 F. nucleatum subsp. animalis 21_1AI2.152721,755,6142,083467
4 F. nucleatum subsp. animalis 3_1_33I2.292718266,6192,135443
5 F. nucleatum subsp. animalis 4_8C2.2627.112,2025615
6 F. nucleatum subsp. animalis 7_1I2.4726.89545,4442,477422
7 F. nucleatum subsp. animalis ATCC 51191I2.4825.24654,6792,443425
8 F. nucleatum subsp. animalis ChDC F324I2.2726.912339,2832,210350
9 F. nucleatum subsp. animalis D11I2.3426.935511,4762,387444
10 F. nucleatum subsp. animalis F0419I2.442752,257,4692,3453913
11 F. nucleatum subsp. nucleatum ATCC 23726I2.24276769,9052,143433
12 F. nucleatum subsp. nucleatum ATCC 25586C2.1727.212,1024715
13 F. nucleatum subsp. nucleatum ChDC F316I2.16278368,0852,112323
14 F. nucleatum subsp. polymorphum ATCC 10953C2.4326.812,3904510
15 F. nucleatum subsp. polymorphum F0401I2.512722,507,7212,5335119
16 F. nucleatum subsp. vincentii 3_1_27I2.22777111,9612,1154613
17 F. nucleatum subsp. vincentii 3_1_36A2C2.2727.112,189433
18 F. nucleatum subsp. vincentii 4_1_13I2.2526.950106,1632,207435
19 F. nucleatum subsp. vincentii ATCC 49256I2.1227.330214,2482,3214412
20 F. nucleatum subsp. vincentii ChDC F8I1.9926.918623,7381,833281
21 F. nucleatum subsp. fusiforme ATCC 51190I1.8427.219823,7811,7784511

Note.—C, complete genome; I, incomplete genome.

Overview of the genome statistics Note.—C, complete genome; I, incomplete genome. In order to know more about the genomic structure of F. nucleatum, we further predicted the pan-genome size by extrapolation through calculation of the gene clusters and core clusters for N genomes, where N is the number of F. nucleatum genomes (N = 1, 2, 3 ….19, 20, 21). To avoid bias, the pan-genome size and core genome for each of the permutations of genome comparisons was predicted. Results show that the pan-genome size of F. nucleatum increased with the increase in the number of genomes. Our mathematical functional model (Y = 1260.48511544134 *X**0.5 + 854.557552658844; where Y = pan-genome size; X = number of genomes added) (Laing et al. 2010) represents the pan-genome of F. nucleatum. From the curve (fig. 4) generated from this model, we can observe that F. nucleatum contains an open pan-genome, which suggested that F. nucleatum was able to acquire novel genes during the evolution of its genome. From the pan-genome results it was observed that 44% of the genes were strain-specific gene clusters. Further investigation revealed that, among the 21 F. nucleatum genomes, W1481 was the fourth highest in term of the number of strain-specific genes (281) (fig. 5).
F

Pan-genome and core genome size prediction for 21 F. nucleatum genomes. The blue diamonds, represent the functional pan-genome size for each combination and the blue dotted line represents the relationship between the number of genomes and functional pan-genome size. The green dots represent the core functional clusters for each combination and the green dotted line represents the relationship between the number of genomes and the number of core functional cluster.

F

Strain-specific genes for the 21 Fusobacterium nucleatum genomes.

Pan-genome and core genome size prediction for 21 F. nucleatum genomes. The blue diamonds, represent the functional pan-genome size for each combination and the blue dotted line represents the relationship between the number of genomes and functional pan-genome size. The green dots represent the core functional clusters for each combination and the green dotted line represents the relationship between the number of genomes and the number of core functional cluster. Strain-specific genes for the 21 Fusobacterium nucleatum genomes. Interestingly, except for one bacteriophage-type DNA polymerase and one phage-related protein, W1481 did not show the presence of any other phage-related proteins. This might be attributed to the presence of abortive infection proteins AbiGI and AbiGII which were present only in W1481. The abortive infection (Abi) system limits phage replication within a bacterial population by promoting cell death (O’Connor et al. 1999; Dy et al. 2014). W1481 also showed the presence of two distinct restriction modification system (RM) Type I and III (Murray 2000; Rao et al. 2014), as well as CRISPR-Cas system (Horvath and Barrangou 2010; Marraffini and Sontheimer 2010; Jore et al. 2012), conferring resistance against foreign genetic elements such as plasmids and bacteriophages. This was further supported by the fact that no prophage was detected in the W1481 genome by PHAST (PHAge Search Tool) (Zhou et al. 2011). Additionally, W1481 is equipped with a number of efflux systems. W1481 contains the macrolide efflux system (MacA and MacB) along with TolC. In Escherichia coli MacA and MacB, along with TolC, have been reported to confer resistance to erythromycin (Kobayashi et al. 2001). W1481 also carries genes encoding beta-lactamase, beta-lactamase domain protein, beta-lactamase-like protein and Zn-dependent hydrolase, thus, constituting a beta-lactamase superfamily protein in the genome, indicating that W1481 might be resistant against beta-lactam antibiotics. It also showed the presence of Spectinomycin 9-O-adenylyltransferase and a number of multi-drug resistance proteins and efflux pumps such as multidrug resistance protein 2, permease of the drug/metabolite transporter (DMT) superfamily, multi-antimicrobial extrusion protein (Na(+)/drug antiporter), MATE family of MDR efflux pumps and Na+ driven multidrug efflux pump. We also found the presence of hemolysin and putative large exoprotein involved in heme utilization or adhesion of ShlA/HecA/FhaA family in the W1481 genome. W1481 showed the presence of ATP-dependent oligopeptide transporter OppABCDF, a member of the ATP-Binding Cassette (ABC) Superfamily of transporters (Pearce et al. 1992). OppABCDF have been reported to transport oligopeptides up to five amino acids in length but not free amino acids as shown by studies based on binding affinity and competition assays (Guyer et al. 1986). The system is involved in the uptake of oligopeptides, as well as recycling of cell wall peptides (Hiles et al. 1987). OppA is the periplasmic substrate-binding component that binds oligopeptides (Tame et al. 1994), OppB and OppC are the membrane components of the ABC transporter, whereas OppD and OppF are the ATP-binding components of the ABC transporter (Pearce et al. 1992). Another ABC transport system present in W1481 is the nik operon which is responsible for uptake of nickel ions, an essential nutrient participating in various cellular processes. The first five genes of the nik operon (nikABCDE) were identified in W1481, however, nikR the DNA-binding protein repressing the transcription of nikABCDE (Navarro et al. 1993) was not detected in W1481. We also found two putative genes, ltrA (encoding low temperature requirement protein A). These ltrA genes were not present in the other F. nucleatum strains. ltrA genes are essential for bacterial growth at low temperature (Zheng and Kathariou 1995). We suggest that the presence of these genes in W1481 might allow it to survive in lower temperatures as compared to other F. nucleatum strains thus enabling it to survive in more diverse environments in term of temperature.

Genomic Island Prediction

Genomic islands were predicted for the genome of W1481 together with the other 20 F. nucleatum genomes by the IslandViewer 3 software. A total of 91 nonredundant GIs were predicted in all F. nucleatum genomes used in this study (supplementary table S2, Supplementary Material online). Of the 91 putative GIs, two GIs (GI41 and GI73) were conserved across all F. nucleatum genomes. Inside GI43, there are two genes which encode for pyruvate formate-lyase enzyme. Insertion of these GI to the F. nucleatum genomes allow catalyzation of the reversible conversion of pyruvate and coenzyme-A into formate and acetyl-CoA (Becker et al. 1999). For the W1481 genome, we found 23 GIs, of which three GIs (GI1, GI7, and GI10) were only present in W1481, and not in other F. nucleatum strains (supplementary table S2, Supplementary Material online). We found the presence of putative genes code for adenosylcobinamide amidohydrolase, which is required in the salvage pathway of cobinamide (Woodson and Escalante-Semerena 2004), and the components of cobalt chelatase (CobN, ChlI/ChlD) involved in B12 (cobalamin) biosynthesis (Rodionov et al. 2003) in the GI1. We have also found L(+)-tartrate dehydratase alpha subunit gene and L(+)-tartrate dehydratase beta subunit gene in another W1481 specific GI (GI7). These genes are involved in the catabolism of lactate and succinate (Do et al. 2015). Thus, GI1 and GI7 may allow W1481 to synthesize or catabolize cobinamide, lactate and succinate. In addition, we also observed some interesting genes and features in the GIs shared between W1481 and some of the F. nucleatum strains. For example, we found that the horizontally transferred GI3, which was shared by W1481 with the strains of subsp. animalis (11_3_2), subsp. fusiforme (ATCC51190) and subsp. nucleatum (ATCC23726), carries the tripartite ATP-independent (TRAP)-like transporter genes, encoding for TRAP-type C4-dicarboxylate transport system-periplasmic component, TRAP-type C4-dicarboxylate transport system-small permease component and TRAP-type C4-dicarboxylate transport system-large permease component. These TRAP transporters are a high-affinity transport system for the C4-dicarboxylates malate, succinate, and fumarate (Forward et al. 1997). Strikingly, the GI4 which was shared by W1481 with the strains of subsp. polymorphum (ATCC10953 & F401) and three strains of subsp. animalis (F0419, D11 & 7_1), contain a cluster of eight putative CRISPRs genes which may provide the host with acquired and heritable resistance to foreign DNA (Horvath and Barrangou 2010; Marraffini and Sontheimer 2010; Jore et al. 2012). The distribution of the GIs among the strains indicates that W1481 share a number of GIs with the strains of subsp. animalis, vincentii and polymorphum.

Comparative Pathogenomic Analysis

To better understand the pathogenic potential of W1481, we performed a comparative pathogenomic analysis of W1481 along with the strains of the five subspecies of F. nucleatum (fig. 6). W1481 showed the presence of a number of virulence genes which were shared by all or most of the other strains under study mostly related to lipopolysaccharide layer of the bacterial cell surface, such as galE (UDP-glucose 4-epimerase), galU (glucose-1-phosphate uridylyltransferase), gtaB (UTP–glucose-1-phosphate uridylyltransferase), acpXL (acyl carrier protein), kdsA (2-dehydro-3-deoxyphosphooctonate aldolase), lpxA (UDP-N-acetylglucosamine acyltransferase), gmhA/lpcA (phosphoheptose isomerase) and eno (phosphopyruvate hydratase). W1481 also shared the presence of groEL a 60-kDa chaperonin, htpB (Hsp60, 60K heat shock protein), hemL (glutamate-1-semialdehyde aminotransferase), tuf (elongation factor Tu) and plr/gapA (glyceraldehyde-3-phosphate dehydrogenase) with the other strains.
F

Comparative pathogenomic analysis of W1481. The threshold used in this analysis was 50% sequence identity and 50% sequence completeness.

Comparative pathogenomic analysis of W1481. The threshold used in this analysis was 50% sequence identity and 50% sequence completeness. One interesting observation was the presence of ddhABC genes which are related to CDP-glucose pathways. These genes were present in W1481 and only one other strain of F. nucleatum subsp. vincentii ATCC49256. ddhABC genes are involved in lipopolysaccharide O-antigen biosynthesis which have been reported to play a crucial role in effective colonization of host tissues, and evasion of the host immune system by gram negative bacteria (Lerouge and Vanderleyden 2002; Skurnik and Bengoechea 2003). None of the other strains under study showed the presence of these genes. W1481 also showed the presence of Putative O-antigen ligase WaaL which is responsible for the ligation of O-antigen onto the core region of the lipid A-core block a crucial step in the lipopolysaccharide biosynthetic pathway (Han et al. 2012). Another interesting set of genes were that of neuramic acid synthetase (neuB and neuB2) reported to be catalyzing the last step of the de novo sialic acid biosynthetic pathway, which is a major element of bacterial surface structure (Feng et al. 2012). Again these were present in only W1481 and two strains of F. nucleatum subsp. vincentii (ATCC49256 and 3_1_27) and one other strain, F. nucleatum subsp. polymorphum ATCC10953. It has been suggested that bacterial pathogens employ the strategy of “molecular mimicry” by incorporating sialic acid on their cell-surface to disguise themselves as host cells to circumvent and/or counteract the host’s immune responses (Severi et al. 2007). W1481 also possess Alpha-2-macroglobulin (A2M) which are located in the periplasm and are believed to trap external proteases through a covalent interaction with an activated thioester thus providing protection to the cell. Studies in Salmonella enterica ser. typhimurium have shown that A2M mimics the innate immune system of eukaryotes (Wong and Dessen 2014). The comparative analysis results suggest that W1481 might be slightly more effective in colonization and resisting the host immune system.

Conclusion

The results of phylogenetic and MUMi distance analyses all suggest the possibility that W1481 is likely a novel subspecies of F. nucletum. W1481 has several unique horizontally transferred GIs, indicating the HGT may play an important role in the evolution of this bacterium. Comparative analysis suggests that W1481 is well equipped to defend itself, colonize and survive within the host evading its immune system, although experimental verification is required to confirm these observations. This comparative in silico study provides novel insights into the genome of W1481 and lays the foundation for future experimental studies. Click here for additional data file.
  56 in total

1.  Structure and mechanism of the glycyl radical enzyme pyruvate formate-lyase.

Authors:  A Becker; K Fritz-Wolf; W Kabsch; J Knappe; S Schultz; A F Volker Wagner
Journal:  Nat Struct Biol       Date:  1999-10

2.  The nik operon of Escherichia coli encodes a periplasmic binding-protein-dependent transport system for nickel.

Authors:  C Navarro; L F Wu; M A Mandrand-Berthelot
Journal:  Mol Microbiol       Date:  1993-09       Impact factor: 3.501

3.  Comparative genomics of the vitamin B12 metabolism and regulation in prokaryotes.

Authors:  Dmitry A Rodionov; Alexey G Vitreschak; Andrey A Mironov; Mikhail S Gelfand
Journal:  J Biol Chem       Date:  2003-07-17       Impact factor: 5.157

4.  PHAST: a fast phage search tool.

Authors:  You Zhou; Yongjie Liang; Karlene H Lynch; Jonathan J Dennis; David S Wishart
Journal:  Nucleic Acids Res       Date:  2011-06-14       Impact factor: 16.971

5.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega.

Authors:  Fabian Sievers; Andreas Wilm; David Dineen; Toby J Gibson; Kevin Karplus; Weizhong Li; Rodrigo Lopez; Hamish McWilliam; Michael Remmert; Johannes Söding; Julie D Thompson; Desmond G Higgins
Journal:  Mol Syst Biol       Date:  2011-10-11       Impact factor: 11.429

6.  Transcriptomic analysis of three Veillonella spp. present in carious dentine and in the saliva of caries-free individuals.

Authors:  Thuy Do; Evelyn C Sheehy; Tonnie Mulli; Francis Hughes; David Beighton
Journal:  Front Cell Infect Microbiol       Date:  2015-03-26       Impact factor: 5.293

7.  FIGfams: yet another set of protein families.

Authors:  Folker Meyer; Ross Overbeek; Alex Rodriguez
Journal:  Nucleic Acids Res       Date:  2009-09-17       Impact factor: 16.971

8.  Genome Sequence of Fusobacterium nucleatum Strain W1481, a Possible New Subspecies Isolated from a Periodontal Pocket.

Authors:  Mia Yang Ang; David Dymock; Joon Liang Tan; Ming Hang Thong; Qin Kai Tan; Guat Jah Wong; Ian C Paterson; Siew Woh Choo
Journal:  Genome Announc       Date:  2014-02-13

9.  A widespread bacteriophage abortive infection system functions through a Type IV toxin-antitoxin mechanism.

Authors:  Ron L Dy; Rita Przybilski; Koen Semeijn; George P C Salmond; Peter C Fineran
Journal:  Nucleic Acids Res       Date:  2014-01-24       Impact factor: 16.971

Review 10.  Type III restriction-modification enzymes: a historical perspective.

Authors:  Desirazu N Rao; David T F Dryden; Shivakumara Bheemanaik
Journal:  Nucleic Acids Res       Date:  2013-07-17       Impact factor: 16.971

View more
  7 in total

1.  RNA landscape of the emerging cancer-associated microbe Fusobacterium nucleatum.

Authors:  Falk Ponath; Caroline Tawk; Yan Zhu; Lars Barquist; Franziska Faber; Jörg Vogel
Journal:  Nat Microbiol       Date:  2021-07-08       Impact factor: 17.745

2.  Comparative Analysis of Colon Cancer-Derived Fusobacterium nucleatum Subspecies: Inflammation and Colon Tumorigenesis in Murine Models.

Authors:  Jessica Queen; Jada C Domingue; James Robert White; Courtney Stevens; Barath Udayasuryan; Tam T D Nguyen; Shaoguang Wu; Hua Ding; Hongni Fan; Madison McMann; Alina Corona; Tatianna C Larman; Scott S Verbridge; Franck Housseau; Daniel J Slade; Julia L Drewes; Cynthia L Sears
Journal:  mBio       Date:  2022-02-08       Impact factor: 7.867

3.  High-resolution microbiome profiling uncovers Fusobacterium nucleatum, Lactobacillus gasseri/johnsonii, and Lactobacillus vaginalis associated to oral and oropharyngeal cancer in saliva from HPV positive and HPV negative patients treated with surgery and chemo-radiation.

Authors:  Rafael Guerrero-Preston; James Robert White; Filipa Godoy-Vitorino; Arnold Rodríguez-Hilario; Kelvin Navarro; Herminio González; Christina Michailidi; Anne Jedlicka; Sierra Canapp; Jessica Bondy; Amanda Dziedzic; Barbara Mora-Lagos; Gustavo Rivera-Alvarez; Carmen Ili-Gangas; Priscilla Brebi-Mieville; William Westra; Wayne Koch; Hyunseok Kang; Luigi Marchionni; Young Kim; David Sidransky
Journal:  Oncotarget       Date:  2017-09-07

4.  Forward Genetic Dissection of Biofilm Development by Fusobacterium nucleatum: Novel Functions of Cell Division Proteins FtsX and EnvC.

Authors:  Chenggang Wu; Abu Amar Mohamed Al Mamun; Truc Thanh Luong; Bo Hu; Jianhua Gu; Ju Huck Lee; Melissa D'Amore; Asis Das; Hung Ton-That
Journal:  MBio       Date:  2018-04-24       Impact factor: 7.867

5.  Comprehensive genome analysis of a pangolin-associated Paraburkholderia fungorum provides new insights into its secretion systems and virulence.

Authors:  Ka Yun Tan; Avirup Dutta; Tze King Tan; Ranjeev Hari; Rofina Y Othman; Siew Woh Choo
Journal:  PeerJ       Date:  2020-09-03       Impact factor: 2.984

Review 6.  Fusobacterium and Colorectal Cancer.

Authors:  Ziwei Zhou; Jiewen Chen; Herui Yao; Hai Hu
Journal:  Front Oncol       Date:  2018-10-15       Impact factor: 6.244

7.  The role of Fusobacterium nucleatum in colorectal cancer: from carcinogenesis to clinical management.

Authors:  Chun-Hui Sun; Bin-Bin Li; Bo Wang; Jing Zhao; Xiao-Ying Zhang; Ting-Ting Li; Wen-Bing Li; Di Tang; Miao-Juan Qiu; Xin-Cheng Wang; Cheng-Ming Zhu; Zhi-Rong Qian
Journal:  Chronic Dis Transl Med       Date:  2019-10-01
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.