| Literature DB >> 28793925 |
Andreas Zanzoni1, Lionel Spinelli2, Shérazade Braham2, Christine Brun2,3.
Abstract
BACKGROUND: Fusobacterium nucleatum is a gram-negative anaerobic species residing in the oral cavity and implicated in several inflammatory processes in the human body. Although F. nucleatum abundance is increased in inflammatory bowel disease subjects and is prevalent in colorectal cancer patients, the causal role of the bacterium in gastrointestinal disorders and the mechanistic details of host cell functions subversion are not fully understood.Entities:
Keywords: Bioinformatics; Colorectal cancer; Fusobacterium nucleatum; Inflammatory bowel diseases; Interaction network; Molecular mimicry; Secretome; Short linear motifs; Virulence proteins
Mesh:
Substances:
Year: 2017 PMID: 28793925 PMCID: PMC5551000 DOI: 10.1186/s40168-017-0307-1
Source DB: PubMed Journal: Microbiome ISSN: 2049-2618 Impact factor: 14.650
Fig. 1Flow strategy of our computational approach
Fig. 2Disorder propensity of the FusoSecretome. SignalP-secreted proteins show a significantly higher fraction of disordered residues compared to non-secreted proteins (P value = 1.9 × 10−4, Kolgomorov–Smirnov test, two-sided)
Enrichment of Pfam domains in the FusoSecretome compared to non-secreted proteins
| Pfam domain | Pfam clana | Function | VFDBb | Humanc |
| non-secreted | Corrected |
|---|---|---|---|---|---|---|---|
| MORN repeat variant | MORN repeat | Invasion | – | – | 16 | 2 | 2.17 × 10−12 |
| Autotransporter beta-domain | Outer membrane beta-barrel protein superfamily | Adhesion | ✓ | – | 9 | 0 | 2.4 × 10−7 |
| Haemolysin secretion/activation protein ShlB/FhaC/HecB | Outer membrane beta-barrel protein superfamily | Secretion | ✓ | – | 5 | 0 | 0.002 |
| TonB-dependent Receptor Plug Domain | Ubiquitin superfamily | Transport | ✓ | – | 5 | 0 | 0.002 |
| TonB dependent receptor | Outer membrane beta-barrel protein superfamily | Transport | ✓ | – | 4 | 0 | 0.011 |
| Surface antigen variable number repeat | POTRA domain superfamily | Folding | – | – | 4 | 0 | 0.011 |
| YadA-like C-terminal region | Pilus subunit | Adhesion | ✓ | – | 4 | 0 | 0.011 |
| Haemagluttinin repeat | Pectate lyase-like beta helix | Adhesion | ✓ | – | 4 | 0 | 0.011 |
| Bacterial extracellular solute-binding proteins, family 5 Middle | Periplasmic binding protein clan | Transport | – | – | 5 | 4 | 0.082 |
| Coiled stalk of trimeric autotransporter adhesion | – | Adhesion | ✓ | – | 3 | 0 | 0.09 |
| Pyruvate flavodoxin/ferredoxin oxidoreductase, thiamine diP-bdg | Thiamin diphosphate-binding superfamily | Metabolism | – | – | 3 | 0 | 0.09 |
| TPR repeat | Tetratrico peptide repeat superfamily | Protein binding | ✓ | ✓ | 6 | 8 | 0.094 |
aA clan is defined as a collection of related Pfam entries sharing sequence or structural similarity
bPfam entry detected in at least one protein sequence stored in the database of known bacterial virulence factors
cPfam entry detected in at least one human protein sequence
dNumber of occurrences in the FusoSecretome
eNumber of occurrences in non-secreted proteins
fPfam domain matches with a corrected P value <0.1
Significant Gene Ontology and pathways annotations among FusoSecretome inferred human interactors
| Annotation source | Annotation ID | Annotation name | Corrected |
|---|---|---|---|
| Biological process | |||
| GO:0006415 | Translational termination | 7.57 × 10−41 | |
| GO:0006414 | Translational elongation | 2.65 × 10−34 | |
| GO:0006457 | Protein folding | 2.51 × 10−29 | |
| GO:0006413 | Translational initiation | 6.63 × 10−27 | |
| GO:0072376 | Protein activation cascade | 1.17 × 10−25 | |
| GO:0051604 | Protein maturation | 5.45 × 10−23 | |
| GO:0006614 | SRP-dependent cotranslational protein targeting to membrane | 1.93 × 10−22 | |
| GO:0006956 | Complement activation | 3.62 × 10−21 | |
| GO:0002697 | Regulation of immune effector process | 4.03 × 10−18 | |
| GO:0030449 | Regulation of complement activation | 4.10 × 10−18 | |
| Cellular component | |||
| GO:0005840 | Ribosome | 9.81 × 10−37 | |
| GO:0022626 | Cytosolic ribosome | 2.76 × 10−23 | |
| GO:0005912 | Adherens junction | 4.15 × 10−22 | |
| GO:0098552 | Side of membrane | 2.46 × 10−18 | |
| GO:0030055 | Cell-substrate junction | 2.85 × 10−17 | |
| GO:0072562 | Blood microparticle | 9.20 × 10−17 | |
| GO:0005761 | Mitochondrial ribosome | 4.28 × 10−12 | |
| GO:0019897 | Extrinsic component of plasma membrane | 2.35 × 10−09 | |
| GO:0005911 | Cell-cell junction | 4.54 × 10−08 | |
| GO:0031012 | Extracellular matrix | 3.91 × 10−07 | |
| KEGG | |||
| KEGG:03010 | Ribosome | 4.34 × 10−39 | |
| KEGG:04610 | Complement and coagulation cascades | 2.28 × 10−22 | |
| KEGG:04514 | Cell adhesion molecules (CAMs) | 2.88 × 10−09 | |
| KEGG:04141 | Protein processing in endoplasmic reticulum | 2.03 × 10−08 | |
| KEGG:04660 | T cell receptor signaling pathway | 1.33 × 10−07 | |
| KEGG:05150 |
| 1.78 × 10−06 | |
| KEGG:04380 | Osteoclast differentiation | 7.25 × 10−06 | |
| KEGG:05203 | Viral carcinogenesis | 8.21 × 10−06 | |
| KEGG:05169 | Epstein-Barr virus infection | 1.11 × 10−05 | |
| KEGG:05164 | Influenza A | 1.77 × 10−05 | |
| Reactome | |||
| REAC:1,592,389 | Activation of Matrix Metalloproteinases | 4.77 × 10−28 | |
| REAC:192,823 | Viral mRNA Translation | 2.14 × 10−19 | |
| REAC:156,902 | Peptide chain elongation | 2.14 × 10−19 | |
| REAC:975,956 | Nonsense Mediated Decay independent of the Exon Junction Complex | 1.83 × 10−17 | |
| REAC:977,606 | Regulation of Complement cascade | 5.13 × 10−16 | |
| REAC:202,733 | Cell surface interactions at the vascular wall | 7.22 × 10−16 | |
| REAC:5,368,287 | Mitochondrial translation | 3.61 × 10−14 | |
| REAC:3,371,453 | Regulation of HSF1-mediated heat shock response | 3.92 × 10−14 | |
| REAC:3,371,599 | Defective HLCS causes multiple carboxylase deficiency | 1.33 × 10−08 | |
| REAC:420,597 | Nectin/Necl trans heterodimerization | 1.33 × 10−08 | |
For each annotation source, the ten most significant terms are reported. The full list of annotation enrichments is available in Additional file 8: Table S6
Fig. 3Topological properties of inferred human interactors in the human interactome. a Inferred human interactors have more interaction partners and b higher values of betweenness centrality compared to non-interacting proteins in the human interactome
Network module significantly enriched in inferred human interactors
| Module | Module proteins | Interactors | Inferred interactions |
| Annotations |
|---|---|---|---|---|---|
| 9 | 74 | 13 | 188 | 64 | Immune response-regulating cell surface receptor signaling pathway (GO:0002768), cell-cell junction (GO:0005911) |
| 16 | 67 | 12 | 122 | 65 | Metal ion homeostasis (GO:0055065), cell surface (GO:0009986) |
| 19 | 80 | 15 | 260 | 55 | Cellular response to organonitrogen compound (GO:0071417), membrane raft (GO:0045121) |
| 42 | 140 | 19 | 312 | 56 | Endocytosis (GO:0006897), membrane raft (GO:0045121) |
| 74 | 113 | 26 | 78 | 26 | Extracellular structure organization (GO:0043062), cell surface (GO:0009986) |
| 78 | 292 | 54 | 752 | 100 | Immune response-regulating cell surface receptor signaling pathway (GO:0002768), cell surface (GO:0009986) |
| 89 | 45 | 10 | 169 | 48 | Immune response-activating cell surface receptor signaling pathway (GO:0002429), nucleolar ribonuclease P complex (GO:0005655) |
| 90 | 51 | 10 | 28 | 12 | I-kappaB kinase/NF-kappaB cascade (GO:0007249), inclusion body (GO:0016234) |
| 138 | 126 | 22 | 57 | 26 | I-kappaB kinase/NF-kappaB cascade (GO:0007249), perinuclear region of cytoplasm (GO:0048471) |
| 165 | 81 | 17 | 327 | 71 | Neuron projection guidance (GO:0097485), synapse (GO:0045202) |
| 194 | 38 | 12 | 119 | 48 | G1/S transition of mitotic cell cycle (GO:0000082), cyclin-dependent protein kinase holoenzyme complex (GO:0000307) |
| 216 | 47 | 11 | 261 | 80 | Blood coagulation (GO:0007596), membrane raft (GO:0045121) |
| 246 | 50 | 15 | 83 | 32 | T cell activation (GO:0042110), Golgi membrane (GO:0000139) |
| 277 | 25 | 7 | 7 | 2 | Collagen catabolic process (GO:0030574), extracellular matrix (GO:0031012) |
| 298 | 106 | 16 | 242 | 75 | Actin cytoskeleton organization (GO:0030036), Arp2/3 protein complex (GO:0005885) |
| 300 | 37 | 9 | 164 | 49 | Stress-activated MAPK cascade (GO:0051403), nuclear speck (GO:0016607) |
| 371 | 25 | 7 | 79 | 48 | Actin filament organization (GO:0007015), lamellipodium (GO:0030027) |
| 433 | 38 | 9 | 121 | 49 | Positive regulation of intracellular protein kinase cascade (GO:0010740), spindle (GO:0005819) |
| 451 | 40 | 9 | 55 | 34 | Mitotic cell cycle phase transition (GO:0044772), heterochromatin (GO:0000792) |
| 456 | 36 | 9 | 142 | 59 | Regulation of system process (GO:0044057), dendrite (GO:0030425) |
| 563 | 26 | 8 | 43 | 29 | Regulation of sequence-specific DNA binding transcription factor activity (GO:0051090), external side of plasma membrane (GO:0009897) |
| 571 | 42 | 11 | 206 | 53 | Cell cycle phase transition (GO:0044770), transcription factor complex (GO:0005667) |
| 577 | 33 | 10 | 66 | 48 | Complement activation (GO:0006956), ER membrane insertion complex (GO:0072379) |
| 587 | 17 | 6 | 109 | 46 | Axonogenesis (GO:0007409), signalosome (GO:0008180) |
| 615 | 36 | 11 | 26 | 10 | Response to unfolded protein (GO:0006986), perinuclear region of cytoplasm (GO:0048471) |
| 625 | 25 | 8 | 226 | 104 | Regulation of sequence-specific DNA binding transcription factor activity (GO:0051090), chromatin (GO:0000785) |
| 689 | 38 | 14 | 151 | 72 | Blood coagulation (GO:0007596), apical junction complex (GO:0043296) |
| 702 | 23 | 8 | 157 | 46 | Peptidyl-tyrosine phosphorylation (GO:0018108), nucleolar ribonuclease P complex (GO:0005655) |
| 745 | 18 | 6 | 113 | 46 | Axon guidance (GO:0007411), cell leading edge (GO:0031252) |
| 794 | 22 | 7 | 151 | 46 | Gamma-aminobutyric acid signaling pathway (GO:0007214), postsynaptic membrane (GO:0045211) |
| 831 | 15 | 6 | 129 | 45 | Fc-gamma receptor signaling pathway involved in phagocytosis (GO:0038096), cell leading edge (GO:0031252 |
For each module the following information is reported: identifier, number of constituent proteins, number of inferred human interactors in the module, number of inferred interactions between proteins in the module and FusoSecretome proteins, number of interacting FusoSecretome proteins, representative annotations (Biological Process and Cellular Component) selected as the most frequent and significantly enriched annotations for the given module (for the complete list of functional annotations see Additional file 11: Table S9)
List of the main candidate virulence proteins in the FusoSecretome
| UniprotKB AC | Protein name | Gene symbol | Domains | Interacting domains | Interacting SLiMs |
|---|---|---|---|---|---|
| Q8RIM1 | Fusobacterium outer membrane protein family | FN1554 | Autotransportera | – | LIG_FHA_1, LIG_FHA_2, LIG_PP1b, LIG_SH2_SRCb, LIG_SH2_STAT5, LIG_SH3_3b, LIG_SUMO_SBM_1b, MOD_N-GLC_1b, TRG_ENDOCYTIC_2b |
| Q8RGK2 | Hemolysin | FN0291 | Fil_haemagg_2a | – | LIG_FHA_1, LIG_FHA_2, LIG_Rb_pABgroove_1b, LIG_SH2_GRB2, LIG_SH2_SRCb, LIG_SH2_STAT5, LIG_SUMO_SBM_1b, MOD_CK1_1, MOD_CK2_1b, MOD_GSK3_1, MOD_N-GLC_1b, MOD_PIKK_1, TRG_ENDOCYTIC_2b |
| Q8RGT9 | Peptide methionine sulfoxide reductase MsrA | msrA | PMSR, SelR | PMSR, SelR | CLV_PCSK_PC1ET2_1, LIG_FHA_1, LIG_SH2_GRB2, LIG_SH2_SRCb, LIG_SH2_STAT5, MOD_Cter_Amidation, MOD_PIKK_1, TRG_ENDOCYTIC_2b |
| Q8RHB9 | Hypothetical exported 24-amino acid repeat protein | FN2118 | MORN_2 | – | LIG_SH2_GRB2, LIG_SH2_SRCb |
| Q8R609 | Pyruvate-flavodoxin oxidoreductase | FN1421 | POR_N, POR, EKR, Fer4_7, TPP_enzyme_C | TPP_enzyme_C | LIG_CYCLIN_1b, LIG_SH2_GRB2, LIG_SH2_STAT5, LIG_SH3_3b, LIG_SUMO_SBM_1b, LIG_WW_Pin1_4, MOD_CK1_1, MOD_CK2_1b, MOD_GSK3_1, MOD_PIKK_1, MOD_ProDKin_1, TRG_ENDOCYTIC_2b |
| Q8RH03 | Chaperone protein DnaJ | dnaJ | DnaJ, DnaJ_CXXCXGXG, CTDII | DnaJ, DnaJ_CXXCXGXG | CLV_NDR_NDR_1, CLV_PCSK_SKI1_1, LIG_CYCLIN_1b, LIG_FHA_2, LIG_SH2_STAT5, LIG_SH3_3b, LIG_SUMO_SBM_1b, LIG_TRAF2_1b, LIG_WW_Pin1_4, MOD_CK2_1b, MOD_PLK, MOD_ProDKin_1, TRG_ENDOCYTIC_2b |
| Q8R643 | Pyruvate-flavodoxin oxidoreductase | FN1170 | POR_N, POR, EKR, Fer4_7, TPP_enzyme_C | TPP_enzyme_C | LIG_BRCT_BRCA1_1, LIG_SH2_GRB2, LIG_SH2_STAT5, LIG_SH3_3b, LIG_SUMO_SBM_1b, LIG_WW_Pin1_4, MOD_CK1_1, MOD_CK2_1b, MOD_GSK3_1, MOD_ProDKin_1, TRG_ENDOCYTIC_2b |
| Q8RDQ9 | Fusobacterium outer membrane protein family | FN1449 | – | – | CLV_PCSK_SKI1_1, LIG_FHA_1, LIG_FHA_2, LIG_PDZ_Class_2, LIG_SH2_SRCb, LIG_SH2_STAT5, LIG_SUMO_SBM_1b, MOD_GSK3_1, MOD_N-GLC_1b, MOD_PLK |
| Q8R608 | Serine protease | FN1426 | Peptidase_S8a, Autotrns_rpta, Autotransportera | Peptidase_S8 | CLV_PCSK_SKI1_1, LIG_FHA_2, LIG_PDZ_Class_2, LIG_SH2_STAT5, LIG_SUMO_SBM_1b, MOD_CK1_1, MOD_CK2_1b, MOD_GSK3_1, MOD_PKA_2, TRG_ENDOCYTIC_2b |
| Q8RFV3 | Hypothetical cytosolic protein | FN0579 | MG1, A2M_N, A2M_N_2, A2M | A2M_N, A2M_N_2, A2M | LIG_CYCLIN_1b, LIG_FHA_2, LIG_SH2_STAT5, LIG_SUMO_SBM_1b, MOD_CK2_1b, MOD_PIKK_1, MOD_PKA_2 |
| Q8R5P1 | DNAse I homologous protein DHP2 | FN0891 | Exo_endo_phos | Exo_endo_phos | LIG_SH2_GRB2, TRG_ENDOCYTIC_2b |
| Q8R5Y8 | Biotin carboxyl carrier protein of glutaconyl-COA decarboxylase | FN0200 | Biotin_lipoyl | Biotin_lipoyl | LIG_SUMO_SBM_1b, LIG_WW_Pin1_4, MOD_ProDKin_1, MOD_SUMOb |
| Q8R6D6 | Serine protease | FN1950 | Peptidase_S8a, Autotransportera | Peptidase_S8 | LIG_FHA_2, LIG_SH2_STAT5, LIG_SUMO_SBM_1b, MOD_CK1_1, MOD_CK2_1b, MOD_GSK3_1, MOD_PKA_2 |
| Q8RE26 | Single-stranded DNA-binding protein | ssb | SSBa | SSB | LIG_BRCT_BRCA1_1, LIG_FHA_1, LIG_FHA_2, LIG_PDZ_Class_2, LIG_SUMO_SBM_1b, MOD_PKA_2, TRG_ENDOCYTIC_2b |
| Q8REJ1 | Dipeptide-binding protein | FN1111 | SBP_bac_5 | – | CLV_PCSK_SKI1_1, LIG_BRCT_BRCA1_1, LIG_CYCLIN_1b, LIG_SH2_GRB2, LIG_SUMO_SBM_1b, LIG_WW_Pin1_4, MOD_CK2_1b, MOD_ProDKin_1 |
| Q8RG20 | Hemin receptor | FN0499 | Pluga | – | CLV_PCSK_PC1ET2_1, LIG_CYCLIN_1b, LIG_FHA_1, LIG_MAPK_1, LIG_PDZ_Class_2, LIG_SH2_STAT5, LIG_SH3_3b, LIG_SUMO_SBM_1b, MOD_GSK3_1, MOD_N-GLC_1b |
| Q8RGC9 | Iron(III)-binding protein | FN0375 | SBP_bac_8 | – | LIG_PP1b, LIG_SH2_GRB2, LIG_SUMO_SBM_1b, LIG_WW_Pin1_4, MOD_CK1_1, MOD_ProDKin_1 |
| Q8RGZ4 | Hemolysin activator protein | FN0131 | POTRA_2a, ShlBa | – | CLV_PCSK_PC1ET2_1, LIG_SH2_STAT5, LIG_SUMO_SBM_1b, LIG_TRAF2_1b |
| Q8RH05 | Chaperone protein DnaK | dnaK | HSP70 | HSP70 | CLV_PCSK_PC1ET2_1, CLV_PCSK_SKI1_1, LIG_BRCT_BRCA1_1, LIG_EVH1_1, LIG_FHA_2, LIG_SH2_STAT5, LIG_SH3_3b, LIG_SUMO_SBM_1b, LIG_WW_Pin1_4, MOD_CK1_1, MOD_CK2_1b, MOD_GSK3_1, MOD_PIKK_1, MOD_PLK, MOD_ProDKin_1, TRG_ENDOCYTIC_2b, TRG_LysEnd_APsAcLL_1b |
| Q8RHM4 | Tetratricopeptide repeat protein | FN1990 | DnaJ | DnaJ | – |
| Q8RHU4 | Hypothetical lipoprotein | FN1899 | DUF3798 | – | CLV_C14_Caspase3–7, LIG_FHA_1, LIG_FHA_2, LIG_SH2_STAT5, LIG_SUMO_SBM_1b, LIG_WW_Pin1_4, MOD_CK2_1b, MOD_GSK3_1, MOD_PIKK_1, MOD_PLK, MOD_ProDKin_1 |
| Q8RI19 | Hemolysin | FN1817 | Fil_haemagg_2a | – | LIG_CYCLIN_1b, LIG_FHA_1, LIG_FHA_2, LIG_SH2_STAT5, LIG_SUMO_SBM_1b, LIG_WW_Pin1_4, MOD_CK1_1, MOD_CK2_1b, MOD_GSK3_1, MOD_N-GLC_1b, MOD_PIKK_1, MOD_PLK, MOD_ProDKin_1, TRG_ENDOCYTIC_2b |
| Q8RI47 | Tetratricopeptide repeat family protein | FN1787 | TPR_11a, TPR_1 | TPR_11, TPR_1 | – |
| Q8RIF8 | 50S ribosomal protein L2 | rplB | Ribosomal_L2, Ribosomal_L2_C | Ribosomal_L2, Ribosomal_L2_C | CLV_PCSK_SKI1_1, LIG_BIR_II_1, LIG_PP1b, LIG_SH3_3b, LIG_SUMO_SBM_1b, LIG_USP7_1b, LIG_WW_Pin1_4, MOD_PLK, MOD_ProDKin_1, TRG_ENDOCYTIC_2b |
| Q8RII5 | Competence protein | FN1611 | HHH_3 | – | LIG_TRAF2_1b, MOD_CK2_1 b |
| Q8RIK0 | Hypothetical lipoprotein | FN1590 | DUF3798 | – | CLV_C14_Caspase3–7, LIG_FHA_1, LIG_FHA_2, LIG_SH2_STAT5, LIG_SUMO_SBM_1b, LIG_WW_Pin1_4, MOD_CK2_1b, MOD_GSK3_1, MOD_PIKK_1, MOD_PLK, MOD_ProDKin_1, TRG_ENDOCYTIC_2b |
For every protein, the detected Pfam domains are reported along with the list of domains and SLiMs for which at least one interaction has been inferred
aPfam entry detected in at least one protein sequence stored in the database of known bacterial virulence factors
bMotif for which it was experimentally identified at least one instance in a viral or bacterial protein
Fig. 4Interaction network between FusoSecretome candidate virulence proteins and preferentially targeted modules. a Candidate virulence proteins are depicted as green rectangular nodes labeled with respective gene symbol, whereas network modules as orange circles, whose size is proportional to the number of proteins belonging to each module and are labeled with the corresponding identifier. Edge width is proportional to the number of inferred interactions of a virulence protein with a given module. Network modules enriched in gut-related disease gene sets are labeled with symbols of different colors (i.e., light blue star: Crohn’s disease, CD; dark blue star: Inflammatory bowel disease, IBD; violet star: genes whose expression correlates with F. nucleatum abundance in colorectal cancer patients, FusoExpr; rose star: genes mutated in colorectal cancer, CRC-mutated; rose zig-zag arrow: dysregulated expression during colorectal cancer progression, CRC-dysregulated). b The protein Fap2 (FN1449) interacts with 9 proteins (nodes with a green border) of Module 9 and c the MORN2 domain containing protein (FN2118) interacts with 8 proteins in Module 89
Fig. 5Enrichment of FusoSecretome inferred human interactors and gut disease related proteins in network modules. Each column of the heatmap represents a module. The color of the cells corresponds to the log-transformed enrichment ratio. Pink circles indicate enriched sets. Modules showing a significant dysregulation in CRC progression are highlighted by an empty circle with green border. For the six modules showing an enrichment in inferred interactions and at least in one of gut disease related proteins, the most representative functions are reported. FusoExpr: genes whose expression correlates with F. nucleatum abundance in CRC patients; CRC: genes mutated in colorectal cancer samples; IBD: genes associated to inflammatory bowel disease; CD and UC: genes specifically associated to Crohn’s disease and ulcerative colitis respectively