Jianhua Li1, Mohammed Zakariah2, Abdul Malik3, Mohammad Shamsul Ola4, Rabbani Syed3, Anis Ahmad Chaudhary5, Shahanavaj Khan6,7. 1. Department of General Surgery Ⅰ, Xinxiang Central Hospital, Xinxiang City, Henan Province 453000, People's Republic of China. 2. Research Center, College of Computer and Information Science, King Saud University, Riyadh, Saudi Arabia. 3. Department of Pharmaceutics, College of Pharmacy, King Saud University, Riyadh 11451, Saudi Arabia. 4. Department of Biochemistry, College of Science, King Saud University, Riyadh, Saudi Arabia. 5. Department of Biology, College of Science, Imam Mohammad Ibn Saud Islamic University, Riyadh, Saudi Arabia. 6. Bioinformatics and Biotechnology Unit, Department of Bioscience, Shri Ram Group of College (SRGC), Muzaffarnagar, UP, India. 7. Nano-Biotechnology Unit, Department of Pharmaceutics, College of Pharmacy, King Saud University, Riyadh 11451, Saudi Arabia.
Abstract
BACKGROUND: Infections of Salmonella typhimurium (S. typhimurium) are major threats to health, threats include diarrhoea, fever, acute intestinal inflammation, and cancer. Nevertheless, little information is available about the involvement of S. typhimurium in colon cancer etiology. METHODS: The present study was designed to predict nuclear targeting of S. typhimurium proteins in the host cell through computational tools, including nuclear localization signal (NLS) mapper, Balanced Subcellular Localization predictor (BaCeILo), and Hum-mPLoc using next-generation sequencing data. RESULTS: Several gene expression-associated proteins of S. typhimurium have been predicted to target the host nucleus during intracellular infections. Nuclear targeting of S. typhimurium proteins can lead to competitive interactions between the host and pathogen proteins with similar cellular substrates, and it may have a possible involvement in colon cancer growth. Our results suggested that S. typhimurium releases its proteins within compartments of the host cell, where they act as a component of the host cell proteome. Protein targeting is possibly involved in colon cancer etiology during intracellular bacterial infection. CONCLUSION: The results of current in-silico study showed the potential involvement of S. typhimurium infection with alteration in normal functioning of host cell which act as possible factor to connect with the growth and development of colon cancer.
BACKGROUND: Infections of Salmonella typhimurium (S. typhimurium) are major threats to health, threats include diarrhoea, fever, acute intestinal inflammation, and cancer. Nevertheless, little information is available about the involvement of S. typhimurium in colon cancer etiology. METHODS: The present study was designed to predict nuclear targeting of S. typhimurium proteins in the host cell through computational tools, including nuclear localization signal (NLS) mapper, Balanced Subcellular Localization predictor (BaCeILo), and Hum-mPLoc using next-generation sequencing data. RESULTS: Several gene expression-associated proteins of S. typhimurium have been predicted to target the host nucleus during intracellular infections. Nuclear targeting of S. typhimurium proteins can lead to competitive interactions between the host and pathogen proteins with similar cellular substrates, and it may have a possible involvement in colon cancer growth. Our results suggested that S. typhimurium releases its proteins within compartments of the host cell, where they act as a component of the host cell proteome. Protein targeting is possibly involved in colon cancer etiology during intracellular bacterial infection. CONCLUSION: The results of current in-silico study showed the potential involvement of S. typhimurium infection with alteration in normal functioning of host cell which act as possible factor to connect with the growth and development of colon cancer.
About 20% of the overall burden of cancer can presently be associated with infections of different types of agents, such as viruses, parasites and bacteria.12,58 A largely neglected aspect of infectious diseases is the connection of chronic bacterial infections with the growth and development of cancer,4 among very few bacterial infections associated with cancer development to date2,3,47 with respect to viral-mediated cancers.7,13 Although studies have shown that specific strains of bacteria are associated with the growth of different types of human cancers but their molecular mechanism is not well understood. Previous studies have connected specific bacterial species to the carcinogenesis of different types of cancers, however, various bacterial species have emerged as therapeutic agents in the prevention, diagnosis, or management of cancers.34Different bacterial strains may be connected with the growth and development of different types of cancers by stimulating DNA damaging processes through toxins, inflammation and alteration in metabolites, and normal cell signaling pathways of the host during infections.6,53 For example, pathogenic strains of E. coli have the ability to change normal functions of the host cell through induction of chronic inflammation and interference with the host cell cycle, recommending a potential connection between specific bacteria and cancers.18,22 Similarly, Helicobacter pylori has been connected as a possible risk factor for the growth of various types of stomach cancers.43 Bacteria has the ability to change normal functions of the host cell during infection, as demonstrated by pathogenic species of Salmonella, which manipulate signaling pathways of the host cell for intracellular survival and bacterial uptake.11If specific bacterial strains can induce cancer development by altering the cell signaling in the host,10 it is possible that infections of specific bacterial strains would increase the risk of cancer. In particular, this could occur in cases of long-term bacterial infections, where the possibility of targeting a formerly pre-transformed cell is higher. The effector proteins of Salmonella activate the AKT/ERK signalling pathway in host cells. It has been observed that the AKT/ERK pathway is also stimulated in different types of cancers.48 The effector AvrA of Salmonella stimulates β-catenin signaling in the infected host cell, which supports carcinogenesis in the colon of mice32,33 Although these studies demonstrated the association of bacterial infection in the growth of colon and colorectal cancers in human clinical specimens and experimental mouse models,32 it remains uncertain whether Salmonella infections acts as a possible cause for colon cancer in humans. The frequency of colon cancer increases over time through various unidentified potential factors.5,56To determine whether Salmonella infections represent another possible factor for colon cancer development, we predicted the nuclear targeting of Salmonella typhimurium strain Ty proteins in the host cell using next-generation sequencing data of whole proteome from the UniProt database. Moreover, we examined the implication of such nuclear targeting proteins in the etiology of colon cancer during Salmonella infection.
Methods
Selection of Salmonella typhimurium Proteome
The database of Universal Protein Resource (UniProt) was utilized for the selection of specific strains of S. typhimurium to analyze nuclear protein targeting in the current work.1 The whole S. typhimurium proteome was retrieved from UniProt and utilized to predict nuclear protein targeting in host cells and their involvement in colon cancer development.36,45
Bioinformatics Tools for Prediction of Salmonella typhimurium Nuclear Targeting Proteins in Host Cells
S. typhimurium LT2 strain whole proteome was selected for computation of nuclear targeting proteins in the host cell by employing cNLS mapper, Balanced Subcellular Localization (BaCelLo), and Hum-mPLoc 2.0 bioinformatics tools.45
cNLS Mapper for Prediction of Nuclear Localization Signals in Salmonella typhimurium Proteins
The whole proteome of S. typhimurium LT2 was utilized to predict the nuclear localization signal (NLS) using the bioinformatics tool cNLS mapper.26 The cNLS mapper generated activity-based reports for diverse categories of importin-α-dependent NLSs, which characterize the functional roles of diverse amino acids at each position within an NLS class. S. typhimurium protein sequences were predicted as follows: particularly targeted to the cytoplasm, targeted to the cytoplasm as well as the nucleus, partially targeted to the nucleus, and particularly targeted to the nucleus with a specific range of cutoff values of 1–2, 3–5, 7–8, and 8–10, respectively, as demonstrated in the previous cNLS literature.26
BaCeILo Predictor for Prediction of Salmonella typhimurium Nuclear Localization Proteins
Nuclear targeting proteins of S. typhimurium LT2 were predicted using the Balanced Subcellular Localization (BaCeILo) tool. The BaCeILo predictor is an important bioinformatics software for the prediction of protein localization in the eukaryotic cell. It is worked on diverse support vector machines (SVMs) that can predict subcellular protein targeting in five different organelles of eukaryotes, such as the nucleus, mitochondrion, cytoplasm, plasma membrane (secretory proteins) and chloroplast.42
Hum-mPLoc 3.0 Predictor for Prediction of Salmonella typhimurium Nuclear Localization Proteins
The Hum-mPLoc 2.0 predictor was employed to confirm nuclear protein targeting in humans using whole proteins from the S. typhimurium LT2 proteome. The bioinformatics tool Hum-mPLoc 2.0 operates on a top-down networking system.50 The bioinformatics Hum-mPLoc 2.0 predictor can predict protein targeting in 14 different compartments of the cell, including the cytoplasm, mitochondria, endoplasmic reticulum, centrioles, Golgi apparatus, and nucleus.
Results
The UniProt database is a widespread source for protein sequences, which was developed through the collection of Swiss-Prot, PIR and TrEMBL protein database information.1 This is a freely accessible comprehensive resource and database for data annotation and protein sequence.44 The proteome of S. typhimurium LT2 strain was chosen to analyze nuclear targeting proteins in the host cell, which contained the highest number of proteins sequences,45 with respect to all existing strains.36Different bioinformatic tools were used in the current study, such as cNLS mapper, Balanced Subcellular Localization (BaCelLo), and Hum-mPLoc 2.0 software, for the analysis of S. typhimurium nuclear targeting proteins. These predictors were selected for the prediction of nuclear targeting proteins.It was observed that increase in molecular weight constantly enhanced nuclear targeting, except for the range of 20–40 kDa. The 20–40 kDa molecular weight proteins were targeted primarily in the nucleus of the host cell. Moreover, the isoelectric point (pI) value did not demonstrate specific pattern for nuclear targeting. The patterns of S. typhimurium proteins targeting in the host cell nucleus using different parameters are described in Figure 1. The whole proteins targeting of S. typhimurium in various components of host cells with diverse parameters is illustrated in graph (Figure 2). shows details characteristics of nuclear targeted proteins of host cell.
Figure 1
Computational prediction of nuclear targeting of S. typhimurium proteins in host cells and their relationship with various parameters.
Figure 2
Computational prediction of total targeting of S. typhimurium proteins in host cells and their relationship with various parameters.
Computational prediction of nuclear targeting of S. typhimurium proteins in host cells and their relationship with various parameters.Computational prediction of total targeting of S. typhimurium proteins in host cells and their relationship with various parameters.Briefly, 141 nuclear targeted proteins of S. typhimurium were predicted with different values of monopartite and bipartite NLSs. Among those, 121, 9, 10, and 1 proteins were found to have 0–3.0, 3.0–5.0, 5.0–8.0, and >8.0 cutoff values for monopartite NLSs, respectively. Furthermore, among the 141 nuclear targeted proteins, 32, 84, 23, and 2 proteins were found to have 0–3.0, 3.0–5.0, 5.0–8.0, and >8.0 cutoff values for bipartite NLSs, respectively (Figure 1). Increasing the cutoff value of monopartite NLS is connected with declined nuclear targeting, and a similar pattern was found with bipartite NLS values, except for cutoff values in the range of 3.0–5.0 (Figure 1). The detailed results of nuclear targeting proteins after synchronization are demonstrated in .Similarly, whole protein sequences (4718) of S. typhimurium were predicted with different values of monopartite and bipartite NLSs. Among those, 4356, 173, 135, and 54 proteins were found to have 0–3.0, 3.0–5.0, 5.0–8.0, and >8.0 cutoff values for monopartite NLSs, respectively. Furthermore, among 4718 proteins, 1635, 2564, 499, and 20 proteins were found to have 0–3.0, 3.0–5.0, 5.0–8.0, and >8.0 cutoff values for bipartite NLSs, respectively (Figure 2).The outcomes of BaCeILo showed that 331 S. typhimurium proteins were targeted to the host cell nucleus. The results of prediction analysis also demonstrated that 2069 proteins were targeted in the cytoplasm, 921 in the mitochondria, and 1397 were secretory proteins (Table 1).
Table 1
Details of Possible Protein Targeting in Different Subcellular Locations of the Host Cell by BaCeILo and Hum-mPLoc 2.0, Using the Complete Salmonella typhimurium Proteome
Protein Targeting in Different Subcellular Locations
Number of Proteins Predicted by BaCeILo
Number of Proteins Predicted byHum-mPLoc 2.0
Nucleus
331
867
Mitochondria
921
654
Cytoplasm
2069
707
Secretory/Plasma membrane
1397
–
Plasma Membrane
–
289
Endoplasmic reticulum
–
215
Golgi apparatus
–
15
Peroxisome
–
64
Microsome
–
01
Lysosome
–
18
Extracellular
–
444
Centrosome
–
01
Cytoskeleton
–
0
Multiple location
–
1360
Unknown
–
83
Details of Possible Protein Targeting in Different Subcellular Locations of the Host Cell by BaCeILo and Hum-mPLoc 2.0, Using the Complete Salmonella typhimurium Proteome
Hum-mPLoc 2.0 Predictor for Prediction of Salmonella typhimurium Nuclear Localization Proteins
To increase the prediction efficiency, we used the Hum-mPLoc bioinformatics tool, which operates using a top-down strategy to predict protein targeting in different components of human cells, including the nucleus.49 The results of S. typhimurium protein targeting illustrated exclusive targeting of proteins in the following regions using the Hum-mPLoc tool: 867 in the nucleus, 707 in the cytoplasm, 654 in the mitochondria, 289 in the plasma membrane, 215 in the endoplasmic reticulum, 15 in the Golgi apparatus, 64 in peroxisomes, 1 in microsomes, 18 in the lysosome, extracellular 444, 1 in the centrosome, and 83 in unknown locations. The prediction results showed 1360 targeted proteins in multiple organelles (Table 1).
Synchronization of BaCeILo and Hum-mPLoc 2.0 Results
The outcome of synchronization showed that among BaCeILo predicted total nuclear (331), mitochondrial (921), secretory (1397), and cytoplasmic (2069) proteins. When the BaCeILo predicted proteins were compared with the Hum-mPLoc 2.0 results, only 141 proteins showed consistent results with respect to nuclear targeting proteins.Detailed results and properties of nuclear targeting proteins are illustrated in . Recent evidences showed the connection of Salmonella infection with the development of colorectal cancer in human.38 It is observed that the effector AvrA of Salmonella activates host β-catenin signaling pathways which promotes the carcinogenesis in colon of mice.32,33 Our results showed the targeting of bacterial S. typhimurium DNA mismatch repair protein MutS (Accession no. P0A1Y1) in the nucleus of host cells. DNA mismatch repair proteins (MMRPs) are ubiquitous performers in a wide range of main cellular functions.16 The rate of spontaneous mutation increased due to alteration in the MMR, which affects the growth of different types of cancer, including colon cancer.17 A previous reportshowed the possible role of Escherichia coli DNA MMRPs in the growth and development of colon cancer.22 In this study, the results illustrated nuclear targeting of various DNA binding proteins, such as DNA directed RNA polymerase subunit alpha (Accession No. P0A7Z8), DNA polymerase III subunit epsilon (Accession No. P0A1H0), replication protein RepA (Accession No. Q934T6), putative DNA-binding protein (Accession No. Q934Z5, Q8Z752), DNA-invertase (Accession No. Q8Z339), RNA polymerase-binding transcription factor DksA (Accession No. P0A1G6), RNA polymerase sigma factor RpoD (Accession No. P0A2E4), ATP-dependent RNA helicase (Accession No. Q8Z877), DNA topoisomerase III (Accession No. Q8Z6F5), topoisomerase IV subunit A (Accession No. Q8Z3P4), topoisomerase B (Accession No. Q9RHF5), putative DNA-binding protein (Accession No. Q8Z2J9), excinuclease cho (Accession No. Q8Z6G5), and RepHI1A Replication initiation protein (Accession No. Q7BRX0).Moreover, the results show that RNA chaperone ProQ (Accession No. P60318), chaperone protein SigE (Accession No. Q8Z7R2), chaperone protein DnaJ (Accession No. P0A1G8), and chaperone modulatory protein CbpM (Accession No. Q8XGQ8) were localized in the nucleus of the host cell. The localization of these proteins in the nucleus of the host cell suggests their potential implications in colon cancer growth, but this requires further research.
Discussion
Protein targeting of various microbes in specific host cells has immense impact on regulation of different pathways of host cells, which may promote normal-to-cancer cell transformation. Previously, published report has showed that M. hominis proteins targeting in the nucleus of host cells and their possible involvement in growth of prostate cancer.24 In thisstai study we have selected BaCeILo predictor, which utilizes the data set of different animals, fungi and plants. We have also selected another predictor, Hum-mPLoc, for its high accuracy in our results. Although the possible implication of S. typhimurium on the growth and development of colon cancer has been previously studied, the exact mechanism is not well understood. Also few studies showed that the attenuated strain (VP20009) of S. typhimurium emerges to have a nontoxic profile in humans, while it has been utilized in Phase I trials for the management of colon cancer and melanoma patients with less toxic effects. Mutations have been connected with the lesser toxicity of this strain, which decreases the toxicity of its lipopolysaccharide.9,39A recent study has demonstrated that severe salmonellosis has connected with the high risk of development of colon cancer in the ascending or transverse parts of the colon.38 The bacterial proteins or effectors play important role in altering normal functioning of different pathways of host cell. For instance, the AvrA protein of Salmonella is targeted to the host cells through Type Three Secretion System (TTSS), which suppresses the process of program cell death/apoptosis, specifically with reference to enteropathogenic salmonellosis for prolonging intracellular bacterial survival.31 Various bioinformatics predictors for prediction of subcellular targeting of object proteins are available, which mainly rely on prediction of specific localization motifs on proteins or search for similarity/alignment to predict possible localization of similar proteins in new systems.14,46 The analysis of NLSs in particular proteins is very important for predicting their nuclear localization.51 All proteins generally carry a specific sequence motif known as the NLS. There are 6 classes of NLSs that have been detected, which are involved in nuclear import of proteins through α/β importin pathways. Two types of NLSs have been categorized based on stretches of the basic amino acids, i.e., monopartite (1 basic stretch) and bipartite (2 basic stretches). These 2 stretches mediate the binding of NLS to the importin-α transport receptor. Such complex attaches to importin-β, stimulating localization of a specific protein to the nucleus.29 Our rationale behind using the cNLS mapper was to predict the activity of NLS instead of an NLS sequence. The cNLS mapper identifies the involvement of each residue in the NLS and analyzes NLS activity which is proposed to offer highly accurate analysis performance.25,26 However, the cNLS mapper is unable to predict any proteins that directly bind to importin-β. Moreover, the cNLS mapper predicts NLS activity as an isolated peptide, but not within the overall structural context of the protein.Another subcellular localization prediction tool, BaCeILo, was used in our study, and it relies on different SVMs to predict secretory, nuclear, cytoplasmic, chloroplast, and mitochondrial targeting of certain proteins.42 It predicts particular targeting based on alignment to ascertain evolutionary information and residue sequence information within the whole protein sequence and its N and C termini.Additionally, we have also used Hum-mPLoc, which is based on the human protein subcellular location prediction approach to predict subcellular location using the information of peptide composition, amino acid composition, and similarity.49 Hum-mPLoc also detects protein localization based on a hybrid method using all three approaches. The results of the localization also provide more reliability and accuracy in order to estimate the certainty of the prediction results. We used a hybrid approach as, according to the literature, it has been shown to give better accuracy.Our aim to obtain prediction results from diverse methods was satisfied by the use of these tools as all these tools work in different ways. The NLS mapper works using data generated from yeast and therefore its accuracy to predict NLS in humans can be evaluated. Both BaCeILo and Hum-mPLoc 2.0 predictors do not show the same pattern of protein targeting due to the use of different algorithms and data sets by both computational tools. The BaCelLo predictor is based on diverse SVMs organized in a decision tree and utilizes the information obtained from the residue sequence and from the evolutionary information contained in alignment profiles of different animals, fungi, and plants. The predictor has utilized variable numbers of sequences for nucleus (Plants: 121, Animals: 1166, Fungi: 711), cytoplasm (Plants: 58, Animals: 439, Fungi: 211), extracellular (Plants: 41, Animals: 804, Fungi: 88), mitochondria (Plants: 67, Animals: 188, Fungi: 188), and chloroplast (Plants: 204). It analyzes the whole sequence composition and the compositions of both the N- and C-termini. Although the training set is fully curated in order to avoid any possible redundancy during prediction but used information of three kingdom systems. For the first time a balancing procedure is introduced in order to diminish the cause of biased training sets.Analyzing subcellular localization in human proteins is a very challenging task due to the existence of multiplex character in query proteins. On the other hand, the Hum-mPLoc utilizes human specific protein data sets during analysis of protein sequences. The Hum-mPLoc predictor worked on learning data set includes 3681 protein sequences (3106 different proteins), classified into 14 human subcellular locations including nucleus, cytoplasm, cytoskeleton, endoplasmic reticulum, centriole, endosome, extracellular, Golgi apparatus, lysosome, microsome, mitochondrion, plasma membrane, synapse and peroxisome. The Hum-mPLoc 2.0 predictor worked on a top-down strategy using sequences information from human data set exclusively. The Hum-mPLoc 2.0 predictor utilized the sequential evolution information and functional domain information as ensemble classifier. Therefore, the results of BaCeILo were used to narrow down proteins as per their localization prediction in the animal-specific predictor, and they were further scrutinized after using Hum-mPLoc 2.0 tools, which has used the human-specific data set. It has been estimated that only 30% of nuclear proteins actually carry NLS sequences; therefore, a large proportion of protein can localize to the nucleus, even in the absence of NLS.8,26 In addition, small proteins with less than 40 kDa molecular weights can passively diffuse into the nucleus.15,23 The methods used for protein targeting predictions are different with every tool therefore, variation in results is logical.
Nuclear Targeting Proteins of S. typhimurium and Their Role in Cancer
Nuclear targeting proteins of S. typhimurium can have a variety of consequences on progression of colon cancer etiology. Additional reports discuss the potential of nuclear-targeted proteins in cervical cancer etiology. The Hum-mPLoc operates by using an experimentally annotated data set of 3780 human proteins. Although it is designed to predict subcellular localization of human proteins, its performance in predicting protein subcellular localization of several closely related mammalian genomes is also plausible. However, its role in predicting subcellular localization of bacterial proteins as the query is arguable and the results need to be experimentally validated. Furthermore, both BaCeILo and Hum-mPLoc are based on SVM with different data sets. SVMs have certain limitations, especially when a different data set is used for analysis. Therefore, the differences in results obtained from different tools can be understood.
DNA Binding Proteins
The role of DNA binding proteins is very important in cancer development. For example chromodomain helicase DNA binding protein 5 is involved in tumor suppression, and mutation in the protein synthesis can lead to loss of its function and further development of breast cancer in humans.57 Certain DNA binding proteins, like CpG binding proteins, are involved in cancer progression through methylation of target DNA.41 Nucleotide excision repair protein DDB2 (DNA damage binding protein 2) is another DNA binding protein that is suggested to cause cancer after loss of its activity.20 The existence of both S. typhimurium and human DNA binding proteins in the host nucleus provides competition for both proteins to bind their substrates. Similar substrates can interfere with the binding of normal human proteins to the target site and can affect progression of colon cancer. The role of DNA binding inhibitor proteins in cancer development has already been proven in ovarian cancer, where inhibition of DNA binding protein (ID-1) overexpression leads to ovarian cancer.35
DNA Repair Proteins
Erroneous DNA repair has been implicated as an etiological factor in several cancers. For example, the MutS protein, which is involved in DNA mismatch repair, is involved in the colon cancer etiology.22,40 In this study we found nuclear localization of DNA MMRP MutS (Accession No. P0A1Y1), Host-nuclease inhibitor protein (Accession No. Q8Z7Y2), and Putative DNA replication terminus site-binding protein (Accession No. Q9L5F9). Alteration in mismatch repair genes is already linked with cancer development, and human cells have a homolog for MutS. MMR mutation enhanced the rates of spontaneous mutation which may act as a potential factor for the development of cancer including colon cancer.17 Recent research work reported the implication of S. typhimurium in growth of colon cancer through down regulation of Wnt1 using a Salmonella-colitis colon cancer model.55 In an early report, it has shown the possible role of Escherichia coli DNA MMRPs including Muts in growth and development of colon cancer.22 Therefore, it can be supposed that if two homologous proteins with the same enzymatic function are present in a particular cell, the relative enzymatic function of proteins will be different, and both of them compete with each other to bind with their substrate. As MutS is a DNA repair associated protein, aberration in DNA repair can lead to the development of cancer. It has already been proposed that it induces DNA damage in host cells and suppresses DNA repair activity, but this suppressed DNA repair involves double-stranded breaks.38 Future research on the involvement of S. typhimurium nuclear targeted proteins in suppression of mismatch DNA repair activity can provide new insights on the role of S. typhimurium in the development of colon cancer.
DNA Damage Proteins
In addition, certain DNA damaging endonucleases are involved in the development of cancer. For example, a variant LINE-1 endonuclease overexpression is involved in the development of primary gastric cancer and lymph node metastasis.54 Our prediction results demonstrated nuclear localization of S. typhimurium ribonuclease R (Accession No. Q8XF68) and endoribonuclease SymE (Accession No. Q8Z0W8). In addition to DNA damaging activity, ribonuclease R is predicted to localize to the host cell nucleus and its role in colon cancer etiology needs to be investigated further.
Transcriptional and Translational Regulators
It has been found that RNA polymerase subunits share structural similarities and antigenicity among eukaryotes. This indicates that RNA polymerase is somewhat conserved. Yeast RNA polymerase II has subunit RBP, which binds to nucleotides, and shares similarities with a β subunit of E. coli RNA polymerase.52 Various transcription-associated proteins have the ability to bind DNA; therefore, they can also inhibit binding of host gene regulators and modulate host gene expression.19 Many transcription factors are involved in cancer development.30The results of the current study indicate that translation initiation factor IF-1 (Accession No. P69225), tRNA (Met) cytidine acetyltransferase TmcA (Accession No. Q8Z4S1), ATP-dependent RNA helicase (Accession No. Q8Z3I1), ATP-dependent RNA helicase RhlB (Accession No. P0A2P1), dual-specificity RNA methyltransferase (Accession No. Q8Z4P2), 23S rRNA pseudouridine synthase (Accession No. Q8Z1V3), putative transcriptional regulator (Accession No. Q8Z6E4), putative reverse transcriptase (Accession No. Q8Z4H7), transcriptional regulatory protein BtsR (Accession No. Q8Z5C1), LysR-family transcriptional regulator (Accession No. Q8Z3S5), ribosome-recycling (RRF) factor/ribosome-releasing factor (Accession No. P66739), transcriptional activator CaiF (Accession No. Q8XFZ2), transcriptional activator RamA (Q8Z8M2), met repressor (Accession No. Q8Z2Z4), ribosomal protein S12 methylthiotransferase RimO (Accession No. Q8Z861), putative RNA methyltransferase (Accession No. Q8XFT1), ribosomal large subunit pseudouridine synthase B (Accession No. Q8Z7D5), and transcriptional regulatory protein OmpR (Accession No. P0AA20) localize to the host nucleus. The nuclear localization of transcription associated S. typhimurium proteins should be investigated further for their potential involvement in the regulation of human gene expression. The role of bacterial RNA polymerase in human gene transcription has already been investigated in many cases. The RNA polymerase II of E. coli has been shown to transcribe human DNA, known as the arrest site, which indicates that bacterial transcriptional regulators can also act on human DNA sequences.37In addition, the variation in human and S. typhimurium promoter sequences raises criticism for practical application of S. typhimurium transcriptional proteins in human gene transcription. Therefore, it can be anticipated that computational prediction has its own limitations, and the data obtained needs to be validated experimentally before the role of S. typhimurium transcription-associated proteins in human genes transcription can be concluded. Presently, it can be assumed that the localization of such proteins can cause several alterations in human gene expression and may possibly contribute to S. typhimurium-related colon cancer etiology.
Lateral Gene Transfer and RNA Chaperone
Recently, it has been discovered that lateral gene transfer is a common event during chronic infections, where bacteriato-human gene transfer is possible. Lateral gene transfer can be a potential factor for cancer development.28 Previously, it was thought that bacteria-to-human gene transfer was a rare event, but it is now recognized as a category of gene coding for approximately 223 genes that have been identified during the human genome project, which has similarity with bacteria, but no comparable similarity with worm, yeast, fly, or any other non-vertebrate eukaryotes. Therefore, it suggests lateral gene transfer of such genes. Obligate and invasive nature of S. typhimurium coupled with detection of strong inter-strain lateral gene transfer frequency suggests potential involvement of S. typhimurium lateral gene transfer in colon cancer etiology.21,32 The chaperones contain cancer antagonist properties by working as genetic buffers, which stabilizes the usual phenotype.27 Therefore, the alteration in chaperones can act as a potential factor for cancer development. Although this study provides a background for S. typhimurium protein targeting in the host cell, the validation of these results is extremely important before any conclusion can be drawn.
Conclusion
Although there is considerable, well-justified speculation that S. typhimurium infection might play an important role in colon cancer etiology, this study provides, to the best of our knowledge, the first demonstration of a nuclear targeting of S. typhimurium proteins in host cells, which may be significantly linked with colon cancer development. The mechanism of S. typhimurium in the development of cancer is still unclear, and the studies proposing and opposing this relation are enormous. The current in-silico work showed that various S. typhimurium proteins targeted to host cell nucleus which may have a profound impact on colon cancer etiology. These proteins can potentially affect the normal functioning of different pathways of the host cell. The current understanding of S. typhimurium infection and colon cancer is very limited, and this study makes a valuable contribution to help us understand the relation between S. typhimurium infection and colon cancer. Therefore, this study provides a detailed perspective that can help narrow down the targets for studying the important role of different proteins of S. typhimurium in development of colon cancer in the future. Our work may open new avenues of research and aid in the management of colon cancer.
Authors: E S Lander; L M Linton; B Birren; C Nusbaum; M C Zody; J Baldwin; K Devon; K Dewar; M Doyle; W FitzHugh; R Funke; D Gage; K Harris; A Heaford; J Howland; L Kann; J Lehoczky; R LeVine; P McEwan; K McKernan; J Meldrim; J P Mesirov; C Miranda; W Morris; J Naylor; C Raymond; M Rosetti; R Santos; A Sheridan; C Sougnez; Y Stange-Thomann; N Stojanovic; A Subramanian; D Wyman; J Rogers; J Sulston; R Ainscough; S Beck; D Bentley; J Burton; C Clee; N Carter; A Coulson; R Deadman; P Deloukas; A Dunham; I Dunham; R Durbin; L French; D Grafham; S Gregory; T Hubbard; S Humphray; A Hunt; M Jones; C Lloyd; A McMurray; L Matthews; S Mercer; S Milne; J C Mullikin; A Mungall; R Plumb; M Ross; R Shownkeen; S Sims; R H Waterston; R K Wilson; L W Hillier; J D McPherson; M A Marra; E R Mardis; L A Fulton; A T Chinwalla; K H Pepin; W R Gish; S L Chissoe; M C Wendl; K D Delehaunty; T L Miner; A Delehaunty; J B Kramer; L L Cook; R S Fulton; D L Johnson; P J Minx; S W Clifton; T Hawkins; E Branscomb; P Predki; P Richardson; S Wenning; T Slezak; N Doggett; J F Cheng; A Olsen; S Lucas; C Elkin; E Uberbacher; M Frazier; R A Gibbs; D M Muzny; S E Scherer; J B Bouck; E J Sodergren; K C Worley; C M Rives; J H Gorrell; M L Metzker; S L Naylor; R S Kucherlapati; D L Nelson; G M Weinstock; Y Sakaki; A Fujiyama; M Hattori; T Yada; A Toyoda; T Itoh; C Kawagoe; H Watanabe; Y Totoki; T Taylor; J Weissenbach; R Heilig; W Saurin; F Artiguenave; P Brottier; T Bruls; E Pelletier; C Robert; P Wincker; D R Smith; L Doucette-Stamm; M Rubenfield; K Weinstock; H M Lee; J Dubois; A Rosenthal; M Platzer; G Nyakatura; S Taudien; A Rump; H Yang; J Yu; J Wang; G Huang; J Gu; L Hood; L Rowen; A Madan; S Qin; R W Davis; N A Federspiel; A P Abola; M J Proctor; R M Myers; J Schmutz; M Dickson; J Grimwood; D R Cox; M V Olson; R Kaul; C Raymond; N Shimizu; K Kawasaki; S Minoshima; G A Evans; M Athanasiou; R Schultz; B A Roe; F Chen; H Pan; J Ramser; H Lehrach; R Reinhardt; W R McCombie; M de la Bastide; N Dedhia; H Blöcker; K Hornischer; G Nordsiek; R Agarwala; L Aravind; J A Bailey; A Bateman; S Batzoglou; E Birney; P Bork; D G Brown; C B Burge; L Cerutti; H C Chen; D Church; M Clamp; R R Copley; T Doerks; S R Eddy; E E Eichler; T S Furey; J Galagan; J G Gilbert; C Harmon; Y Hayashizaki; D Haussler; H Hermjakob; K Hokamp; W Jang; L S Johnson; T A Jones; S Kasif; A Kaspryzk; S Kennedy; W J Kent; P Kitts; E V Koonin; I Korf; D Kulp; D Lancet; T M Lowe; A McLysaght; T Mikkelsen; J V Moran; N Mulder; V J Pollara; C P Ponting; G Schuler; J Schultz; G Slater; A F Smit; E Stupka; J Szustakowki; D Thierry-Mieg; J Thierry-Mieg; L Wagner; J Wallis; R Wheeler; A Williams; Y I Wolf; K H Wolfe; S P Yang; R F Yeh; F Collins; M S Guyer; J Peterson; A Felsenfeld; K A Wetterstrand; A Patrinos; M J Morgan; P de Jong; J J Catanese; K Osoegawa; H Shizuya; S Choi; Y J Chen; J Szustakowki Journal: Nature Date: 2001-02-15 Impact factor: 49.962
Authors: Vassilis Samaras; Petros I Rafailidis; Eleni G Mourtzoukou; George Peppas; Matthew E Falagas Journal: J Infect Dev Ctries Date: 2010-06-03 Impact factor: 0.968
Authors: Tiziana Scanu; Robbert M Spaapen; Jeroen M Bakker; Chandra Bhan Pratap; Lin-en Wu; Ingrid Hofland; Annegien Broeks; Vijay Kumar Shukla; Mohan Kumar; Hans Janssen; Ji-Ying Song; E Andra Neefjes-Borst; Hein te Riele; David W Holden; Gopal Nath; Jacques Neefjes Journal: Cell Host Microbe Date: 2015-05-28 Impact factor: 21.023