Literature DB >> 27478803

A Bioinformatics Approach to Prioritize Single Nucleotide Polymorphisms in TLRs Signaling Pathway Genes.

Behnam Alipoor1, Hamid Ghaedi2, Mir Davood Omrani2, Milad Bastami3, Reza Meshkani1, Taghi Golmohammadi1.   

Abstract

It has been suggested that single nucleotide polymorphisms (SNPs) in genes involved in Toll-like receptors (TLRs) pathway may exhibit broad effects on function of this network and might contribute to a range of human diseases. However, the extent to which these variations affect TLR signaling is not well understood. In this study, we adopted a bioinformatics approach to predict the consequences of SNPs in TLRs network. The consequences of non-synonymous coding SNPs (nsSNPs) were predicted by SIFT, PolyPhen, PANTHER, SNPs&GO, I-Mutant, ConSurf and NetSurf tools. Structural visualization of wild type and mutant protein was performed using the project HOPE and Swiss PDB viewer. The influence of 5'-UTR and 3'- UTR SNPs were analyzed by appropriate computational approaches. Nineteen nsSNPs in TLRs pathway genes were found to have deleterious consequences as predicted by the combination of different algorithms. Moreover, our results suggested that SNPs located at UTRs of TLRs pathway genes may potentially influence binding of transcription factors or microRNAs. By applying a pathway-based bioinformatics analysis of genetic variations, we provided a prioritized list of potentially deleterious variants. These findings may facilitate the selection of proper variants for future functional and/or association studies.

Entities:  

Keywords:  Bioinformatics; in-silico analysis; single nucleotide polymorphisms; toll- like receptors

Year:  2016        PMID: 27478803      PMCID: PMC4947211     

Source DB:  PubMed          Journal:  Int J Mol Cell Med        ISSN: 2251-9637


Toll-like receptors (TLRs) are a major class of the pattern- recognition receptors of the innate immune system involved in the identification of pathogen-associated molecular patterns (PAMPs) from infectious pathogens (1-2). These trans-membrane proteins engage with PAMPs and trigger activation of intracellular signaling cascades, leading to the induction of genes that regulate the expression of pro- inflammatory cytokines and chemokines (3-4). Due to the critical roles of TLRs signaling network in the initiation of innate immune responses, malfunction of genes involved in this pathway may predispose individuals to numerous human diseases ranging from infectious and chronic inflammatory to cancers and autoimmune diseases (5-6). Accumulating evidence now suggests that genetic variations in TLRs pathway genes may exhibit deleterious effects on gene function, leading to the dysregulation of this signaling pathways (7-8). Single nucleotide polymorphisms (SNPs) are the shortest and the most frequent variations in the human genome. Among these, the functional consequences of untranslated regions (UTRs) and non-synonymous (nsSNPs) SNPs are of special interest, as they can either modulate gene expression or influence protein structure and function (9-10). Although the contribution of SNPs in TLR signaling to human pathological states was addressed by several studies, a comprehensive and prioritized list of SNPs potentially affecting the function and regulation of this pathway is still lacking. Therefore, this study aimed to systematically identify the UTR-SNPs and nsSNPs in genes involved in TLRs signaling network by employing a bioinformatics approach and predicting their deleterious functional and structural consequences.

Materials and methods

Retrieving SNPs in TLRs pathway genes Data on the human TLRs pathway genes were collected from national center for biological information (http://www.ncbi.nlm.nih.gov/) (acce-ssed May 2015) (Table 1). Genes implicated in TLRs pathway and their functional connections were retrieved by querying Kyoto encyclopedia of genes and genomes (KEGG) (http:// www. genome.jp/ kegg/) (accessed May 2015) (Figure 1). SNPs located in TLRs network genes were retrieved from dbSNP (http:// www. ncbi. nlm. nih. gov/SNP/) (accessed June 2015). For each SNP, the following information was recorded: SNP ID, genomic coordinate, and variation type. Protein information of TLR network genes was retrieved from UniProt (http: // www. uniprot.org/) (accessed
Table 1

TLR signaling pathway genes list.

Name Gene ID Location MIM Number of SNPs
1TLR17096Chr 4601194321
2TLR27097Chr 4603028537
3TLR37098Chr 4603029400
4TLR421898Chr 9603030606
5TLR57100Chr 1603031790
6TLR610333Chr 4605403854
7TLR751284Chr X300365544
8TLR851311Chr X300366270
9TLR954106Chr 3605474509
10MYD884615Chr 3602170123
11TIRAP114609Chr 11606252267
12IRAK13654Chr X300283235
13IRAK451135Chr 12606883601
14TRAF67189Chr 11602355579
15TRAF37187Chr 146018962570
16TAB110454Chr 226026151989
17TAB223118Chr 66051013967
18MAP3K76885Chr 66026141267
19IKBKG8517Chr X300248222
20IKBKB3551Chr 86032581376
21CHUK1147Chr 10600664750
22NFKBIA4792Chr 14164008143
23NFKB14790Chr 41640112060
24MAP2K15604Chr 151768722124
25MAPK15594Chr 221769482335
26MAP2K35606Chr 176023151329
27MAP2K75609Chr 19603014317
28MAPK141432Chr 66002891778
29MAPK85599Chr 106011582450
30FOS2353Chr 14164810101
31TICAM1148022Chr 19607601438
32RIPK18737Chr 66034531322
33IKBKE9641Chr 1605048696
34TBK129110Chr 12604834895
35IRF33661Chr 19603734199
36IRF53663Chr 7607218284
37IRF73665Chr 11605047173
Fig. 1

Schematic presentation of gene network implicated in TLR signaling pathway. Direction of signal transduction is exhibited by arrows.

June 2015). Predicting UTR-SNPs consequences To evaluate the conservation score, we used genomic evolutionary rate profiling (GERP) track implemented in UCSC (https://genome.ucsc.edu/) to calculate the GERP++conservation score for each SNPs. Genomic Evolutionary Rate Profiling (GERP) is a method for producing position-specific estimates of evolutionary constraint using maximum likelihood evolutionary rate estimation. Constraint intensity at each individual alignment position is quantified in terms of a "rejected substitutions" (RS) score, defined as the number of substitutions expected under neutrality minus the number of substitutions "observed" at the position. Positive scores represent a substitution deficit (i.e., fewer substitutions than the average neutral site) and thus indicate that a site may be under the evolutionary constraint. Negative scores indicate that a site is probably evolving neutrally; negative scores should not be interpreted as evidence of accelerated rates of evolution because of too many strong confounders, such as alignment uncertainty or rate variance. The effects of UTR-SNPs on local RNA secondary structure were predicted using mode 1 of RNAsnp program (v 1.1). The software requires RNA sequence and SNP as inputs and uses a window of 400 nucleotides, ±200 nucleotide on either side of the SNP position to obtain subsequences and generate the base-pairing probability matrix for the corresponding wild type and mutant alleles. Then, RNAsnp computes the Euclidian distance (d) and Pearson correlation coefficient (r) for all sequence intervals with a minimum length of 50 that have self-contained base pairs to assess structural difference between the wild type and mutant alleles and reports the interval with the maximum base pairing distance (dmax) or minimum correlation coefficient (rmin) along with the corresponding empirical p-value (11). Here, we used both measures independently and defined structure disruptive UTR-SNPs as those with significant dmax or rmin (significance threshold is p< 0.2 as defined by RNAsnp). RegulomeDB Version 1.1 (12) was used to annotate UTR-SNPs with known and predicted regulatory elements of the genome including the regions of DNase hypersensitivity, binding sites and motifs of transcription factors, chromatin state and the expression of quantitative trait loci. To have further annotations, we identified 3'-UTR SNPs residing in microRNAs target sites. A comprehensive dataset of experimentally supported miRNAs target sites, including CLIP-Seq supported interactions from starBase version 2 (http:// starbase.sysu.edu.cn/) (13) and CLASH verified interactions extracted from PolymiRTS database, were compiled (http://compbio.uthsc.edu/miRSNP/) (14). TLR signaling pathway genes list. Schematic presentation of gene network implicated in TLR signaling pathway. Direction of signal transduction is exhibited by arrows. Analyzing the functional and structural conse-quences of non- synonymous SNPs Phenotypic effects of amino acid substitution on protein function were predicted by Sorting intolerant from tolerant (SIFT) (http://sift.jcvi.org/). In this study, a list of nsSNPs (rsIDs) from NCBI's dbSNP database was submitted as a query sequence to SIFT to predict tolerated and deleterious substitutions for every position of sequence. nsSNPs with SIFT score0.05 were classified as deleterious and those>0.05 were classified as tolerated (15). Polymorphism Phenotyping-2 (PolyPhen-2) (http://genetics.bwh.harvard.edu/ pph2/) predicts possible impact of an amino acid substitution on the structure and function of a human protein using straightforward physical and comparative conside-rations. Input options for this tool are comprised of protein sequence, database ID/ accession number and details of amino acids substitution. For a given substitution, prediction outcome can be one of possibly damaging, probably damaging, and benign (16). Protein analysis through evolutionary relati-onships (PANTHER) (http:// www.pantherdb. org/) estimates the likelihood of a particular nsSNPs to cause a functional impact on the protein. This tool calculates the substitution position-specific evolutionary conservation (subPSEC) score based on an alignment of evolutionarily related proteins. The subPSEC scores are continuous values from 0 (neutral) to about -10 (most likely to be deleterious). A cutoff of -3 corresponds to a 50% probability that a score is deleterious. From this, the probability that a given variant will cause a deleterious effect on protein function is estimated by Pdeleterious, such that a subPSEC score of -3 corresponds to a Pdeleterious of 0.5 (17). SNPs database and gene ontology (GO) (http://snps.biofold.org/snps-and-go/snps-and-go.html) have been optimized to predict if a given single point protein variation can be classified as disease associated or neutral. A probability > 0.5 indicates that the mutation at the protein is disease-related (18). ConSurf web-server (http://consurf.tau.ac.il/) is a bioinformatics tool for estimating the evolutionary conservation of amino acid positions in a protein molecule based on the phylogenetic relations between homologous sequences. The continuous conservation scores are divided into a discrete scale of nine grades for visualization, from the most variable positions (grade 1) colored turqu-oise, through intermediately conserved positions (grade 5) colored white, to the most conserved positions (grade 9) colored maroon. I-Mutant (http:// folding. uib.es/ i-mutant/ i-mutant 2.0.html) is a neural network based web server for the automatic prediction of protein stability changes upon amino acid substitution. This tool provides the scores for free energy alterations, DDG<0 and DDG> 0 indicate reduction and elevation of the stability, respectively (19). NetSurfp (http: //www. cbs.dtu. dk/services /NetSurfP/) predicts the relative and absolute surface accessibility and secondary structure of residues in amino acid sequences. The reliability of the surface accessibility prediction is stated in the form of a Z-score, which cannot predict secondary structures of proteins (20). Project Have your Protein Explained (ProjectHOPE) (http://www.cmbi.ru.nl/hope/home) has been used to study the insight structural features of native protein and the variant models (21). This web server provides three dimensional structural visualization of mutated proteins, and gives the results by using UniProt and DAS prediction servers.

Results

SNP analysis Mining the dbSNP-NCBI and UniProt databases revealed a total of 35802 SNPs in thirty-seven candidate genes in TLRs pathway (Table 2). Among these, 819 and 2502 were located in 5′-UTR and 3′-UTR respectively, and 2172 were identified as nsSNPs.
Table 2

Summary results of SNPs mining of candidate genes in TLRs signaling pathway

Categories Number of SNPs
IntragenicexonSynonymous1382
Non-synonymous2172
Intron28654
Unknown273
Intergenic3′-UTR2502
5′-UTR819
Total35802
Summary results of SNPs mining of candidate genes in TLRs signaling pathway Density plot of GERP++ conservation score (RS score). The figure shows that 5'UTR SNPs have higher (more positive) score than 3'UTR SNPs. Structure disruptive UTR SNPs in TLR genes. SNPs positioned above dashed line are those with dmax p-value< 0.2, and hence, designated to be structure disruptive. Common 3’UTR SNPs resided in miRNA target sites Target miRNA SNP MAF dmax p-value Conservation score of UTR SNPs We computed GERP++scores for SNPs in UTRs, which represent an evolutionary conservation extent based on alignment of 35 mammals to hg19. Generally, 5′-UTR SNPs were found to be more conserved than 3′-UTR SNPs (Figure 2). With a cut off RS score of ≥ 2, a total of 480 constrained SNPs (including 85 5′-UTR-SNPs and 395 3′-UTR-SNPs) were identified. Moreover, 1200 SNPs (including 141 5′-UTR-SNPs and 1059 3′-UTR-SNPs) were classified as neutrally evolving, which represents a RS score of ≤0. The most conserved SNPs were found in 3′-UTR of TAB2 (rs138687718, RS score= 6.17), MAPK14 (rs377447706, RS score= 6.17) and FOS (rs45480193, RS score= 6.16).
Fig. 2

Density plot of GERP++ conservation score (RS score). The figure shows that 5'UTR SNPs have higher (more positive) score than 3'UTR SNPs.

Influence of UTR-SNPs on RNA secondary structures Our analysis showed that 313 UTR-SNPs were structure disruptive as defined by dmax p- value P<0.2 (Figure 3). Considering both dmax and rmin, there were 232 unique structure disruptive UTR-SNPs. The top five genes enriched for structure disruptive SNPs were MAPK14 (n= 23), TLR7 (n= 12), TLR4 (n= 10), MAPK1 (n= 10), and TRAF3 (n= 8).
Fig. 3

Structure disruptive UTR SNPs in TLR genes. SNPs positioned above dashed line are those with dmax p-value< 0.2, and hence, designated to be structure disruptive.

Annotation of SNPs with regulatory elements Disease associated variants are enriched in regulatory elements of the genome. Using RegulomeDB, we annotated UTR-SNPs within regulatory elements. 11 UTR-SNPs were associated with transcription factor binding sites (i.e eQTL). These SNPs were found within 3’UTR of TAB1 (rs1010169, rs1010170, rs5757650, rs5750822), RIPK1 (rs9503383, rs9405606), IRF5 (rs752637, rs3807306), IRAK4 (rs4251425) and TLR9 (rs187084) genes. Identification of SNPs residing in miRNA target sites Intersecting 3′-UTR-SNPs with the experimentally validated miRNAs target site datasets, we found 314 SNPs resided in microRNAs target sites. Since miRNA target sites are under selective pressure, we refined SNPs in miRNA target sites by minor allele frequency (MAF) threshold of 0.01 (Table 3).
Table 3

Common 3’UTR SNPs resided in miRNA target sites

NFKBIAhsa-miR-208a-3prs6960.46 0.07
MYD88hsa-miR-520f-3prs77440.140.86
TAB2hsa-miR-4500rs78960.200.27
MAPK14hsa-miR-4306rs85100.180.45
MAPK1hsa-miR-210-3prs93400.330.21
MAPK1hsa-miR-186-5prs130580.04 0.01
MAP3K7hsa-miR-212-3prs21319060.040.38
MAPK14hsa-miR-381-3prs38044510.130.35
IRAK4hsa-miR-340-5prs42515620.040.90
MAP3K7hsa-miR-212-3prs94514410.010.43
TAB2hsa-miR-33a-5prs358599180.010.47
MAPK1hsa-miR-217rs412826070.01 0.08
TAB2hsa-miR-539-5prs412884310.010.82
MAPK1hsa-miR-488-3prs617579760.010.76
TRAF3hsa-miR-4500rs727047370.29 0.12

Target miRNA SNP MAF dmax p-value

List of nsSNPs that predicted to be deleterious by both PolyPhen-2 and SIFT tools Abbreviations: P.D; probablydamaging Distribution of SIFT and PolyPhen score of SNPs in coding region. Horizontal and vertical dashed red line correspond to the thresholds for predicting deleterious variants by PolyPhen and SIFT, respectively. Prediction of tolerated and deleterious non-synonymous SNPs by SIFT SIFT analysis predicted that a total of 785 nsSNPs were damaging (score 0.05) and 1322 nsSNPs had tolerated effects on the candidate genes involved in TLR pathway network (score> 0.05) (Figure 4).
Fig. 4

Distribution of SIFT and PolyPhen score of SNPs in coding region. Horizontal and vertical dashed red line correspond to the thresholds for predicting deleterious variants by PolyPhen and SIFT, respectively.

Prediction of damaging non-synonymous SNPs by PolyPhen-2 According to our Polyphen-2 results, 610 nsSNPs were predicted “probably damaging”, 353 nsSNPs were predicted “possibly damaging” and 1068 were classified as benign (Figure 4). To increase the accuracy of predictions, results of SIFT and PolyPhen-2 were joined and SNPs with PolyPhen score> 0.95 and SIFT< 0.05 were selected. Accordingly, 29 nsSNPs passed both criteria and were classified as deleterious/damaging (Table 4).
Table 4

List of nsSNPs that predicted to be deleterious by both PolyPhen-2 and SIFT tools

Gene Symbol SNP Allele AA substitution PolyPhen Score PolyPhenPerediction SIFT Score SIFT prediction
1CHUKrs56948661G>AP623L1P.D0.01Damaging
2CHUKrs61732515C>GQ277H0.999P.D0.00Damaging
3CHUKrs112432667T>CE492G0.954P.D0.00Damaging
4FOSrs74685695T>GV77G0.999P.D0.01Damaging
5IRF5rs112815033T>CL450P1P.D0.01Damaging
6IRAK4rs55944915G>AR391H0.999P.D0.01Damaging
7IRAK4rs114820168C>TR391C1P.D0.00Damaging
8MAP3K7rs77759048A>TW55R1P.D0.00Damaging
9TBK1rs34774243A>GK291E0.997P.D0.00Damaging
10TBK1rs55824172C>TS151F0.997P.D0.00Damaging
11TIRAPrs74937157T>CC134R1P.D0.00Damaging
12TLR1rs5743621G>AP733L0.995P.D0.00Damaging
13TLR1rs41311402A>GL697S1P.D0.00Damaging
14TLR1rs56205407A>GI679T0.999P.D0.00Damaging
15TLR1rs117033348A>GL144P1P.D0.04Damaging
16TLR2rs5743706T>AY715N1P.D0.01Damaging
17TLR2rs56303479T>CL81P1P.D0.00Damaging
18TLR2rs121917864C>TR677W1P.D0.00Damaging
19TLR3rs5743316A>TN284I1P.D0.00Damaging
20TLR3rs112666655T>CL545P1P.D0.00Damaging
21TLR3rs111488413C>AP880Q1P.D0.00Damaging
22TLR4rs77214890G>TD181Y1P.D0.00Damaging
23TLR4rs80197996G>TL470F1P.D0.03Damaging
24TLR4rs55905951C>GA676G1P.D0.00Damaging
25TLR4rs55786277C>TR804W0.999P.D0.01Damaging
26TLR5rs5744176T>CD694G1P.D0.01Damaging
27TLR5rs78098893T>CR752G0.997P.D0.01Damaging
28TLR6rs13102250A>CL105W1P.D0.01Damaging
29TLR9rs55881257G>AR962C1P.D0.01Damaging

Abbreviations: P.D; probablydamaging

Prediction of functional impact of non-synonymous SNPs on protein by PANTHER and SNPs & GO. According to the PANTHER results, all 29 SNPs possessed the subPSEC score more than −3 and were therefore classified as deleterious (Table 5). As shown in table 5, these SNPs were found to be as disease-associated with the probability >0.5 after analyzing by SNPs & GO.
Table 5

PANTHER and SNPs&GO results for prediction of SNPs as disease associated.

PANTHER
SNPs&GO
SNPs Substitution subPSEC Pdeleterious Prediction RI Probability
1 rs56948661P623L-4.928550.87309Disease50.742
2 rs61732515Q277H-4.615890.83423Disease30.527
3 rs112432667E492G-3.991820.72945Disease40.711
4 rs74685695V77G-4.068620.74433Disease10.545
5 rs112815033L450P-4.366010.79674Disease00.523
6 rs55944915R391H-3.649240.65684Disease00.525
7 rs114820168R391C-4.670970.84171Disease30.643
8 rs77759048W55R-3.30070.57461Disease40.717
9 rs34774243K291E-3.565330.63768Disease50.772
10 rs55824172S151F-4.71190.84708Disease60.804
11 rs74937157C134R-3.471780.6158Disease20.619
12 rs5743621P733L-4.516660.82005Disease20.623
13 rs41311402L697S-4.238450.77529Disease40.712
14 rs56205407I679T-5.358550.91361Disease70.870
15 rs117033348L144P-8.178340.99439Disease50.750
16 rs5743706Y715N-4.343310.79303Disease40.707
17 rs56303479L81P-6.49360.97051Disease70.855
18 rs121917864R677W-6.46880.96979Disease60.819
19 rs5743316N284I-3.914480.71392Disease50.748
20 rs112666655L545P-4.256410.77841Disease60.823
21 rs111488413P880Q-8.508810.99597Disease60.811
22 rs77214890D181Y-4.480680.81467Disease00.511
23 rs80197996L470F-3.941060.71931Disease40.639
24 rs55905951A676G-3.162080.54043Disease00.503
25 rs55786277R804W-5.102630.89116Disease50.748
26 rs5744176D694G-3.429670.6058Disease40.716
27 rs78098893R752G-3.169190.5422Disease20.614
28 rs13102250L105W-5.093830.8903Disease20.583
29 rs55881257R962C-4.480940.81471Disease10.547
Prediction of protein stability analysis by I-Mutant According to I- Mutant results, all mutations expect N284I (rs5743316 in TLR3), S151F (rs55824172 in TBK1) and L105W (rs13102250 in TLR6) were predicted to decrease protein stability, with a free energy change value <0.0 (Table 6).
Table 6

Summary results of nsSNPs analysis by I-mutant and ConSurf.


I-mutant
ConSurf
Gene Symbol SNP AA substitution DDG  ( Kcal/mol) Stability conservation scale Functional or structural residue
1 CHUKrs56948661P623L-0.97Decrease9F
2 CHUKrs61732515Q277H-1.58Decrease7-
3 CHUKrs112432667E492G-1.06Decrease4-
4 FOSrs74685695V77G-5.25Decrease9S
5 IRF5rs112815033L450P-1.74Decrease8-
6 IRAK4rs55944915R391H-1.32Decrease8F
7 IRAK4rs114820168R391C-0.86Decrease8F
8 MAP3K7rs77759048W55R-1.71Decrease8-
9 TBK1rs34774243K291E-0.82Decrease6-
10 TBK1rs55824172S151F0.01Increase9F
11 TIRAPrs74937157C134R-1.55Decrease8-
12 TLR1rs5743621P733L-1.33Decrease8F
13 TLR1rs41311402L697S-1.51Decrease9S
14 TLR1rs56205407I679T-1.91Decrease8-
15 TLR1rs117033348L144P-0.79Decrease9S
16 TLR2rs5743706Y715N-1.65Decrease9S
17 TLR2rs56303479L81P-1.24Decrease9S
18 TLR2rs121917864R677W-0.83Decrease9F
19 TLR3rs5743316N284I1.23Increase9F
20 TLR3rs112666655L545P-1.10Decrease7-
21 TLR3rs111488413P880Q-1.26Decrease9F
22 TLR4rs77214890D181Y-0.98Decrease8F
23 TLR4rs80197996L470F-0.86Decrease9S
24 TLR4rs55905951A676G-1.19Decrease9S
25 TLR4rs55786277R804W-0.54Decrease6-
26 TLR5rs5744176D694G-1.31Decrease9F
27 TLR5rs78098893R752G-1.49Decrease7-
28 TLR6rs13102250L105W0.91Increase9S
29 TLR9rs55881257R962C-2.62Decrease8F

Abbreviations: DDG; free energy change value (DDG<0: Decrease Stability, DDG>0: Increase Stability). The pH and the temperature were set to7 and 25˚C for all submissions, respectively. F: functional residue; S: structural residue.

Prediction of evolutionary conservation of amino acid position by ConSurf Our ConSurf analysis revealed that all 29 expected SNPs including the Q277H (CHUK), E492G (CHUK), L450P (IRF5), W55R (MAP3K7), K291E (TBK1), C134R (TIRAP), I679T (TLR1), L545P (TLR3), R804W (TLR4) and R752G (TLR5) were located in highly conserved regions and predicted to have functional and structural impacts on TLRs pathway proteins (Table 6). solvent accessibility and three-dimensional analyzes of native and mutant protein structures By combining the results of SIFT, Poly-phen-2, PANTHER, SNPs & GO, I-Mutant 2.0, and ConSurf servers, 19 mutations were found to be more deleterious in candidate genes. Subsequently, these mutations were analyzed for solvent accessibility and stability, and the results were represented in the following paragraphs (see also Table 7). Visualization of structural features of wild type and mutant protein containing the mentioned deleterious variants was performed using the project HOPE and Swiss PDB viewer.
Supplementary Table 1

Surface accessibility of wild-type and mutant variants in TLRs network intermediate molecules.

Gene Symbol SNP AA substitution Class assignment AA AA position RSA ASA Z-fit score
1CHUKrs56948661P623LExposedExposedPL6236230.5440.53777.17998.3061.1901.088
2FOSrs74685695V77GBuriedBuriedVG77770.0820.15912.5812.55-0.799-0.920
3IRAK4rs55944915R391HExposedExposedRH3913910.5000.520114.4094.58-0.611-0.727
4IRAK4rs114820168R391CExposedExposedRC3913910.5000.477114.4067.04-0.611-0.891
5TBK1rs55824172S151FBuriedBuriedSF1511510.1320.11615.5223.300.068-0.048
6TLR1rs5743621P733LExposedExposedPL7337330.5750.56981.57104.230.6870.717
7TLR1rs41311402L697SBuriedBuriedLS6976970.0280.0305.053.490.9510.649
8TLR1rs117033348L144PBuriedBuriedLP1441440.0380.0356.925.0230.5030.657
9TLR2rs5743706Y715NBuriedBuriedYN7157150.1520.15332.4622.390.1930.253
10TLR2rs56303479L81PBuriedBuriedLP81810.0380.0296.934.100.3620.758
11TLR2rs121917864R677WBuriedBuriedRW6776770.2430.25555.6061.32-0.079-0.088
12TLR3rs5743316N284IBuriedBuriedNI2842840.0830.08812.1616.33-1.686-1.081
13TLR3rs111488413P880QExposedExposedPQ8808800.4010.44656.8879.650.1500.108
14TLR4rs77214890D181YBuriedBuriedDY1811810.2400.25834.5255.240.5280.277
15TLR4rs80197996L470FBuriedBuriedLF4704700.0900.08916.4017.880.0800.247
16TLR4rs55905951A676GBuriedBuriedAG6766760.0330.0343.622.71-0.046-0.158
17TLR5rs5744176D694GBuriedBuriedDG6946940.1640.17323.5713.64-0.270-0.384
18TLR6rs13102250L105WBuriedBuriedLW1051050.0300.0315.517.400.8430.799
19TLR9rs55881257R962CExposedExposedRC9629620.4190.46495.9565.200.0660.045

Abbreviations: RSA: Relative Surface Accessibility; ASA: Absolute Surface Accessibility. Values for wild type and mutant variants are presented by red and green color respectively

PANTHER and SNPs&GO results for prediction of SNPs as disease associated. Summary results of nsSNPs analysis by I-mutant and ConSurf. Abbreviations: DDG; free energy change value (DDG<0: Decrease Stability, DDG>0: Increase Stability). The pH and the temperature were set to7 and 25˚C for all submissions, respectively. F: functional residue; S: structural residue. The rs56948661 in CHUK gene leads to P623L. The residue is located on the surface of the protein and mutation of this residue can disturb the interactions with other molecules or other parts of the protein. Moreover, the mutation can disturb the special backbone conformation induced by proline. Conversion of V77G (rs74685695 in FOS) causes some structural changes in protein. Glycine residue is smaller than valine and this may lead to loss of the interactions. Furthermore, the mutant residue is more hydrophobic and flexible and can disturb the required rigidity of the protein on this position. For rs114820168 in IRAK4, the wild-type (arginine) and mutant (cysteine) amino acids differ in size, hydrophobicity and charge. The difference in charge will disturb the ionic interactions of the wild type residue with D388, E389 and D398. R391H is annotated with rs55944915 in dbSNP database. According to the PISA-database, the mutated residue is involved in a multimer contact. The new residue might be too small to make multimer contacts. In S151F variant, rs55824172 of TBK1 gene, the mutant residue (phenylalanine) is bigger and more hydrophobic than the wild-type (serine). This conversion will cause the loss of hydrogen bonds in the core of the protein resulting in the disruption of correct folding. We found that three SNPs in TLR1, including P733L (rs5743621), L697S (rs41311402) and L144P (rs117033348), were located in highly conserved regions and predicted to have functional and structural impacts on proteins. For P733L, the mutant residue (leucine) is bigger than the wild-type (proline) and is located on surface of the protein, potentially disturbing its interactions. For L697S and L144P, the mutant residues are smaller than the wild-type residues and will cause an empty space in the core of the protein. In addition, all three mutations are predicted to have functional and structural influences on TLR2 protein (Figure 5).
Fig 5

Deep view of superimposed structure of wild and mutant TLR2. A: L81P; B: R677W and C: Y715N. The protein and the side chains of the wild-type and the mutant residue are shown and colored grey, green and red, respectively.

Deep view of superimposed structure of wild and mutant TLR2. A: L81P; B: R677W and C: Y715N. The protein and the side chains of the wild-type and the mutant residue are shown and colored grey, green and red, respectively. Hydrogen bonding interactions and clashes of wild type and mutant TLR4 at position 181. A: the wild-type residue (D) forms hydrogen bonds (green discontinuous line) with L155, V157, A158, L182 and S183; B: substitution of this amino acid with tyrosine will cause loss of hydrogen bonds with A158, L182 and S183. Moreover, the mutation showed a network of clashes (pink discontinuous line) with A158 and S183 residues. Surface accessibility of wild-type and mutant variants in TLRs network intermediate molecules. Abbreviations: RSA: Relative Surface Accessibility; ASA: Absolute Surface Accessibility. Values for wild type and mutant variants are presented by red and green color respectively For L81P (rs56303479),because this residue is part of some interpro domains like leucine-rich repeat, typical subtype, the interaction between these domains could be disturbed by the mutation. The R677W (rs121917864) mutation leads to substitution of arginine by a bigger and more hydrophobic residue named tryptophan. The difference in charge will disturb the ionic interaction made by the arginine with E649 and 656. The third mutation of TLR2 occurs at position 715 (rs5743706). The hydrophobicity of the wild-type (tyrosine) and mutant residue (asparagine) differs and the mutation will cause the loss of hydrophobic interactions in the core of the protein. Finally, the size difference between residues makes that the new residue is not in the correct position to make the same hydrogen bond with S646, as the wild-type residue does. For N284I (rs5743316, in TLR3), due to the difference in hydrophobicity index of residues, the mutation will cause the loss of hydrogen bonds in the core of the protein and may lead to incorrect folding of protein. The second mutation of TLR3 (rs111488413) causes P880Q. This mutant residue is bigger than the wild-type residue and can disturb the protein interactions. Additionally, the hydrophobicity of the residue differs; hence, the mutation may cause the loss of hydrophobic interactions. Concerning D181Y mutation in TLR4 (rs77214890), the difference in charge will disturb the ionic interaction made by the original residue with R234. Moreover, the hydrophobicity of the native and mutant residue differs. Therefore, this mutation causes the loss of hydrogen bonds in the core of the protein leading to disruption of the correct folding (Figure 6). For rs80197996 (L470F) in TLR4, the mutant residue (phenylalanine) is bigger and probably will not fit to bury in the core of the protein. In A676G (rs55905951), the mutant residue is smaller than the wild-type residue. This will cause a possible loss of external interactions. Furthermore, the mutation may cause the loss of hydrophobic interactions with other molecules on the surface of the protein.
Fig 6

Hydrogen bonding interactions and clashes of wild type and mutant TLR4 at position 181. A: the wild-type residue (D) forms hydrogen bonds (green discontinuous line) with L155, V157, A158, L182 and S183; B: substitution of this amino acid with tyrosine will cause loss of hydrogen bonds with A158, L182 and S183. Moreover, the mutation showed a network of clashes (pink discontinuous line) with A158 and S183 residues.

Concerning rs5744176 (D694G) of TLR5, the wild-type residue forms a salt bridge with K692, R752 and K753. The difference in charge will disturb these ionic interactions. Moreover, the aspartic acid forms a hydrogen bond with N726, but due to difference in hydrophobicity, the mutation causes the loss of hydrogen bond. For the L105W (rs13102250) in TLR6, the wild-type (leucine) and mutant (tryptophan) amino acids differ in size. The wild-type residue was buried in the core of the protein, but the mutant residue is bigger and probably will not fit. For rs55881257 (R962C in TLR9) the charge of the wild-type residue will be lost; this can cause the loss of interactions with other molecules or residues. Furthermore, this mutation introduces a more hydrophobic residue at this position, probably resulting to loss of hydrogen bonds.

Discussion

TLRs signaling pathway plays a key role in the host innate immune response. Increasing evidence has suggested that functional SNPs of genes related to TLRs pathway may contribute to diseases ranging from chronic inflammatory to cancers. Since SNPs are the most common genetic variations in human genome, it is expected that genes involved in TLRs pathway contains numerous SNPs. Nevertheless, discriminating deleterious SNPs with potential effects on disease susceptibility from tolerated variants is a major challenge. Therefore, a comprehensive study that systematically analyzes the effects of such SNPs can cost-effectively prioritized SNPs for further analyzes. In-silico analysis of the deleterious effects of SNPs may help to improve our understanding on the biological pathways (22). In this study, we systematically analyzed the SNPs in different parts of genes (5′-UTR, 3′-UTR and coding) in TLRs pathway. A report has suggested that mutation effect prediction algorithms have their own strengths and weaknesses, and therefore, implementing a combination of these tools may help to enhance the accuracy of effect predictions (23). In the present study, we combined the results of the SIFT, PolyPhen, PANTHER, SNPs & GO, I-Mutant and ConSurf algorithms to prioritize the damaging nsSNPs and increase the analysis accuracy. Accordingly, we were able to identify several potentially deleterious nsSNPs in TLRs pathway genes. These SNPs, to the best of our knowledge, have not yet been investigated and therefore may be considered as candidates for association with diseases. These results may pave the ground for future functional and/or association studies and facilitate the process of choosing functional variant for further analyses. UTR-SNPs play important roles in gene regulation and accumulating evidence has indicated their contribution to different diseases. Sequence alteration in these regulatory elements has been shown to interfere with transcription factors or microRNA binding, leading to gene dysregulation (24-25). By applying a bioinformatics approach, we evaluated such effects of UTR-SNPs on TLRs pathway genes and identified numerous disease-associated variants that potentially confer the disease risk through affecting transcription factors or miRNAs binding. TLR9 rs187084, a UTR-SNP which probably interferes with transcription factors binding, has been shown to modify susceptibility to diseases specially renal transplant recipients and cancers (26-27). Several genes of TLRs pathway are regulated post-transcriptionally by miRNAs (28). Our analysis revealed that several SNPs of TLRs network resided in microRNA target sites (Table 3) that may potentially modify miRNA-mediated regulation of these genes. For instance, rs7744 in 3′-UTR of MYD88 and rs696 in 3′-UTR of NFKBIA genes could disrupt the binding of miR-520f-3p and miR-208a-3p, respectively. Matsunaga et al. showed that homozygous minor allele of rs7744 is associated with the severity of ulcerative colitis (29). Moreover, it has been shown that rs696 G>A is associated with the susceptibility to different diseases including coronary artery disease and Behçet's disease (30-31). In conclusion, the current study reports the first pathway-based bioinformatics analysis of SNPs in TLRs pathway genes and provides a prioritized list of functional SNPs potentially affecting regulation and function of the pathway. However, we noticed that the complexities of biological pathways merit the need for more experimentation to validate the true effect of these SNPs on TLRs network. Although the functional significance of the candidate SNPs was not experimentally assessed in this study, we believe that our results will help researchers interested in the roles of SNPs in TLRs pathways genes to focus on proper candidate variants.
  31 in total

Review 1.  The role of pattern-recognition receptors in innate immunity: update on Toll-like receptors.

Authors:  Taro Kawai; Shizuo Akira
Journal:  Nat Immunol       Date:  2010-04-20       Impact factor: 25.606

2.  The *1244 A>G polymorphism of MyD88 (rs7744) is closely associated with susceptibility to ulcerative colitis.

Authors:  Kazuhiro Matsunaga; Tomomitsu Tahara; Hisakazu Shiroeda; Toshimi Otsuka; Masakatsu Nakamura; Takeo Shimasaki; Nobuyuki Toshikuni; Natsuko Kawada; Tomoyuki Shibata; Tomiyasu Arisawa
Journal:  Mol Med Rep       Date:  2013-11-01       Impact factor: 2.952

Review 3.  Toll-like receptors and their crosstalk with other innate receptors in infection and immunity.

Authors:  Taro Kawai; Shizuo Akira
Journal:  Immunity       Date:  2011-05-27       Impact factor: 31.745

Review 4.  Toll-Like Receptor Pathways in Autoimmune Diseases.

Authors:  Ji-Qing Chen; Peter Szodoray; Margit Zeher
Journal:  Clin Rev Allergy Immunol       Date:  2016-02       Impact factor: 8.667

Review 5.  Genetic variation in Toll-like receptors and disease susceptibility.

Authors:  Mihai G Netea; Cisca Wijmenga; Luke A J O'Neill
Journal:  Nat Immunol       Date:  2012-05-18       Impact factor: 25.606

Review 6.  Prediction of deleterious nonsynonymous single-nucleotide polymorphism for human diseases.

Authors:  Jiaxin Wu; Rui Jiang
Journal:  ScientificWorldJournal       Date:  2013-01-30

7.  Protein structure analysis of mutations causing inheritable diseases. An e-Science approach with life scientist friendly interfaces.

Authors:  Hanka Venselaar; Tim A H Te Beek; Remko K P Kuipers; Maarten L Hekkelman; Gert Vriend
Journal:  BMC Bioinformatics       Date:  2010-11-08       Impact factor: 3.169

8.  Toll-like receptors and human disease: lessons from single nucleotide polymorphisms.

Authors:  Yi-Tzu Lin; Amanda Verma; Conrad P Hodgkinson
Journal:  Curr Genomics       Date:  2012-12       Impact factor: 2.236

9.  starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-Seq data.

Authors:  Jun-Hao Li; Shun Liu; Hui Zhou; Liang-Hu Qu; Jian-Hua Yang
Journal:  Nucleic Acids Res       Date:  2013-12-01       Impact factor: 16.971

10.  PolymiRTS Database 3.0: linking polymorphisms in microRNAs and their target sites with human diseases and biological pathways.

Authors:  Anindya Bhattacharya; Jesse D Ziebarth; Yan Cui
Journal:  Nucleic Acids Res       Date:  2013-10-24       Impact factor: 16.971

View more
  2 in total

1.  Associations of Polymorphisms Localized in the 3'UTR Regions of the KRAS, NRAS, MAPK1 Genes with Laryngeal Squamous Cell Carcinoma.

Authors:  Ruta Insodaite; Alina Smalinskiene; Vykintas Liutkevicius; Virgilijus Ulozas; Roberta Poceviciute; Arunas Bielevicius; Laimutis Kucinskas
Journal:  Genes (Basel)       Date:  2021-10-23       Impact factor: 4.096

2.  Comprehensive Computational Analysis of Protein Phenotype Changes Due to Plausible Deleterious Variants of Human SPTLC1 Gene.

Authors:  Tayyaba Sadaf; Peter John; Attya Bhatti
Journal:  Int J Mol Cell Med       Date:  2019-04-23
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.