Literature DB >> 27331020

In silico analysis of deleterious single nucleotide polymorphisms in human BUB1 mitotic checkpoint serine/threonine kinase B gene.

Fatemeh Akhoundi1, Nikpour Parvaneh1, Emadi-Baygi Modjtaba1.   

Abstract

One of the major challenges in the analysis of human genetic variation is to distinguish mutations that are functionally neutral from those that contribute to disease. BubR1 is a key protein mediating spindle-checkpoint activation that plays a role in the inhibition of the anaphase-promoting complex/cyclosome (APC/C), delaying the onset of anaphase and ensuring proper chromosome segregation. Owing to the importance of BUB1B gene in mitotic checkpoint a functional analysis using different in silico approaches was undertaken to explore the possible associations between genetic mutations and phenotypic variation. In this work we found that 3 nsSNPs I82N, P334L and R814H have a functional effect on protein function and stability. A literature search revealed that R814H was already implicated in human diseases. Additionally, 2 SNPs in the 5' UTR region was predicted to exhibit a pattern change in the internal ribosome entry site (IRES), and eight MicroRNA binding sites were found to be highly affected due to 3' UTR SNPs. These in silico predictions will provide useful information in selecting the target SNPs that are likely to have functional impact on the BUB1B gene.

Entities:  

Keywords:  BUB1B gene; Pathogenic variants; Single nucleotide polymorphism (SNP); Spindle assembly checkpoint

Year:  2016        PMID: 27331020      PMCID: PMC4913181          DOI: 10.1016/j.mgene.2016.05.002

Source DB:  PubMed          Journal:  Meta Gene        ISSN: 2214-5400


Introduction

The spindle assembly checkpoint (SAC) is a cell-cycle surveillance mechanism that prevents premature anaphase entry until all chromosomes have completely aligned at the metaphase plate. SAC is composed of the checkpoint proteins BubR1, Bub3, and Mad2, associated with the APC/C coactivator Cdc20. The checkpoint system acts to inhibit the activity of the large multi-protein E3 ubiquitin ligase known as the anaphase promoting complex/cyclosome (APC/C), by binding to the co-activating subunit Cdc20. BubR1 is a key protein mediating spindle-checkpoint activation That directly binds to Cdc20 and inhibits APC/C activity (Kaisari et al., 2016). The corresponding BUB1B gene is located on chromosome 15q15 and is composed of 23 exons that encodes 1050 amino acids (Davenport et al., 1999, Hanks et al., 2012). BUB1B mutated in several cancers including colorectal, lung, breast, hematopoietic malignancies and in a rare human hereditary condition called premature chromatid separation syndrome (mosaic variegated aneuploidy) (Kapanidou et al., 2015, Kops et al., 2005, Hanks et al., 2004, Hanks et al., 2006, Matsuura et al., 2006, Suijkerbuijk et al., 2010, Ohshima et al., 2000). Single nucleotide polymorphisms (SNPs) found in any position throughout the genome in exons, introns, intergenic regions, promoters and enhancers (Drazen et al., 1999). Many SNPs are phenotypically neutral. However, others could predispose human to disease or influence their response to a drug. Nonsynonymous SNPs (nsSNPs) that lead to an amino acid substitution in the corresponding protein product are of particular interest as they are responsible for nearly half of the known gene lesions responsible for human inherited disease (Krawczak et al., 2000). Computational analyses of BUB1B gene for harmful nsSNPs have not been carried out until now; therefore, we applied different publicly available computational tools according to Fig. 1. The value and novelty of this study is to prioritize SNPs with functional significance from an enormous number of neutral non-risk alleles of BUB1B and provides new insights for further genetic association studies.
Fig. 1

Schematic representation of computational tools for in silico analysis of BUB1B gene.

Material and method

Dataset

The NCBI database of SNPs (Sherry et al., 2001), dbSNP available at http://www.ncbi.nlm.nih.gov/SNP and SWISSProt databases (Bairoch and Apweiler, 1996) were used to obtain the SNP information [SNP ID, amino acid position, mRNA accession number NM_001211, and Protein accession number NP_001202.4] of the human BUB1B gene for our computational analyses.

Predicting functional context of missense mutation

The functional context of nsSNPs was predicted using SIFT (Sorting Intolerant from Tolerant), PolyPhen 2.0, I-Mutant 3.0 and PROVEAN (Protein Variation Effect Analyzer) (Table 1).
Table 1

In silico approaches available as online tools.

ServerFeatureURLReference
SIFTFocuses more on the sequence preservation over the evolutionary time in predicting the effect of residue substitutions on function.http://sift.bii.a-star.edu.sg/index.htmlKumar et al. (2009), Magesh and Doss (2014)
PolyPhen 2.0Sequence and structure based method that predicts the possible impact of an amino acid substitution on the structure and function of a protein.http://genetics.bwh.harvard.edu/pph2Adzhubei et al. (2010)
I-Mutant 3.0Support vector machine (SVM) based predictors of protein stability changes upon single amino acid substitution.http://gpcr2.biocomp.unibo.it/cgi/predictors/I-Mutant3.0/I-Mutant3.0.cgiCapriotti et al. (2008)
PROVEANSequence based predictor that estimates whether a protein sequence variation affects protein function.http://provean.jcvi.orgChoi et al. (2012)
SNP&GOSupport vector machine (SVM) based web server that combine protein structural/functional parameters and sequence analysis derived information.http://snps.biofold.org/snps-and-go/snps-and-go.htmlMagesh and Doss (2014)
PhD-SNPSVM based on evolutionary information.http://gpcr.biocomp.unibo.it/cgi/predictors/PhD-SNP/PhD-SNP.cgiMagesh and Doss (2014)
PANTHERProtein family and subfamily database that predicts the frequency of occurrence of amino acid at a particular position in evolutionary related protein sequences.http://pantherdb.org/tools/csnpScoreForm.jspMi et al. (2005)
UTRscanAnalyzing functional impacts of UTR SNPs.http://www/.ba.itb.cnr.it/BIG/UTRScanPesole et al. (1999)
NetSurfPAnalysis of SNP effects on surface and solvent accessibility of protein.http://www.cbs.dtu.dk/services/NetSurfPPetersen et al. (2009)
I-TASSERProtein structure prediction server.http://zhanglab.ccmb.med.umich.edu/I-TASSERZhang (2008)
HOPE ProjectAn automatic mutant analysis server for studying the structural features of native protein and the variant models.http://www.cmbi.ru.nl/hope/homeVenselaar et al. (2010)
STRINGDatabase of known and predicted protein-protein interactions.http://string-db.orgVon Mering (et al. 2005)
PolymiRTSThe polymorphic microRNA target sites are classifies into four classes: ‘D’ (the derived allele disrupts a conserved microRNA site), ‘N’ (the derived allele disrupts a nonconserved microRNA site), ‘C’ (the derived allele creates a new microRNA site) and ‘O’ (other cases when the ancestral allele cannot be determined unambiguously).http://compbio.uthsc.edu/miRSNP/Bhattacharya et al. (2013)
1000 Genomes ProjectA resource about human genetic variation that will be used in many studies of particular phenotypes, such as complex diseases or drug response.http://www.1000genomes.orgVia et al. (2010)
SIFT predict whether an amino acid substitution in a protein would be tolerated or damaging. The amino acid substitution is predicted damaging when the score is below or equal to 0.05, and tolerated if the score is greater than 0.05 (Ng and Henikoff, 2003). PolyPhen input is the amino acid sequence of protein or SNP identifier with the nsSNP. The output levels of probably damaging and possibly damaging were classified as functionally significant (≤ 0.5) and the benign level being classified as tolerated (≥ 0.5) (Ramensky et al., 2002). I-Mutant 3.0 performed analyses based on the protein sequence combined with mutational position and correlated new residue and the output result of the predicted free energy change (DDG) classifies the prediction into one of three classes: largely unstable (DDG < − 0.5 kcal/mol), largely stable (DDG > − 0.5 kcal/mol), or neutral (− 0.5 ≤ DDG ≥ 0.5 kcal/mol) (Capriotti et al., 2008). PROVEAN is able to provide predictions for any type of protein sequence variations including amino acid substitutions, and in-frame insertions and deletions (Choi et al., 2012). The PROVEAN predict a protein variant to be neutral if the score is above the threshold. The cutoff score − 2.5 indicates a deleterious substitution (Manickam et al., 2014). Furthermore, we used SNP&GO, PHD-SNP Predictor of human deleterious single nucleotide polymorphisms and PANTHER (Protein Analysis Through Evolutionary Relationships) tools to filter the disease-associated nsSNPs (Table 1). SNP&GO predict SNPs are or are not disease-associated with including the protein FASTA sequence and Gene Ontology terms. The probability score higher than 0.5 indicates the disease related effect of mutation on the parent protein function (Calabrese et al., 2009). PhD-SNP predicts whether the given amino acid substitution leads to disease associated or neutral along with the reliability index score (Capriotti et al., 2006). PANTHER comprehensive software system predicts the likelihood of a particular nsSNP to cause a functional impact on a protein. The cutoff subPSEC score − 3 indicates a deleterious substitution (Thomas et al., 2003).

Biophysical validation of nsSNPs

NetSurfP predicts the surface and, solvent accessibility of amino acids, using the amino acid FASTA sequence. The solvent accessibility has been predicted in two classes as either buried or exposed, based on the accessibility of the amino acid residues to the solvent, respectively. The reliability of this prediction method is in the form of Z-score. The Z-score highlights the surface prediction reliability, but is not associated with the secondary structure (Petersen et al., 2009). Finding 3D structure of proteins is helpful in predicting the impact of SNPs on the structural level and in showing the degrees of alteration. I-TASSER generates a full length model of proteins by excising continuous fragments from threading alignments and then reassembling them using replica-exchanged Monte Carlo simulations. Low temperature replicas (decoys) generated during the simulation are clustered by SPICKER and the top five cluster centroids are selected for generating full atomic models. The quality of prediction models was reflected in the form of c-scores (− 5 to 2) (Roy et al., 2010, Roy et al., 2012). The native structure was mutated with the most deleterious amino acid substitution predicted in this study, using Swiss PDB viewer and Chimera (Kaplan and Littlejohn, 2001, Pettersen et al., 2004). In addition, we used HOPE Project that provides the 3D structural visualization of mutated proteins, and gives the results by using UniProtKB and predictions from DAS-servers. FASTA sequence of whole protein and selection of mutant variants is considered to be an input option, the output is based on the structural variation between the mutant and the wild-type residues (Venselaar et al., 2010).

Predictions of protein-protein interactions

STRING (Search Tool for the Retrieval of Interacting proteins) is a database and web resource dedicated to protein–protein interactions, including direct (physical) and indirect (functional) interactions (Jensen et al., 2009); the database contains information from: genomic context, experimental repositories, co-expression and public text collections (Szklarczyk et al., 2011).

Functional SNPs in UTR found by the UTRscan

The UTRscan program allows one to search the user-submitted sequences for any of the patterns collected in the UTR site (Grillo et al., 2010). If different sequences for each UTR SNP are found to have different functional patterns, that the particular UTR SNP is predicted to have functional significance.

PolymiRTS database (version 3.0) for polymorphism in microRNA target site

PolymiRTS database was designed specifically for the analysis of non-coding SNPs namely 3′ UTR. The polymorphic microRNA target sites are classified into four classes according to Table 1 (Bhattacharya et al., 2013). PolymiRTS of ‘D’ may cause loss of normal repression; PolymiRTS of class ‘C’ may cause abnormal gene repression control. Therefore, these two classes of PolymiRTS are most likely to have functional impacts.

1000 Genomes Project

The 1000 Genomes Project (Consortium, 2010) is sequencing the entire genome of approximately 2500 individuals from different worldwide populations. The aim of the1000 Genomes Project is to determine most of the genetic variation that occurs at a population frequency greater than 1%.

Results

SNP dataset from dbSNP

The BUB1B gene investigated in this work was retrieved from dbSNP database (Table 4). It contained a total of 827 SNPs: 90 were non-synonymous SNPs (nsSNPs), 57 were in non-coding regions, which comprises of 21 SNPs in 5′ UTR region and 36 SNPs in 3′ UTR region. The rest were in the intron region. We selected non-synonymous coding SNPs, 5′ and 3′ UTR region SNPs for our investigation.
Table 4

Surface accessibility of wild-type and mutant variants in BUB1B.

Amino acidClass assignmentPositionRelative surface accessibilityAbsolute surface accessibilityZ-fit score for RSA prediction
IBuried820.023.8291.418
NBuried0.0223.1771.383
PExposed3340.54677.520− 1.915
L0.34963.975− 0.779
RExposed8140.43699.798− 0.340
HExposed0.42376.998− 0.280

Prediction of functional mutations

Of the 90 nsSNPs used in our analysis, 18 nsSNPs were identified to be deleterious with SIFT and the results were listed in Table 2.
Table 2

List of nsSNP analysis by SIFT, PolyPhen-2, I-Mutant 3.0 PROVEAN respectively.

rsIDAmino acid changeSIFTScorePolyPhen-2ScoreI-Mutant 3.0ScorePROVEANScore
rs38678332G37VTolerated0.17Benign0.181Large decrease− 0.46Neutral− 1.187
rs52798733E641VDamaging0.03Probably damaging0.981Large decrease− 0.04Neutral− 2.256
rs53178613R256KTolerated0.35Probably damaging0.992Large decrease− 0.52Neutral− 0.978
rs53231959K170EDamaging0.05Possibly damaging0.897Neutral− 0.53Deleterious− 3.155
rs53396744I272NDamaging0.02Probably damaging0.980Large decrease 1.75Deleterious 3.292
rs53429711R36QDamaging0Probably damaging1.000Neutral− 0.65Deleterious− 3.718
rs54188126I625MTolerated0.11Probably damaging0.980Large decrease− 1.71Neutral− 0.454
rs54578440A348VTolerated0.25Probably damaging1.000Large decrease0.01Neutral− 1.909
rs54653854I156VTolerated0.93Probably damaging0.966Large decrease− 1.01Neutral− 0.489
rs54660763H836YTolerated1Benign0.004Neutral0.44Neutral− 2.169
rs54865001M353TTolerated0.48Probably damaging0.992Large decrease− 0.62Neutral− 2.259
rs55238070A173VTolerated0.27Benign0.034Large decrease− 0.45Neutral− 0.601
rs55342059K539QTolerated0.24Possibly damaging0.682Large decrease− 0.79Neutral− 0.111
rs55355571I82NDamaging0Probably damaging1.000Large decrease 2.12Deleterious 6.284
rs55478232G8ATolerated0.09Benign0.018Large increase− 0.11Neutral− 0.595
rs55619315R616HTolerated0.52Benign0.003Large decrease− 1.29Neutral1.972
rs55752197R677HTolerated0.12Benign0.002Large decrease− 1.39Neutral− 0.882
rs55935830D576ETolerated1Benign0.003Neutral− 0.14Neutral0.056
rs56791614R194QTolerated0.2Probably damaging1.000Neutral− 0. 45Neutral− 0.742
rs57105655T100MTolerated0.06Probably damaging0.998Large increase− 0.16Neutral− 1.801
rs57153880G316DTolerated0.37Benign0.024Large decrease− 0.76Deleterious− 3.017
rs57759191R727CTolerated0.09Probably damaging1.000Large decrease− 0.69Neutral− 2.061
rs28989181L844FDamaging0Probably damaging0.998Large decrease− 0.82Neutral2.454
rs28989182R814HDamaging0Probably damaging1.000Large decrease 1.43Deleterious 2.880
rs28989187R550QTolerated0.86Benign0.001Large decrease− 0.78Neutral0.332
rs56079734T40MDamaging0.05Probably damaging1.000Neutral0.01Neutral− 1.811
rs1017842E390DTolerated0.46Benign0.102Large decrease− 0.31Neutral− 0.530
rs1801528V618ATolerated1Benign0.000Large decrease− 1.43Neutral1.441
rs17851677P378SDamaging0.04Possibly damaging0.804Large decrease 1.10Deleterious 3.228
rs28989188E409DTolerated0.24Probably damaging1.000Large decrease− 0.49Neutral− 1.355
rs35923791N133STolerated1Benign0.248Large decrease− 0.35Neutral− 0.664
rs56158360R244HDamaging0.02Probably damaging1.000Large decrease 1.08Deleterious 4.461
rs75763304Q460KTolerated0.72Benign0.072Neutral0.13Neutral− 1.236
rs76546181F531STolerated0.56Probably damaging1.000Large decrease− 1.68Neutral− 2.074
rs77520855Y162HDamaging0.04Probably damaging0.960Large decrease− 1.09Neutral− 2.414
rs117485407T471MTolerated0.1Possibly damaging0.579Neutral0.32Neutral− 0.615
rs138332995P544STolerated0.74Benign0.181Large decrease− 1.59Neutral− 1.109
rs139226455P800STolerated0.07Possibly damaging0.839Large decrease− 1.85Deleterious− 3.780
rs140368608K779RTolerated0.64Benign0.073Neutral− 0.30Neutral− 1.176
rs141953425P334LDamaging0.01Possibly damaging0.453Large decrease 0.80Deleterious 5.244
rs142705245A784VTolerated0.38Benign0.019Large decrease− 0.20Neutral0.604
rs143346774H850RTolerated0.61Probably damaging0.998Neutral0.16Neutral− 1.165
rs143559902D675ETolerated1Benign0.003Neutral− 0.37Neutral− 0.008
rs145026343C825FTolerated0.35Probably damaging0.966Large decrease− 0.13Deleterious− 4.601
rs145028054E184QDamaging0.01Benign0.362Neutral0.07Neutral− 2.086
rs145184714A335TTolerated0.84Benign0.005Large decrease− 0.78Neutral0.308
rs145578529I567VTolerated0.3Possibly damaging0.512Large decrease− 1.10Neutral− 0.081
rs146387899L258FTolerated0.81Benign0.074Large decrease− 0.64Neutral− 1.786
rs146795655T493ITolerated0.16Benign0.001Neutral− 0.26Neutral− 0.913
rs146821149R886SDamaging0.03Benign0.002Large decrease− 1.25Neutral− 0.328
rs147150527G376VTolerated0.17Benign0.181Large decrease− 0.46Neutral− 1.187
rs147549987V4MTolerated0.23Benign0.000Neutral− 0.68Neutral− 0.282
rs147832586S83GDamaging0.01Benign0.061Large decrease− 0.86Neutral− 1.807
rs148159407N26DTolerated0.52Possibly damaging0.913Neutral− 0.43Deleterious− 2.827
rs148348158T648ITolerated0.19Benign0.000Large decrease0.25Neutral− 1.427
rs149628229D579GTolerated0.1Benign0.328Large decrease− 1.20Neutral− 1.894
rs149955447E813ATolerated0.07Probably damaging0.997Large decrease− 0.69Neutral− 2.258
rs150707631S797ATolerated0.13Possibly damaging0.495Large decrease− 0.79Neutral− 1.043
rs150983783R421QTolerated0.15Benign0.178Neutral− 0.79Neutral− 1.197
rs181352808H836QTolerated0.29Probably damaging0.959Neutral− 0.22Deleterious− 2.528
rs184449375M626VTolerated0.61Benign0.000Large decrease− 0.84Neutral0.072
rs190909040Y343FTolerated0.14Possibly damaging0.925Large decrease− 0.13Neutral− 1.814
rs199509124P222LDamaging0.01Possibly damaging0.774Large increase 0.20Deleterious 4.125
rs199743655V274ADamaging0Possibly damaging0.866Large decrease 1.00Deleterious 3.319
rs200060772S691LTolerated0.08Probably damaging0.999Neutral− 0.32Neutral− 2.173
rs200788206Q350KTolerated0.79Probably damaging0.984Large decrease0.10Neutral− 1.593
rs200997833K542RTolerated0.35Benign0.055Neutral− 0.28Neutral− 0.375
rs201251790R421WTolerated0.09Benign0.021Neutral− 0.38Neutral− 1.629
rs201360106E21KTolerated0.09Probably damaging1.000Large decrease− 0.63Deleterious− 2.504
rs202114756S384GTolerated0.31Possibly damaging0.860Large decrease− 0.57Neutral− 1.503
rs202132335A739GDamaging0.0Benign0.072Large decrease− 1.08Neutral− 1.113
rs367543489Q829ETolerated0.11Probably damaging0.999Neutral− 0.10Neutral− 1.197
rs368023159K488NTolerated0.15Benign0.004Neutral− 0.60Neutral− 0.119
rs368079817Q42RTolerated0.25Probably damaging0.999Neutral− 0.06Neutral− 1.985
rs368996088F781LTolerated0.16Benign0.075Large decrease− 1.32Neutral− 1.964
rs370388424P640LTolerated0.29Benign0.020Large decrease− 0.60Neutral− 0.985
rs370655726C356STolerated0.74Probably damaging1.000Large decrease− 0.69Deleterious− 3.683
rs371124423C700RTolerated0.38Benign0.068Large decrease− 0.26Neutral− 1.228
rs371305662T291KTolerated0.84Benign0.100Large decrease− 0.46Neutral− 1.934
rs372003254D846ETolerated0.1Possibly damaging0.626Neutral0.24Neutral− 1.695
rs372569297I755TTolerated0.75Benign0.000Large decrease− 1.59Neutral− 0.814
rs373256667K454RTolerated0.21Probably damaging1.000Neutral0.12Neutral− 0.935
rs373789523T658ITolerated0.16Benign0.000Large decrease0.25Neutral− 1.951
rs373830262A108TTolerated0.49Probably damaging0.982Large decrease− 0.71Neutral− 0.949
rs374682772V333ITolerated0.35Benign0.002Large decrease− 0.84Neutral− 0.423
rs375105548I854VTolerated0.44Benign0.007Large decrease− 0.65Neutral− 0.137
rs375388175I703TTolerated0.76Probably damaging0.985Large decrease− 1.94Neutral− 1.952
rs375798678Q181RTolerated1Benign0.000Neutral0.06Neutral0.416
rs375885859C51RTolerated0.11Benign0.000Neutral0.07Neutral− 1.379
rs376072541P632LTolerated0.1Benign0.001Large decrease− 0.40Neutral− 2.406
A total of 46 nsSNPs was predicted to be damaging and the remaining 44 nsSNPs were categorized as benign with Polyphen 2.0 and the results were listed inTable 2. Out of 90 nsSNPs, 27 nsSNP were predicted to be neutral mutation (− 0.5 ≤ DDG ≤ 0.5 kcal/mol), 60 nsSNP were predicted to be “large decrease” (≤−0.5 kcal/mol) and 3 nsSNP were predicted to be “large increase” (> 0.5 kcal/mol). I-Mutant 3.0 predicted 63 of SNPs to affect the stability of protein structure (Table 2). All the nsSNPs submitted to SIFT and PolyPhen 2.0 and I-Mutant 3.0 were submitted as input to PROVEAN. Out of 90 nsSNPs, 17 nsSNP were predicted to be deleterious and 73 were found to be neutral (Table 2). The accuracy of the in silico techniques for prioritizing deleterious SNPs can be increased by combining different computational methods. Out of 90 nsSNPs, SIFT, PolyPhen, I-Mutant 3.0 and PROVEAN predicted 8 nsSNPs as deleterious (Fig. 2).
Fig. 2

List of functionally significant mutations.

SNPs&GO, PhD-SNP and PANTHER were performed to validate the results obtained from four tools. Out of 8 nsSNPs that predicted to be deleterious with SIFT, Polyphen, I-Mutant and PROVEAN; SNP&GO predicted 3 nsSNP, PhD-SNP predicted 4 nsSNP and PANTHER predicted 5 nsSNP to be associated with disease (Table 3).
Table 3

List of nsSNP predicted as disease associated byPHD-SNP,SNP&GO and PANTHER server.

rsIDAmino acid changePHD-SNPSNP&GOPANTHERsubPSEC score
rs527987333I82NDiseaseDiseaseDeleterious− 5.89613
rs53396744I272NDiseaseNeutralTolerated− 2.75316
rs199743655V274ANeutralNeutralTolerated− 2.50206
rs199509124P222LNeutralNeutralDeleterious− 3.07979
rs141953425P334LDiseaseDiseaseDeleterious− 3.73233
rs56158360R244HNeutralNeutralDeleterious− 4.34981
rs17851677P378SNeutralNeutralDeleterious− 3.97828
rs28989182R814HDiseaseDiseaseDeleterious− 7.86508
Finally out of 90 nsSNP, we found 3 nsSNPs namely rs55355571 (I82N), rs141953425 (P334L) and rs28989182 (R814H) that are common in all (SIFT, Polyphen, I-Mutant, PROVEAN, PHD-SNP, SNP&GO, PANTHER) prediction.

In silico biophysical validation of nsSNPs

Based on the in silico analyses performed, 3 nsSNPs was selected for further analyses. The location and the type of a mutated residue affect the stability of the protein. In particular, as the solvent accessibility of a residue decreases, the stability of the protein due to mutation decreases. NetSurfP Z-score allows the identification of the most reliable/unreliable predictions for both buried and exposed amino acids. A huge drift in the Z-score was not observed for 3 nsSNPs as given in Table 5. For any of 3 nsSNPs, the class assignment does not change.
Table 5

Top 10 templates used by I-TASSER to create the high quality models for human BUB1B secondary structure.

RankPDB hitIden1Iden2Cov.Norm. Z-score
13e7eA0.220.080.311.71
21vw1A0.080.200.872.36
33e7eA0.220.080.311.40
44jspB0.090.190.921.44
53e7eA0.230.080.302.68
63cm9S0.070.110.551.19
73e7eA0.220.080.312.84
84jspB0.080.190.892.06
93e7eA0.220.080.311.51
104kf7A0.100.190.852.31
The I-TASSER tool created the 5 full-length models for BubR1 protein (with C-scores: − 0.24, − 1.01, − 2.83, − 2.95 and − 3.00) by excising top 10 structures with C-scores after targeting the PDB library hits (Table 5).Top 10 proteins in the PDB which are structurally closest to the predicted models. Among the 5 predicted models, model 1 (Fig. 3) carried the high-quality confidence in the form of C-score (− 0.24), TM-score (0.68 ± 0.12), and the RMSD (9.6 ± 4.6 Å) (Table 6). We did not perform any molecular dynamics structure optimization; therefore, our 3D homology model is a preliminary model implicating the disruptive role of the SNPs.
Fig. 3

3D structure of BubR1 predicted with I-TASSER.

Table 6

I-TASSER results carrying C-score, TM-score and RMSD regarding selected secondary structure (native protein model 1).

ModelC-scoreExp. TM-scoreExp. RMSDNo. of decoysCluster density
Model 1− 0.240.68 ± 0.129.6 ± 4.6 Å3120.3039
Model2− 1.012580.1404
Model3− 2.83600.0228
Model4− 2.95570.0202
Model5− 3.00570.0191
Project Hope revealed the 3D structure of the proteins with its new residue. Furthermore, it described the reaction and physiochemical properties of these candidates. Here we present the results upon each candidate and discuss the conformational variations and interactions with the neighboring amino acids: A/G Mutation (rs28989182) caused substitution of the amino acid from Arginine into a Histidine at position 814 (R814H). For this variant the mutated residue is smaller (Fig. 1); this might lead to loss of interactions. The wild-type residue was positively charged, the mutated residue is neutral. Only this residue type was found at this position. Mutation of a 100% conserved residue is usually damaging for the protein. Additionally, the structural analysis of H814 showed some clashes for Phe822 which may contribute to the extra energy in the protein structure, and hence the decrease in stability (Fig. 4).
Fig. 4

Deep view of superimposed structure of wild and mutant residue at 814 position. The main protein core is shown in gray color while the wild type and mutated residues are shown in green and red colors respectively. SNP ID: rs28989182, protein position 814 changed from Arginine to Histidine.

A/T mutation (rs553555716) resulted in a change of the Isoleucine to Aspargine at position 82 (I82N). The wild type residue is smaller and more hydrophobic than the mutated residue (Fig. 6). The residue is buried in the core of a domain, annotated in UniProt as: “BUB1 N terminal”. The mutation will cause loss of hydrophobic interactions in the core of the protein. This residue is part of an interprotein domain named “Mitotic checkpoint serine/threonine protein kinase Bub1/Mitotic spindle checkpoint component Mad3” (IPR015661).
Fig. 6

Superimposed structure of (wild type) Ile and (mutant) Asn residues at 82 position.

C/T (rs141953425) leads to conversion of Proline into a Leucine at position 334 (P334L). The wild-type and mutant amino acid differs in sizes; the mutated residue is bigger and this might lead to displace of the mutant residue. Prolines are known to have a very rigid structure, sometimes forcing the backbone in a specific conformation. The mutation can disturb this special conformation. This residue is part of an inter protein domain named “Mitotic checkpoint serine/threonine protein kinase Bub1/Mitotic spindle checkpoint component Mad3”. Chimera (Fig. 5, Fig. 7, Fig. 8) and Swiss PDB viewer were used to visualize the structural features of amino acids in native and mutant protein chains. During structural visualization for all 3mutations, only mutant residue (Histidine) at 814 position showed a network of clashes with Phe822 (Fig. 9).
Fig. 5

SNP ID: rs28989182, protein position 814 changed from Arginine (green) to Histidine (red).

Fig. 7

SNP ID: rs553555716, protein position 82 changed from Isoleucine (left image) to Aspargine (right image).

Fig. 8

SNP ID: rs141953425, protein position 334 changed from Proline (left image) to Leucine (right image).

Fig. 9

H-bonding (green discontinuous line) interactions and clashes (pink discontinuous line) of wild type and mutant analogues with the vicinal amino acid residues. (a) At 814 position 2 H-bond is observed with Leu811and Glu817 in both native (Arg) and mutant (His814) structures, but a network of clashes appeared between His814and Phe822. (b) At 82 position, 4 H-bond is observed with Trp78, Asp79, Thr85 and Glu86 in both native (Ile) and mutant (Asn) structures. (c) At 334 position, 2 H-bond is observed with Leu330 and Pro338 in both native (Pro) and mutant (Leu) structures.

Protein-protein interactions analysis

The interaction analysis revealed that BUB1B is related to Cell Division Cycle 20 homolog (CDC20), Budding Uninhibited By Benzimidazoles 3 (BUB3), Cancer Susceptibility Candidate 5 (CASC5), MAD2 Mitotic Arrest Deficient-Like 1 (MAD2L1), Cell Division Cycle 27 homolog (CDC27), Centromere Protein E (CENPE), BUB1 Mitotic Checkpoint Serine/Threonine Kinase (BUB1), ZW10 Interacting Kinetochore Protein (ZWINT), Anaphase Promoting Complex Subunit 2 (ANAPC2), Cell Division Cycle 16 homolog (CDC16) (Fig. 10). Furthermore, our literature search demonstrated that BubR1 interacts with Bub3, Cdc20, and Mad2 (Kapanidou et al., 2015).
Fig. 10

Protein–protein interaction network of BUB1B using STRING 9.0 server.

Functional SNPs in UTR found by UTRscan server

All of the 57 UTR SNPs was analyzed using UTRscan. After comparing the functional elements for each UTR SNP, we predicted that two SNPs, namely rs375434078 and rs538302864 in 5′ UTR are related to the functional pattern change of IRES (Table 7). Internal ribosome entry site (IRES) is bound by an internal mRNA ribosome that is an alternative mechanism of translation initiation compared to the common 5′-cap dependent ribosome scanning mechanism (Pickering and Willis, 2005).
Table 7

List of mRNA UTR SNPs that were predicted to be of functional significance by UTRscan server.

SNP IDNucleotide changeUTR positionFunctional element change
rs375434078C/T5′ UTRIRES → no pattern
rs538302864A/G5′ UTRIRES → no pattern

Functional SNPs in 3′ untranslated regions (UTR) predicted by PolymiRTS database 3.0

Among 36 SNPs in 3′ UTR region of BUB1B gene, 3 functional SNPs was predicted that among them, only one SNP disrupts 8 miRNAs conserved site (ancestral allele with support ≥ 2), while all of them create 8 new miRNA site. The results are listed in Table 8.
Table 8

Prediction result of PolymiRTS database.

dbSNP IDmiR IDConservationmiRSiteFunction class
rs149437374hsa-miR 130a -3p2ATGCACTAccattD
hsa-miR-130b-3p
hsa-miR-301a-3p2ATGCACTAccattD
hsa-miR-301b
hsa-miR-36662ATGCACTAccattD
hsa-miR-4295
hsa-miR-454-3p2ATGCACTAccattD
hsa-miR-4671-3p2ATGCACTAccattD
hsa-miR-323a-5p2ATGCACTAccattD
hsa-miR-876-3p2ATGCACTAccattD
2ATGCACTAccattD
5atgcACCACCAttC
5atgcACCACCAttC
rs143807849hsa-miR-539-5p4CcATTTCTCtctaC
hsa-miR-5680
hsa-miR-6758-5p5CCATTTCTctctaC
hsa-miR-6856-5p4CcatttCTCTCTAC
4CcatttCTCTCTAC
rs1047193hsa-miR-44504atgATCCCCAtgtC
hsa-miR-6857-5p4atgATCCCCAtgtC

Discussion

The identification of SNPs responsible for specific phenotypes with molecular approaches seems to be expensive and time-consuming (Chen and Sullivan, 2003), hence computational approaches can help in narrowing down the number of missense mutations to be screened in genetic association studies and for a better understanding of the functional and structural aspects of the parent protein. Previous studies on polymorphisms screening using in silico analysis helped in predicting the functional nsSNPs associated with genes such as G6PD (Rajith, 2011), ATM (Doss and Rajith, 2012), PTEN (Doss and Rajith, 2013), BRAF (Hussain et al., 2012). Our results also revealed that implementations of different algorithms often serve as powerful tools for prioritizing candidate functional nsSNPs. Recent work by Thusberg and Vihinen (2009) compared different in silico tools, out of which SIFT and PolyPhen were reported to have better performance in identifying deleterious nsSNPs. The accuracy of SIFT and PolyPhen 2.0 was further validated by Hicks et al. (2011), which makes these tools more applicable for the prediction. I-Mutant 3.0 was used which evaluate the stability change upon single amino acid mutation that ranked as one of the most reliable predictor based on the work performed by Khan and Vihinen (2010). Based on these in silico studies, we select SIFT, PolyPhen, I-Mutant, PROVEAN, SNP&GO, PHD-SNP and PANTHER for the screening of functional mutation in BUB1B gene. By comparing the scores of all 7 methods, 3 nsSNPs with positions I82N, P334L and R814H were found to be highly significant. The 5′ and 3′ UTR SNPs was analyzed using UTRscan. Due to the importance of the translational regulation of microRNAs, we further studied whether the 3′ UTR SNPs changes the profile of microRNA binding to the BUB1B gene using PolymiRTS. Two SNPs in the 5′ UTR was predicted to influence the translation pattern of the BUB1B gene through UTRscan analysis, and three 3′ UTR SNPs may affect microRNA binding sites, as determined through PolymiRTS. Protein-protein interaction analysis showed the interaction of BUB1B with ten different genes. Therefore, any changes in the protein function would have an impact on many pathways involved in disease. In conclusion, we surveyed and compared available databases such as NCBI, dbSNP, 1000 genome project along with in silico prediction programs to assess the effects of deleterious functional variants on the protein functions. Analyzing deleterious nsSNPs by both sequence and structure level has the added advantage of being able to assess the reliability of the generated prediction results by cross-referencing the results from both approaches. One striking observation was the identification of rs28989182 (R814H), that associated with Mosaic Variegated Aneuploidy Syndrome (Bairoch and Apweiler, 1996), lie within a serine/threonine kinase domain of BubR1 protein. Only this residue type was found at this position. Mutation of a 100% conserved residue is usually damaging for the protein. Both I82N and P334L mutations occurred in the N terminal region of BubR1; Therefore, these mutations may compromise its binding to Bub1, Mad2 and cdc20 resulting plausible failure of the corresponding checkpoint. In addition rs149437374 and rs143807849 in 3′ UTR that disrupts a conserved of 8 miRNAs site are genotyped by 1000 genome project; Based on the data obtained through determining the allele frequency in 1000 genome populations, it is observed that the frequency of normal allele is more than the mutant allele. Therefore, it is concluded that rs149437374 and rs143807849 in 3′ UTR are deleterious, so that in different 1000 genome population, it has a low frequency; hence allele frequency reported in 1000 genome project confirmed our results. These results indicate that our approach successfully allowed us in selecting the deleterious SNPs that are likely to have functional impact on the BUB1B gene and contribute to an individual's susceptibility to the disease.
  48 in total

Review 1.  Pathogenic or not? And if so, then how? Studying the effects of missense mutations using bioinformatics methods.

Authors:  Janita Thusberg; Mauno Vihinen
Journal:  Hum Mutat       Date:  2009-05       Impact factor: 4.878

2.  UTRdb: a specialized database of 5' and 3' untranslated regions of eukaryotic mRNAs.

Authors:  G Pesole; S Liuni; G Grillo; M Ippedico; A Larizza; W Makalowski; C Saccone
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

3.  In silico analysis of Single Nucleotide Polymorphisms (SNPs) in human BRAF gene.

Authors:  Muhammad Ramzan Manwar Hussain; Noor Ahmad Shaik; Jumana Yousuf Al-Aama; Hani Z Asfour; Fatima Subhani Khan; Tariq Ahmad Masoodi; Muhammad Akhtar Khan; Nazia Sultana Shaik
Journal:  Gene       Date:  2012-07-21       Impact factor: 3.688

4.  Human gene mutation database-a biomedical information and research resource.

Authors:  M Krawczak; E V Ball; I Fenton; P D Stenson; S Abeysinghe; N Thomas; D N Cooper
Journal:  Hum Mutat       Date:  2000       Impact factor: 4.878

5.  The SWISS-PROT protein sequence data bank and its new supplement TREMBL.

Authors:  A Bairoch; R Apweiler
Journal:  Nucleic Acids Res       Date:  1996-01-01       Impact factor: 16.971

6.  Pharmacogenetic association between ALOX5 promoter genotype and the response to anti-asthma treatment.

Authors:  J M Drazen; C N Yandava; L Dubé; N Szczerback; R Hippensteel; A Pillari; E Israel; N Schork; E S Silverman; D A Katz; J Drajesk
Journal:  Nat Genet       Date:  1999-06       Impact factor: 38.330

7.  Human non-synonymous SNPs: server and survey.

Authors:  Vasily Ramensky; Peer Bork; Shamil Sunyaev
Journal:  Nucleic Acids Res       Date:  2002-09-01       Impact factor: 16.971

8.  The mouse mitotic checkpoint gene bub1b, a novel bub1 family member, is expressed in a cell cycle-dependent manner.

Authors:  J W Davenport; E R Fernandes; L D Harris; G A Neale; R Goorha
Journal:  Genomics       Date:  1999-01-01       Impact factor: 5.736

9.  I-TASSER server for protein 3D structure prediction.

Authors:  Yang Zhang
Journal:  BMC Bioinformatics       Date:  2008-01-23       Impact factor: 3.169

10.  PolymiRTS Database 3.0: linking polymorphisms in microRNAs and their target sites with human diseases and biological pathways.

Authors:  Anindya Bhattacharya; Jesse D Ziebarth; Yan Cui
Journal:  Nucleic Acids Res       Date:  2013-10-24       Impact factor: 16.971

View more
  3 in total

1.  Phenotype prediction for mucopolysaccharidosis type I by in silico analysis.

Authors:  Li Ou; Michael J Przybilla; Chester B Whitley
Journal:  Orphanet J Rare Dis       Date:  2017-07-04       Impact factor: 4.123

2.  Computational Analysis of High Risk Missense Variant in Human UTY Gene: A Candidate Gene of AZFa Sub-region.

Authors:  Mili Nailwal; Jenabhai Bhathibhai Chauhan
Journal:  J Reprod Infertil       Date:  2017 Jul-Sep

3.  Predicting the most deleterious missense nsSNPs of the protein isoforms of the human HLA-G gene and in silico evaluation of their structural and functional consequences.

Authors:  Elaheh Emadi; Fatemeh Akhoundi; Seyed Mehdi Kalantar; Modjtaba Emadi-Baygi
Journal:  BMC Genet       Date:  2020-08-31       Impact factor: 2.797

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.