| SIFT: Sorting Intolerant From Tolerant REF: [100] | S | EC | http://sift-dna.orgDownload: Yes | Calculates a normalised probability of substitution score from multiple alignments based on sequence homology using PSI-BLAST.Removes close homologous sequences to prevent over prediction of “tolerated” substitutions.Mutational effect on protein function is classified as damaging (<=0.05) or tolerated (>0.05). | Fasta sequence or aligned sequencesSNP list | Per-SNP:1) SIFT score2) Binary mutation classification3) Median sequence conservation | Predictions for submitted SNPs, as well as all possible SNPs (but without a score).Positions are weighted equally within an alignment. Alignments may be user defined.Sequence conservation score provides a useful estimate of whether the alignment contains sufficient variation to support classification. |
| PROVEAN: Protein Variation Effect AnalyzerREF: [104] | S | EC | http://provean.jcvi.org/seq_submit.phpDownload: Yes | Related sequences are collected with BLAST (using CD-HIT) and clustered based on 75% global sequence identity. The top 30 clusters of closely related sequences form the supporting sequence set, used to generate the prediction.Delta alignment scores are computed for each supporting sequence and averaged within and across clusters to generate the final PROVEAN score.Predicted mutation effects are classified as either deleterious or neutral based on a predefined threshold (-2.5).Available as:PROVEAN ProteinPROVEAN Protein Batch*PROVEAN Genome Variants**Human and Mouse only | Fasta sequenceMutation list (SNPs and INDELs) | Per-mutation:1) PROVEAN score2) Binary mutation classification | Predictions for submitted mutations only.Predict effects for both SNPs and INDELs, but not frameshift mutations.Batch processing of multiple organisms.The classification threshold is fixed in the online version.Stand-alone package only available for PROVEAN Protein. |
| SNAP2:Screening for Non-Acceptable Polymorphisms, v2REF: [105] | S | NN | https://www.rostlab.org/services/SNAP/Download: Yes | Combines evolutionary information with an expanded list of original SNAP features (amino acid properties) including features such as AA index, predicted binding residues and disordered regions, residue annotations from Pfam and PROSITE, etc.Mutations are classified as either neutral or effect based on predicted scores, between (-100 to 100) respectively.The prediction algorithm is based on a NN consisting of a feed-forward multi-layer perceptron. 10-fold cross-validation is used to create 10 models, each providing a single score for each output class (neutral/effect). The final score is calculated as the difference between the average scores for each output class. | Fasta sequence | For all possible substitutions:1) Heatmap representing the predicted effect2) Multi column table with Predicted Effect, Score and Accuracy. | Predictions for all possible substitutions.Prediction scores are accompanied by an “accuracy metric” to aid interpretation.Uses predicted structural features.Heatmap generated for visualisation of the predictions.Additional method (SNAP2noali) predicts functional effects without alignments. Automatic selection of best method (SNAP2 by default, and SNAP2noali for orphans) with notification to users. |
| ConSurfREF: [106] | S(St) | EC | https://consurf.tau.ac.il/Download: No | Estimates evolutionary conservation rate of amino/nucleic acid positions based on the phylogenetic relations between homologous sequences.Homologous sequences are searched using CSI-BLAST, PSI-BLAST or BLAST, with closely related sequences removed using CD-HIT with multiple sequence alignments (MSA) generated by MAFFT by default.MSA is used to construct phylogenetic relationships using the neighbour joining method. Position specific evolutionary rates are calculated using the empirical Bayesian or Maximum Likelihood methods.Scores graded 1 (variable) to 9 (conserved) for visualisation. | Amino acid/nucleotide sequenceStructure (if available)MSA (if available)Advanced options:- homologue database- MSA methods- Phylogenetic tree- structural data- calculation method- evolutionary substitution modelOptional: user defined MSA and phylogenetic tree. | Detailed output containing conservation scores, MSA and BLAST results.Estimates mapped onto sequence and structure. | Analysis at amino acid and nucleotide levels.Improved HMMER algorithm to search for homologous proteins. Results are accompanied by confidence intervals.Robust statistical approach to differentiate between apparent conservation (short evolutionary time) and genuine conservation (purifying selection).‘ConSeq’ mode used in the absence of a structure. Site-specific predictions of the buried/exposed status of each position. |
| MAPP: Multivariate Analysis of Protein PolymorphismREF: [110] | SA | EC | Download only:http://mendel.stanford.edu/SidowLab/downloads/MAPP/index.html | Combines MSA with 6 physicochemical properties for amino acids.Calculates a MAPP impact score for each position within the MSA.Sequences in the MSA are weighted to account for phylogenetic correlation.Physicochemical property scores for each column along with their mean and variances are calculated. The deviation of each property is calculated for every possible variant and converted to a single score. | Fasta format MSAPhylogenetic tree | Multicolumn table giving the physico-chemical characteristics of each position, MAPP impact score, and a listing of “good” and “bad” amino acids. | Predictions for every possible substitution, and median MAPP scores calculated for each position.Constructs a physiochemical profile rather than an amino acid profile.Demonstrates value of using only orthologous protein in creating a conservation profile.Scores are continuous and interpreted in a relative manner with higher MAPP scores indicating more conserved areas.Can be optimised for individual genes including MAPP impact score threshold for classification.Requires user defined MSA and phylogenetic tree. |
| PANTHER-PSEP: Protein ANalysis Through Evolutionary Relationships-Position Specific Evolutionary PreservationREF: [149] | S | EC | http://www.pantherdb.org/tools/csnpScoreForm.jspDownload: Yes | Uses Hidden Markov Model (HMM) to align sequence to protein families and subfamilies in its database to calculate the evolutionary preservation metric.Uses variation over each alignment position to estimate the likelihood of a coding SNP to cause a functional impact on the protein.Score represents the time (in millions of years [my]) a given amino acid has been preserved in the lineage, directly corresponding to the likelihood of a functional impact. Score classified into: Probably damaging, Possibly damaging, Probably benign. | Fasta sequenceSNP listOther parameters:Organism | Per-SNP:1) Preservation Time: PANTHER PSEP score2) Message: Classification of the PSEP score | Positions are weighted equally at all positions within an alignment.Profiles are subfamily specific if they substantially differ from entire family.User defined alignments are not possible since scores are derived from HMMs (PANTHER protein library) together with an ontology of protein function (PANTHER/X – a simplified form of GO) to make predictions. |
| FoldX suiteREF: [113] | St | Empirical force field | Download only:http://foldxsuite.crg.eu/ | Empirical force-field used for calculating mutational effects of stability, folding, and dynamics on proteins and nucleic acidsΔG (free energy of unfolding) is calculated using a combination of empirical terms. Empirical data (derived from protein engineering experiments) is used for weighting energy terms for stability calculations.Foldx BuildModel command calculates stability changes upon mutation based on a full atomic description of the protein structure.Classification of ΔΔG:ΔΔG > 0: DestabilisingΔΔG < 0: Stabilising | PDB fileSNP list (including chain ID) | Multiple output files where requested.Main output is present in ‘Dif_’’ files, containing ΔG of wt and mt residues along with ΔΔG of mutation.Output also contains changes in the associated energy terms. | Command line interface.Creates mutant structure models.Can be used to analyse protein–protein and protein-DNA interactions.Calculates actual stabilities of wt and mt structures, as well as change in stability upon mutation (ΔΔG). Easily integrated into custom workflows.Optimised energy function for faster calculations.Requires registration to download. |
| PoPMuSic (v2.1):Prediction Of Proteins MUtations StabIlity ChangesREF: [115] | St | Physics-based and NN | https://soft.dezyme.com/Download: No | Stability change upon mutation calculated using a linear combination of statistical energy potentials, accounting for variation in volume of the mutant residue.Predictive models include an optimised set of 52 parameters, whose values are estimated and optimised using a neural network. ΔΔG of point mutation is calculated by a linear combination of 16 terms: 13 statistical potentials, 2 terms for volume of wt and mut residues, and 1 independent term.Classification of ΔΔG:ΔΔG > 0: DestabilisingΔΔG < 0: StabilisingAdditional “optimality” score is assigned for each position in the protein sequence. It indicates poorly optimised positions with potential functional consequences. | Only accepts currently available entries in the PDBSNP list in three input modes:1) Systematic: all possible point mutations2) Manual: single mutation3) File: SNP list | Multi-column table containing secondary structure, solvent accessibility (%) and predicted ΔΔG of mutations. | Optimised to rapidly calculate stability changes of all possible mutations in mid-size proteins.Graphical output of sequence optimality scores.No option to upload user-defined PDB files.Requires registration to download. |
| I-Mutant (2.0)REF: [116] | S(St) | SVM | http://gpcr2.biocomp.unibo.it/cgi/predictors/I-Mutant3.0/I-Mutant3.0.cgiDownload: No | Predicts stability effect of a point mutation, as a classification or regression task. The classification task predicts the direction of change, while the regression estimator predicts the ΔΔG. Can be applied to both sequence and structure.RI value (Reliability Index) is computed from the output of the SVM model.Binary classification ΔΔG:ΔΔG < 0: Decrease StabilityΔΔG > 0: Increase StabilityTernary Classification ΔΔG:Large Decrease of Stability: ΔΔG < -0.5Large Increase of Stability: ΔΔG > 0.5Neutral Stability: 0.5<=ΔΔG<=0.5 | Fasta sequence or PDB code/fileChain IDSingle SNPTemperaturePHPrediction request: Binary/Ternary classification | Table containing:1) RSA (%) of mt residue2) RI (Reliability Index)3) Predicted ΔΔG3) Classification of ΔΔG | Predicts both the direction and the estimate of stability.Experimental conditions of pH and Temperature (Celsius) are considered in the stability calculations.Analyses a single mutation at a time only.Output on the web server is better than output requested via email.Use of two different SVM models can lead to discordance between the ΔΔG sign and classification, but is stated to occur only in cases of low RI value. |
| STRUM: STRucture-based prediction of protein stability changes Upon single-point MutationREF: [117] | S(St) | ML | https://zhanglab.ccmb.med.umich.edu/STRUM/Download: Yes | Calculates ΔΔG of mutation using gradient boosting regression algorithm trained on 120 features divided into three groups (sequence, threading and structure).Classification of ΔΔG:ΔΔG < 0: DestabilisingΔΔG > 0: Stabilising | Fasta sequence or PDB fileSNP list in two modes:1) Single or multiple SNPs2) Systematic: All possible SNPs for user defined amino acid segments. | Results available via e-mail only.Multi-column table containing ΔΔG for SNPs. | Combines sequence profiles and 3D features3D Structure modelling of query protein sequence by iterative threading assembly refinement simulationsComputationally expensive with relatively long runtime. |
| MAESTRO:Multi AgEnt STability pRedictiOnREF: [150] | St | Multi agent: ML methods and statistical scoring functions | https://pbwww.che.sbg.ac.at/maestro/webDownload: Yes | Multi-agent method where 3 ML methods i.e Artificial NN, SVM and Multiple Linear Regression. are combined to generate a consensus prediction.Each agent (ML method) uses 9 input values divided into two categories: SSF functions and protein properties (size, mutational environment, etc.).Classification of ΔΔG:ΔΔG > 0: DestabilisingΔΔG < 0: Stabilising | PDB code/fileInput mode:1) Specific mutations2) Sensitivity profile: all possible mutations3) Scan for destabilising mutations4) Stability of Disulphide bonds | Input modes 1 & 2ΔΔG predictions and confidence intervals.Graphical display. | Ability to analyse mutations independently or in combinationΔΔG predictions are accompanied by confidence intervals.High throughput scanning for all possible point mutations.Specific mode for prediction of stabilising disulphide bonds. |
| mCSM suite: mutational Cut-Off Scanning MatrixREF: [122] | St | Graph-based and ML | Protein Stability (PS),Protein-Protein (PP),Protein-DNA(P-NA)http://biosig.unimelb.edu.au/mcsm/Download: No | Uses graph-based methods to calculate atomic pairwise distance surrounding the wt amino acid. Mutational impact is captured based on a change in the atomic pharmocophore count resulting from the point mutation. Together, this forms the mCSM-signature, and is used to train predictive models for analysing mutational impact on structure stability. Predicted ΔΔG < 0 relates to destabilising, and ΔΔG > 0 relates to stabilising mutational effects.Ternary Classification of Destabilising effect:Mild: −1 < ΔΔG < 0Moderate: −2 < ΔΔG < -1High: ΔΔG < -2Ternary Classification of Stabilising effect:Mild: 0 < ΔΔG < 1Moderate: 1 < ΔΔG < 2High: ΔΔG > 2 | PDB code/fileSNP listChain IDInput modes:1) Single mutation2) Mutation list3) Systematic: all possible mutation for a single residue | Input mode 1:1) Predicted ΔΔG2) Classification of mutational stability changeInput modes (2) & (3):Multi-column table with predicted ΔΔG and RSA. | Predicts both the direction and the estimate of stability.Mutant structure is notrequired.webGL structural visualisationfor input mode 1.Works at an atomic level.Demonstrates correlation between atomic-distance pattern of the wild-type residue environment and mutational impact.Calculates overall stability of protein and interactions. |
| mCSM-lig: mutational Cut-Off Scanning Matrix on ligand affinityREF:[88] | St | Graph-based and ML | Protein-ligand affinity(mCSM-lig):http://biosig.unimelb.edu.au/mcsm_lig/predictionDownload: No | Based on the mCSM graph-based signatures (as above) with the addition of small-molecule chemical features and ligandphysicochemical properties to capture mutational changes.Predictive models trained on a representative set of protein–ligand complexes.Mutational impact on affinity is calculated as the log (ln) affinity fold change as below:ln(Kwt) - ln(Kmt) = ln (fold-change)Classification of ln (fold-change):ln (fold-change) < 0: Destabilisingln (fold-change) > 0: Stabilising | PDB code/fileSingle SNPChain ID3-letter ligand IDwt-affinity (nano Molar (nM)) | Log affinity fold changeDistance to ligand (Angstroms)DUET stability change (Kcal/mol)Binary classification of affinity and stability changes. | Predicts both the direction and the estimate of stability.Returns both DUET and ligand affinity changes, along with ligand distance to site.Measures both global and local stability effects.Analyses single mutation at a time.Returns a change in affinity value.Less reliable results for sites > 10 Å from ligand. |
| Rosetta Flex_ddGREF:[151] | St | All-atom energy function | Download only: https://www.rosettacommons.org/software/license-and-download | Based on a mixed physics and knowledge-based approach. Uses all-atom energy function, parameterized from small molecule and X-ray crystal structure.The Flex_ddG protocol models changes in the ΔΔG upon mutation at a protein–protein or protein–ligand interface using the ‘backrub’ algorithm. This algorithm is used to sample conformational space and produce an ensemble of wt and mt models to estimate the interface ΔΔG values.Ternary Classification of ΔΔG:Destabilising: ΔΔG >=1Neutral: −1 < ΔΔG < 1Stabilising: ΔΔG<= −1 | Customized PDB fileLigand parameter fileCustomized XML protocol fileMutation listChain ID | Each run outputs db3 file containing the changes in the main components of the energy function, ΔG wt, ΔG mt, and the ΔΔG upon mutation. | For a reliable prediction, at least 35 runs per mutation are required, with each run taking between 2 and 4 h.Access to HPC may be required for large number of mutations.Protocols are written in XML format.Requires license to download. |
| INPS-MDImpact of Non-synonymous mutations on Protein Stability-Multi DimensionREF: [141] | S/St | SVM regression | https://inpsmd.biocomp.unibo.it/inpsSuiteDownload: No | Calculates ΔΔG of mutation on sequence and structure.The sequence-based predictions are derived from seven descriptors to account for evolutionary information (INPS), while two additional structural features (RSA and energy difference between wt and mt structures) are included for the structure-based predictions (INPS-3D).SVM regression is used to map the sequence descriptors to the ΔΔG values.Classification of ΔΔG:ΔΔG < 0: DestabilisingΔΔG > 0: Stabilising | Fasta sequence/PDB fileSNP listChain ID (INPS-3D only) | Per SNP in list:Predicted ΔΔG | Predicts both the direction and the estimate of stability.Can operate on both sequence (INPS) and structure (INPS-3D)Accounts for anti-symmetric property of variation i.e ΔΔG (A->B) = - ΔΔG (B->A). |
| DeepDDG/iDeepDDGREF: [142] | St | NN/Ensemble method | http://protein.org.cn/ddg.htmlDownload: No | Calculates ΔΔG of mutation using NN trained on nine categories of sequence and structural features.Operates independently as ‘DeepDDG’, and in an integrated manner as ‘iDeepDDG’. In the latter, predictions from three methods: mCSM, SDM and DUET are fed into the concatenation layer of the NN to generate the consensus prediction.Classification of ΔΔG:ΔΔG < 0: DestabilisingΔΔG > 0: Stabilising | PDB code/fileNetwork model:-DeepDDG-iDeepDDGSNP list in two modes:1) Single or multiple SNPs2) All possible mutations | Per SNP/all possible SNPs:Predicted ΔΔG | Predicts both the direction and the estimate of stability.Accounts for anti-symmetric property of variation i.e ΔΔG (A->B) = - ΔΔG (B->A).Runs in independent or integrated modes.‘DeepDDG’ allows high throughput scanning for all possible point mutations with relatively fast computation time.For running ‘iDeepDDG’, user must provide predictions for each mutation from the mCSM DUET server. |
| DUETREF: [102] | St | Ensemble method: SVM | http://biosig.unimelb.edu.au/duet/Download: No | Predicts stability effects upon mutation on proteins.Combines predictions from two complementary methods: mCSM and Site Directed Mutator (SDM) in an optimised predictor to generate the DUET prediction.The optimised predictor is generated using SVM trained with Sequential Minimal Optimisation.Classification of ΔΔG:ΔΔG < 0: DestabilisingΔΔG > 0: Stabilising | PDB code/fileSNP listChain IDInput mode1:Single mutationInput mode 2:Systematic: all possible mutation for a single residue | Input mode 1:1) Predicted ΔΔG from mCSM, SDM and DUET.Input mode 2:Multi-column table with predicted ΔΔG from mCSM, SDM, DUET and RSA. | Predicts both the direction and the estimate of stability.Mutant structure is notrequired.webGL structural visualisation for input mode 1. |
| ELASPIC: Ensemble Learning Approach for Stability Prediction of Interface and Core mutationsREF:[124] | (St) | Ensemble method: ML | http://elaspic.kimlab.org/Download: No | Predicts stability effects upon mutation in both, domain cores and domain-domain interfaces.Combination of semi-empirical energy terms, sequence conservation, and a wide variety of molecular details with a Stochastic Gradient Boosting of Decision Trees (SGB-DT) algorithm.Uses a combination of sequence, molecular and energy features including prediction scores from other tools. | Uniprot Protein ID or PDB structureSNP list | Multi-column table, with the main output being ΔG of wt and mt structures, and ΔΔG of mutation.Results are downloadable. FoldX generated mutant structures in pdb formatJmol applet showing superimposed wt mt structures. | Can be run as a single or multiple mutations and Protein-protein interactionsOption to filter results based on additional criteria.Non-human proteins may take longer to run.An interactive connectivity network showing the affected protein–protein interaction mutations. |
| DynaMutREF:[118] | St | Ensemble method: NMA | http://biosig.unimelb.edu.au/dynamut/Download: No | Predicts stability effects based on protein dynamics resulting from vibrational entropy changes.Integrates mCSM signatures and normal model analysis. Combines mutational effect from 3 structure-based prediction tools to generate a consensus prediction.Classification of ΔΔG:ΔΔG < 0 DestabilisingΔΔG > 0: Stabilising | PDB code/fileSingle SNP/SNP listChain ID | NMA based predictionsOther structure-based predictions included. | Accounts for protein molecular motion and flexibility.Easy and detailed visualisation of results including interatomic interactions, deformation and fluctuation analysis.Returns a change in stability.Computationally expensive with relatively long runtime. |