Literature DB >> 34953462

Prediction of disease-associated nsSNPs by integrating multi-scale ResNet models with deep feature fusion.

Fang Ge1, Ying Zhang1, Jian Xu1, Arif Muhammad1, Jiangning Song2,3, Dong-Jun Yu1.   

Abstract

More than 6000 human diseases have been recorded to be caused by non-synonymous single nucleotide polymorphisms (nsSNPs). Rapid and accurate prediction of pathogenic nsSNPs can improve our understanding of the principle and design of new drugs, which remains an unresolved challenge. In the present work, a new computational approach, termed MSRes-MutP, is proposed based on ResNet blocks with multi-scale kernel size to predict disease-associated nsSNPs. By feeding the serial concatenation of the extracted four types of features, the performance of MSRes-MutP does not obviously improve. To address this, a second model FFMSRes-MutP is developed, which utilizes deep feature fusion strategy and multi-scale 2D-ResNet and 1D-ResNet blocks to extract relevant two-dimensional features and physicochemical properties. FFMSRes-MutP with the concatenated features achieves a better performance than that with individual features. The performance of FFMSRes-MutP is benchmarked on five different datasets. It achieves the Matthew's correlation coefficient (MCC) of 0.593 and 0.618 on the PredictSNP and MMP datasets, which are 0.101 and 0.210 higher than that of the existing best method PredictSNP1. When tested on the HumDiv and HumVar datasets, it achieves MCC of 0.9605 and 0.9507, and area under curve (AUC) of 0.9796 and 0.9748, which are 0.1747 and 0.2669, 0.0853 and 0.1335, respectively, higher than the existing best methods PolyPhen-2 and FATHMM (weighted). In addition, on blind test using a third-party dataset, FFMSRes-MutP performs as the second-best predictor (with MCC and AUC of 0.5215 and 0.7633, respectively), when compared with the other four predictors. Extensive benchmarking experiments demonstrate that FFMSRes-MutP achieves effective feature fusion and can be explored as a useful approach for predicting disease-associated nsSNPs. The webserver is freely available at http://csbio.njust.edu.cn/bioinf/ffmsresmutp/ for academic use.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  1D-ResNet; 2D-ResNet; deep feature fusion; deep learning; disease-associated nsSNP prediction; microenvironment of mutant site

Mesh:

Substances:

Year:  2022        PMID: 34953462      PMCID: PMC8769912          DOI: 10.1093/bib/bbab530

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  71 in total

1.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

2.  Performance of protein stability predictors.

Authors:  Sofia Khan; Mauno Vihinen
Journal:  Hum Mutat       Date:  2010-06       Impact factor: 4.878

Review 3.  The effects of non-synonymous single nucleotide polymorphisms (nsSNPs) on protein-protein interactions.

Authors:  Christopher M Yates; Michael J E Sternberg
Journal:  J Mol Biol       Date:  2013-07-15       Impact factor: 5.469

4.  STRUM: structure-based prediction of protein stability changes upon single-point mutation.

Authors:  Lijun Quan; Qiang Lv; Yang Zhang
Journal:  Bioinformatics       Date:  2016-06-17       Impact factor: 6.937

5.  DAMpred: Recognizing Disease-Associated nsSNPs through Bayes-Guided Neural-Network Model Built on Low-Resolution Structure Prediction of Proteins and Protein-Protein Interactions.

Authors:  Lijun Quan; Hongjie Wu; Qiang Lyu; Yang Zhang
Journal:  J Mol Biol       Date:  2019-02-21       Impact factor: 5.469

6.  A method and server for predicting damaging missense mutations.

Authors:  Ivan A Adzhubei; Steffen Schmidt; Leonid Peshkin; Vasily E Ramensky; Anna Gerasimova; Peer Bork; Alexey S Kondrashov; Shamil R Sunyaev
Journal:  Nat Methods       Date:  2010-04       Impact factor: 28.547

7.  Assessing the performance of computational predictors for estimating protein stability changes upon missense mutations.

Authors:  Shahid Iqbal; Fuyi Li; Tatsuya Akutsu; David B Ascher; Geoffrey I Webb; Jiangning Song
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

8.  AAindex: amino acid index database, progress report 2008.

Authors:  Shuichi Kawashima; Piotr Pokarowski; Maria Pokarowska; Andrzej Kolinski; Toshiaki Katayama; Minoru Kanehisa
Journal:  Nucleic Acids Res       Date:  2007-11-12       Impact factor: 16.971

9.  Enhancing protein-vitamin binding residues prediction by multiple heterogeneous subspace SVMs ensemble.

Authors:  Dong-Jun Yu; Jun Hu; Hui Yan; Xi-Bei Yang; Jing-Yu Yang; Hong-Bin Shen
Journal:  BMC Bioinformatics       Date:  2014-09-05       Impact factor: 3.169

10.  mCSM: predicting the effects of mutations in proteins using graph-based signatures.

Authors:  Douglas E V Pires; David B Ascher; Tom L Blundell
Journal:  Bioinformatics       Date:  2013-11-26       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.