Literature DB >> 14988129

Why neural networks should not be used for HIV-1 protease cleavage site prediction.

Thorsteinn Rögnvaldsson1, Liwen You.   

Abstract

UNLABELLED: Several papers have been published where nonlinear machine learning algorithms, e.g. artificial neural networks, support vector machines and decision trees, have been used to model the specificity of the HIV-1 protease and extract specificity rules. We show that the dataset used in these studies is linearly separable and that it is a misuse of nonlinear classifiers to apply them to this problem. The best solution on this dataset is achieved using a linear classifier like the simple perceptron or the linear support vector machine, and it is straightforward to extract rules from these linear models. We identify key residues in peptides that are efficiently cleaved by the HIV-1 protease and list the most prominent rules, relating them to experimental results for the HIV-1 protease.
MOTIVATION: Understanding HIV-1 protease specificity is important when designing HIV inhibitors and several different machine learning algorithms have been applied to the problem. However, little progress has been made in understanding the specificity because nonlinear and overly complex models have been used.
RESULTS: We show that the problem is much easier than what has previously been reported and that linear classifiers like the simple perceptron or linear support vector machines are at least as good predictors as nonlinear algorithms. We also show how sets of specificity rules can be generated from the resulting linear classifiers. AVAILABILITY: The datasets used are available at http://www.hh.se/staff/bioinf/

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 14988129     DOI: 10.1093/bioinformatics/bth144

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  13 in total

1.  Predicting human immunodeficiency virus protease cleavage sites in nonlinear projection space.

Authors:  Xuehua Li; Hongli Hu; Lan Shu
Journal:  Mol Cell Biochem       Date:  2010-01-07       Impact factor: 3.396

2.  OETMAP: a new feature encoding scheme for MHC class I binding prediction.

Authors:  Murat Gök; Ahmet Turan Özcerit
Journal:  Mol Cell Biochem       Date:  2011-07-30       Impact factor: 3.396

3.  The importance of physicochemical characteristics and nonlinear classifiers in determining HIV-1 protease specificity.

Authors:  Timmy Manning; Paul Walsh
Journal:  Bioengineered       Date:  2016-04-02       Impact factor: 3.269

4.  Comprehensive bioinformatic analysis of the specificity of human immunodeficiency virus type 1 protease.

Authors:  Liwen You; Daniel Garwicz; Thorsteinn Rögnvaldsson
Journal:  J Virol       Date:  2005-10       Impact factor: 5.103

5.  Feature Selection Combined with Neural Network Structure Optimization for HIV-1 Protease Cleavage Site Prediction.

Authors:  Hui Liu; Xiaomiao Shi; Dongmei Guo; Zuowei Zhao
Journal:  Biomed Res Int       Date:  2015-04-15       Impact factor: 3.411

6.  A consistency-based feature selection method allied with linear SVMs for HIV-1 protease cleavage site prediction.

Authors:  Orkun Oztürk; Alper Aksaç; Abdallah Elsheikh; Tansel Ozyer; Reda Alhajj
Journal:  PLoS One       Date:  2013-08-23       Impact factor: 3.240

7.  How to find simple and accurate rules for viral protease cleavage specificities.

Authors:  Thorsteinn Rögnvaldsson; Terence A Etchells; Liwen You; Daniel Garwicz; Ian Jarman; Paulo J G Lisboa
Journal:  BMC Bioinformatics       Date:  2009-05-16       Impact factor: 3.169

8.  A genetic approach for building different alphabets for peptide and protein classification.

Authors:  Loris Nanni; Alessandra Lumini
Journal:  BMC Bioinformatics       Date:  2008-01-24       Impact factor: 3.169

9.  Prediction of the burial status of transmembrane residues of helical membrane proteins.

Authors:  Yungki Park; Sikander Hayat; Volkhard Helms
Journal:  BMC Bioinformatics       Date:  2007-08-20       Impact factor: 3.169

10.  SVM-based prediction of propeptide cleavage sites in spider toxins identifies toxin innovation in an Australian tarantula.

Authors:  Emily S W Wong; Margaret C Hardy; David Wood; Timothy Bailey; Glenn F King
Journal:  PLoS One       Date:  2013-07-22       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.