Literature DB >> 16570873

Rule generation for protein secondary structure prediction with support vector machines and decision tree.

Jieyue He1, Hae-Jin Hu, Robert Harrison, Phang C Tai, Yi Pan.   

Abstract

Support vector machines (SVMs) have shown strong generalization ability in a number of application areas, including protein structure prediction. However, the poor comprehensibility hinders the success of the SVM for protein structure prediction. The explanation of how a decision made is important for accepting the machine learning technology, especially for applications such as bioinformatics. The reasonable interpretation is not only useful to guide the "wet experiments," but also the extracted rules are helpful to integrate computational intelligence with symbolic AI systems for advanced deduction. On the other hand, a decision tree has good comprehensibility. In this paper, a novel approach to rule generation for protein secondary structure prediction by integrating merits of both the SVM and decision tree is presented. This approach combines the SVM with decision tree into a new algorithm called SVM_ DT, which proceeds in three steps. This algorithm first trains an SVM. Then, a new training set is generated through careful selection from the output of the SVM. Finally, the obtained training set is used to train a decision tree learning system and to extract the corresponding rule sets. The results of the experiments of protein secondary structure prediction on RS126 data set show that the comprehensibility of SVM_DT is much better than that of the SVM. Moreover, the generalization ability of SVM_DT is better than that of C4.5 decision trees and is similar to that of the SVM. Hence, SVM_DT can be used not only for prediction, but also for guiding biological experiments.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16570873     DOI: 10.1109/tnb.2005.864021

Source DB:  PubMed          Journal:  IEEE Trans Nanobioscience        ISSN: 1536-1241            Impact factor:   2.935


  6 in total

1.  Development and comparative assessment of Raman spectroscopic classification algorithms for lesion discrimination in stereotactic breast biopsies with microcalcifications.

Authors:  Narahara Chari Dingari; Ishan Barman; Anushree Saha; Sasha McGee; Luis H Galindo; Wendy Liu; Donna Plecha; Nina Klein; Ramachandra Rao Dasari; Maryann Fitzmaurice
Journal:  J Biophotonics       Date:  2012-07-20       Impact factor: 3.207

2.  Interpreting SVM for medical images using Quadtree.

Authors:  Prashant Shukla; Abhishek Verma; Shekhar Verma; Manish Kumar
Journal:  Multimed Tools Appl       Date:  2020-08-11       Impact factor: 2.757

3.  A novel ensemble deep learning model for stock prediction based on stock prices and news.

Authors:  Yang Li; Yi Pan
Journal:  Int J Data Sci Anal       Date:  2021-09-17

4.  Application of Raman spectroscopy to identify microcalcifications and underlying breast lesions at stereotactic core needle biopsy.

Authors:  Ishan Barman; Narahara Chari Dingari; Anushree Saha; Sasha McGee; Luis H Galindo; Wendy Liu; Donna Plecha; Nina Klein; Ramachandra Rao Dasari; Maryann Fitzmaurice
Journal:  Cancer Res       Date:  2013-06-01       Impact factor: 12.701

5.  Logic minimization and rule extraction for identification of functional sites in molecular sequences.

Authors:  Raul Cruz-Cano; Mei-Ling Ting Lee; Ming-Ying Leung
Journal:  BioData Min       Date:  2012-08-16       Impact factor: 2.522

6.  A prostate cancer model build by a novel SVM-ID3 hybrid feature selection method using both genotyping and phenotype data from dbGaP.

Authors:  Sait Can Yücebaş; Yeşim Aydın Son
Journal:  PLoS One       Date:  2014-03-20       Impact factor: 3.240

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.