Literature DB >> 8435763

A novel method of protein sequence classification based on oligopeptide frequency analysis and its application to search for functional sites and to domain localization.

V V Solovyev1, K S Makarova.   

Abstract

A new method for distinguishing among protein families based on the analysis of oligopeptide composition of amino acid sequences is presented. It is assumed that any protein family can be characterized by a set of essential oligopeptides (oligopeptide vocabulary). A simple approach to find such a vocabulary is suggested. It is shown that comparison of the vocabularies can distinguish among different families and the latter from random sequences. This comparison can be successfully made with a small set of frequencies of 25 dipeptides (or tripeptides). No preliminary alignment is necessary. It is established that characteristic peptides are located in the regions of functional value, as shown for GTP-binding domains of the translation elongation factors. It is demonstrated that this method is reasonably efficient for localizing functional domains in the amino acid sequences. The average error of prediction does not exceed three or four amino acid residues as shown for several functional domains.

Entities:  

Mesh:

Substances:

Year:  1993        PMID: 8435763     DOI: 10.1093/bioinformatics/9.1.17

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  10 in total

1.  Metagenomic Classification Using an Abstraction Augmented Markov Model.

Authors:  Xiujun Sylvia Zhu; Monnie McGee
Journal:  J Comput Biol       Date:  2015-11-30       Impact factor: 1.479

2.  A novel missense-mutation-related feature extraction scheme for 'driver' mutation identification.

Authors:  Hua Tan; Jiguang Bao; Xiaobo Zhou
Journal:  Bioinformatics       Date:  2012-10-07       Impact factor: 6.937

3.  A hybrid neural network system for prediction and recognition of promoter regions in human genome.

Authors:  Chuan-Bo Chen; Tao Li
Journal:  J Zhejiang Univ Sci B       Date:  2005-05       Impact factor: 3.066

4.  Objective classification system for sagittal craniosynostosis based on suture segmentation.

Authors:  Xiaohua Qian; Hua Tan; Jian Zhang; Xiahai Zhuang; Leslie Branch; Chaire Sanger; Allison Thompson; Weiling Zhao; King Chuen Li; Lisa David; Xiaobo Zhou
Journal:  Med Phys       Date:  2015-09       Impact factor: 4.071

5.  N-gram analysis of 970 microbial organisms reveals presence of biological language models.

Authors:  Hatice Ulku Osmanbeyoglu; Madhavi K Ganapathiraju
Journal:  BMC Bioinformatics       Date:  2011-01-10       Impact factor: 3.169

6.  Identification and analysis of driver missense mutations using rotation forest with feature selection.

Authors:  Xiuquan Du; Jiaxing Cheng
Journal:  Biomed Res Int       Date:  2014-08-27       Impact factor: 3.411

7.  A novel feature extraction scheme with ensemble coding for protein-protein interaction prediction.

Authors:  Xiuquan Du; Jiaxing Cheng; Tingting Zheng; Zheng Duan; Fulan Qian
Journal:  Int J Mol Sci       Date:  2014-07-18       Impact factor: 5.923

8.  Peptide vocabulary analysis reveals ultra-conservation and homonymity in protein sequences.

Authors:  Derek Gatherer
Journal:  Bioinform Biol Insights       Date:  2009-11-24

9.  LAF: Logic Alignment Free and its application to bacterial genomes classification.

Authors:  Emanuel Weitschek; Fabio Cunial; Giovanni Felici
Journal:  BioData Min       Date:  2015-12-08       Impact factor: 2.522

10.  n-Gram characterization of genomic islands in bacterial genomes.

Authors:  Gordana M Pavlović-Lazetić; Nenad S Mitić; Milos V Beljanski
Journal:  Comput Methods Programs Biomed       Date:  2008-12-19       Impact factor: 5.428

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.