Literature DB >> 21192951

Non-linear models based on simple topological indices to identify RNase III protein members.

Guillermin Agüero-Chapin1, Gustavo A de la Riva, Reinaldo Molina-Ruiz, Aminael Sánchez-Rodríguez, Gisselle Pérez-Machado, Vítor Vasconcelos, Agostinho Antunes.   

Abstract

Alignment-free classifiers are especially useful in the functional classification of protein classes with variable homology and different domain structures. Thus, the Topological Indices to BioPolymers (TI2BioP) methodology (Agüero-Chapin et al., 2010) inspired in both the TOPS-MODE and the MARCH-INSIDE methodologies allows the calculation of simple topological indices (TIs) as alignment-free classifiers. These indices were derived from the clustering of the amino acids into four classes of hydrophobicity and polarity revealing higher sequence-order information beyond the amino acid composition level. The predictability power of such TIs was evaluated for the first time on the RNase III family, due to the high diversity of its members (primary sequence and domain organization). Three non-linear models were developed for RNase III class prediction: Decision Tree Model (DTM), Artificial Neural Networks (ANN)-model and Hidden Markov Model (HMM). The first two are alignment-free approaches, using TIs as input predictors. Their performances were compared with a non-classical HMM, modified according to our amino acid clustering strategy. The alignment-free models showed similar performances on the training and the test sets reaching values above 90% in the overall classification. The non-classical HMM showed the highest rate in the classification with values above 95% in training and 100% in test. Although the higher accuracy of the HMM, the DTM showed simplicity for the RNase III classification with low computational cost. Such simplicity was evaluated in respect to HMM and ANN models for the functional annotation of a new bacterial RNase III class member, isolated and annotated by our group.
Copyright © 2010 Elsevier Ltd. All rights reserved.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 21192951     DOI: 10.1016/j.jtbi.2010.12.019

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  3 in total

1.  An alignment-free approach for eukaryotic ITS2 annotation and phylogenetic inference.

Authors:  Guillermin Agüero-Chapin; Aminael Sánchez-Rodríguez; Pedro I Hidalgo-Yanes; Yunierkis Pérez-Castillo; Reinaldo Molina-Ruiz; Kathleen Marchal; Vítor Vasconcelos; Agostinho Antunes
Journal:  PLoS One       Date:  2011-10-26       Impact factor: 3.240

2.  Graph Theory-Based Sequence Descriptors as Remote Homology Predictors.

Authors:  Guillermin Agüero-Chapin; Deborah Galpert; Reinaldo Molina-Ruiz; Evys Ancede-Gallardo; Gisselle Pérez-Machado; Gustavo A de la Riva; Agostinho Antunes
Journal:  Biomolecules       Date:  2019-12-23

3.  Exploring the adenylation domain repertoire of nonribosomal peptide synthetases using an ensemble of sequence-search methods.

Authors:  Guillermin Agüero-Chapin; Reinaldo Molina-Ruiz; Emanuel Maldonado; Gustavo de la Riva; Aminael Sánchez-Rodríguez; Vitor Vasconcelos; Agostinho Antunes
Journal:  PLoS One       Date:  2013-07-16       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.