Literature DB >> 28916136

Mathematical basis of improved protein subfamily classification by a HMM-based sequence filter.

Siddhartha Kundu1.   

Abstract

Informative phylogenetic analysis is dependent on the presence of curated and annotated sequences. This may be complemented by the simultaneous availability of empirical data pertaining to their in vivo function. Confounding sequences, with their similarity to more than one functional cluster, can therefore, render any categorization ambiguous, subjective, and imprecise. Here, I analyze and discuss the development of a mathematical expression that can characterize a potential confounding protein sequence. Specifically, statistical descriptors of combinatorially arranged profile HMM scores are computed and evaluated. The resultant data is then incorporated into an index of sequence suitability. The sequence may then be recommended as either suitable for inclusion or be excluded all together. The index is independent of experimental data and, can, be computed from the primary structure of the protein sequence. This can be utilized to trim previously grouped sequences and can either finalize the composition of training set or reduce the search space of sequences to be tested.
Copyright © 2017 Elsevier Inc. All rights reserved.

Keywords:  Hidden Markov Model; Phylogenetics; Protein subfamily; Statistical filter

Mesh:

Substances:

Year:  2017        PMID: 28916136     DOI: 10.1016/j.mbs.2017.09.001

Source DB:  PubMed          Journal:  Math Biosci        ISSN: 0025-5564            Impact factor:   2.144


  2 in total

1.  Mathematical Basis of Predicting Dominant Function in Protein Sequences by a Generic HMM-ANN Algorithm.

Authors:  Siddhartha Kundu
Journal:  Acta Biotheor       Date:  2018-04-26       Impact factor: 1.774

2.  Fe(2)OG: an integrated HMM profile-based web server to predict and analyze putative non-haem iron(II)- and 2-oxoglutarate-dependent dioxygenase function in protein sequences.

Authors:  Siddhartha Kundu
Journal:  BMC Res Notes       Date:  2021-03-01
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.