Literature DB >> 25791288

Discovering short linear protein motif based on selective training of profile hidden Markov models.

Tao Song1, Hong Gu2.   

Abstract

Short linear motifs (SLiMs) in proteins are relatively conservative sequence patterns within disordered regions of proteins, typically 3-10 amino acids in length. They play an important role in mediating protein-protein interactions. Discovering SLiMs by computational methods has attracted more and more attention, most of which were based on regular expressions and profiles. In this paper, a de novo motif discovery method was proposed based on profile hidden Markov models (HMMs), which can not only provide the emission probabilities of amino acids in the defined positions of SLiMs, but also model the undefined positions. We adopted the ordered region masking and the relative local conservation (RLC) masking to improve the signal to noise ratio of the query sequences while applying evolutionary weighting to make the important sequences in evolutionary process get more attention by the selective training of profile HMMs. The experimental results show that our method and the profile-based method returned different subsets within a SLiMs dataset, and the performance of the two approaches are equivalent on a more realistic discovery dataset. Profile HMM-based motif discovery methods complement the existing methods and provide another way for SLiMs analysis.
Copyright © 2015 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Evolutionary weighting; Intrinsic disorder prediction; Masked residues processing; Relative local conservation; Statistical significance

Mesh:

Substances:

Year:  2015        PMID: 25791288     DOI: 10.1016/j.jtbi.2015.03.010

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  3 in total

1.  HH-MOTiF: de novo detection of short linear motifs in proteins by Hidden Markov Model comparisons.

Authors:  Roman Prytuliak; Michael Volkmer; Markus Meier; Bianca H Habermann
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

2.  GLTM: A Global-Local Attention LSTM Model to Locate Dimer Motif of Single-Pass Membrane Proteins.

Authors:  Quanchao Ma; Kai Zou; Zhihai Zhang; Fan Yang
Journal:  Front Genet       Date:  2022-03-15       Impact factor: 4.599

3.  SLALOM, a flexible method for the identification and statistical analysis of overlapping continuous sequence elements in sequence- and time-series data.

Authors:  Roman Prytuliak; Friedhelm Pfeiffer; Bianca Hermine Habermann
Journal:  BMC Bioinformatics       Date:  2018-01-26       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.