Literature DB >> 9283754

Meta-MEME: motif-based hidden Markov models of protein families.

W N Grundy1, T L Bailey, C P Elkan, M E Baker.   

Abstract

MOTIVATION: Modeling families of related biological sequences using Hidden Markov models (HMMs), although increasingly widespread, faces at least one major problem: because of the complexity of these mathematical models, they require a relatively large training set in order to accurately recognize a given family. For families in which there are few known sequences, a standard linear HMM contains too many parameters to be trained adequately.
RESULTS: This work attempts to solve that problem by generating smaller HMMs which precisely model only the conserved regions of the family. These HMMs are constructed from motif models generated by the EM algorithm using the MEME software. Because motif-based HMMs have relatively few parameters, they can be trained using smaller data sets. Studies of short chain alcohol dehydrogenases and 4Fe-4S ferredoxins support the claim that motif-based HMMs exhibit increased sensitivity and selectivity in database searches, especially when training sets contain few sequences.

Mesh:

Substances:

Year:  1997        PMID: 9283754     DOI: 10.1093/bioinformatics/13.4.397

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  53 in total

1.  Discovering regulatory elements in non-coding sequences by analysis of spaced dyads.

Authors:  J van Helden; A F Rios; J Collado-Vides
Journal:  Nucleic Acids Res       Date:  2000-04-15       Impact factor: 16.971

2.  Electrotonically mediated oscillatory patterns in neuronal ensembles: an in vitro voltage-dependent dye-imaging study in the inferior olive.

Authors:  Elena Leznik; Vladimir Makarenko; Rodolfo Llinás
Journal:  J Neurosci       Date:  2002-04-01       Impact factor: 6.167

3.  Bioinformatic design of A-kinase anchoring protein-in silico: a potent and selective peptide antagonist of type II protein kinase A anchoring.

Authors:  Neal M Alto; Scott H Soderling; Naoto Hoshi; Lorene K Langeberg; Rosa Fayos; Patricia A Jennings; John D Scott
Journal:  Proc Natl Acad Sci U S A       Date:  2003-04-02       Impact factor: 11.205

4.  Enhanced protein domain discovery by using language modeling techniques from speech recognition.

Authors:  Lachlan Coin; Alex Bateman; Richard Durbin
Journal:  Proc Natl Acad Sci U S A       Date:  2003-03-31       Impact factor: 11.205

5.  JASPAR: an open-access database for eukaryotic transcription factor binding profiles.

Authors:  Albin Sandelin; Wynand Alkema; Pär Engström; Wyeth W Wasserman; Boris Lenhard
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

6.  LEON: multiple aLignment Evaluation Of Neighbours.

Authors:  Julie D Thompson; Véronique Prigent; Olivier Poch
Journal:  Nucleic Acids Res       Date:  2004-02-24       Impact factor: 16.971

7.  DNA Motif Detection Using Particle Swarm Optimization and Expectation-Maximization.

Authors:  C T Hardin; Eric C Rouchka
Journal:  Proc IEEE Swarm Intell Symp       Date:  2005-06-08

8.  In vitro selection of RNA aptamers directed against protein E: a Haemophilus influenzae adhesin.

Authors:  Anders Barfod; Birendra Singh; Urban Johanson; Kristian Riesbeck; Per Kjellbom
Journal:  Mol Biotechnol       Date:  2014-08       Impact factor: 2.695

9.  Bayesian variable selection for gene expression modeling with regulatory motif binding sites in neuroinflammatory events.

Authors:  Kuang-Yu Liu; Xiaobo Zhou; Kinhong Kan; Stephen T C Wong
Journal:  Neuroinformatics       Date:  2006

10.  Organization of the Caenorhabditis elegans small non-coding transcriptome: genomic features, biogenesis, and expression.

Authors:  Wei Deng; Xiaopeng Zhu; Geir Skogerbø; Yi Zhao; Zhuo Fu; Yudong Wang; Housheng He; Lun Cai; Hong Sun; Changning Liu; Biao Li; Baoyan Bai; Jie Wang; Dong Jia; Shiwei Sun; Hang He; Yan Cui; Yu Wang; Dongbo Bu; Runsheng Chen
Journal:  Genome Res       Date:  2005-12-12       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.