Literature DB >> 8302831

Hidden Markov models of biological primary sequence information.

P Baldi1, Y Chauvin, T Hunkapiller, M A McClure.   

Abstract

Hidden Markov model (HMM) techniques are used to model families of biological sequences. A smooth and convergent algorithm is introduced to iteratively adapt the transition and emission parameters of the models from the examples in a given family. The HMM approach is applied to three protein families: globins, immunoglobulins, and kinases. In all cases, the models derived capture the important statistical characteristics of the family and can be used for a number of tasks, including multiple alignments, motif detection, and classification. For K sequences of average length N, this approach yields an effective multiple-alignment algorithm which requires O(KN2) operations, linear in the number of sequences.

Mesh:

Substances:

Year:  1994        PMID: 8302831      PMCID: PMC521453          DOI: 10.1073/pnas.91.3.1059

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  18 in total

Review 1.  Dual-specificity protein kinases: will any hydroxyl do?

Authors:  R A Lindberg; A M Quinn; T Hunter
Journal:  Trends Biochem Sci       Date:  1992-03       Impact factor: 13.807

2.  Efficient methods for multiple sequence alignment with guaranteed error bounds.

Authors:  D Gusfield
Journal:  Bull Math Biol       Date:  1993-01       Impact factor: 1.758

3.  An expectation maximization (EM) algorithm for the identification and characterization of common sites in unaligned biopolymer sequences.

Authors:  C E Lawrence; A A Reilly
Journal:  Proteins       Date:  1990

4.  Evolutionary processes and evolutionary noise at the molecular level. I. Functional density in proteins.

Authors:  E Zuckerkandl
Journal:  J Mol Evol       Date:  1976-04-09       Impact factor: 2.395

Review 5.  A thousand and one protein kinases.

Authors:  T Hunter
Journal:  Cell       Date:  1987-09-11       Impact factor: 41.582

6.  Stochastic models for heterogeneous DNA sequences.

Authors:  G A Churchill
Journal:  Bull Math Biol       Date:  1989       Impact factor: 1.758

7.  Determinants of a protein fold. Unique features of the globin amino acid sequences.

Authors:  D Bashford; C Chothia; A M Lesk
Journal:  J Mol Biol       Date:  1987-07-05       Impact factor: 5.469

8.  A general method applicable to the search for similarities in the amino acid sequence of two proteins.

Authors:  S B Needleman; C D Wunsch
Journal:  J Mol Biol       Date:  1970-03       Impact factor: 5.469

9.  Establishing homologies in protein sequences.

Authors:  M O Dayhoff; W C Barker; L T Hunt
Journal:  Methods Enzymol       Date:  1983       Impact factor: 1.600

10.  Similar amino acid sequences: chance or common ancestry?

Authors:  R F Doolittle
Journal:  Science       Date:  1981-10-09       Impact factor: 47.728

View more
  54 in total

1.  Testing computational prediction of missense mutation phenotypes: functional characterization of 204 mutations of human cystathionine beta synthase.

Authors:  Qiong Wei; Liqun Wang; Qiang Wang; Warren D Kruger; Roland L Dunbrack
Journal:  Proteins       Date:  2010-07

2.  mRNA degradation by the virion host shutoff (Vhs) protein of herpes simplex virus: genetic and biochemical evidence that Vhs is a nuclease.

Authors:  David N Everly; Pinghui Feng; I Saira Mian; G Sullivan Read
Journal:  J Virol       Date:  2002-09       Impact factor: 5.103

3.  Hidden Markov models from molecular dynamics simulations on DNA.

Authors:  Kelly M Thayer; D L Beveridge
Journal:  Proc Natl Acad Sci U S A       Date:  2002-06-18       Impact factor: 11.205

4.  Genome-level evolution of resistance genes in Arabidopsis thaliana.

Authors:  Andrew Baumgarten; Steven Cannon; Russ Spangler; Georgiana May
Journal:  Genetics       Date:  2003-09       Impact factor: 4.562

5.  Phylogenetic profiles reveal evolutionary relationships within the "twilight zone" of sequence similarity.

Authors:  Gue Su Chang; Yoojin Hong; Kyung Dae Ko; Gaurav Bhardwaj; Edward C Holmes; Randen L Patterson; Damian B van Rossum
Journal:  Proc Natl Acad Sci U S A       Date:  2008-09-02       Impact factor: 11.205

6.  A hidden Markov model that finds genes in E. coli DNA.

Authors:  A Krogh; I S Mian; D Haussler
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

7.  GeneMark.hmm: new solutions for gene finding.

Authors:  A V Lukashin; M Borodovsky
Journal:  Nucleic Acids Res       Date:  1998-02-15       Impact factor: 16.971

8.  Statistical modeling and analysis of the LAGLIDADG family of site-specific endonucleases and identification of an intein that encodes a site-specific endonuclease of the HNH family.

Authors:  J Z Dalgaard; A J Klar; M J Moser; W R Holley; A Chatterjee; I S Mian
Journal:  Nucleic Acids Res       Date:  1997-11-15       Impact factor: 16.971

9.  Predicting conserved protein motifs with Sub-HMMs.

Authors:  Kevin Horan; Christian R Shelton; Thomas Girke
Journal:  BMC Bioinformatics       Date:  2010-04-26       Impact factor: 3.169

10.  Transposon identification using profile HMMs.

Authors:  Paul T Edlefsen; Jun S Liu
Journal:  BMC Genomics       Date:  2010-02-10       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.