Literature DB >> 7991589

Detection of conserved segments in proteins: iterative scanning of sequence databases with alignment blocks.

R L Tatusov1, S F Altschul, E V Koonin.   

Abstract

We describe an approach to analyzing protein sequence databases that, starting from a single uncharacterized sequence or group of related sequences, generates blocks of conserved segments. The procedure involves iterative database scans with an evolving position-dependent weight matrix constructed from a coevolving set of aligned conserved segments. For each iteration, the expected distribution of matrix scores under a random model is used to set a cutoff score for the inclusion of a segment in the next iteration. This cutoff may be calculated to allow the chance inclusion of either a fixed number or a fixed proportion of false positive segments. With sufficiently high cutoff scores, the procedure converged for all alignment blocks studied, with varying numbers of iterations required. Different methods for calculating weight matrices from alignment blocks were compared. The most effective of those tested was a logarithm-of-odds, Bayesian-based approach that used prior residue probabilities calculated from a mixture of Dirichlet distributions. The procedure described was used to detect novel conserved motifs of potential biological importance.

Mesh:

Substances:

Year:  1994        PMID: 7991589      PMCID: PMC45382          DOI: 10.1073/pnas.91.25.12091

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  40 in total

1.  Improved tools for biological sequence comparison.

Authors:  W R Pearson; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1988-04       Impact factor: 11.205

2.  Profile analysis: detection of distantly related proteins.

Authors:  M Gribskov; A D McLachlan; D Eisenberg
Journal:  Proc Natl Acad Sci U S A       Date:  1987-07       Impact factor: 11.205

3.  Selection of DNA binding sites by regulatory proteins. Statistical-mechanical theory and application to operators and promoters.

Authors:  O G Berg; P H von Hippel
Journal:  J Mol Biol       Date:  1987-02-20       Impact factor: 5.469

Review 4.  Evolution and taxonomy of positive-strand RNA viruses: implications of comparative analysis of amino acid sequences.

Authors:  E V Koonin; V V Dolja
Journal:  Crit Rev Biochem Mol Biol       Date:  1993       Impact factor: 8.250

5.  Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment.

Authors:  C E Lawrence; S F Altschul; M S Boguski; J S Liu; A F Neuwald; J C Wootton
Journal:  Science       Date:  1993-10-08       Impact factor: 47.728

6.  Reverse gyrase: a helicase-like domain and a type I topoisomerase in the same polypeptide.

Authors:  F Confalonieri; C Elie; M Nadal; C de La Tour; P Forterre; M Duguet
Journal:  Proc Natl Acad Sci U S A       Date:  1993-05-15       Impact factor: 11.205

7.  The SWISS-PROT protein sequence data bank, recent developments.

Authors:  A Bairoch; B Boeckmann
Journal:  Nucleic Acids Res       Date:  1993-07-01       Impact factor: 16.971

8.  The PROSITE dictionary of sites and patterns in proteins, its current status.

Authors:  A Bairoch
Journal:  Nucleic Acids Res       Date:  1993-07-01       Impact factor: 16.971

9.  Performance evaluation of amino acid substitution matrices.

Authors:  S Henikoff; J G Henikoff
Journal:  Proteins       Date:  1993-09

10.  Analysis of gene duplication repeats in the myosin rod.

Authors:  A D McLachlan
Journal:  J Mol Biol       Date:  1983-09-05       Impact factor: 5.469

View more
  79 in total

1.  Increased coverage of protein families with the blocks database servers.

Authors:  J G Henikoff; E A Greene; S Pietrokovski; S Henikoff
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Sialidase-like Asp-boxes: sequence-similar structures within different protein folds.

Authors:  R R Copley; R B Russell; C P Ponting
Journal:  Protein Sci       Date:  2001-02       Impact factor: 6.725

3.  Comparison of sequence profiles. Strategies for structural predictions using sequence information.

Authors:  L Rychlewski; L Jaroszewski; W Li; A Godzik
Journal:  Protein Sci       Date:  2000-02       Impact factor: 6.725

4.  Evolutionary relationship between K(+) channels and symporters.

Authors:  S R Durell; Y Hao; T Nakamura; E P Bakker; H R Guy
Journal:  Biophys J       Date:  1999-08       Impact factor: 4.033

5.  Proteins of the endoplasmic-reticulum-associated degradation pathway: domain detection and function prediction.

Authors:  C P Ponting
Journal:  Biochem J       Date:  2000-10-15       Impact factor: 3.857

6.  Cascaded multiple classifiers for secondary structure prediction.

Authors:  M Ouali; R D King
Journal:  Protein Sci       Date:  2000-06       Impact factor: 6.725

7.  Sulfolobus solfataricus P2 DNA polymerase IV (Dpo4): an archaeal DinB-like DNA polymerase with lesion-bypass properties akin to eukaryotic poleta.

Authors:  F Boudsocq; S Iwai; F Hanaoka; R Woodgate
Journal:  Nucleic Acids Res       Date:  2001-11-15       Impact factor: 16.971

8.  Genome sequence of a baculovirus pathogenic for Culex nigripalpus.

Authors:  C L Afonso; E R Tulman; Z Lu; C A Balinsky; B A Moser; J J Becnel; D L Rock; G F Kutish
Journal:  J Virol       Date:  2001-11       Impact factor: 5.103

9.  Suppression subtractive hybridization identifies distinctive expression markers for coronary and internal mammary arteries.

Authors:  Minghui Qin; Zhaohui Zeng; Jie Zheng; Prediman K Shah; Stephen M Schwartz; Lawrence D Adams; Behrooz G Sharifi
Journal:  Arterioscler Thromb Vasc Biol       Date:  2003-01-30       Impact factor: 8.311

10.  Statistical significance for genomewide studies.

Authors:  John D Storey; Robert Tibshirani
Journal:  Proc Natl Acad Sci U S A       Date:  2003-07-25       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.