Literature DB >> 8390686

Applications and statistics for multiple high-scoring segments in molecular sequences.

S Karlin1, S F Altschul.   

Abstract

Score-based measures of molecular-sequence features provide versatile aids for the study of proteins and DNA. They are used by many sequence data base search programs, as well as for identifying distinctive properties of single sequences. For any such measure, it is important to know what can be expected to occur purely by chance. The statistical distribution of high-scoring segments has been described elsewhere. However, molecular sequences will frequently yield several high-scoring segments for which some combined assessment is in order. This paper describes the statistical distribution for the sum of the scores of multiple high-scoring segments and illustrates its application to the identification of possible transmembrane segments and the evaluation of sequence similarity.

Mesh:

Substances:

Year:  1993        PMID: 8390686      PMCID: PMC46825          DOI: 10.1073/pnas.90.12.5873

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  28 in total

1.  The PIR-International Protein Sequence Database.

Authors:  W C Barker; D G George; H W Mewes; A Tsugita
Journal:  Nucleic Acids Res       Date:  1992-05-11       Impact factor: 16.971

2.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

3.  Chance and statistical significance in protein and DNA sequence analysis.

Authors:  S Karlin; V Brendel
Journal:  Science       Date:  1992-07-03       Impact factor: 47.728

4.  The SWISS-PROT protein sequence data bank.

Authors:  A Bairoch; B Boeckmann
Journal:  Nucleic Acids Res       Date:  1992-05-11       Impact factor: 16.971

5.  The rapid generation of mutation data matrices from protein sequences.

Authors:  D T Jones; W R Taylor; J M Thornton
Journal:  Comput Appl Biosci       Date:  1992-06

6.  Protein database searches for multiple alignments.

Authors:  S F Altschul; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1990-07       Impact factor: 11.205

7.  Identification of protein coding regions by database similarity search.

Authors:  W Gish; D J States
Journal:  Nat Genet       Date:  1993-03       Impact factor: 38.330

8.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

9.  The ovalbumin gene family: structure of the X gene and evolution of duplicated split genes.

Authors:  R Heilig; F Perrin; F Gannon; J L Mandel; P Chambon
Journal:  Cell       Date:  1980-07       Impact factor: 41.582

10.  Tests for comparing related amino-acid sequences. Cytochrome c and cytochrome c 551 .

Authors:  A D McLachlan
Journal:  J Mol Biol       Date:  1971-10-28       Impact factor: 5.469

View more
  91 in total

1.  ParAlign: a parallel sequence alignment algorithm for rapid and sensitive database searches.

Authors:  T Rognes
Journal:  Nucleic Acids Res       Date:  2001-04-01       Impact factor: 16.971

2.  Identification of senescence-associated genes from daylily petals.

Authors:  T Panavas; A Pikula; P D Reid; B Rubinstein; E L Walker
Journal:  Plant Mol Biol       Date:  1999-05       Impact factor: 4.076

3.  Analysis of similarity within 142 pairs of orthologous intergenic regions of Caenorhabditis elegans and Caenorhabditis briggsae.

Authors:  Colleen T Webb; Svetlana A Shabalina; Aleksey Yu Ogurtsov; Alexey S Kondrashov
Journal:  Nucleic Acids Res       Date:  2002-03-01       Impact factor: 16.971

4.  BALSA: Bayesian algorithm for local sequence alignment.

Authors:  Bobbie-Jo M Webb; Jun S Liu; Charles E Lawrence
Journal:  Nucleic Acids Res       Date:  2002-03-01       Impact factor: 16.971

Review 5.  The application of molecular markers in the study of diversity in acarology: a review.

Authors:  M Navajas; B Fenton
Journal:  Exp Appl Acarol       Date:  2000       Impact factor: 2.132

6.  Localization of denaturation bubbles in random DNA sequences.

Authors:  Terence Hwa; Enzo Marinari; Kim Sneppen; Lei-han Tang
Journal:  Proc Natl Acad Sci U S A       Date:  2003-04-02       Impact factor: 11.205

Review 7.  Biochemistry and comparative genomics of SxxK superfamily acyltransferases offer a clue to the mycobacterial paradox: presence of penicillin-susceptible target proteins versus lack of efficiency of penicillin as therapeutic agent.

Authors:  Colette Goffin; Jean-Marie Ghuysen
Journal:  Microbiol Mol Biol Rev       Date:  2002-12       Impact factor: 11.056

8.  Patterns in interspecies similarity correlate with nucleotide composition in mammalian 3'UTRs.

Authors:  Svetlana A Shabalina; Aleksey Y Ogurtsov; David J Lipman; Alexey S Kondrashov
Journal:  Nucleic Acids Res       Date:  2003-09-15       Impact factor: 16.971

9.  Rab5-mediated endosome-endosome fusion regulates hemoglobin endocytosis in Leishmania donovani.

Authors:  Sudha B Singh; Ruchi Tandon; Ganga Krishnamurthy; Rajagopal Vikram; Nimisha Sharma; Sandip K Basu; Amitabha Mukhopadhyay
Journal:  EMBO J       Date:  2003-11-03       Impact factor: 11.598

10.  Reply.

Authors:  M. Schindler
Journal:  Plant Cell       Date:  1993-09       Impact factor: 11.277

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.