Literature DB >> 8169961

Similarity landscapes: a way to detect many structural and sequence motifs in both introns and exons.

M Hultner1, D W Smith, C Wills.   

Abstract

When investigators undertake searches of DNA databases, they normally discard large numbers of alignments that demonstrate very weak resemblances to each other, retaining only those that show statistically significant levels of resemblance. We show here that a great deal of information can be extracted from these weak alignments by examining them en masse. This is done by building three-dimensional similarity landscapes from the alignments, landscapes that reveal whether an unusual number of individually nonsignificant alignments tend to match up to a particular region of the query sequence being searched. The power of the search is increased by the use of libraries consisting entirely of introns or of exons. We show that (1) similarity landscapes with a variety of features can be generated from both intron and exon libraries, using introns or exons as query sequences; (2) the landscape features are real and not a statistical artifact; (3) well-known protein motifs used as query sequences can generate various landscape features; and (4) there is some evidence for resemblances between short regions of sequence carried by introns and exons. One possible interpretation of these results is that both introns and exons may have been built up during their evolution from short regions of sequence that as a result are now widely distributed throughout eukaryotic genomes. Such an interpretation would imply that these short regions have common ancestry. Alternatively, the wide sharing of short pieces of DNA may reflect regions with particular structural properties that have arisen through convergent evolution. The similarity-landscape approach can be used to detect such widespread structural motifs and sequence motifs in the genome that might be missed by less-global searches. It can also be used in conjunction with algorithms developed for detecting significant multiple alignments by isolating promising subsets of the databases that can be examined in more detail.

Mesh:

Year:  1994        PMID: 8169961     DOI: 10.1007/bf00166165

Source DB:  PubMed          Journal:  J Mol Evol        ISSN: 0022-2844            Impact factor:   2.395


  20 in total

1.  A search for common patterns in many sequences.

Authors:  M A Roytberg
Journal:  Comput Appl Biosci       Date:  1992-02

2.  Exhaustive matching of the entire protein sequence database.

Authors:  G H Gonnet; M A Cohen; S A Benner
Journal:  Science       Date:  1992-06-05       Impact factor: 47.728

3.  Protein database searches for multiple alignments.

Authors:  S F Altschul; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1990-07       Impact factor: 11.205

4.  Rapid and sensitive protein similarity searches.

Authors:  D J Lipman; W R Pearson
Journal:  Science       Date:  1985-03-22       Impact factor: 47.728

5.  Profile analysis: detection of distantly related proteins.

Authors:  M Gribskov; A D McLachlan; D Eisenberg
Journal:  Proc Natl Acad Sci U S A       Date:  1987-07       Impact factor: 11.205

6.  Multiple sequence alignment by consensus.

Authors:  M S Waterman
Journal:  Nucleic Acids Res       Date:  1986-11-25       Impact factor: 16.971

7.  The leucine zipper: a hypothetical structure common to a new class of DNA binding proteins.

Authors:  W H Landschulz; P F Johnson; S L McKnight
Journal:  Science       Date:  1988-06-24       Impact factor: 47.728

8.  Adaptive evolution in the stomach lysozymes of foregut fermenters.

Authors:  C B Stewart; J W Schilling; A C Wilson
Journal:  Nature       Date:  1987 Nov 26-Dec 2       Impact factor: 49.962

9.  Genomic and cDNA cloning of the human C1 inhibitor. Intron-exon junctions and comparison with other serpins.

Authors:  P E Carter; B Dunbar; J E Fothergill
Journal:  Eur J Biochem       Date:  1988-04-05

10.  There appear to be conserved constraints on the distribution of nucleotide sequences in cellular genomes.

Authors:  A C Rogerson
Journal:  J Mol Evol       Date:  1991-01       Impact factor: 2.395

View more
  1 in total

1.  A model of the nicotinic receptor extracellular domain based on sequence identity and residue location.

Authors:  I Tsigelny; N Sugiyama; S M Sine; P Taylor
Journal:  Biophys J       Date:  1997-07       Impact factor: 4.033

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.