Literature DB >> 1568127

A search for common patterns in many sequences.

M A Roytberg1.   

Abstract

A new approach to search for common patterns in many sequences is presented. The idea is that one sequence from the set of sequences to be compared is considered as a 'basic' one and all its similarities with other sequences are found. Multiple similarities are then reconstructed using these data. This approach allows one to search for similar segments which can differ in both substitutions and deletions/insertions. These segments can be situated at different positions in various sequences. No regions of complete or strong similarity within the segments are required. The other parts of the sequences can have no similarity at all. The only requirement is that the similar segments can be found in all the sequences (or in the majority of them, given the common segments are present in the basic sequence). Working time of an algorithm presented is proportional to n.L2 when n sequences of length L are analyzed. The algorithm proposed is implemented as programs for the IBM-PC and IBM/370. Its applications to the analysis of biopolymer primary structures as well as the dependence of the results on the choice of basic sequence are discussed.

Mesh:

Year:  1992        PMID: 1568127     DOI: 10.1093/bioinformatics/8.1.57

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  3 in total

1.  Similarity landscapes: a way to detect many structural and sequence motifs in both introns and exons.

Authors:  M Hultner; D W Smith; C Wills
Journal:  J Mol Evol       Date:  1994-02       Impact factor: 2.395

2.  Finding flexible patterns in unaligned protein sequences.

Authors:  I Jonassen; J F Collins; D G Higgins
Journal:  Protein Sci       Date:  1995-08       Impact factor: 6.725

3.  Discovering active motifs in sets of related protein sequences and using them for classification.

Authors:  J T Wang; T G Marr; D Shasha; B A Shapiro; G W Chirn
Journal:  Nucleic Acids Res       Date:  1994-07-25       Impact factor: 16.971

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.