Literature DB >> 7984436

A method for fast database search for all k-nucleotide repeats.

G Benson1, M S Waterman.   

Abstract

A significant portion of DNA consists of repeating patterns of various sizes, from very small (one, two and three nucleotides) to very large (over 300 nucleotides). Although the functions of these repeating regions are not well understood, they appear important for understanding the expression, regulation and evolution of DNA. For example, increases in the number of trinucleotide repeats have been associated with human genetic disease, including Fragile-X mental retardation and Huntington's disease. Repeats are also useful as a tool in mapping and identifying DNA; the number of copies of a particular pattern at a site is often variable among individuals (polymorphic) and is therefore helpful in locating genes via linkage studies and also in providing DNA fingerprints of individuals. The number of repeating regions is unknown as is the distribution of pattern sizes. It would be useful to search for such regions in the DNA database in order that they may be studied more fully. The DNA database currently consists of approximately 150 million basepairs and is growing exponentially. Therefore, any program to look for repeats must be efficient and fast. In this paper, we present some new techniques that are useful in recognizing repeating patterns and describe a new program for rapidly detecting repeat regions in the DNA database where the basic unit of the repeat has size up to 32 nucleotides. It is our hope that the examples in this paper will illustrate the unrealized diversity of repeats in DNA and that the program we have developed will be a useful tool for locating new and interesting repeats.

Entities:  

Mesh:

Substances:

Year:  1994        PMID: 7984436      PMCID: PMC308537          DOI: 10.1093/nar/22.22.4828

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  20 in total

1.  Evolution of repeated DNA sequences by unequal crossover.

Authors:  G P Smith
Journal:  Science       Date:  1976-02-13       Impact factor: 47.728

2.  Optimal alignments in linear space.

Authors:  E W Myers; W Miller
Journal:  Comput Appl Biosci       Date:  1988-03

3.  Satellite DNA sequences in Drosophila virilis.

Authors:  J G Gall; D D Atherton
Journal:  J Mol Biol       Date:  1974-01-05       Impact factor: 5.469

4.  Frameshift mutations and the genetic code. This paper is dedicated to Professor Theodosius Dobzhansky on the occasion of his 66th birthday.

Authors:  G Streisinger; Y Okada; J Emrich; J Newton; A Tsugita; E Terzaghi; M Inouye
Journal:  Cold Spring Harb Symp Quant Biol       Date:  1966

5.  A rapidly evolving region in the immunoglobulin heavy chain loci of rat and mouse: postulated role of (dC-dA)n.(dG-dT)n sequences.

Authors:  L Hellman; M L Steen; M Sundvall; U Pettersson
Journal:  Gene       Date:  1988-08-15       Impact factor: 3.688

6.  Discovering simple DNA sequences by the algorithmic significance method.

Authors:  A Milosavljević; J Jurka
Journal:  Comput Appl Biosci       Date:  1993-08

7.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

8.  The short arm of chromosome 11 is a "hot spot" for hypermethylation in human neoplasia.

Authors:  A de Bustros; B D Nelkin; A Silverman; G Ehrlich; B Poiesz; S B Baylin
Journal:  Proc Natl Acad Sci U S A       Date:  1988-08       Impact factor: 11.205

9.  Enhanced gene expression by the poly(dT-dG).poly(dC-dA) sequence.

Authors:  H Hamada; M Seidman; B H Howard; C M Gorman
Journal:  Mol Cell Biol       Date:  1984-12       Impact factor: 4.272

10.  (dC-dA)n.(dG-dT)n sequences have evolutionarily conserved chromosomal locations in Drosophila with implications for roles in chromosome structure and function.

Authors:  M L Pardue; K Lowenhaupt; A Rich; A Nordheim
Journal:  EMBO J       Date:  1987-06       Impact factor: 11.598

View more
  11 in total

1.  mreps: Efficient and flexible detection of tandem repeats in DNA.

Authors:  Roman Kolpakov; Ghizlane Bana; Gregory Kucherov
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

2.  In silico comparison of bacterial strains using mutual information.

Authors:  D Swati
Journal:  J Biosci       Date:  2007-09       Impact factor: 1.826

3.  Computerized polymorphic marker identification: experimental validation and a predicted human polymorphism catalog.

Authors:  J W Fondon; G M Mele; R I Brezinschek; D Cummings; A Pande; J Wren; K M O'Brien; K C Kupfer; M H Wei; M Lerman; J D Minna; H R Garner
Journal:  Proc Natl Acad Sci U S A       Date:  1998-06-23       Impact factor: 11.205

4.  Searching microsatellites in DNA sequences: approaches used and tools developed.

Authors:  Atul Grover; Veenu Aishwarya; P C Sharma
Journal:  Physiol Mol Biol Plants       Date:  2011-12-23

5.  Detection and characterization of megasatellites in orthologous and nonorthologous genes of 21 fungal genomes.

Authors:  Fredj Tekaia; Bernard Dujon; Guy-Franck Richard
Journal:  Eukaryot Cell       Date:  2013-03-29

6.  Unusual composition of a yeast chromosome arm is associated with its delayed replication.

Authors:  Célia Payen; Gilles Fischer; Christian Marck; Caroline Proux; David James Sherman; Jean-Yves Coppée; Mark Johnston; Bernard Dujon; Cécile Neuvéglise
Journal:  Genome Res       Date:  2009-07-10       Impact factor: 9.043

7.  Analysis of microsatellites in 13 hemiascomycetous yeast species: mechanisms involved in genome dynamics.

Authors:  Alain Malpertuy; Bernard Dujon; Guy-Franck Richard
Journal:  J Mol Evol       Date:  2003-06       Impact factor: 2.395

8.  Triplet repeat length bias and variation in the human transcriptome.

Authors:  Michael Molla; Arthur Delcher; Shamil Sunyaev; Charles Cantor; Simon Kasif
Journal:  Proc Natl Acad Sci U S A       Date:  2009-09-17       Impact factor: 11.205

9.  Direct mapping of symbolic DNA sequence into frequency domain in global repeat map algorithm.

Authors:  Matko Glunčić; Vladimir Paar
Journal:  Nucleic Acids Res       Date:  2012-09-12       Impact factor: 16.971

10.  Modeling and comparing the organization of circular genomes.

Authors:  Grace S Shieh; Shurong Zheng; Richard A Johnson; Yi-Feng Chang; Kunio Shimizu; Chia-Chang Wang; Sen-Lin Tang
Journal:  Bioinformatics       Date:  2011-01-28       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.