Literature DB >> 17713591

A novel signal processing measure to identify exact and inexact tandem repeat patterns in DNA sequences.

Ravi Gupta1, Divya Sarthi, Ankush Mittal, Kuldip Singh.   

Abstract

The identification and analysis of repetitive patterns are active areas of biological and computational research. Tandem repeats in telomeres play a role in cancer and hypervariable trinucleotide tandem repeats are linked to over a dozen major neurodegenerative genetic disorders. In this paper, we present an algorithm to identify the exact and inexact repeat patterns in DNA sequences based on orthogonal exactly periodic subspace decomposition technique. Using the new measure our algorithm resolves the problems like whether the repeat pattern is of period P or its multiple (i.e., 2P, 3P, etc.), and several other problems that were present in previous signal-processing-based algorithms. We present an efficient algorithm of O(NL(w) log L(w)), where N is the length of DNA sequence and L(w) is the window length, for identifying repeats. The algorithm operates in two stages. In the first stage, each nucleotide is analyzed separately for periodicity, and in the second stage, the periodic information of each nucleotide is combined together to identify the tandem repeats. Datasets having exact and inexact repeats were taken up for the experimental purpose. The experimental result shows the effectiveness of the approach.

Entities:  

Year:  2007        PMID: 17713591      PMCID: PMC3171338          DOI: 10.1155/2007/43596

Source DB:  PubMed          Journal:  EURASIP J Bioinform Syst Biol        ISSN: 1687-4145


  13 in total

1.  REPuter: the manifold applications of repeat analysis on a genomic scale.

Authors:  S Kurtz; J V Choudhuri; E Ohlebusch; C Schleiermacher; J Stoye; R Giegerich
Journal:  Nucleic Acids Res       Date:  2001-11-15       Impact factor: 16.971

2.  An efficient algorithm for finding short approximate non-tandem repeats.

Authors:  E F Adebiyi; T Jiang; M Kaufmann
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

3.  Telomerase and cancer: where and when?

Authors:  W C Hahn
Journal:  Clin Cancer Res       Date:  2001-10       Impact factor: 12.531

4.  An algorithm for approximate tandem repeats.

Authors:  G M Landau; J P Schmidt; D Sokol
Journal:  J Comput Biol       Date:  2001       Impact factor: 1.479

5.  Beyond tandem repeats: complex pattern structures and distant regions of similarity.

Authors:  Amy M Hauth; Deborah A Joseph
Journal:  Bioinformatics       Date:  2002       Impact factor: 6.937

6.  Spectral Repeat Finder (SRF): identification of repetitive sequences using Fourier transformation.

Authors:  Deepak Sharma; Biju Issac; G P S Raghava; R Ramaswamy
Journal:  Bioinformatics       Date:  2004-02-19       Impact factor: 6.937

7.  Tandem repeats finder: a program to analyze DNA sequences.

Authors:  G Benson
Journal:  Nucleic Acids Res       Date:  1999-01-15       Impact factor: 16.971

Review 8.  Human tandem repeat sequences in forensic DNA typing.

Authors:  Keiji Tamaki; Alec J Jeffreys
Journal:  Leg Med (Tokyo)       Date:  2005-07       Impact factor: 1.376

9.  Triplet repeat DNA structures and human genetic disease: dynamic mutations from dynamic DNA.

Authors:  Richard R Sinden; Vladimir N Potaman; Elena A Oussatcheva; Christopher E Pearson; Yuri L Lyubchenko; Luda S Shlyakhtenko
Journal:  J Biosci       Date:  2002-02       Impact factor: 1.826

10.  The nucleotide sequence of chromosome I from Saccharomyces cerevisiae.

Authors:  H Bussey; D B Kaback; W Zhong; D T Vo; M W Clark; N Fortin; J Hall; B F Ouellette; T Keng; A B Barton
Journal:  Proc Natl Acad Sci U S A       Date:  1995-04-25       Impact factor: 11.205

View more
  8 in total

1.  Searching microsatellites in DNA sequences: approaches used and tools developed.

Authors:  Atul Grover; Veenu Aishwarya; P C Sharma
Journal:  Physiol Mol Biol Plants       Date:  2011-12-23

2.  Periodic power spectrum with applications in detection of latent periodicities in DNA sequences.

Authors:  Changchuan Yin; Jiasong Wang
Journal:  J Math Biol       Date:  2016-03-04       Impact factor: 2.259

3.  TRStalker: an efficient heuristic for finding fuzzy tandem repeats.

Authors:  Marco Pellegrini; M Elena Renda; Alessio Vecchio
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

4.  Direct mapping of symbolic DNA sequence into frequency domain in global repeat map algorithm.

Authors:  Matko Glunčić; Vladimir Paar
Journal:  Nucleic Acids Res       Date:  2012-09-12       Impact factor: 16.971

5.  Genome-scale computational analysis of DNA curvature and repeats in Arabidopsis and rice uncovers plant-specific genomic properties.

Authors:  Ali Masoudi-Nejad; Sara Movahedi; Ruy Jáuregui
Journal:  BMC Genomics       Date:  2011-05-06       Impact factor: 3.969

6.  Improved algorithm for analysis of DNA sequences using multiresolution transformation.

Authors:  T M Inbamalar; R Sivakumar
Journal:  ScientificWorldJournal       Date:  2015-04-27

7.  Finding long tandem repeats in long noisy reads.

Authors:  Shinichi Morishita; Kazuki Ichikawa; Eugene W Myers
Journal:  Bioinformatics       Date:  2021-05-05       Impact factor: 6.937

8.  Hierarchical structure of cascade of primary and secondary periodicities in Fourier power spectrum of alphoid higher order repeats.

Authors:  Vladimir Paar; Nenad Pavin; Ivan Basar; Marija Rosandić; Matko Gluncić; Nils Paar
Journal:  BMC Bioinformatics       Date:  2008-11-03       Impact factor: 3.169

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.