Literature DB >> 20090173

Breaking the computational barrier: a divide-conquer and aggregate based approach for Alu insertion site characterisation.

Kun Zhang1, Wei Fan, Prescott Deininger, Andrea Edwards, Zujia Xu, Dongxiao Zhu.   

Abstract

Insertion site characterisation of Alu elements is an important problem in primate-specific bioinformatics research. Key characteristics of this challenging problem include: data are not in the pre-defined feature vectors for predictive model construction; without any prior knowledge, can we discover the general patterns that could exist and also make biological insights?; how to obtain the compact yet discriminative patterns given a search space of 4(200)? This paper provides an integrated algorithmic framework for fulfilling the above mining tasks. Compared to the benchmark biological study, our results provide a further refined analysis of the patterns involved in Alu insertion. In particular, we acquire a 200nt predictive profile around the primary insertion site which not only contains the widely accepted consensus, but also suggests a longer pattern (T(7)AA[G'A]AATAA. This pattern provides more insight into the favourable sequence variations allowed for preferred binding and cleavage by the L1 ORF2 endonuclease. The proposed method is general enough that can be also applied to other sequence detection problems, such as microRNA target prediction.

Entities:  

Mesh:

Year:  2009        PMID: 20090173      PMCID: PMC2922064          DOI: 10.1504/IJCBDD.2009.030763

Source DB:  PubMed          Journal:  Int J Comput Biol Drug Des        ISSN: 1756-0756


  14 in total

1.  L1 (LINE-1) retrotransposon evolution and amplification in recent human history.

Authors:  S Boissinot; P Chevret; A V Furano
Journal:  Mol Biol Evol       Date:  2000-06       Impact factor: 16.240

2.  WebLogo: a sequence logo generator.

Authors:  Gavin E Crooks; Gary Hon; John-Marc Chandonia; Steven E Brenner
Journal:  Genome Res       Date:  2004-06       Impact factor: 9.043

3.  A generic motif discovery algorithm for sequential data.

Authors:  Kyle L Jensen; Mark P Styczynski; Isidore Rigoutsos; Gregory N Stephanopoulos
Journal:  Bioinformatics       Date:  2005-10-27       Impact factor: 6.937

4.  An efficient, versatile and scalable pattern growth approach to mine frequent patterns in unaligned protein sequences.

Authors:  Kai Ye; Walter A Kosters; Adriaan P Ijzerman
Journal:  Bioinformatics       Date:  2007-01-19       Impact factor: 6.937

5.  Sequence patterns indicate an enzymatic involvement in integration of mammalian retroposons.

Authors:  J Jurka
Journal:  Proc Natl Acad Sci U S A       Date:  1997-03-04       Impact factor: 11.205

6.  Information content of binding sites on nucleotide sequences.

Authors:  T D Schneider; G D Stormo; L Gold; A Ehrenfeucht
Journal:  J Mol Biol       Date:  1986-04-05       Impact factor: 5.469

Review 7.  Alu repeats and human disease.

Authors:  P L Deininger; M A Batzer
Journal:  Mol Genet Metab       Date:  1999-07       Impact factor: 4.797

8.  Evolutionary diversity and potential recombinogenic role of integration targets of Non-LTR retrotransposons.

Authors:  Andrew J Gentles; Oleksiy Kohany; Jerzy Jurka
Journal:  Mol Biol Evol       Date:  2005-06-08       Impact factor: 16.240

9.  Structure-based prediction of insertion-site preferences of transposons into chromosomes.

Authors:  Aron M Geurts; Christopher S Hackett; Jason B Bell; Tracy L Bergemann; Lara S Collier; Corey M Carlson; David A Largaespada; Perry B Hackett
Journal:  Nucleic Acids Res       Date:  2006-05-22       Impact factor: 16.971

10.  Molecular archeology of L1 insertions in the human genome.

Authors:  Suzanne T Szak; Oxana K Pickeral; Wojciech Makalowski; Mark S Boguski; David Landsman; Jef D Boeke
Journal:  Genome Biol       Date:  2002-09-19       Impact factor: 13.583

View more
  5 in total

1.  LINE-1 activity as molecular basis for genomic instability associated with light exposure at night.

Authors:  Victoria P Belancio
Journal:  Mob Genet Elements       Date:  2015-04-07

2.  Alu distribution and mutation types of cancer genes.

Authors:  Wensheng Zhang; Andrea Edwards; Wei Fan; Prescott Deininger; Kun Zhang
Journal:  BMC Genomics       Date:  2011-03-23       Impact factor: 3.969

3.  Genome-wide analysis of mobile genetic element insertion sites.

Authors:  Kamal Rawal; Ram Ramaswamy
Journal:  Nucleic Acids Res       Date:  2011-05-23       Impact factor: 16.971

4.  Orangutan Alu quiescence reveals possible source element: support for ancient backseat drivers.

Authors:  Jerilyn A Walker; Miriam K Konkel; Brygg Ullmer; Christopher P Monceaux; Oliver A Ryder; Robert Hubley; Arian Fa Smit; Mark A Batzer
Journal:  Mob DNA       Date:  2012-04-30

5.  Inferring the expression variability of human transposable element-derived exons by linear model analysis of deep RNA sequencing data.

Authors:  Wensheng Zhang; Andrea Edwards; Wei Fan; Zhide Fang; Prescott Deininger; Kun Zhang
Journal:  BMC Genomics       Date:  2013-08-28       Impact factor: 3.969

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.