Literature DB >> 21723298

New powerful statistics for alignment-free sequence comparison under a pattern transfer model.

Xuemei Liu1, Lin Wan, Jing Li, Gesine Reinert, Michael S Waterman, Fengzhu Sun.   

Abstract

Alignment-free sequence comparison is widely used for comparing gene regulatory regions and for identifying horizontally transferred genes. Recent studies on the power of a widely used alignment-free comparison statistic D2 and its variants D*2 and D(s)2 showed that their power approximates a limit smaller than 1 as the sequence length tends to infinity under a pattern transfer model. We develop new alignment-free statistics based on D2, D*2 and D(s)2 by comparing local sequence pairs and then summing over all the local sequence pairs of certain length. We show that the new statistics are much more powerful than the corresponding statistics and the power tends to 1 as the sequence length tends to infinity under the pattern transfer model.
Copyright © 2011 Elsevier Ltd. All rights reserved.

Entities:  

Mesh:

Year:  2011        PMID: 21723298      PMCID: PMC3146591          DOI: 10.1016/j.jtbi.2011.06.020

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  32 in total

1.  EMBOSS: the European Molecular Biology Open Software Suite.

Authors:  P Rice; I Longden; A Bleasby
Journal:  Trends Genet       Date:  2000-06       Impact factor: 11.639

Review 2.  Alignment-free sequence comparison-a review.

Authors:  Susana Vinga; Jonas Almeida
Journal:  Bioinformatics       Date:  2003-03-01       Impact factor: 6.937

3.  Distributional regimes for the number of k-word matches between two random sequences.

Authors:  Ross A Lippert; Haiyan Huang; Michael S Waterman
Journal:  Proc Natl Acad Sci U S A       Date:  2002-10-08       Impact factor: 11.205

4.  Optimal word sizes for dissimilarity measures and estimation of the degree of dissimilarity between DNA sequences.

Authors:  Tiee-Jian Wu; Ying-Hsueh Huang; Lung-An Li
Journal:  Bioinformatics       Date:  2005-09-06       Impact factor: 6.937

5.  A measure of DNA sequence dissimilarity based on Mahalanobis distance between frequencies of words.

Authors:  T J Wu; J P Burke; D B Davison
Journal:  Biometrics       Date:  1997-12       Impact factor: 2.571

6.  A measure of the similarity of sets of sequences not requiring sequence alignment.

Authors:  B E Blaisdell
Journal:  Proc Natl Acad Sci U S A       Date:  1986-07       Impact factor: 11.205

7.  Horizontal DNA transfer from donor to host cells as an alternative mechanism of epithelial chimerism after allogeneic hematopoietic cell transplantation.

Authors:  Miguel Waterhouse; Maria Themeli; Hartmut Bertz; Nicholas Zoumbos; Jürgen Finke; Alexandros Spyridonidis
Journal:  Biol Blood Marrow Transplant       Date:  2010-09-15       Impact factor: 5.742

8.  Dating of the human-ape splitting by a molecular clock of mitochondrial DNA.

Authors:  M Hasegawa; H Kishino; T Yano
Journal:  J Mol Evol       Date:  1985       Impact factor: 2.395

9.  CVTree: a phylogenetic tree reconstruction tool based on whole genomes.

Authors:  Ji Qi; Hong Luo; Bailin Hao
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

10.  Detection and characterization of horizontal transfers in prokaryotes using genomic signature.

Authors:  Christine Dufraigne; Bernard Fertil; Sylvain Lespinats; Alain Giron; Patrick Deschavanne
Journal:  Nucleic Acids Res       Date:  2005-01-13       Impact factor: 16.971

View more
  16 in total

1.  A geometric interpretation for local alignment-free sequence comparison.

Authors:  Ehsan Behnam; Michael S Waterman; Andrew D Smith
Journal:  J Comput Biol       Date:  2013-07       Impact factor: 1.479

Review 2.  Alignment-free genetic sequence comparisons: a review of recent approaches by word analysis.

Authors:  Oliver Bonham-Carter; Joe Steele; Dhundy Bastola
Journal:  Brief Bioinform       Date:  2013-07-31       Impact factor: 11.622

3.  Multiple alignment-free sequence comparison.

Authors:  Jie Ren; Kai Song; Fengzhu Sun; Minghua Deng; Gesine Reinert
Journal:  Bioinformatics       Date:  2013-08-29       Impact factor: 6.937

Review 4.  New developments of alignment-free sequence comparison: measures, statistics and next-generation sequencing.

Authors:  Kai Song; Jie Ren; Gesine Reinert; Minghua Deng; Michael S Waterman; Fengzhu Sun
Journal:  Brief Bioinform       Date:  2013-09-23       Impact factor: 11.622

5.  Inference of Markovian properties of molecular sequences from NGS data and applications to comparative genomics.

Authors:  Jie Ren; Kai Song; Minghua Deng; Gesine Reinert; Charles H Cannon; Fengzhu Sun
Journal:  Bioinformatics       Date:  2015-06-30       Impact factor: 6.937

6.  Alignment-free sequence comparison based on next-generation sequencing reads.

Authors:  Kai Song; Jie Ren; Zhiyuan Zhai; Xuemei Liu; Minghua Deng; Fengzhu Sun
Journal:  J Comput Biol       Date:  2013-02       Impact factor: 1.479

7.  Assembly-free genome comparison based on next-generation sequencing reads and variable length patterns.

Authors:  Matteo Comin; Michele Schimd
Journal:  BMC Bioinformatics       Date:  2014-09-10       Impact factor: 3.169

8.  Determination of k-mer density in a DNA sequence and subsequent cluster formation algorithm based on the application of electronic filter.

Authors:  Bimal Kumar Sarkar; Ashish Ranjan Sharma; Manojit Bhattacharya; Garima Sharma; Sang-Soo Lee; Chiranjib Chakraborty
Journal:  Sci Rep       Date:  2021-07-01       Impact factor: 4.379

9.  Indel-tolerant read mapping with trinucleotide frequencies using cache-oblivious kd-trees.

Authors:  Md Pavel Mahmud; John Wiedenhoeft; Alexander Schliep
Journal:  Bioinformatics       Date:  2012-09-15       Impact factor: 6.937

10.  DendroBLAST: approximate phylogenetic trees in the absence of multiple sequence alignments.

Authors:  Steven Kelly; Philip K Maini
Journal:  PLoS One       Date:  2013-03-15       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.