Literature DB >> 24858075

K-mer natural vector and its application to the phylogenetic analysis of genetic sequences.

Jia Wen1, Raymond H F Chan2, Shek-Chung Yau3, Rong L He4, Stephen S T Yau5.   

Abstract

Based on the well-known k-mer model, we propose a k-mer natural vector model for representing a genetic sequence based on the numbers and distributions of k-mers in the sequence. We show that there exists a one-to-one correspondence between a genetic sequence and its associated k-mer natural vector. The k-mer natural vector method can be easily and quickly used to perform phylogenetic analysis of genetic sequences without requiring evolutionary models or human intervention. Whole or partial genomes can be handled more effective with our proposed method. It is applied to the phylogenetic analysis of genetic sequences, and the obtaining results fully demonstrate that the k-mer natural vector method is a very powerful tool for analysing and annotating genetic sequences and determining evolutionary relationships both in terms of accuracy and efficiency.
Copyright © 2014 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  K-mer model; Natural vector; Phylogenetic analysis

Mesh:

Substances:

Year:  2014        PMID: 24858075      PMCID: PMC4096558          DOI: 10.1016/j.gene.2014.05.043

Source DB:  PubMed          Journal:  Gene        ISSN: 0378-1119            Impact factor:   3.688


  49 in total

1.  Optimal word sizes for dissimilarity measures and estimation of the degree of dissimilarity between DNA sequences.

Authors:  Tiee-Jian Wu; Ying-Hsueh Huang; Lung-An Li
Journal:  Bioinformatics       Date:  2005-09-06       Impact factor: 6.937

2.  Alignment-free genome comparison with feature frequency profiles (FFP) and optimal resolutions.

Authors:  Gregory E Sims; Se-Ran Jun; Guohong A Wu; Sung-Hou Kim
Journal:  Proc Natl Acad Sci U S A       Date:  2009-02-02       Impact factor: 11.205

3.  Complete mitochondrial genome suggests diapsid affinities of turtles.

Authors:  R Zardoya; A Meyer
Journal:  Proc Natl Acad Sci U S A       Date:  1998-11-24       Impact factor: 11.205

4.  Combining data in phylogenetic analysis.

Authors:  J P Huelsenbeck; J J Bull; C W Cunningham
Journal:  Trends Ecol Evol       Date:  1996-04       Impact factor: 17.712

5.  A measure of DNA sequence dissimilarity based on Mahalanobis distance between frequencies of words.

Authors:  T J Wu; J P Burke; D B Davison
Journal:  Biometrics       Date:  1997-12       Impact factor: 2.571

6.  A novel statistical measure for sequence comparison on the basis of k-word counts.

Authors:  Xiwu Yang; Tianming Wang
Journal:  J Theor Biol       Date:  2012-11-09       Impact factor: 2.691

7.  A measure of the similarity of sets of sequences not requiring sequence alignment.

Authors:  B E Blaisdell
Journal:  Proc Natl Acad Sci U S A       Date:  1986-07       Impact factor: 11.205

Review 8.  Lens crystallins: gene recruitment and evolutionary dynamism.

Authors:  G Wistow
Journal:  Trends Biochem Sci       Date:  1993-08       Impact factor: 13.807

9.  The complete mitochondrial genome of Alligator mississippiensis and the separation between recent archosauria (birds and crocodiles).

Authors:  A Janke; U Arnason
Journal:  Mol Biol Evol       Date:  1997-12       Impact factor: 16.240

10.  Divergent evolution and evolution by the birth-and-death process in the immunoglobulin VH gene family.

Authors:  T Ota; M Nei
Journal:  Mol Biol Evol       Date:  1994-05       Impact factor: 16.240

View more
  10 in total

1.  Phenetic Comparison of Prokaryotic Genomes Using k-mers.

Authors:  Maxime Déraspe; Frédéric Raymond; Sébastien Boisvert; Alexander Culley; Paul H Roy; François Laviolette; Jacques Corbeil
Journal:  Mol Biol Evol       Date:  2017-10-01       Impact factor: 16.240

2.  Evolutionary mechanism and biological functions of 8-mers containing CG dinucleotide in yeast.

Authors:  Yan Zheng; Hong Li; Yue Wang; Hu Meng; Qiang Zhang; Xiaoqing Zhao
Journal:  Chromosome Res       Date:  2017-02-09       Impact factor: 5.239

3.  Informational laws of genome structures.

Authors:  Vincenzo Bonnici; Vincenzo Manca
Journal:  Sci Rep       Date:  2016-06-29       Impact factor: 4.379

4.  Intrinsic laws of k-mer spectra of genome sequences and evolution mechanism of genomes.

Authors:  Zhenhua Yang; Hong Li; Yun Jia; Yan Zheng; Hu Meng; Tonglaga Bao; Xiaolong Li; Liaofu Luo
Journal:  BMC Evol Biol       Date:  2020-11-23       Impact factor: 3.260

5.  A new graph-theoretic approach to determine the similarity of genome sequences based on nucleotide triplets.

Authors:  Subhram Das; Arijit Das; D K Bhattacharya; D N Tibarewala
Journal:  Genomics       Date:  2020-08-19       Impact factor: 5.736

6.  Analysis of the Genomic Distance Between Bat Coronavirus RaTG13 and SARS-CoV-2 Reveals Multiple Origins of COVID-19.

Authors:  Shaojun Pei; Stephen S-T Yau
Journal:  Acta Math Sci       Date:  2021-04-19       Impact factor: 1.258

7.  Exploring short k-mer profiles in cells and mobile elements from Archaea highlights the major influence of both the ecological niche and evolutionary history.

Authors:  Ariane Bize; Cédric Midoux; Mahendra Mariadassou; Sophie Schbath; Patrick Forterre; Violette Da Cunha
Journal:  BMC Genomics       Date:  2021-03-16       Impact factor: 3.969

8.  Full Chromosomal Relationships Between Populations and the Origin of Humans.

Authors:  Rui Dong; Shaojun Pei; Mengcen Guan; Shek-Chung Yau; Changchuan Yin; Rong L He; Stephen S-T Yau
Journal:  Front Genet       Date:  2022-02-02       Impact factor: 4.599

9.  Identification of HIV Rapid Mutations Using Differences in Nucleotide Distribution over Time.

Authors:  Nan Sun; Jie Yang; Stephen S-T Yau
Journal:  Genes (Basel)       Date:  2022-01-19       Impact factor: 4.096

10.  FastGT: an alignment-free method for calling common SNVs directly from raw sequencing reads.

Authors:  Fanny-Dhelia Pajuste; Lauris Kaplinski; Märt Möls; Tarmo Puurand; Maarja Lepamets; Maido Remm
Journal:  Sci Rep       Date:  2017-05-31       Impact factor: 4.379

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.