Literature DB >> 22820020

Alignment-free distance measure based on return time distribution for sequence analysis: applications to clustering, molecular phylogeny and subtyping.

Pandurang Kolekar1, Mohan Kale, Urmila Kulkarni-Kale.   

Abstract

The data deluge in post-genomic era demands development of novel data mining tools. Existing molecular phylogeny analyses (MPAs) developed for individual gene/protein sequences are alignment-based. However, the size of genomic data and uncertainties associated with alignments, necessitate development of alignment-free methods for MPA. Derivation of distances between sequences is an important step in both, alignment-dependant and alignment-free methods. Various alignment-free distance measures based on oligo-nucleotide frequencies, information content, compression techniques, etc. have been proposed. However, these distance measures do not account for relative order of components viz. nucleotides or amino acids. A new distance measure, based on the concept of 'return time distribution' (RTD) of k-mers is proposed, which accounts for the sequence composition and their relative orders. Statistical parameters of RTDs are used to derive a distance function. The resultant distance matrix is used for clustering and phylogeny using Neighbor-joining. Its performance for MPA and subtyping was evaluated using simulated data generated by block-bootstrap, receiver operating characteristics and leave-one-out cross validation methods. The proposed method was successfully applied for MPA of family Flaviviridae and subtyping of Dengue viruses. It is observed that method retains resolution for classification and subtyping of viruses at varying levels of sequence similarity and taxonomic hierarchy.
Copyright © 2012 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2012        PMID: 22820020     DOI: 10.1016/j.ympev.2012.07.003

Source DB:  PubMed          Journal:  Mol Phylogenet Evol        ISSN: 1055-7903            Impact factor:   4.286


  10 in total

1.  Genetic diversity and evolutionary dynamics of dengue isolates from India.

Authors:  Johni Rexliene; Jayavel Sridhar
Journal:  Virusdisease       Date:  2019-07-24

2.  Fast alignment-free sequence comparison using spaced-word frequencies.

Authors:  Chris-Andre Leimeister; Marcus Boden; Sebastian Horwege; Sebastian Lindner; Burkhard Morgenstern
Journal:  Bioinformatics       Date:  2014-04-03       Impact factor: 6.937

3.  Analysis of genotype diversity and evolution of Dengue virus serotype 2 using complete genomes.

Authors:  Vaishali P Waman; Pandurang Kolekar; Mukund R Ramtirthkar; Mohan M Kale; Urmila Kulkarni-Kale
Journal:  PeerJ       Date:  2016-08-24       Impact factor: 2.984

4.  RV-Typer: A Web Server for Typing of Rhinoviruses Using Alignment-Free Approach.

Authors:  Pandurang S Kolekar; Vaishali P Waman; Mohan M Kale; Urmila Kulkarni-Kale
Journal:  PLoS One       Date:  2016-02-12       Impact factor: 3.240

5.  A novel fast vector method for genetic sequence comparison.

Authors:  Yongkun Li; Lily He; Rong Lucy He; Stephen S-T Yau
Journal:  Sci Rep       Date:  2017-09-22       Impact factor: 4.379

6.  An open-source k-mer based machine learning tool for fast and accurate subtyping of HIV-1 genomes.

Authors:  Stephen Solis-Reyes; Mariano Avino; Art Poon; Lila Kari
Journal:  PLoS One       Date:  2018-11-14       Impact factor: 3.240

7.  Alignment-free method for DNA sequence clustering using Fuzzy integral similarity.

Authors:  Ajay Kumar Saw; Garima Raj; Manashi Das; Narayan Chandra Talukdar; Binod Chandra Tripathy; Soumyadeep Nandi
Journal:  Sci Rep       Date:  2019-03-06       Impact factor: 4.379

8.  Benchmarking of alignment-free sequence comparison methods.

Authors:  Andrzej Zielezinski; Hani Z Girgis; Guillaume Bernard; Chris-Andre Leimeister; Kujin Tang; Thomas Dencker; Anna Katharina Lau; Sophie Röhling; Jae Jin Choi; Michael S Waterman; Matteo Comin; Sung-Hou Kim; Susana Vinga; Jonas S Almeida; Cheong Xin Chan; Benjamin T James; Fengzhu Sun; Burkhard Morgenstern; Wojciech M Karlowski
Journal:  Genome Biol       Date:  2019-07-25       Impact factor: 13.583

9.  Genetic variability in minor capsid protein (L2 gene) of human papillomavirus type 16 among Indian women.

Authors:  Arati Mane; Sanket Limaye; Linata Patil; Urmila Kulkarni-Kale
Journal:  Med Microbiol Immunol       Date:  2022-05-13       Impact factor: 4.148

10.  Women in the European Virus Bioinformatics Center.

Authors:  Franziska Hufsky; Ana Abecasis; Patricia Agudelo-Romero; Magda Bletsa; Katherine Brown; Claudia Claus; Stefanie Deinhardt-Emmer; Li Deng; Caroline C Friedel; María Inés Gismondi; Evangelia Georgia Kostaki; Denise Kühnert; Urmila Kulkarni-Kale; Karin J Metzner; Irmtraud M Meyer; Laura Miozzi; Luca Nishimura; Sofia Paraskevopoulou; Alba Pérez-Cataluña; Janina Rahlff; Emma Thomson; Charlotte Tumescheit; Lia van der Hoek; Lore Van Espen; Anne-Mieke Vandamme; Maryam Zaheri; Neta Zuckerman; Manja Marz
Journal:  Viruses       Date:  2022-07-12       Impact factor: 5.818

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.