Literature DB >> 27538895

Numerical encoding of DNA sequences by chaos game representation with application in similarity comparison.

Tung Hoang1, Changchuan Yin1, Stephen S-T Yau2.   

Abstract

Numerical encoding plays an important role in DNA sequence analysis via computational methods, in which numerical values are associated with corresponding symbolic characters. After numerical representation, digital signal processing methods can be exploited to analyze DNA sequences. To reflect the biological properties of the original sequence, it is vital that the representation is one-to-one. Chaos Game Representation (CGR) is an iterative mapping technique that assigns each nucleotide in a DNA sequence to a respective position on the plane that allows the depiction of the DNA sequence in the form of image. Using CGR, a biological sequence can be transformed one-to-one to a numerical sequence that preserves the main features of the original sequence. In this research, we propose to encode DNA sequences by considering 2D CGR coordinates as complex numbers, and apply digital signal processing methods to analyze their evolutionary relationship. Computational experiments indicate that this approach gives comparable results to the state-of-the-art multiple sequence alignment method, Clustal Omega, and is significantly faster. The MATLAB code for our method can be accessed from: www.mathworks.com/matlabcentral/fileexchange/57152.
Copyright © 2016 Elsevier Inc. All rights reserved.

Keywords:  Chaos game representation; Clustal Omega; Digital signal processing

Mesh:

Year:  2016        PMID: 27538895     DOI: 10.1016/j.ygeno.2016.08.002

Source DB:  PubMed          Journal:  Genomics        ISSN: 0888-7543            Impact factor:   5.736


  16 in total

1.  Identifying anticancer peptides by using a generalized chaos game representation.

Authors:  Li Ge; Jiaguo Liu; Yusen Zhang; Matthias Dehmer
Journal:  J Math Biol       Date:  2018-10-05       Impact factor: 2.259

2.  A Puzzling Anomaly in the 4-Mer Composition of the Giant Pandoravirus Genomes Reveals a Stringent New Evolutionary Selection Process.

Authors:  Olivier Poirot; Sandra Jeudy; Chantal Abergel; Jean-Michel Claverie
Journal:  J Virol       Date:  2019-11-13       Impact factor: 5.103

3.  MathFeature: feature extraction package for DNA, RNA and protein sequences based on mathematical descriptors.

Authors:  Robson P Bonidia; Douglas S Domingues; Danilo S Sanches; André C P L F de Carvalho
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

4.  A novel fast vector method for genetic sequence comparison.

Authors:  Yongkun Li; Lily He; Rong Lucy He; Stephen S-T Yau
Journal:  Sci Rep       Date:  2017-09-22       Impact factor: 4.379

5.  Small-Angle Scattering and Multifractal Analysis of DNA Sequences.

Authors:  lEugen Mircea Anitas
Journal:  Int J Mol Sci       Date:  2020-06-30       Impact factor: 5.923

6.  Comparative genomic signature representations of the emerging COVID-19 coronavirus and other coronaviruses: High identity and possible recombination between Bat and Pangolin coronaviruses.

Authors:  Rabeb Touati; Sondes Haddad-Boubaker; Imen Ferchichi; Imen Messaoudi; Afef Elloumi Ouesleti; Henda Triki; Zied Lachiri; Maher Kharrat
Journal:  Genomics       Date:  2020-07-06       Impact factor: 5.736

7.  Integrated entropy-based approach for analyzing exons and introns in DNA sequences.

Authors:  Junyi Li; Li Zhang; Huinian Li; Yuan Ping; Qingzhe Xu; Rongjie Wang; Renjie Tan; Zhen Wang; Bo Liu; Yadong Wang
Journal:  BMC Bioinformatics       Date:  2019-06-10       Impact factor: 3.169

8.  Chaos game representation dataset of SARS-CoV-2 genome.

Authors:  Raquel de M Barbosa; Marcelo A C Fernandes
Journal:  Data Brief       Date:  2020-04-25

9.  Genomic signal processing for DNA sequence clustering.

Authors:  Gerardo Mendizabal-Ruiz; Israel Román-Godínez; Sulema Torres-Ramos; Ricardo A Salido-Ruiz; Hugo Vélez-Pérez; J Alejandro Morales
Journal:  PeerJ       Date:  2018-01-24       Impact factor: 2.984

10.  Large-Scale Genome Comparison Based on Cumulative Fourier Power and Phase Spectra: Central Moment and Covariance Vector.

Authors:  Shaojun Pei; Rui Dong; Rong Lucy He; Stephen S-T Yau
Journal:  Comput Struct Biotechnol J       Date:  2019-07-11       Impact factor: 7.271

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.