Literature DB >> 29349724

Analyzing similarities in genome sequences.

I C Fonseca1, E Nogueira1, P H Figueirêdo2, S Coutinho3.   

Abstract

This article investigates aspects of similarity between complete sequences of mitochondrial DNA by determining the distribution of the relative frequencies of words with different lengths and the characteristics of their relevance throughout the sequences. The degree of similarity is obtained by comparing the distances between words contained within these sequences. Our results indicate that the best groupings among different species depend on the lengths of words and their respective relative frequencies. We also observed that the longer the word the more consistent the grouping between the sequences becomes. The application of our results, together with the perspective of analyzing DNA sequences belonging to a single biological species, may be important for the construction of phylogenetic trees, which are appropriate structures for understanding the evolutionary history of the species.

Keywords:  Living systems: Biological Matter

Mesh:

Substances:

Year:  2018        PMID: 29349724     DOI: 10.1140/epje/i2018-11609-8

Source DB:  PubMed          Journal:  Eur Phys J E Soft Matter        ISSN: 1292-8941            Impact factor:   1.890


  13 in total

1.  Words in DNA sequences: some case studies based on their frequency statistics.

Authors:  Srabashi Basu; Debi Prosad Burma; Probal Chaudhuri
Journal:  J Math Biol       Date:  2003-06       Impact factor: 2.259

2.  Multifractal analysis of DNA walks and trails.

Authors:  Alexandre Rosas; Edvaldo Nogueira; José F Fontanari
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2002-12-18

3.  Objective method for estimating asymptotic parameters, with an application to sequence alignment.

Authors:  Sergey Sheetlin; Yonil Park; John L Spouge
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2011-09-13

4.  Statistical physics approach to categorize biologic signals: from heart rate dynamics to DNA sequences.

Authors:  C-K Peng; Albert C-C Yang; Ary L Goldberger
Journal:  Chaos       Date:  2007-03       Impact factor: 3.642

5.  Long-range correlations in nucleotide sequences.

Authors:  C K Peng; S V Buldyrev; A L Goldberger; S Havlin; F Sciortino; M Simons; H E Stanley
Journal:  Nature       Date:  1992-03-12       Impact factor: 49.962

6.  Mosaic organization of DNA nucleotides.

Authors:  C K Peng; S V Buldyrev; S Havlin; M Simons; H E Stanley; A L Goldberger
Journal:  Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics       Date:  1994-02

7.  Is DNA a language?

Authors:  A A Tsonis; J B Elsner; P A Tsonis
Journal:  J Theor Biol       Date:  1997-01-07       Impact factor: 2.691

8.  Probability distribution of intersymbol distances in random symbolic sequences: Applications to improving detection of keywords in texts and of amino acid clustering in proteins.

Authors:  Pedro Carpena; Pedro A Bernaola-Galván; Concepción Carretero-Campos; Ana V Coronado
Journal:  Phys Rev E       Date:  2016-11-04       Impact factor: 2.529

9.  A Guaranteed Similarity Metric Learning Framework for Biological Sequence Comparison.

Authors:  Keru Hua; Qin Yu; Ruiming Zhang
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2015-10-26       Impact factor: 3.710

10.  Distinguishing Functional DNA Words; A Method for Measuring Clustering Levels.

Authors:  Hanieh Moghaddasi; Khosrow Khalifeh; Amir Hossein Darooneh
Journal:  Sci Rep       Date:  2017-01-27       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.