Literature DB >> 10642879

Sequence complexity for biological sequence analysis.

L Allison1, L Stern, T Edgoose, T I Dix.   

Abstract

A new statistical model for DNA considers a sequence to be a mixture of regions with little structure and regions that are approximate repeats of other subsequences, i.e. instances of repeats do not need to match each other exactly. Both forward- and reverse-complementary repeats are allowed. The model has a small number of parameters which are fitted to the data. In general there are many explanations for a given sequence and how to compute the total probability of the data given the model is shown. Computer algorithms are described for these tasks. The model can be used to compute the information content of a sequence, either in total or base by base. This amounts to looking at sequences from a data-compression point of view and it is argued that this is a good way to tackle intelligent sequence analysis in general.

Mesh:

Substances:

Year:  2000        PMID: 10642879

Source DB:  PubMed          Journal:  Comput Chem        ISSN: 0097-8485


  5 in total

Review 1.  Three subsets of sequence complexity and their relevance to biopolymeric information.

Authors:  David L Abel; Jack T Trevors
Journal:  Theor Biol Med Model       Date:  2005-08-11       Impact factor: 2.432

2.  An Optimal Seed Based Compression Algorithm for DNA Sequences.

Authors:  Pamela Vinitha Eric; Gopakumar Gopalakrishnan; Muralikrishnan Karunakaran
Journal:  Adv Bioinformatics       Date:  2016-07-31

3.  Comparative analysis of long DNA sequences by per element information content using different contexts.

Authors:  Trevor I Dix; David R Powell; Lloyd Allison; Julie Bernal; Samira Jaeger; Linda Stern
Journal:  BMC Bioinformatics       Date:  2007-05-03       Impact factor: 3.169

4.  XS: a FASTQ read simulator.

Authors:  Diogo Pratas; Armando J Pinho; João M O S Rodrigues
Journal:  BMC Res Notes       Date:  2014-01-16

5.  DNA sequences at a glance.

Authors:  Armando J Pinho; Sara P Garcia; Diogo Pratas; Paulo J S G Ferreira
Journal:  PLoS One       Date:  2013-11-21       Impact factor: 3.240

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.