| Literature DB >> 24267034 |
Helena Skutkova, Martin Vitek, Petr Babula, Rene Kizek, Ivo Provaznik.
Abstract
BACKGROUND: Classification methods of DNA most commonly use comparison of the differences in DNA symbolic records, which requires the global multiple sequence alignment. This solution is often inappropriate, causing a number of imprecisions and requires additional user intervention for exact alignment of the similar segments. The similar segments in DNA represented as a signal are characterized by a similar shape of the curve. The DNA alignment in genomic signals may adjust whole sections not only individual symbols. The dynamic time warping (DTW) is suitable for this purpose and can replace the multiple alignment of symbolic sequences in applications, such as phylogenetic analysis.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24267034 PMCID: PMC3750471 DOI: 10.1186/1471-2105-14-S10-S1
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The specifications of ten DNA sequences from different organisms coding ACTA1
| Organism | Chromosome | Accession | Region (sequence position) | Sequence length (bp) |
|---|---|---|---|---|
| 1 | NC_000001.10 | 229566993.. ..229569844 | 2852 | |
| Pongo abelii (Sumatran orangutan, | 1 | NC_012591.1 | 20230379.. ..20233215 | 2837 |
| 1 | NC_007858.1 | 227524284.. ..227527141 | 2858 | |
| 19 | NC_013914.1 | 15737624.. ..15740450 | 2827 | |
| 8 | NC_000074.5 | 126415668.. ..126418637 | 2970 | |
| 19 | NC_005118.2 | 54081497.. ..54084509 | 3013 | |
| 14 | NC_010456.4 | 65236451.. ..65239197 | 2747 | |
| 28 | NC_007329.5 | 427530.. ..430286 | 2757 | |
| 1 | NC_009144.2 | 68408850.. ..68411788 | 2939 | |
| 3 | NC_006090.3 | 39337938.. ..39340802 | 2865 |
Figure 1Genomic signals preprocessing. a) The record of cumulated phase of the DNA sequences of tree different organisms. b) The principle of detrendization of genomic signal of human ACTA1. c) The resulting preprocessed genomic signals ready for DTW.
Figure 2Genomic signals after DTW. Upper - the alignment of human and rhesus macaque genomic signals; lower - the alignment of human and chicken genomic signals.
Figure 3The results of genomic signals classification. Similarity analysis of 10 ACTA1 genes presented as a dendrogram constructed from distance values calculated by: a) Euclidean distance between aligned genomic signals after DTW; -b) the evolutionary distance between sequences of symbols aligned by global sequence multiple alignment.
Figure 4The influence of downsampling factor of genomic signals. a) The dependence of change of pair distance on downsampling; b) The dependence of DTW processing time on downsampling.