| Literature DB >> 23908584 |
A T M Golam Bari1, Mst Rokeya Reaz, A K M Tauhidul Islam, Ho-Jin Choi, Byeong-Soo Jeong.
Abstract
Effective representation of DNA sequences is one of the important tasks in the study of genome sequences. In this paper, we propose a graphical representation of DNA sequences based on nucleotide ring structure. In the proposed representation, we convert DNA sequences into 16 dinucleotides on the surface of the hexagon so that it can preserve nucleotide's chemical property and positional information. Our approach can provide capability of efficient similarity comparison between DNA sequences and also high comparison accuracy. Furthermore, our approach satisfies uniqueness and no degeneracy of DNA sequences. In the experimental study, we use phylogeny analysis for evolutionary relationship among different species. Extensive performance study shows that the proposed method can give better performance than existing methods in comparison with the degree of similarity.Entities:
Keywords: DNA curve; hexagon; ring structure; β-globin gene
Year: 2013 PMID: 23908584 PMCID: PMC3712558 DOI: 10.4137/EBO.S12160
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Figure 1Heterogenic cycle of four bases.
Figure 2Six combinations of heterogenic cycle in 2D space.
Figure 3Cartesian coordinates of 16 dinucleotide in a hexagon.
3D coordinates of ATACGATGCAG based on the proposed method.
| Points | Dinucleotide | Cycle 1 | Cycle 2 | Cycle 3 | Cycle 4 | Cycle 5 | Cycle 6 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
| ||||||||||||||
| x | y | z | x | y | z | x | y | z | x | y | z | x | y | z | x | y | z | ||
| P1 | AT | 0.5 | 1.25 | 1 | 1 | 0 | 1 | 0.5 | −1.25 | 1 | −0.5 | −1.25 | 1 | −1 | 0 | 1 | −0.5 | 1.25 | 1 |
| P2 | TA | −0.5 | 2.25 | 2 | 1 | 1.5 | 2 | 1.5 | −0.25 | 2 | 0.5 | −2.25 | 2 | −1 | −1.5 | 2 | −1.5 | 0.25 | 2 |
| P3 | AC | −1.0 | 3.5 | 3 | 1.5 | 2.75 | 3 | 2.5 | −0.25 | 3 | 1.0 | −3.5 | 3 | −1.5 | −2.75 | 3 | −2.5 | 0.25 | 3 |
| P4 | CG | 0 | 2.5 | 4 | 1.5 | 1.25 | 4 | 1.5 | −1.25 | 4 | 0 | −2.5 | 4 | −1.5 | −1.25 | 4 | −1.5 | 1.25 | 4 |
| P5 | GA | 1.0 | 2.5 | 5 | 2 | 0 | 5 | 1.0 | −2.5 | 5 | −1.0 | −2.5 | 5 | −2 | 0 | 5 | −1.0 | 2.5 | 5 |
| P6 | AT | 1.5 | 3.75 | 6 | 3 | 0 | 6 | 1.5 | −3.75 | 6 | −1.5 | −3.75 | 6 | −3 | 0 | 6 | −1.5 | 3.75 | 6 |
| P7 | TG | 0.5 | 2.75 | 7 | 2 | 1 | 7 | 1.5 | −2.25 | 7 | −0.5 | −2.75 | 7 | −2 | −1 | 7 | −1.5 | 2.25 | 7 |
| P8 | GC | 0 | 1.5 | 8 | 1 | 1 | 8 | 1 | −1.0 | 8 | 0 | −1.5 | 8 | −1 | −1 | 8 | −1.0 | 1.0 | 8 |
| P9 | CA | 1.0 | 2.5 | 9 | 2 | 0 | 9 | 1 | −2.5 | 9 | −1 | −2.5 | 9 | −2 | 0 | 9 | −1.0 | 2.5 | 9 |
| P10 | AG | 1.0 | 4.0 | 10 | 3 | 1 | 10 | 2 | −3.5 | 10 | −1 | −4 | 10 | −3 | −1 | 10 | −2.0 | 3.5 | 10 |
Figure 4The graphical representation of the proposed model for the example sequence ATACGATGCAG.
Figure 5DNA curves of 11 different species.
The first exon of β-globin gene of 11 different species.
| Species | ID/Accession | Database | Length |
|---|---|---|---|
| Human | U01317 | NCBI | 92 |
| Chimpanzee | X02345 | NCBI | 105 |
| Gorilla | X61109 | NCBI | 93 |
| Lemur | M15734 | NCBI | 92 |
| Rat | X06701 | NCBI | 92 |
| Mouse | V00722 | NCBI | 93 |
| Rabbit | V00882 | NCBI | 92 |
| Goat | M15387 | NCBI | 86 |
| Bovine | X00376 | NCBI | 86 |
| Opossum | J03643 | NCBI | 92 |
| Gallus | V00409 | NCBI | 92 |
Geometrical center of 11 different species.
| Species | Cycle 1 | Cycle 2 | Cycle 3 | Cycle 4 | Cycle 5 | Cycle 6 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |||||||||||||
| ux | uy | uz | ux | uy | uz | ux | uy | uz | ux | uy | uz | ux | uy | uz | ux | uy | uz | |
| Human | −4.489 | −15.0275 | 46 | −9.81 | −3.75 | 46 | −6.74725 | 5.947802 | 46 | −0.45055 | 9.55 | 46 | 6.967033 | −1.71703 | 46 | 3.901099 | −11.4203 | 46 |
| Chimpanzee | −4.7981 | −17.5024 | 52.5 | −10.88 | −4.89 | 52.5 | −7.59135 | 5.947115 | 52.5 | −0.70192 | 10.33 | 52.5 | 7.875 | −2.28606 | 52.5 | 4.581731 | −13.1202 | 52.5 |
| Gorilla | −4.5163 | −15.2255 | 46.5 | −9.9 | −3.85 | 46.5 | −6.82065 | 5.942935 | 46.5 | −0.47283 | 9.62 | 46.5 | 7.032609 | −1.76087 | 46.5 | 3.951087 | −11.5516 | 46.5 |
| Lemur | −2.3132 | −12.6181 | 46 | −6.71 | −2.23 | 46 | −3.65385 | 6.035714 | 46 | 2.208791 | 8.78 | 46 | 8.208791 | −1.61538 | 46 | 5.148352 | −9.88187 | 46 |
| Rat | −6.7802 | −11.8489 | 46 | −9.78 | 1.53 | 46 | −3.74176 | 10.74725 | 46 | 3.615385 | 9.04 | 46 | 8.296703 | −4.3489 | 46 | 2.258242 | −13.5604 | 46 |
| Mouse | −5.6882 | −18.2285 | 47 | −13 | −1.65 | 47 | −7.8172 | 13.25 | 47 | 2.844086 | 16.7 | 47 | 12.13441 | 0.172043 | 47 | 6.903226 | −14.7769 | 47 |
| Rabbit | −2.3736 | −13.6401 | 46 | −7.31 | −4.88 | 46 | −5.9011 | 2.239011 | 46 | −2.49451 | 7.46 | 46 | 5.39011 | −1.29121 | 46 | 3.978022 | −8.41484 | 46 |
| Goat | −2.7824 | −16.1882 | 43 | −9.106 | −4.98 | 43 | −6.68235 | 5.679412 | 43 | 1.088235 | 10.82 | 43 | 14.5 | −2.75 | 43 | 5.964706 | −11.0441 | 43 |
| Bovine | −1.7824 | −14.6706 | 43 | −7.747 | −4.76 | 43 | −6.06471 | 4.447059 | 43 | 0.517647 | 10.15 | 43 | 12 | −2.25 | 43 | 5.864706 | −8.96471 | 43 |
| Opossum | −1.8571 | −3.8159 | 46 | −1.92 | 0.266 | 46 | 0.489011 | 2.379121 | 46 | 1.686813 | 0.431 | 46 | 3.032967 | −3.6511 | 46 | 0.620879 | −5.76374 | 46 |
| Gallus | −2.4615 | −11.5604 | 46 | −5.9 | −5.24 | 46 | −4.34615 | 0.898352 | 46 | −0.33516 | 3.341 | 46 | 4.071429 | −2.97527 | 46 | 2.521978 | −9.11813 | 46 |
Mathematical descriptor of 11 different species.
| Species | Cycle 1 | Cycle 2 | Cycle 3 | Cycle 4 | Cycle 5 | Cycle 6 |
|---|---|---|---|---|---|---|
| Human | 48.60017 | 47.18367 | 46.87112 | 46.98303 | 46.55629 | 47.55673 |
| Chimpanzee | 55.54823 | 53.83806 | 53.37834 | 53.51123 | 53.13654 | 54.30821 |
| Gorilla | 49.13718 | 47.69782 | 47.37182 | 47.48703 | 47.06175 | 48.07599 |
| Lemur | 47.75529 | 46.54027 | 46.53795 | 46.88248 | 46.75461 | 47.3303 |
| Rat | 47.98299 | 47.05305 | 47.38675 | 47.01907 | 46.9441 | 48.01026 |
| Mouse | 50.73099 | 48.79265 | 49.45373 | 49.95977 | 48.54146 | 49.74948 |
| Rabbit | 48.03838 | 46.83215 | 46.43098 | 46.6677 | 46.33272 | 46.93223 |
| Goat | 46.03042 | 44.23482 | 43.88519 | 44.35377 | 43.81217 | 44.79453 |
| Bovine | 45.46871 | 43.95081 | 43.65269 | 44.18473 | 43.65795 | 44.31434 |
| Opossum | 46.19535 | 46.04082 | 46.06408 | 46.03293 | 46.24424 | 46.36385 |
| Gallus | 47.49423 | 46.67191 | 46.21359 | 46.12239 | 46.27557 | 46.96276 |
Euclidian distance among 11 different species.
| Species | Chimpanzee | Gorilla | Lemur | Rat | Mouse | Rabbit | Goat | Bovine | Opossum | Gallus |
|---|---|---|---|---|---|---|---|---|---|---|
| Human | 0.0020 | 0.0002 | 0.0124 | 0.0109 | 0.0336 | 0.0116 | 0.0236 | 0.0139 | 0.0343 | 0.0189 |
| Chimpanzee | 0.0018 | 0.0138 | 0.0115 | 0.0330 | 0.0131 | 0.0226 | 0.0137 | 0.0358 | 0.0201 | |
| Gorilla | 0.0126 | 0.0109 | 0.0336 | 0.0118 | 0.0235 | 0.0139 | 0.0344 | 0.0190 | ||
| Lemur | 0.0134 | 0.0420 | 0.0080 | 0.0280 | 0.0156 | 0.0236 | 0.0114 | |||
| Rat | 0.0155 | 0.0174 | 0.0233 | 0.0318 | 0.0342 | 0.0218 | ||||
| Mouse | 0.0443 | 0.0280 | 0.0330 | 0.0644 | 0.0516 | |||||
| Rabbit | 0.0311 | 0.0186 | 0.0238 | 0.0088 | ||||||
| Goat | 0.0131 | 0.0494 | 0.0368 | |||||||
| Bovine | 0.0370 | 0.0245 | ||||||||
| Opossum | 0.0169 |
Figure 6Phylogenic analysis of 11 different species.
Figure 7The degree of similarity/dissimilarity of the other 10 species with human.