| Literature DB >> 17931659 |
Xiao-Qin Qi1, Jie Wen, Zhao-Hui Qi.
Abstract
We introduce a 3D graphical representation of DNA sequences based on the pairs of dual nucleotides (DNs). Based on this representation, we consider some mathematical invariants and construct two 16-component vectors associated with these invariants. The vectors are used to characterize and compare the complete coding sequence part of beta globin gene of nine different species. The examination of similarities/dissimilarities illustrates the utility of the approach.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17931659 PMCID: PMC7094097 DOI: 10.1016/j.jtbi.2007.08.025
Source DB: PubMed Journal: J Theor Biol ISSN: 0022-5193 Impact factor: 2.691
Cartesian 3D coordinates for the sequence ATGGTGCACC of the coding sequence of the first exon of human -globin gene
| Base | DNs | |||
|---|---|---|---|---|
| 1 | AT | 2 | 0 | 1 |
| 2 | TG | 1 | 3 | 2 |
| 3 | GG | 3 | 2 | 3 |
| 4 | GT | 1 | 2 | 4 |
| 5 | TG | 1 | 3 | 5 |
| 6 | GC | 2 | 3 | 6 |
| 7 | CA | 1 | 1 | 7 |
| 8 | AC | 1 | 0 | 8 |
| 9 | CC | 3 | 1 | 9 |
Fig. 1Characteristic curve of the sequence ATGGTGCACC, the dots denote the DNs making up the sequence.
The complete coding sequences of -globin genes of nine species
| Species | Complete coding sequence |
|---|---|
| Human | ACCESSION U01317; REGION: |
| Exon1 | |
| ATGGTGCACCTGACTCCTGAGGAGAAGTCTGCCGTTACTGCCCTGTGGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCTGGGCAGGCTGCTGGTGGTCTACCCTTGGACCCAGAGGTTCTTTGAGTCCTTTGGGGATCTGTCCACTCCTGATGCTGTTATGGGCAACCCTAAGGTGAAGGCTCATGGCAAGAAAGTGCTCGGTGCCTTTAGTGATGGCCTGGCTCACCTGGACAACCTCAAGGGCACCTTTGCCACACTGAGTGAGCTGCACTGTGACAAGCTGCACGTGGATCCTGAGAACTTCAGGCTCCTGGGCAACGTGCTGGTCTGTGTGCTGGCCCATCACTTTGGCAAAGAATTCACCCCACCAGTGCAGGCTGCCTATCAGAAAGTGGTGGCTGGTGTGGCTAATGCCCTGGCCCACAAGTATCACTAA | |
| Goat | ACCESSION M15387; REGION: |
| Exon1 | |
| ATGCTGACTGCTGAGGAGAAGGCTGCCGTCACCGGCTTCTGGGGCAAGGTGAAAGTGGATGAAGTTGGTGCTGAGGCCCTGGGCAGGCTGCTGGTTGTCTACCCCTGGACTCAGAGGTTCTTTGAGCACTTTGGGGACTTGTCCTCTGCTGATGCTGTTATGAACAATGCTAAGGTGAAGGCCCATGGCAAGAAGGTGCTAGACTCCTTTAGTAACGGCATGAAGCATCTTGACGACCTCAAGGGCACCTTTGCTCAGCTGAGTGAGCTGCACTGTGATAAGCTGCACGTGGATCCTGAGAACTTCAAGCTCCTGGGCAACGTGCTGGTGGTTGTGCTGGCTCGCCACCATGGCAGTGAATTCACCCCGCTGCTGCAGGCTGAGTTTCAGAAGGTGGTGGCTGGTGTTGCCAATGCCCTGGCCCACAGATATCACTAA | |
| North American opossum | ACCESSION J03643; REGION: |
| Exon1 | |
| ATGGTGCACTTGACTTCTGAGGAGAAGAACTGCATCACTACCATCTGGTCTAAGGTGCAGGTTGACCAGACTGGTGGTGAGGCCCTTGGCAGGATGCTCGTTGTCTACCCCTGGACCACCAGGTTTTTTGGGAGCTTTGGTGATCTGTCCTCTCCTGGCGCTGTCATGTCAAATTCTAAGGTTCAAGCCCATGGTGCTAAGGTGTTGACCTCCTTCGGTGAAGCAGTCAAGCATTTGGACAACCTGAAGGGTACTTATGCCAAGTTGAGTGAGCTCCACTGTGACAAGCTGCATGTGGACCCTGAGAACTTCAAGATGCTGGGGAATATCATTGTGATCTGCCTGGCTGAGCACTTTGGCAAGGATTTTACTCCTGAATGTCAGGTTGCTTGGCAGAAGCTCGTGGCTGGAGTTGCCCATGCCCTGGCCCACAAGTACCACTAA | |
| Gallus | ACCESSION V00409; REGION: |
| For simplification, only Exon1 | |
| ATGGTGCACTGGACTGCTGAGGAGAAGCAGCTCATCACCGGCCTCTGGGGCAAGGTCAATGTGGCCGAATGTGGGGCCGAAGCCCTGGCCAG | |
| Black lemur | ACCESSION M15734; REGION: |
| For simplification, only Exon1 | |
| ATGACTTTGCTGAGTGCTGAGGAGAATGCTCATGTCACCTCTCTGTGGGGCAAGGTGGATGTAGAGAAAGTTGGTGGCGAGGCCTTGGGCAG | |
| House mouse | ACCESSION V00722; REGION: |
| For simplification, only Exon1 | |
| ATGGTGCACCTGACTGATGCTGAGAAGTCTGCTGTCTCTTGCCTGTGGGCAAAGGTGAACCCCGATGAAGTTGGTGGTGAGGCCCTGGGCAGG | |
| Rabbit | ACCESSION V00882; REGION: |
| For simplification, only Exon1 | |
| ATGGTGCATCTGTCCAGTGAGGAGAAGTCTGCGGTCACTGCCCTGTGGGGCAAGGTGAATGTGGAAGAAGTTGGTGGTGAGGCCCTGGGCAG | |
| Norway rat | ACCESSION X06701; REGION: |
| For simplification, only Exon1 (1 …92) is listed; | |
| ATGGTGCACCTAACTGATGCTGAGAAGGCTACTGTTAGTGGCCTGTGGGGAAAGGTGAACCCTGATAATGTTGGCGCTGAGGCCCTGGGCAG | |
| Cattle | ACCESSION X00376; REGION: |
| For simplification, only Exon1 (1 … 86) is listed; | |
| ATGCTGACTGCTGAGGAGAAGGCTGCCGTCACCGCCTTTTGGGGCAAGGTGAAAGTGGATGAAGTTGGTGGTGAGGCCCTGGGCAG |
The -Matrix of the nine different species presented in Table 2
| Human | Goat | ||||||
|---|---|---|---|---|---|---|---|
| 500.6136 | 396.8885 | 586.6441 | 591.9827 | 456.9633 | 418.3000 | 524.6166 | 563.2798 |
| 0.0609 | 0.0564 | 0.0993 | 0.0451 | 0.0709 | 0.0686 | 0.1030 | 0.0412 |
| 379.2788 | 613.9212 | 416.0631 | 381.3174 | 380.0210 | 614.6291 | 456.6932 | 413.2502 |
| 0.0542 | 0.0677 | 0.0722 | 0.1354 | 0.0458 | 0.0618 | 0.0595 | 0.1350 |
| 432.6606 | 614.6310 | 446.4609 | 423.7280 | 455.4911 | 499.6837 | 521.9378 | 425.9806 |
| 0.0293 | 0.0203 | 0.0113 | 0.0790 | 0.0343 | 0.0206 | 0.0183 | 0.0961 |
| 597.4704 | 487.8472 | 486.8177 | 487.0639 | 569.0279 | 539.2355 | 491.6873 | 515.4189 |
| 0.0519 | 0.0790 | 0.0993 | 0.0384 | 0.0503 | 0.0572 | 0.0892 | 0.0481 |
| North American opossum | Gallus | ||||||
| 523.8582 | 533.3919 | 556.4765 | 444.8475 | 513.3896 | 474.0916 | 615.7531 | 521.0959 |
| 0.0677 | 0.0677 | 0.0926 | 0.0542 | 0.0519 | 0.0564 | 0.0948 | 0.0655 |
| 346.9337 | 518.0918 | 402.6976 | 459.9530 | 417.5562 | 523.9502 | 421.4063 | 431.0943 |
| 0.0519 | 0.0700 | 0.0677 | 0.1242 | 0.0587 | 0.0835 | 0.0429 | 0.1016 |
| 426.4998 | 515.1239 | 495.2860 | 549.4444 | 376.1917 | 699.1648 | 409.4878 | 425.4332 |
| 0.0429 | 0.0248 | 0.0090 | 0.0655 | 0.0361 | 0.0068 | 0.0316 | 0.0858 |
| 643.2122 | 540.4797 | 525.8275 | 525.6048 | 632.2262 | 544.2404 | 464.1535 | 659.8438 |
| 0.0497 | 0.0655 | 0.0790 | 0.0677 | 0.0542 | 0.1151 | 0.0835 | 0.0316 |
| Black lemur | House mouse | ||||||
| 451.0993 | 413.3322 | 561.5003 | 453.7127 | 520.6454 | 492.1484 | 600.5855 | 553.3528 |
| 0.0655 | 0.0609 | 0.1038 | 0.0587 | 0.0609 | 0.0609 | 0.1038 | 0.0429 |
| 442.8868 | 600.2678 | 447.9226 | 413.2137 | 417.1156 | 610.4255 | 391.1613 | 369.3045 |
| 0.0497 | 0.0677 | 0.0722 | 0.1309 | 0.0542 | 0.0655 | 0.0519 | 0.1242 |
| 321.9795 | 414.8371 | 482.6028 | 431.5504 | 409.7837 | 468.5294 | 419.5267 | 436.9504 |
| 0.0271 | 0.0135 | 0.0158 | 0.0790 | 0.0361 | 0.0248 | 0.0113 | 0.0835 |
| 650.6342 | 530.5800 | 526.9254 | 534.0476 | 482.2962 | 548.1338 | 589.0919 | 536.1029 |
| 0.0474 | 0.0564 | 0.1016 | 0.0497 | 0.0609 | 0.0903 | 0.0903 | 0.0384 |
| Rabbit | Norway rat | ||||||
| 443.4465 | 442.0772 | 626.0979 | 512.4670 | 545.4528 | 513.0164 | 551.5921 | 604.1700 |
| 0.0677 | 0.0587 | 0.0971 | 0.0609 | 0.0564 | 0.0609 | 0.0971 | 0.0384 |
| 449.8174 | 577.2756 | 371.9956 | 388.3690 | 393.9856 | 622.8308 | 434.4702 | 409.3697 |
| 0.0429 | 0.0722 | 0.0767 | 0.1332 | 0.0564 | 0.0632 | 0.0564 | 0.1264 |
| 376.8481 | 590.5806 | 470.5930 | 415.0455 | 385.2784 | 364.5492 | 376.9265 | 405.8272 |
| 0.0361 | 0.0158 | 0.0090 | 0.0745 | 0.0451 | 0.0339 | 0.0045 | 0.0700 |
| 572.4836 | 553.8482 | 543.3979 | 589.6406 | 535.1879 | 529.7384 | 513.1191 | 633.0783 |
| 0.0632 | 0.0564 | 0.0971 | 0.0384 | 0.0632 | 0.0858 | 0.0948 | 0.0474 |
| Cattle | |||||||
| 483.0319 | 428.6478 | 545.2948 | 609.2222 | ||||
| 0.0686 | 0.0686 | 0.0915 | 0.0389 | ||||
| 369.6844 | 610.7933 | 423.1906 | 411.7154 | ||||
| 0.0366 | 0.0572 | 0.0618 | 0.1350 | ||||
| 479.0289 | 529.5687 | 471.0091 | 421.5105 | ||||
| 0.0435 | 0.0229 | 0.0160 | 0.0892 | ||||
| 593.1544 | 478.8359 | 526.3381 | 517.2071 | ||||
| 0.0549 | 0.0618 | 0.0938 | 0.0595 | ||||
Fig. 2The comparison of the mean spaces and the distributions of DNs among the nine species in Table 2; i of x-coordinate denotes the ith species in Table 2, ; the value of y-coordinate denotes the distributions of DNs; the value of z-coordinate denotes the mean spaces of DNs.
The similarity/dissimilarity matrix for the complete coding sequences of Table 2 based on the Euclidean distances between the end points of the 16-component vectors of the space-sums of 16 DNs
| Species | Human | Goat | North American opossum | Gallus | Black lemur | House mouse | Rabbit | Norway rat | Cattle |
|---|---|---|---|---|---|---|---|---|---|
| Human | 0 | 8160 | 11 877 | 14 923 | 7077 | 8683 | 5874 | 8801 | 10 650 |
| Goat | 0 | 7790 | 17 978 | 8224 | 12 258 | 9379 | 10 704 | 5732 | |
| North American opossum | 0 | 18 830 | 11 643 | 13 728 | 11 886 | 8483 | 6705 | ||
| Gallus | 0 | 18 706 | 11 792 | 17 419 | 14 367 | 20 194 | |||
| Black lemur | 0 | 12 323 | 5636 | 10 983 | 9938 | ||||
| House mouse | 0 | 10 992 | 8422 | 13 848 | |||||
| Rabbit | 0 | 9813 | 10 067 | ||||||
| Norway rat | 0 | 10 173 | |||||||
| Cattle | 0 |
The similarity/dissimilarity matrix for the complete coding sequences of Table 2 based on the Euclidean distances between the end points of the 16-component vectors of the distributions of the 16 DNs
| Species | Human | Goat | North American opossum | Gallus | Black lemur | House mouse | Rabbit | Norway rat | Cattle |
|---|---|---|---|---|---|---|---|---|---|
| Human | 0 | 0.0398 | 0.0480 | 0.0713 | 0.0320 | 0.0311 | 0.0348 | 0.0358 | 0.0442 |
| Goat | 0 | 0.0474 | 0.0857 | 0.0345 | 0.0442 | 0.0430 | 0.0519 | 0.0243 | |
| North American opossum | 0 | 0.0834 | 0.0424 | 0.0522 | 0.0453 | 0.0450 | 0.0416 | ||
| Gallus | 0 | 0.0834 | 0.0559 | 0.0853 | 0.0715 | 0.0900 | |||
| Black lemur | 0 | 0.0509 | 0.0265 | 0.0547 | 0.0415 | ||||
| House mouse | 0 | 0.0515 | 0.0253 | 0.0479 | |||||
| Rabbit | 0 | 0.0525 | 0.0450 | ||||||
| Norway rat | 0 | 0.0468 | |||||||
| Cattle | 0 |
Fig. 3The degree of similarity of the complete coding sequences of several species with the complete coding sequence of human (a: from [this work, Table 4]; b: from [this work, Table 5]); i of x-coordinate denotes the species of Table 4 (x-coord 1: Goat, x-coord 2: North American opossum, x-coord 3: Gallus, x-coord 4: Black lemur, x-coord 5: House mouse, x-coord 6: Rabbit, x-coord 7: Norway rat, x-coord 8: Cattle).
The similarity/dissimilarity matrix for the complete coding sequences of Table 2 based on the Euclidean distances between the end points of the 16-component vectors of the space-sums of 16 DNs (by using the s-vector derived from the new 3D-DN curve based on the new random matrix )
| Species | Human | Goat | North American opossum | Gallus | Black lemur | House mouse | Rabbit | Norway rat | Cattle |
|---|---|---|---|---|---|---|---|---|---|
| Human | 0 | 10 379 | 12 545 | 19 172 | 9210 | 10 814 | 7897 | 9602 | 12 403 |
| Goat | 0 | 8054 | 20 541 | 11 125 | 13 402 | 10 123 | 12 887 | 6669 | |
| North American opossum | 0 | 19 740 | 13 302 | 14 171 | 12 391 | 9086 | 8976 | ||
| Gallus | 0 | 21 693 | 11 952 | 20 071 | 16 045 | 23 525 | |||
| Black lemur | 0 | 15 087 | 6044 | 13 759 | 13 335 | ||||
| House mouse | 0 | 13 184 | 9340 | 16 032 | |||||
| Rabbit | 0 | 10 148 | 11 375 | ||||||
| Norway rat | 0 | 14 140 | |||||||
| Cattle | 0 |
Fig. 4The degree of similarity of the complete coding sequences of several species with the complete coding sequence of human (a: from [this work, Table 4]; c: from [this work, Table 6]); i of x-coordinate denotes the species of Table 4 (x-coord 1: Goat, x-coord 2: North American opossum, x-coord 3: Gallus, x-coord 4: Black lemur, x-coord 5: House mouse, x-coord 6: Rabbit, x-coord 7: Norway rat, x-coord 8: Cattle).