| Literature DB >> 28193284 |
Abstract
Direct tests of the random or non-random distribution of nucleotides on genomes have been devised to test the hypothesis of neutral, nearly-neutral or selective evolution. These tests are based on the direct base distribution and are independent of the functional (coding or non-coding) or structural (repeated or unique sequences) properties of the DNA. The first approach described the longitudinal distribution of bases in tandem repeats under the Bose-Einstein statistics. A huge deviation from randomness was found. A second approach was the study of the base distribution within dinucleotides whose bases were separated by 0, 1, 2… K nucleotides. Again an enormous difference from the random distribution was found with significances out of tables and programs. These test values were periodical and included the 16 dinucleotides. For example a high "positive" (more observed than expected dinucleotides) value, found in dinucleotides whose bases were separated by (3K + 2) sites, was preceded by two smaller "negative" (less observed than expected dinucleotides) values, whose bases were separated by (3K) or (3K + 1) sites. We examined mtDNAs, prokaryote genomes and some eukaryote chromosomes and found that the significant non-random interactions and periodicities were present up to 1000 or more sites of base separation and in human chromosome 21 until separations of more than 10 millions sites. Each nucleotide has its own significant value of its distance to neutrality; this yields 16 hierarchical significances. A three dimensional table with the number of sites of separation between the bases and the 16 significances (the third dimension is the dinucleotide, individual or taxon involved) gives directly an evolutionary state of the analyzed genome that can be used to obtain phylogenies. An example is provided.Entities:
Keywords: Evolutionary theories; Selective nucleotide interactions; Selective periodicities
Mesh:
Substances:
Year: 2017 PMID: 28193284 PMCID: PMC5307875 DOI: 10.1186/s40659-017-0112-0
Source DB: PubMed Journal: Biol Res ISSN: 0716-9760 Impact factor: 5.612
Fig. 1HIV-1 cDNA sequence
Internucleotide interactions measured by a test in dinucleotides
| Sep |
| Human Ch21 | ||||
|---|---|---|---|---|---|---|
| K+1 |
| 1° DN |
|
| 1° DN |
|
| 1 | 45,469.3 | CG(−) | 13,210.2 | 1,885,266.8 | CG(−) | 882,286.7 |
| 2 | 17,658.5 | CG(+) | 7962.4 |
| AA(+) |
|
| 3 |
|
|
| 143,071.9 | GG(+) | 30.632.7 |
| 4 | 5333.5 | CG(−) | 1125.0 |
| AA(+) |
|
| 5 | 4734.2 | GC(−) | 1125.0 | 130,240.9 | CC(+) | 21,195.2 |
| 6 |
|
|
|
| CC(+) |
|
| 7 | 4204.9 | GC(+) | 1147.2 | 137,408.0 | GG(+) | 36,654.3 |
| 8 | 2683.6 | CG(+) | 988.2 |
| GG(+) |
|
| 9 |
|
|
| 168,931.2 | GG(+) | 38,267.5 |
| 10 | 3621.9 | GC(+) | 1247.9 |
| CC(+) |
|
| 11 | 3181.3 | CG(+) | 1100.9 | 97,021.9 | AA(+) | 12,523.8 |
| 12 |
|
|
|
| TT(+) |
|
| 13 | 2913.5 | GC(+) | 1104.4 | 118,939.4 | TT(+) | 21,745.2 |
| 14 | 3830.5 | CG(+) | 1487.8 |
| AA(+) |
|
| 15 |
|
|
| 108,724.8 | CC(+) | 19,839.1 |
| 16 | 3208.5 | GC(+) | 1196.6 |
| AA(+) |
|
| 17 | 3015.7 | CG(+) | 1130.7 | 73,618.5 | CC(+) | 10,267.1 |
| 18 |
|
|
|
|
|
|
| 19 | 3370.1 | GC(+) | 1349.0 | 76,660.2 | GG(+) | 12,402.2 |
| 20 | 2817.1 | CG(+) | 1137.0 |
|
|
|
| 21 |
|
|
| 81,732.7 | AA(+) | 10,331.6 |
| 22 | 3581.5 | GC(+) | 1454.3 |
|
|
|
| 23 | 3186.5 | CG(+) | 1095.2 | 84,086.3 | AA(+) | 13,354.2 |
| 24 |
|
|
|
|
|
|
| 25 | 3006.4 | GC(+) | 938.3 | 111,182.0 | TT(+) | 22,494.8 |
| 26 | 3251.2 | CG(+) | 1170.9 | 89,507.1 | TT(+) | 15,788.0 |
| 27 |
|
|
| 101,184.2 | GG(+) | 21,244.9 |
M. smithii genome and human chromosome 21
Sep number of sites (plus one) between the first and second base of a dinucleotide, 1° DN the most significant (deviation from neutrality) pair among the 16. (+) or (−) more or less observed than expected pairs, respectively. Italics indicates periods
The analysis of 100 dinucleotides of 25 ATGC tandem repeats
| 2° Base | 0 Separation | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Adenine | Thymine | Guanine | Cytosine | Total | ||||||
| 1° Base | Exp | Obs | Exp | Obs | Exp | Obs | Exp | Obs | Exp | Obs |
| Adenine | 6.25 | 0 | 6.25 | 25 | 6.25 | 0 | 6.25 | 0 | 25 | 25 |
| Thymine | 6.25 | 0 | 6.25 | 0 | 6.25 | 25 | 6.25 | 0 | 25 | 25 |
| Guanine | 6.25 | 0 | 6.25 | 0 | 6.25 | 0 | 6.25 | 25 | 25 | 25 |
| Cytosine | 6.25 | 25 | 6.25 | 0 | 6.25 | 0 | 6.25 | 0 | 25 | 25 |
| Total | 25 | 25 | 25 | 25 | 25 | 25 | 25 | 25 | 100 | 100 |
Total values and its contribution of the most significant dinucleotide
| Sep |
| 1° Pair | 2° Pair | 3° Pair | 4° Pair | 5° Pair | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Pair |
| Pair |
| Pair |
| Pair |
| Pair |
| ||
| 0 | 485 | (GG)↑ | 124 | (CC)↑ | 113 | (GT)↓ | 91 | (GC)↑ | 50 | (TT)↑ | 28 |
| 1 | 94 | (CG)↑ | 36 | (CC)↑ | 25 | (CT)↓ | 12 | (AG)↓ | 6 | (TC)↓ | 4 |
| 2 | 405 | (GG)↑ | 116 | (CC)↑ | 106 | (GC)↑ | 33 | (CG)↑ | 25 | (TG)↓ | 23 |
| 3 | 114 | (GC)↑ | 23 | (AA)↑ | 22 | (TT)↑ | 15 | (TA)↓ | 11 | (CC)↑ | 9 |
| 4 | 47 | (CG)↑ | 20 | (AG)↓ | 8 | (AT)↑ | 6 | (CT)↓ | 4 | (GT)↓ | 2 |
| 5 | 381 | (GG)↑ | 139 | (CC)↑ | 51 | (AG)↓ | 32 | (GC)↑ | 32 | (CG)↑ | 30 |
| 6 | 87 | (GC)↑ | 38 | (TA)↑ | 14 | (TC)↓ | 12 | (GA)↓ | 6 | (AA)↓ | 5 |
| 7 | 37 | (CG)↑ | 17 | (CT)↓ | 7 | (TG)↓ | 3 | (GG)↑ | 3 | (TT)↑ | 2 |
| 8 | 375 | (GG)↑ | 149 | (CC)↑ | 45 | (CG)↑ | 36 | (GC)↑ | 29 | (TG)↓ | 24 |
| 9 | 76 | (GC)↑ | 34 | (GA)↓ | 16 | (TC)↓ | 12 | (GG)↑ | 8 | (CG)↓ | 2 |
| 10 | 49 | (CG)↑ | 28 | (AG)↓ | 6 | (CT)↓ | 6 | (AT)↑ | 4 | (TA)↑ | 1 |
| 11 | 367 | (GG)↑ | 144 | (GC)↑ | 45 | (CC)↑ | 35 | (CG)↑ | 26 | (AG)↓ | 23 |
| 12 | 65 | (GC)↑ | 34 | (GA)↓ | 13 | (TC)↓ | 8 | (CG)↓ | 3 | (CA)↑ | 2 |
| 13 | 70 | (CG)↑ | 38 | (AG)↓ | 13 | (AA)↑ | 5 | (GC)↓ | 3 | (CA)↓ | 2 |
| 14 | 310 | (GG)↑ | 78 | (CG)↑ | 48 | (GC)↑ | 44 | (CC)↑ | 32 | (GA)↓ | 22 |
| 15 | 60 | (GC)↑ | 34 | (GA)↓ | 10 | (CG)↓ | 6 | (TC)↓ | 3 | (CT)↑ | 2 |
| 16 | 52 | (CG)↑ | 27 | (AG)↓ | 12 | (AA)↑ | 3 | (CT)↓ | 2 | (CA)↓ | 2 |
| 17 | 322 | (GG)↑ | 91 | (CG)↑ | 45 | (CC)↑ | 40 | (GC)↑ | 22 | (AA)↑ | 22 |
Separations from 0 to17 sites. D. melanogaster mtDNA
Sep, number of separation sites; Co, contribution of this pair to the total value (in integers); ↑ more observed than expected pairs; ↓ less observed than expected pairs
Total value (in integers) of difference from randomness of dinucleotides from eukaryotes
| Human collagen | Human mtDNA |
|
| |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| K |
| (MD)S |
|
| (MD)S |
|
| (MD)S |
|
| (MD)S |
|
| 0 | 634 | CG− | 200 | 272 | CG− | 89 | 714 | TA− | 256 | 265 | TA− | 92 |
| 1 | 129 | AA+ | 33 | 54 | TC− | 10 | 137 | TC+ | 32 | 70 | CA+ | 16 |
| 2 | 355 | TT+ | 111 | 146 | GG+ | 39 | 25 | GG+ | 8 | 21 | GG+ | 5 |
| 3 | 93 | GG− | 23 | 52 | TG− | 17 | 34 | TT− | 6 | 6 | AA+ | 2 |
| 4 | 110 | GG− | 28 | 41 | GT− | 9 | 23 | CT− | 4 | 20 | GT+ | 6 |
| 5 | 325 | TT+ | 84 | 149 | GG+ | 35 | 18 | GA− | 4 | 15 | AC− | 4 |
| 6 | 70 | TG+ | 19 | 47 | TG− | 10 | 17 | CG− | 5 | 29 | GC− | 7 |
| 7 | 203 | GC− | 46 | 48 | GT− | 13 | 42 | AT− | 11 | 8 | AG+ | 2 |
| 8 | 1058 | GG+ | 457 | 91 | GG+ | 21 | 44 | TA− | 7 | 17 | AA+ | 5 |
| 9 | 198 | CG− | 46 | 62 | GT+ | 14 | 20 | CT+ | 6 | 15 | AA+ | 3 |
| 10 | 84 | GG− | 22 | 23 | GT− | 5 | 20 | TG− | 4 | 19 | GG− | 4 |
| 11 | 341 | TT+ | 70 | 153 | TT+ | 31 | 86 | TT+ | 28 | 21 | GG+ | 8 |
| 12 | 114 | GG− | 36 | 51 | GT+ | 18 | 15 | CT+ | 4 | 6 | CC− | 1 |
| 13 | 86 | GG− | 26 | 41 | GT− | 7 | 8 | GT+ | 2 | 11 | AT− | 4 |
| 14 | 337 | TT+ | 96 | 80 | TT+ | 17 | 31 | GT− | 9 | 17 | GC− | 3 |
| 15 | 66 | TG+ | 18 | 43 | GT+ | 8 | 18 | CG− | 4 | 12 | GC− | 4 |
| 16 | 156 | GC− | 42 | 31 | GT− | 6 | 14 | CA+ | 5 | 9 | TG+ | 2 |
| 17 | 975 | GG+ | 425 | 86 | GG+ | 23 | 13 | TT+ | 4 | 13 | TT+ | 4 |
| 18 | 159 | CG− | 47 | 53 | GA− | 11 | 14 | TG+ | 6 | 12 | AA+ | 2 |
| 19 | 84 | TT− | 18 | 27 | GT− | 8 | 37 | GT+ | 15 | 8 | CT+ | 1 |
| 20 | 325 | TT+ | 100 | 106 | GG+ | 27 | 36 | TT+ | 10 | 4 | AT− | 1 |
| 21 | 97 | GG− | 29 | 46 | TT− | 11 | 32 | GA+ | 4 | 18 | GG+ | 4 |
| 22 | 80 | GG− | 23 | 46 | AT+ | 10 | 42 | AT− | 15 | 7 | AT+ | 1 |
| 23 | 316 | TT+ | 85 | 64 | TT+ | 17 | 30 | GG+ | 8 | 13 | GG+ | 3 |
| 24 | 72 | GG− | 20 | 32 | GT+ | 10 | 9 | TG+ | 2 | 16 | TT+ | 5 |
| 25 | 194 | GC− | 53 | 30 | GT− | 10 | 21 | TG− | 3 | 5 | CG+ | 1 |
| 26 | 1010 | GG+ | 440 | 139 | TT+ | 39 | 18 | GG+ | 5 | 19 | GG+ | 7 |
Bases separated by K sites, and contribution for the most different (from neutrality) pair (MD). Signs (S) indicate more (+) and less (−) observed than expected pairs
Fig. 2The Homo sapiens collagen type I alpha 2 gene
Fig. 3Human mtDNA
Total value (in integers) for differences from randomness of dinucleotides from prokaryote DNA segments
|
|
|
|
| |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| K |
| (MD)S |
|
| (MD)S |
|
| (MD)S |
|
| (MD)S |
|
| 0 | 56 | TA− | 10 | 114 | TG+ | 21 | 226 | GG+ | 41 | 200 | CG− | 51 |
| 1 | 43 | GC− | 11 | 103 | CC− | 24 | 99 | CG+ | 18 | 181 | CG+ | 73 |
| 2 | 15 | GG+ | 6 | 155 | AT+ | 32 | 146 | GT− | 42 | 139 | AC− | 24 |
| 3 | 10 | TG+ | 3 | 91 | TC+ | 15 | 105 | GC+ | 28 | 86 | CA+ | 30 |
| 4 | 23 | GG− | 9 | 107 | TG+ | 28 | 82 | TC+ | 16 | 68 | AC+ | 22 |
| 5 | 30 | GG+ | 14 | 181 | TT+ | 42 | 111 | GG+ | 32 | 103 | CC+ | 24 |
| 6 | 15 | GG− | 5 | 63 | TA− | 17 | 74 | TC− | 11 | 65 | CA+ | 13 |
| 7 | 15 | CC− | 4 | 69 | CT+ | 15 | 71 | GT+ | 14 | 72 | CG+ | 20 |
| 8 | 21 | GG+ | 7 | 133 | TA+ | 36 | 119 | GT− | 34 | 49 | CC+ | 10 |
| 9 | 19 | AA− | 4 | 51 | TT− | 10 | 52 | TG+ | 15 | 46 | GC+ | 12 |
| 10 | 11 | GT+ | 4 | 139 | CT+ | 37 | 88 | GT+ | 15 | 68 | CG+ | 21 |
| 11 | 25 | GG+ | 9 | 113 | AA+ | 14 | 51 | GG+ | 15 | 60 | CC+ | 15 |
| 12 | 10 | TG+ | 3 | 76 | TA− | 12 | 84 | GC+ | 13 | 67 | CA+ | 15 |
| 13 | 20 | GT+ | 5 | 83 | CA+ | 14 | 71 | GT+ | 18 | 67 | CG+ | 25 |
| 14 | 40 | GG+ | 18 | 69 | AT+ | 13 | 87 | GT− | 22 | 70 | AA+ | 14 |
| 15 | 26 | GG− | 7 | 96 | TC+ | 18 | 79 | GC+ | 14 | 93 | GC+ | 36 |
| 16 | 10 | CT+ | 3 | 110 | AG+ | 20 | 45 | GT+ | 13 | 51 | AA− | 7 |
| 17 | 46 | GG+ | 18 | 100 | TT+ | 16 | 156 | GG+ | 49 | 99 | CC+ | 33 |
| 18 | 5 | GG− | 1 | 91 | GT+ | 13 | 63 | CT+ | 12 | 46 | AA− | 12 |
| 19 | 16 | GT+ | 7 | 81 | CT+ | 12 | 64 | TC+ | 15 | 60 | CG+ | 16 |
| 20 | 37 | GG+ | 15 | 116 | AC− | 17 | 164 | GT− | 41 | 129 | CC+ | 38 |
| 21 | 6 | GG− | 3 | 86 | TA− | 26 | 57 | TG+ | 14 | 42 | GC+ | 16 |
| 22 | 3 | GG− | 1 | 83 | CT+ | 10 | 89 | GT− | 21 | 59 | CG+ | 16 |
| 23 | 29 | GG+ | 10 | 179 | CA− | 25 | 131 | GG+ | 41 | 85 | CA− | 16 |
| 24 | 12 | AG+ | 3 | 39 | CG+ | 7 | 84 | TG+ | 15 | 64 | CA+ | 15 |
| 25 | 25 | GG− | 8 | 85 | AT− | 14 | 35 | GT+ | 7 | 98 | AC+ | 21 |
| 26 | 30 | GG+ | 14 | 96 | TT+ | 15 | 207 | GT− | 45 | 77 | CA− | 18 |
Bases separated by K sites, and contribution for the most different (from neutrality) pair (MD). Signs (S) indicate more (+) and less (−) observed than expected pairs
Fig. 4A gene from Rickettsia prowazekii
Fig. 5A gene from Methanobrevibacter smithii
Total value for differences from randomness of dinucleotides of prokaryote genomes
| Sep |
|
|
| ||||||
|---|---|---|---|---|---|---|---|---|---|
| K+1 |
| (MD)S |
|
| (MD)S |
|
| (MD)S |
|
| 1 | 52,775.3 | TA− | 13,986.2 | 53,841.9 | TA− | 18,564.7 | 12,165.2 | GC+ | 6452.1 |
| 2 | 41,043.3 | GC− | 10,401.0 | 6789.9 | AT− | 1175.8 | 2547.1 | GC− | 894.9 |
| 3 | 42,065.9 | CC+ | 7601.9 | 38,411.4 | TT+ | 4899.4 | 6282.7 | CC+ | 1217.4 |
| 4 | 218,522 | CG− | 6086.2 | 18,543.1 | TA− | 4258.9 | 1845.1 | CG− | 444.3 |
| 5 | 7566.5 | GC− | 2818.3 | 17,156.6 | CG− | 4336.5 | 1951.9 | GC− | 955.8 |
| 6 | 20,901.5 | CC+ | 6350.3 | 38,358.8 | TT+ | 5161.0 | 3433.9 | CC+ | 1088.9 |
| 7 | 8446.0 | CG− | 3334.7 | 12,489.1 | TA− | 3005.9 | 1292.4 | CG− | 522.5 |
| 8 | 3225.5 | GC− | 1.381.1 | 11,437.2 | AT− | 2993.4 | 1044.0 | GC− | 398.4 |
| 9 | 21,854.9 | CC+ | 6071.2 | 37,282.8 | AA+ | 7180.4 | 4799.8 | CC+ | 1020.7 |
| 10 | 6589.2 | CG− | 2578.7 | 10,136.9 | TA− | 3065.7 | 945.1 | CG− | 355.0 |
| 11 | 5279.3 | GC− | 2167.0 | 9152.0 | AT− | 2722.3 | 1053.5 | CG+ | 317.4 |
| 12 | 26,475.6 | CC+ | 7117.3 | 47,310.8 | TT+ | 8069.6 | 4791.1 | CC+ | 1220.7 |
| 13 | 3076.9 | CG− | 903.7 | 10,516.9 | TA− | 2962.6 | 993.8 | GC+ | 320.1 |
| 14 | 5795.4 | GC− | 1923.2 | 11,167.0 | AT− | 2546.7 | 1316.5 | CG+ | 380.2 |
| 15 | 20,371.8 | CC+ | 6091.0 | 36,880.5 | TT+ | 5222.8 | 3788.8 | CC+ | 1120.5 |
| 16 | 3547.9 | CG− | 1373.9 | 12,422.9 | TA− | 2664.7 | 908.9 | CG− | 315.9 |
| 17 | 4049.2 | GC− | 1814.6 | 11,416.3 | AT− | 2668.5 | 825.0 | GC− | 259.8 |
| 18 | 21,761.5 | CC+ | 6791.3 | 36,418.3 | TT+ | 4659.1 | 3899.4 | CC+ | 1054.4 |
| 19 | 5049.3 | CG− | 2065.2 | 11,337.4 | TA− | 2340.1 | 1102.6 | GC+ | 365.6 |
| 20 | 3987.2 | GC− | 1363.8 | 9893.4 | AT− | 2764.1 | 742.1 | CG+ | 235.9 |
| 21 | 24,493.5 | CC+ | 6523.1 | 41,508.1 | TT+ | 7240.1 | 4701.4 | CC+ | 1155.7 |
| 22 | 5145.9 | CG− | 1714.5 | 10,525.4 | TA− | 2665.5 | 1169.1 | GC+ | 387.7 |
| 23 | 3732.6 | GC− | 1130.0 | 10,332.5 | AT− | 2729.9 | 861.4 | CG+ | 276.3 |
| 24 | 21,564.0 | CC+ | 6302.9 | 37,752.4 | TT+ | 6432.9 | 4213.2 | CC+ | 1078.3 |
| 25 | 3810.5 | CG− | 1343.6 | 11,857.5 | TA− | 2773.7 | 995.3 | CG− | 334.7 |
| 26 | 3608.2 | GC− | 1447.3 | 11,438.2 | AT− | 2854.9 | 930.1 | CG+ | 270.3 |
| 27 | 20,132.5 | CC+ | 6016.4 | 34,145.0 | TT+ | 5348.5 | 3833.9 | CC+ | 971.9 |
Bases separated (Sep) by K sites, and contribution for the most different (from neutrality) pair (MD). Signs (S) indicate more (+) and less (−) observed than expected pairs
value for difference from randomness and its for the most distant pair. Eukaryotes, prokaryote DNA segments and human mtDNA
| Sep |
| (MD)S |
|
| (MD)S |
|
| (MD)S |
|
| (MD)S |
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Eukaryotes | ||||||||||||
| K+1 | Human collagen | Human mtDNA |
|
| ||||||||
| 999 | 452 | GG+ | 188 | 7 | GT+ | 3 | 8 | GA− | 1 | 8 | GA+ | 3 |
| 1000 | 64 | CG− | 22 | 15 | AT+ | 3 | 33 | TG+ | 11 | 4 | CG− | 1 |
| 1001 | 52 | TT− | 17 | 15 | TT+ | 7 | 4 | TT− | 1 | 5 | AC− | 1 |
| 1002 | 153 | TT+ | 48 | 8 | TG− | 4 | 21 | GG+ | 6 | 11 | TC+ | 4 |
| 1003 | 42 | GG− | 12 | 22 | GC+ | 4 | 13 | GA+ | 3 | 21 | TC− | 7 |
| 1004 | 55 | GG− | 16 | 12 | AG− | 4 | 29 | TA+ | 8 | 9 | CG+ | 3 |
| 1005 | 139 | TT+ | 44 | 6 | CT− | 2 | 14 | TT+ | 5 | 18 | TG− | 3 |
| 1006 | 44 | TG+ | 8 | 30 | TT− | 11 | 14 | TG+ | 3 | 5 | TC− | 2 |
| 1007 | 89 | GC− | 27 | 3 | AT− | 0 | 8 | TA+ | 2 | 11 | GA− | 3 |
| 1008 | 467 | GG+ | 196 | 16 | GT+ | 6 | 21 | TT+ | 8 | 9 | AC+ | 3 |
S signs indicate more (+) and less (−) observed than expected pairs. Chi square values rounded to integers. Sep separation of K + 1 sites. MD pair most different from neutrality
Fig. 6Comparison of values between the random and the real M. smithii genomes
Fig. 7Comparison of values between the random and the real human chromosome 21
Periodicities found in the first significant pair in the 19 initial separations among bacteria of the Firmicutes group
| S |
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | TA− | TA− | TA− | TA− | TA− | TA− | TA− | TA− | TA− | TA− | CG− | TA− |
| 1 | GC− | GC− | GC− | GC− | GC− | CG+ | CG+ | CG+ | GT+ | CC− | TA− | CC− |
| 2 | CC+ | CC+ | CC+ | CC+ | CC+ | AT− | AT− | AT− | GG+ | GC− | CC+ | GG+ |
| 3 | CG− | CG− | CG− | CG− | CG− | AT− | AT− | TA− | CG− | TA− | CG− | GC+ |
| 4 | GC− | GC− | GC− | GC− | GC− | AT− | AT− | AT− | GC− | AC+ | GC− | CG+ |
| 5 | CC+ | CC+ | CC+ | CC+ | CC+ | GG+ | GG+ | GG+ | CC+ | CC+ | CC+ | CC+ |
| 6 | CG− | CG− | CG− | CG− | CG− | CG− | CG− | AC+ | CG− | AA− | CG− | GC+ |
| 7 | GC− | GC− | GC− | GC− | GC− | AT− | AT− | AT− | GC− | AT− | GC− | CG+ |
| 8 | CC+ | CC+ | CC+ | CC+ | CC+ | GG+ | GG+ | GG+ | CC+ | CC+ | CC+ | GG+ |
| 9 | CG− | CG− | CG− | CG− | CG− | TA− | TA− | TA− | CG− | CA+ | CG− | GC+ |
| 10 | GC− | GC− | GC− | GC− | GC− | AT− | AT− | AT− | GC− | AC+ | GC− | CG+ |
| 11 | CC+ | CC+ | CC+ | CC+ | CC+ | GG+ | GG+ | GG+ | GG+ | CC+ | CC+ | GG+ |
| 12 | CG− | CG− | CG− | GC+ | CG− | TA− | CG+ | CG+ | GC+ | AT+ | CG− | GC+ |
| 13 | GC− | GC− | GC− | GC− | GC− | CG+ | CG+ | TA+ | CG+ | TA+ | GC− | CG+ |
| 14 | GG+ | CC+ | CC+ | CC+ | CC+ | GG+ | GG+ | GG+ | GG+ | CC+ | CC+ | GG+ |
| 15 | CG− | CG− | CG− | CG− | CG− | CG+ | CG+ | GA+ | CG− | AT+ | CG− | GC+ |
| 16 | GC− | GC− | GC− | GC− | GC− | GC+ | GC+ | AT− | GC− | TA+ | GC− | CG+ |
| 17 | CC+ | CC+ | CC+ | CC+ | CC+ | GG+ | GG+ | GG+ | CC+ | CC+ | CC+ | GG+ |
| 18 | CG− | CG− | CG− | CG− | CG− | TA− | TA− | AT+ | CG− | AT+ | CG− | GC+ |
Bacterial strains of the Firmicutes group. Bacillus: thuringiensis, cereus, anthracis, weihenstephanensis, cytotoxicus, subtilis, atrophaeus, selenitireducens. Staphylococcus aureus; Lactobacillus casei; Clostridium botulinum; Mycoplasma hominis. Separation (S) from 0 to 18 nucleotide sites