| Literature DB >> 23984392 |
Dan Tulpan1, Chaouki Regoui, Guillaume Durand, Luc Belliveau, Serge Léger.
Abstract
This paper presents a novel hybrid DNA encryption (HyDEn) approach that uses randomized assignments of unique error-correcting DNA Hamming code words for single characters in the extended ASCII set. HyDEn relies on custom-built quaternary codes and a private key used in the randomized assignment of code words and the cyclic permutations applied on the encoded message. Along with its ability to detect and correct errors, HyDEn equals or outperforms existing cryptographic methods and represents a promising in silico DNA steganographic approach.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23984392 PMCID: PMC3745945 DOI: 10.1155/2013/634832
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
The redundant DNA codon table.
| Amino acid | DNA codons | |||||
|---|---|---|---|---|---|---|
| Alanine | GCT | GCC | GCA | GCG | ||
| Arginine | CGT | CGC | CGA | CGG | AGA | AGG |
| Asparagine | AAT | AAC | ||||
| Aspartic acid | GAT | GAC | ||||
| Cysteine | TGT | TGC | ||||
| Glutamic acid | GAA | GAG | ||||
| Glutamine | CAA | CAG | ||||
| Glycine | GGT | GGC | GGA | GGG | ||
| Histidine | CAT | CAC | ||||
| Isoleucine | ATT | ATC | ATA | |||
| Leucine | CTT | CTC | CTA | CTG | TTA | TTG |
| Lysine | AAA | AAG | ||||
| Methionine | ATG | |||||
| Phenylalanine | TTT | TTC | ||||
| Proline | CCT | CCC | CCA | CCG | ||
| Serine | TCT | TCC | TCA | TCG | AGC | AGT |
| Threonine | ACT | ACC | ACA | ACG | ||
| Tryptophan | TGG | |||||
| Tyrosine | TAT | TAC | ||||
| Valine | GTT | GTC | GTA | GTG | ||
| Start (CI) | ATG | |||||
| Stop (CT) | TAA | TAG | TGA | |||
A sample DNA A 4(8,3) Hamming code consisting of 256 code words. Each code word can be associated with an extended ASCII character and used for encoding text messages. The code was obtained with the DNA word design algorithm described in Tulpan and Hoos [26].
| A set with 256 code words | |||||||
|---|---|---|---|---|---|---|---|
| AAAAAAGA | ACTACACT | ATGGAGTT | CCCTTCGA | CTGGTAGT | GGAAAGGT | GTTGTATT | TCGTGTTA |
| AAAAGAAG | ACTACCTA | ATGGGAAG | CCGATTTC | CTGGTTCG | GGATGACA | TAACATAC | TCTCCGAG |
| AAAATGTT | ACTCTCAG | ATGTAAGT | CCGCGCAT | CTTCGGTG | GGCCAAGT | TAACCATA | TCTCCTTA |
| AAACCTGC | ACTGGAGT | ATTCATAC | CCGGCGCG | CTTGACAT | GGCCGACG | TAACGAGG | TCTGCGCA |
| AAACTCAC | ACTTCCGC | ATTCTGCG | CCGTAGCC | CTTGCATG | GGCCTGGA | TAAGAGCA | TCTGGCTC |
| AAAGATCG | ACTTGCAT | ATTTAATC | CCGTTCAG | CTTTCCAC | GGCGTGCC | TAAGTTGA | TCTGTTAC |
| AAATGTGG | ACTTTGGG | ATTTCAGA | CCTACCGG | GAATCATC | GGCTGCAT | TAATAGGC | TGAAAATA |
| AAATTGAG | AGACCCTA | CAAATACG | CCTTCTGT | GACAGCGT | GGGCATAC | TAATGGAA | TGACTCAT |
| AACAGCTG | AGACTTAA | CAAATCTA | CCTTGTCG | GACCAGCT | GGGCTTGG | TAATTACT | TGAGCATC |
| AACCTAGC | AGAGCGGT | CAATATGA | CCTTTGAC | GACCGTTA | GGGGCCCA | TACGCAAA | TGAGGGTT |
| AACGCGTT | AGAGTAAT | CAATTCGC | CGAACGCT | GACGGTAT | GGGGGTTC | TACTTGGG | TGATATAT |
| AACGGTGA | AGATCTTG | CACCTAAT | CGACCTTT | GAGAATTA | GGTAATGG | TAGACTGA | TGATTCGG |
| AACTACGT | AGATGGCT | CACTCGAA | CGAGAAAC | GAGAGAGC | GGTACGTA | TAGAGTAC | TGCATAAG |
| AACTCATA | AGCCAGCA | CAGACAGG | CGAGCGTA | GAGAGTCG | GGTATGCG | TAGGAGTG | TGGGGCGC |
| AAGAAACT | AGCTCGGG | CAGCAACG | CGAGCTCG | GAGTTGTT | GGTTTAGT | TAGTAACC | TGGTTTTT |
| AAGATAAC | AGGACTGT | CAGCCGGC | CGAGTCTT | GATACCCC | GGTTTCCC | TAGTCCGG | TGTCAGAT |
| AAGCACGC | AGGATGAG | CAGGTCGA | CGATGTAC | GATATTGC | GTAACGCG | TATAAATG | TGTGCAAT |
| AAGGTTGT | AGGCCCAT | CAGTGATC | CGCCACGA | GATCATAT | GTACTACG | TATATGGT | TGTGTTGG |
| AATAGTCT | AGGTACTT | CATCGAGC | CGCCTCCC | GATCCCAG | GTAGATCA | TATGTGAA | TTAAGCCG |
| AATCGTTC | AGGTAGGC | CATCTTTG | CGGAAGTA | GATGACTA | GTAGTCGT | TCAAACGC | TTAATTTA |
| AATGCGGG | AGGTGTCC | CATGCTTA | CGGTAACA | GATTGTTG | GTCATATG | TCAAAGTG | TTAGCTGT |
| AATGTGCT | AGTCGAAG | CATGGGGA | CGGTGTTG | GATTTACG | GTCCGAAT | TCAAGAAC | TTAGTCCA |
| AATTGGTT | AGTCGGGA | CCACCGCC | CGTCACAC | GCAGGTCG | GTCCTTAA | TCACAAGA | TTCAAGAC |
| ACACTAGT | AGTGCCGA | CCAGATGC | CGTTAGCT | GCATTCTT | GTCGCAAG | TCAGTGCC | TTCCGCAC |
| ACACTTCC | ATATGCCC | CCAGTATC | CTAACTCC | GCATTTCA | GTCTCCAA | TCATCTTC | TTCGAATA |
| ACAGCTTA | ATCACAAA | CCAGTGGA | CTAGACGG | GCCGAATT | GTGGAGAA | TCCGAGGC | TTGCGTTC |
| ACATCGAA | ATCACCGG | CCATGACC | CTAGAGCC | GCCGCGGT | GTGGCCAT | TCCGCCGA | TTGGGGTA |
| ACCGGATC | ATCCCTGA | CCATGCAA | CTATTACA | GCGAATGT | GTGTCGGT | TCCTGAAG | TTGTCTTG |
| ACCTCAAC | ATCGTAGG | CCCCTACG | CTATTGTT | GCGACATT | GTTATCAC | TCGATGCG | TTTACAGC |
| ACGCATTT | ATCTCTTC | CCCGGAGA | CTCCCAGT | GCGGGTAA | GTTCACTG | TCGGAACA | TTTCCACG |
| ACGCTATG | ATCTTCAC | CCCGGGAG | CTCCGGCC | GCTGAGTG | GTTCCAAC | TCGTAGAG | TTTCGTAG |
| ACGTCGTC | ATGACGTG | CCCTAGTT | CTCGCGGC | GCTGTCCG | GTTGCTCT | TCGTCCAT | TTTGTGTG |
Figure 1Flowcharts for message encryption and decryption with HyDEn.
A sample assignment of code words to ASCII characters.
| DNA code word | ASCII character |
|---|---|
| AAAAAAGA | →space |
| ACTACACT | →A |
| ATGGAGTT | →C |
| CCCTTCGA | →D |
| CTGGTAGT | →K |
| GGAAAGGT | →N |
| GTTGTATT | →T |
| TCGTGTTA | →W |
Comparison between HyDEn and other encryption methods. n is the length of a DNA sequence, |m| is the length of the original message, |Ω| is the size of the DNA code, and k is a method-specific parameter that represents the length of the longest complementary pairs in the reference DNA sequence.
| Method |
|
|
|---|---|---|
|
|
| 0 |
| Insertion [ |
|
|
| Complementary pair [ |
| | |
| Substitution [ |
| 0 |
|
| ||
| Method |
|
|
|
| ||
|
|
|
|
| Insertion [ |
|
|
| Complementary pair [ |
|
|
| Substitution [ |
|
|
A sample DNA A 4(8,3) Hamming code consisting of 1024 code words. Four distinct code words can be associated with one extended ASCII character and used for encoding text messages. The code was obtained with the DNA word design algorithm described in Tulpan and Hoos [26].
| A set with 1024 code words | |||||||
|---|---|---|---|---|---|---|---|
| AAAAAAAG | AAAAAGGA | AAAACTCC | AAAAGCAC | AAACAATA | AAACAGCT | AAACCGTC | AAACGAGG |
| AAACGCCA | AAACGTAT | AAAGATGG | AAAGTCTT | AAAGTGGC | AAATATAA | AAATCTTT | AAATGGCG |
| AAATTGAT | AACACAAA | AACAGACC | AACAGCTA | AACCAGGG | AACCATCA | AACCCTAC | AACCTCGA |
| AACCTGTT | AACGACCG | AACGCATC | AACGGGGA | AACGTTAA | AACTCGTG | AACTGTGC | AAGAAGAC |
| AAGACTAG | AAGAGGTG | AAGATACA | AAGATGGT | AAGATTTC | AAGCGTGA | AAGCTCAT | AAGCTGCC |
| AAGGACGC | AAGGCCTG | AAGGCGCA | AAGGGATA | AAGGTAAG | AAGTAATT | AAGTGCAG | AATAATTA |
| AATACACG | AATACCTT | AATCAACC | AATCCGGA | AATCGCTC | AATGGAGC | AATGGCAA | AATGGTTG |
| AATGTGCT | AATTACTG | AATTAGCA | AATTCAAC | AATTGGGT | AATTTAGG | AATTTTCC | ACAAAGTC |
| ACAACCCG | ACAAGAGT | ACAATTTT | ACACATTG | ACACCAAT | ACACCTCA | ACACTACG | ACACTGTA |
| ACAGCTGC | ACAGGGCC | ACAGGTAG | ACATAACT | ACATACGA | ACATCATG | ACATTCAC | ACCAACCA |
| ACCACTGA | ACCAGTTG | ACCATGAC | ACCCAATT | ACCCCCGG | ACCCGAAG | ACCCGGCT | ACCGACAC |
| ACCGAGTA | ACCGCGAT | ACCGTAGC | ACCTATCG | ACCTGACA | ACCTGCTC | ACCTTTAT | ACGAATGG |
| ACGACGCC | ACGAGGGA | ACGATCAG | ACGCAACA | ACGCCCAA | ACGCCGTT | ACGCGTAC | ACGCTGGG |
| ACGGCCGT | ACGGTTCT | ACGTAGAA | ACGTTAGT | ACTAAATG | ACTACGAA | ACTAGCCC | ACTATCTA |
| ACTATTGC | ACTCATGT | ACTGCACC | ACTGCTTT | ACTGGTCA | ACTGTCAT | ACTTATTC | ACTTCCAG |
| ACTTCGCT | ACTTGTGG | ACTTTAAA | ACTTTGTG | AGAACCAT | AGAATCTC | AGAATGCG | AGACAAAC |
| AGACGCGT | AGACGTTA | AGACTCAA | AGAGAGCA | AGAGCACT | AGAGCCGA | AGAGTATA | AGAGTTCC |
| AGATCGAC | AGATGAGC | AGCAACGT | AGCAATAA | AGCACTTC | AGCAGGCA | AGCATAAT | AGCCAGAT |
| AGCCCACA | AGCCGTGG | AGCGCCCC | AGCGGGAC | AGCGTCAG | AGCTAAGA | AGCTATTT | AGCTCAAG |
| AGCTGCAA | AGCTTGGT | AGGAACTG | AGGACATT | AGGAGTGC | AGGATGTA | AGGCAAGG | AGGCATCC |
| AGGCCTTG | AGGGACAT | AGGGCAAA | AGGGCGGC | AGGGGACC | AGGGGGTT | AGGTAGTC | AGGTCGCG |
| AGGTCTGT | AGGTGTCA | AGGTTCCC | AGGTTTAG | AGTAACAC | AGTAGGTC | AGTATCCT | AGTCAGTG |
| AGTCCAGT | AGTCCGCC | AGTGATCG | AGTGCTAC | AGTGGAAT | AGTGGCGG | AGTGTGGA | AGTTCCTA |
| AGTTGACG | AGTTTATC | ATAAACTT | ATAACATA | ATAATTCA | ATACCCAG | ATACGGTT | ATACTTGC |
| ATAGAAGT | ATAGCGTG | ATAGGTTC | ATAGTGAA | ATATAGGC | ATATCCTC | ATATCTGG | ATATGCCT |
| ATCAAGTG | ATCACGGC | ATCAGCCG | ATCAGTAC | ATCATTGT | ATCCAGCC | ATCCATAG | ATCCCCCT |
| ATCCCTTA | ATCCTAAA | ATCGAACA | ATCGATGC | ATCGCAGG | ATCGTGTC | ATCGTTCG | ATCTAATC |
| ATCTACAT | ATCTCTCC | ATCTGGAG | ATCTTCGC | ATCTTGCA | ATGAAACT | ATGAGATC | ATGAGCGT |
| ATGCATTT | ATGCCAAC | ATGCGGAA | ATGCGTCG | ATGGACTA | ATGGAGGG | ATGGGCAC | ATGGTAGA |
| ATGTACCG | ATGTCGGA | ATGTGAGG | ATTAAAAA | ATTAAGGT | ATTACCCA | ATTAGTCT | ATTATTAG |
| ATTCCATG | ATTCCCGC | ATTCGACA | ATTGACCT | ATTGAGAC | ATTGCTGA | ATTGGGCG | ATTGTATT |
| ATTTCTAT | ATTTGCGA | CAAAATCA | CAAACTGG | CAAAGGAA | CAAATCCC | CAACACTC | CAACCCAT |
| CAACGGCC | CAAGAATT | CAAGCCTA | CAAGCTCT | CAAGGCCG | CAAGGGGT | CAATACAG | CAATATGT |
| CAATGTTG | CAATTGCA | CACAAAAC | CACAAGTA | CACACCCG | CACATTCT | CACCAACG | CACCGCGG |
| CACCTTTC | CACGCAGT | CACGCTTG | CACTACGC | CACTAGAT | CACTCTCA | CACTGAGA | CACTTTAG |
| CAGAAATG | CAGACCAC | CAGACGCT | CAGAGCCA | CAGAGTGT | CAGCACGA | CAGCATAT | CAGCCTCG |
| CAGCGAAC | CAGCTAGT | CAGGAGTC | CAGGCTGC | CAGGGTAA | CAGTATTA | CAGTCACC | CAGTCCGT |
| CATAGCAT | CATAGTTC | CATATAAG | CATATGTT | CATCACCT | CATCCTTT | CATCTCAC | CATCTGCG |
| CATCTTGA | CATGAGGG | CATGATAC | CATGCGAT | CATGTACC | CATGTCTG | CATTCATA | CATTCGGC |
| CATTGGAG | CATTGTCT | CCAAACTA | CCAAAGGG | CCAACAAC | CCAACGTT | CCACAAAG | CCACCCGA |
| CCACGTGG | CCACTGAC | CCAGAAGA | CCAGATCG | CCAGGGTG | CCAGTTTC | CCATAATC | CCATCAGT |
| CCATCGCC | CCATCTAG | CCATGCAT | CCATGGGA | CCATTCTG | CCCAATAT | CCCACATG | CCCAGCCT |
| CCCAGTGC | CCCATTTA | CCCCACCC | CCCCATGA | CCCCCGAA | CCCCGGTC | CCCCTAGG | CCCGACGT |
| CCCGCGGC | CCCGGCTA | CCCGTATT | CCCGTCCG | CCCTCTTC | CCCTTGCT | CCGAGTAG | CCGATAAT |
| CCGATGCG | CCGCAGGC | CCGCCCTC | CCGCGAGA | CCGCGGAT | CCGCTTCA | CCGGAAAC | CCGGCGAG |
| CCGGCTTA | CCGGTCGA | CCGTACCA | CCGTCAAA | CCGTCTCT | CCGTGATG | CCGTGCGC | CCGTTGTA |
| CCTAAAGC | CCTACTCA | CCTAGACG | CCTAGGAC | CCTATGGA | CCTCAATA | CCTCCCCG | CCTCCTAC |
| CCTCGGCA | CCTCTCGT | CCTGAGCT | CCTGCTGG | CCTGGATC | CCTTCCTT | CGAAAAGT | CGAAACCG |
| CGAACGCA | CGAAGTAC | CGACATCT | CGACCGGG | CGACTAGA | CGACTGTT | CGACTTAG | CGAGAGAT |
| CGAGATGC | CGAGCATG | CGAGCTAA | CGATAGTG | CGATGAAG | CGATGCTA | CGATTCGT | CGCACGAG |
| CGCAGAAA | CGCAGCTG | CGCATACC | CGCCAAGC | CGCCACTT | CGCCCGCT | CGCCGTCA | CGCCTTGT |
| CGCGGACT | CGCGGTTC | CGCGTCGC | CGCTCGTA | CGCTCTGG | CGCTGTAT | CGCTTCCA | CGCTTGAC |
| CGGACAGA | CGGACGTC | CGGATCAA | CGGCACAC | CGGCGGTG | CGGCTCCG | CGGGAACA | CGGGCCTT |
| CGGGGTGG | CGGGTATC | CGGGTTAT | CGGTAGGA | CGGTCCAG | CGGTGGCC | CGGTTACT | CGTAAGCC |
| CGTACCGT | CGTATTCG | CGTCAGAA | CGTCCATC | CGTCGCGA | CGTCTGGC | CGTGATTA | CGTGCGCG |
| CGTGGCAC | CGTGTAGT | CGTTAAAC | CGTTCTCC | CGTTGGTT | CTAACTTC | CTAAGACT | CTAATGGC |
| CTACACGT | CTACATAA | CTACCGTA | CTACGAGC | CTACGGAG | CTACTCCA | CTAGCCAC | CTAGGATA |
| CTATAGCT | CTATGGTC | CTATTAAA | CTATTTTT | CTCAACGG | CTCAATCC | CTCACTAA | CTCATCAT |
| CTCCCCTG | CTCCGGGT | CTCGACTC | CTCGCCCA | CTCGGTAG | CTCGTAAC | CTCTCATT | CTCTGCAC |
| CTCTTGTG | CTGAAGAA | CTGAGGTT | CTGATCTG | CTGCAGCG | CTGCGCCC | CTGCGTTA | CTGCTAAG |
| CTGGATGT | CTGGCAAT | CTGGCCGG | CTGGGGGA | CTGGTCCT | CTGTACTT | CTGTATAC | CTGTTAGC |
| CTGTTTCG | CTTAATTT | CTTACGTG | CTTAGCGC | CTTATATC | CTTCACAG | CTTCGTAT | CTTCTACT |
| CTTCTTTG | CTTGAATG | CTTGACGA | CTTGCAGC | CTTGGCTT | CTTGGTCC | CTTGTGTA | CTTTATCA |
| CTTTCGAA | CTTTGAGT | CTTTGCCG | CTTTTGCC | GAAAACGT | GAAACCTG | GAAACGGC | GAAAGTGA |
| GAAATATA | GAAATTAT | GAACAGTG | GAACGAAA | GAAGAACG | GAAGCTAG | GAATACCC | GAATCGAA |
| GAATGGTT | GAATTACT | GAATTCGG | GACAAGCG | GACATCAA | GACATTTG | GACCAAAT | GACCCCGT |
| GACCGGTA | GACGACGA | GACGATCC | GACGCGAC | GACGGACA | GACGGCAG | GACGTCCT | GACTAATA |
| GACTCCTC | GACTCTAT | GACTTGGC | GAGAACAG | GAGACCGA | GAGAGAGG | GAGAGTCC | GAGCCTTA |
| GAGCGGAG | GAGCTCCA | GAGCTTGC | GAGGCCCC | GAGGGGCT | GAGGTATT | GAGGTGGA | GAGTGCTA |
| GAGTTAAC | GAGTTGCG | GATAAACT | GATAGGCA | GATATCTC | GATCATTC | GATCCACA | GATCGCCG |
| GATCGGGC | GATCTATG | GATCTGAT | GATGAGTA | GATGGCGT | GATGTAAA | GATTAAAG | GATTCCCT |
| GATTCTTG | GATTGATC | GCAAACAC | GCAAGATG | GCAATCCT | GCAATGAA | GCACATAT | GCACCAGC |
| GCACCGCG | GCACGCAG | GCACGGGT | GCACTATT | GCAGCAAA | GCAGGCGC | GCAGGTCT | GCATAGCA |
| GCATATGG | GCATTGTC | GCCAAAAA | GCCACTCC | GCCAGGAT | GCCATCGC | GCCCAGAC | GCCCGTCG |
| GCCCTCAT | GCCGCTGT | GCCGGAGG | GCCGGTAC | GCCGTGAG | GCCTACTT | GCCTCACG | GCCTCGGA |
| GCCTGGCC | GCCTTTCA | GCGAAACG | GCGAGAAC | GCGAGTTA | GCGATAGA | GCGCAATC | GCGCACGT |
| GCGCCAAG | GCGGGCAT | GCGGGGTC | GCGGTTGG | GCGTATCC | GCGTCCTG | GCGTGACT | GCGTGGGG |
| GCGTTGAT | GCTACAGT | GCTACGTC | GCTACTAG | GCTAGCGG | GCTCACTG | GCTCCTGA | GCTCGTTT |
| GCTGAATT | GCTGACAA | GCTGAGGC | GCTGCGCA | GCTTCCGC | GCTTTCCG | GCTTTTAC | GGAACAAG |
| GGAACTGT | GGAAGGGG | GGAATAGC | GGACACGA | GGACCATA | GGACCGAT | GGACTCTG | GGAGATTT |
| GGAGCCCG | GGAGGAGT | GGAGGGTA | GGAGTTGA | GGATAAAA | GGATCTTC | GGATGCAC | GGATTGAG |
| GGCAATCT | GGCACCGG | GGCAGTAG | GGCATGGA | GGCCCCAA | GGCCCTTT | GGCCTAAC | GGCGAATC |
| GGCGATGG | GGCGCAAT | GGCGGGCG | GGCGTGTT | GGCTATAC | GGCTGATT | GGCTTAGG | GGGAAATA |
| GGGAAGAT | GGGACCCT | GGGACTAC | GGGCAGCA | GGGCATAG | GGGCGCGG | GGGCGTTC | GGGCTGGT |
| GGGGCGTG | GGGGCTCA | GGGGTCAC | GGGTAAGT | GGGTGGAA | GGGTTTTA | GGTAATTG | GGTAGACC |
| GGTAGCTA | GGTATGAC | GGTCAACG | GGTCACAT | GGTCGGCT | GGTCTCCC | GGTCTGTA | GGTCTTGG |
| GGTGCCTC | GGTGCGGT | GGTGGATG | GGTGGTGC | GGTGTTCT | GGTTACCA | GGTTAGGG | GGTTCTAA |
| GGTTTAAT | GTAAAATC | GTAATGTG | GTACCCCC | GTACGGCA | GTACTTTA | GTAGACAT | GTAGAGCC |
| GTAGCGGA | GTAGGTAA | GTAGTACA | GTAGTCTC | GTATCCGT | GTATCTCA | GTATGACG | GTATGTGC |
| GTCACAAC | GTCACGTT | GTCAGCTC | GTCAGTCA | GTCCAAGA | GTCCCGAG | GTCCGATG | GTCCTCCG |
| GTCGAGAA | GTCGCATA | GTCGGTTT | GTCGTAGT | GTCTAACT | GTCTATTG | GTCTGAAA | GTCTGCGG |
| GTGAAGGC | GTGAGGCG | GTGATACC | GTGATTAA | GTGCACAA | GTGCCAGT | GTGCCGTC | GTGCGCTT |
| GTGCTTCT | GTGGAAAG | GTGGATTC | GTGGGAGC | GTGTAGTA | GTGTCCAC | GTGTCTTT | GTGTGTAG |
| GTGTTATG | GTGTTCGA | GTTAACCC | GTTAAGAG | GTTACCAT | GTTACTGC | GTTAGATT | GTTATGCT |
| GTTCAGTT | GTTCCCTA | GTTCCTCG | GTTCGCAC | GTTCTAGC | GTTGCACT | GTTGGCCA | GTTGGGAT |
| GTTGTCGG | GTTTATGT | GTTTCAGA | GTTTGGTG | GTTTTCTT | TAAAATTG | TAAACAAT | TAAAGACG |
| TAAAGGTC | TAACCGAG | TAACGTGC | TAACTACC | TAACTGGT | TAAGACCT | TAAGAGAC | TAAGCTTC |
| TAAGGAGA | TAAGTCAA | TAAGTGTG | TAATACTA | TAATCCAC | TAATCTCG | TAATTAAG | TACAAACA |
| TACAACTC | TACAGGAG | TACATAGT | TACCATGT | TACCCAGA | TACCTGAA | TACGAAGC | TACGCAAG |
| TACGGTCT | TACGTCGG | TACGTGCC | TACTCGCT | TACTGATG | TACTGCCA | TACTTCTT | TAGACATC |
| TAGATCCG | TAGCACTG | TAGCGATT | TAGCTTAG | TAGGAAAT | TAGGAGCG | TAGGTCTC | TAGGTTGT |
| TAGTAAGA | TAGTCGGG | TAGTGGAT | TAGTGTTC | TAGTTTCA | TATAAGAT | TATAATGC | TATACGTA |
| TATACTCT | TATATTAA | TATCAAAA | TATCCCCC | TATCCTGG | TATCGGTG | TATCGTCA | TATGACAG |
| TATGCATT | TATGCCGA | TATTATTT | TATTGCGG | TATTTGTC | TCAAATGA | TCAACCGC | TCAAGAAA |
| TCAAGGCT | TCAATTCG | TCACAAGT | TCACACAA | TCACGATC | TCACTCGG | TCAGAACC | TCAGAGTT |
| TCAGCCTG | TCAGTAAT | TCAGTGCA | TCATCCCT | TCATCTTA | TCATGGAG | TCATTTGT | TCCAAGGC |
| TCCACACT | TCCATCTG | TCCCCCTT | TCCCCTAG | TCCCGCGC | TCCGATTC | TCCGGGAA | TCCGTGGT |
| TCCTACAG | TCCTCGAC | TCCTGTTT | TCCTTAGA | TCGAAATT | TCGACCCA | TCGAGCTC | TCGATCGT |
| TCGATTAC | TCGCCATA | TCGCCTGT | TCGCGGCC | TCGGACGG | TCGGATCA | TCGGCACG | TCGGCCAC |
| TCGGCGGA | TCGGGTGC | TCGTAGTG | TCGTATAT | TCGTGCCG | TCGTTATC | TCTACGCG | TCTAGTAT |
| TCTATAGG | TCTCAGAG | TCTCATCC | TCTCCGGC | TCTCTAAC | TCTCTGCT | TCTGCTAA | TCTGGACT |
| TCTGTCGC | TCTGTTTG | TCTTAACA | TCTTACGT | TCTTCAAT | TCTTGCAC | TCTTGGTA | TGAAAGTA |
| TGAAGTTT | TGAATCGA | TGAATGAT | TGACCACG | TGACGCCC | TGACGGAA | TGACTTCA | TGAGCAAC |
| TGAGGGGC | TGAGGTCG | TGATACGG | TGATAGCC | TGATCGTT | TGATGACT | TGATTTAC | TGCAAATG |
| TGCACTAT | TGCAGCAC | TGCAGTGA | TGCCAGCG | TGCCCTGC | TGCCGAAT | TGCCTCTC | TGCGACAA |
| TGCGCGGG | TGCGCTTA | TGCGGCTT | TGCGTACG | TGCTCACC | TGCTGGTC | TGCTTTCT | TGGACGAA |
| TGGACTGG | TGGATAAG | TGGCATTA | TGGCCCAT | TGGCGAGC | TGGCGTCT | TGGCTGAC | TGGGAGGT |
| TGGGGCGA | TGGGGGAG | TGGTACCT | TGGTCCGC | TGGTGATA | TGGTTCTG | TGTAACTT | TGTACACA |
| TGTAGAGT | TGTAGCCG | TGTCACGC | TGTCCCTG | TGTCGTAC | TGTCTATT | TGTGAAGA | TGTGCCCT |
| TGTGGGCA | TGTGTCTA | TGTTATAG | TGTTCAGG | TGTTTGCG | TGTTTTGA | TTAACCAA | TTAAGCGG |
| TTAAGTCC | TTACAACA | TTACAGTC | TTACCATT | TTACCTGA | TTACGTTG | TTACTTAT | TTAGACGC |
| TTAGATTA | TTAGCGCT | TTAGTAGG | TTATAAAT | TTATCAGC | TTATTCCG | TTATTGTA | TTCACTTG |
| TTCAGAGC | TTCATATA | TTCATGCG | TTCCAAAC | TTCCCGCA | TTCCGCAA | TTCCTTGG | TTCGAATT |
| TTCGCCAT | TTCGGCCC | TTCGGGTG | TTCTAGGG | TTCTATAA | TTCTCCTA | TTCTCTGT | TTCTGTCG |
| TTCTTGAT | TTCTTTTC | TTGAACGA | TTGACGGT | TTGAGGAC | TTGATTTT | TTGCAGAT | TTGCATGC |
| TTGCCCCG | TTGCGGGG | TTGCTCTA | TTGGCTCC | TTGGGTAT | TTGGTCAG | TTGGTGGC | TTGTAACC |
| TTGTCAAG | TTGTGTGA | TTTAATCG | TTTAGTTA | TTTATAAT | TTTCAAGG | TTTCCTTC | TTTCGCGT |
| TTTCTGGA | TTTGCGAG | TTTGGAAA | TTTGGTGG | TTTGTTAC | TTTTACTC | TTTTGGCT | TTTTTCAA |