| Literature DB >> 34079580 |
Xue Li1, Ziqi Wei2, Bin Wang1, Tao Song3.
Abstract
DNA computing is a new method based on molecular biotechnology to solve complex problems. The design of DNA sequences is a multi-objective optimization problem in DNA computing, whose objective is to obtain optimized sequences that satisfy multiple constraints to improve the quality of the sequences. However, the previous optimized DNA sequences reacted with each other, which reduced the number of DNA sequences that could be used for molecular hybridization in the solution and thus reduced the accuracy of DNA computing. In addition, a DNA sequence and its complement follow the principle of complementary pairing, and the sequence of base GC at both ends is more stable. To optimize the above problems, the constraints of Pairing Sequences Constraint (PSC) and Close-ending along with the Improved Chaos Whale (ICW) optimization algorithm were proposed to construct a DNA sequence set that satisfies the combination of constraints. The ICW optimization algorithm is added to a new predator-prey strategy and sine and cosine functions under the action of chaos. Compared with other algorithms, among the 23 benchmark functions, the new algorithm obtained the minimum value for one-third of the functions and two-thirds of the current minimum value. The DNA sequences satisfying the constraint combination obtained the minimum of fitness values and had stable and usable structures.Entities:
Keywords: DNA computing; DNA sequence design; ICW; WOA; constraint
Year: 2021 PMID: 34079580 PMCID: PMC8165483 DOI: 10.3389/fgene.2021.644484
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Sequence ACCACAACAACAACACACCC and its complementary sequence.
FIGURE 2Sequence a: AACAACCTCCACACCGAACA and b: TGGTGTTGCTGGTGTAGGTT.
FIGURE 3Flow chart of ICW.
FIGURE 4Pseudo-code of the ICW optimization algorithm.
Result of benchmark functions (F1–F13) with 30 dimensions.
| ID | Metric | ICW | HSWOA | WOA | GA | PSO | FPA | GWO | FA | MFO | TLBO | DE |
| F1 | AVG | 2.71E−91 | 1.41E-03 | 1.03E+03 | 1.83E+04 | 2.01E+03 | 1.18E−27 | 7.11E−03 | 1.01E+03 | 2.17E−89 | 1.33E−03 | |
| STD | 0.00E+00 | 1.24E−90 | 4.91E-30 | 5.79E+02 | 3.01E+03 | 5.60E+02 | 1.47E−27 | 3.21E−03 | 3.05E+03 | 3.14E−89 | 5.92E−04 | |
| F2 | AVG | 7.03E−58 | 1.06E-21 | 2.47E+01 | 3.58E+02 | 3.22E+01 | 9.71E−17 | 4.34E−01 | 3.19E+01 | 2.77E−45 | 6.83E−03 | |
| STD | 0.00E+00 | 2.94E−57 | 2.93E-21 | 5.68E+00 | 1.35E+03 | 5.55E+00 | 5.60E−17 | 1.84E−01 | 2.06E+01 | 3.11E−45 | 2.06E−03 | |
| F3 | AVG | 1.11E+04 | 5.39E-07 | 2.65E+04 | 4.05E+04 | 1.41E+03 | 5.12E−05 | 1.66E+03 | 2.43E+04 | 3.91E−18 | 3.97E+04 | |
| STD | 0.00E+00 | 4.54E+03 | 2.93E-06 | 3.44E+03 | 8.21E+03 | 5.59E+02 | 2.03E−04 | 6.72E+02 | 1.41E+04 | 8.04E−18 | 5.37E+03 | |
| F4 | AVG | 3.22E+01 | 0.07E+00 | 5.17E+01 | 4.39E+01 | 2.38E+01 | 1.24E−06 | 1.11E−01 | 7.00E+01 | 1.68E−36 | 1.15E+01 | |
| STD | 0.00E+00 | 0.07E+01 | 0.39E+00 | 1.05E+01 | 3.64E+00 | 2.77E+00 | 1.94E−06 | 4.75E−02 | 7.06E+00 | 1.47E−36 | 2.37E+00 | |
| F5 | AVG | 2.76E+01 | 2.78E+01 | 1.95E+04 | 1.96E+07 | 3.17E+05 | 2.70E+01 | 7.97E+01 | 7.35E+03 | 2.54E+01 | 1.06E+02 | |
| STD | 0.25E0+00 | 0.58E+00 | 0.76E+00 | 1.31E+04 | 6.25E+06 | 1.75E+05 | 7.78E−01 | 7.39E+01 | 2.26E+04 | 4.26E−01 | 1.01E+02 | |
| F6 | AVG | 0.11E+00 | 0.33E+00 | 3.11E+00 | 9.01E+02 | 1.87E+04 | 1.70E+03 | 8.44E−01 | 6.94E−03 | 2.68E+03 | 3.29E−05 | 1.44E−03 |
| STD | 0.03E+00 | 0.18E+04 | 0.53E+00 | 2.84E+02 | 2.92E+03 | 3.13E+02 | 3.18E−01 | 3.61E−03 | 5.84E+03 | 8.65E−05 | 5.38E−04 | |
| F7 | AVG | 1.29E−03 | 1.42E-03 | 1.91E−01 | 1.07E+01 | 3.41E−01 | 1.70E−03 | 6.62E−02 | 4.50E+00 | 1.16E−03 | 5.24E−02 | |
| STD | 6.37E-05 | 1.20E−03 | 1.14E−03 | 1.50E−01 | 3.05E+00 | 1.10E−01 | 1.06E−03 | 4.23E−02 | 9.21E+00 | 3.63E−04 | 1.37E−02 | |
| F8 | AVG | −1.11E+04 | −5.08E+03 | − | −3.86E+03 | −6.45E+03 | −5.97E+03 | −5.85E+03 | −8.48E+03 | −7.76E+03 | −6.82E+03 | |
| STD | 5.26E+02 | 1.50E+03 | 6.95E+02 | 4.51E+00 | 2.49E+02 | 3.03E+02 | 7.10E+02 | 1.16E+03 | 7.98E+02 | 1.04E+03 | 3.94E+02 | |
| F9 | AVG | 9.01E+00 | 0.00E+00 | 9.04E+00 | 2.87E+02 | 1.82E+02 | 2.19E+00 | 3.82E+01 | 1.59E+02 | 1.40E+01 | 1.58E+02 | |
| STD | 0.00E+00 | 1.44E+01 | 0.00E+00 | 4.58E+00 | 1.95E+01 | 1.24E+01 | 3.69E+00 | 1.12E+01 | 3.21E+01 | 5.45E+00 | 1.17E+01 | |
| F10 | AVG | 2.78E−15 | 7.40E+00 | 1.36E+01 | 1.75E+01 | 7.14E+00 | 1.03E−13 | 4.58E−02 | 1.74E+01 | 6.45E−15 | 1.21E−02 | |
| STD | 0.00E+00 | 1.80E−15 | 9.89E+00 | 1.51E+00 | 3.67E−01 | 1.08E+00 | 1.70E−14 | 1.20E−02 | 4.95E+00 | 1.79E−15 | 3.30E−03 | |
| F11 | AVG | 2.89E-04 | 1.01E+01 | 1.70E+02 | 1.73E+01 | 4.76E−03 | 4.23E−03 | 3.10E+01 | 3.52E−02 | |||
| STD | 0.00E+00 | 0.00E+00 | 1.58E+03 | 2.43E+00 | 3.17E+01 | 3.63E+00 | 8.57E−03 | 1.29E−03 | 5.94E+01 | 0.00E+00 | 7.20E−02 | |
| F12 | AVG | 5.06E-03 | 6.90E−00 | 3.39E-01 | 4.77E+00 | 1.51E+07 | 3.05E+02 | 4.83E−02 | 3.13E−04 | 2.46E+02 | 2.25E−03 | |
| STD | 2.54E-03 | 7.06E−00 | 0.21E+00 | 1.56E+00 | 9.88E+06 | 1.04E+03 | 2.12E−02 | 1.76E−04 | 1.21E+03 | 7.45E−06 | 1.70E−03 | |
| F13 | AVG | 0.10E-00 | 4.52E−00 | 1.88E+00 | 1.52E+01 | 5.73E+07 | 9.59E+04 | 5.96E−01 | 2.73E+07 | 7.89E−02 | 9.12E−03 | |
| STD | 9.40E-02 | 9.14E−00 | 3.66E+01 | 4.52E+00 | 2.68E+07 | 1.46E+05 | 2.23E−01 | 9.62E−04 | 1.04E+08 | 8.78E−02 | 1.16E−02 |
Results of benchmark functions (F14–F23) with 30 dimensions.
| ID | Metric | ICW | HSWOA | WOA | GA | PSO | FPA | GWO | FA | MFO | TLBO | DE |
| F14 | AVG | 3.73E-00 | 2.18E−00 | 2.11E+00 | 1.39E+00 | 9.98E−01 | 4.17E+00 | 3.51E+00 | 2.74E+00 | 1.23E+00 | ||
| STD | 4.17E-00 | 2.11E−00 | 2.49E+00 | 4.52E−16 | 4.60E−01 | 2.00E−04 | 3.61E+00 | 2.16E+00 | 1.82E+00 | 4.52E−16 | 9.23E−01 | |
| F15 | AVG | 5.25E−04 | 5.72E−04 | 3.33E−02 | 1.61E−03 | 6.88E−04 | 6.24E−03 | 1.01E−03 | 2.35E−03 | 1.03E−03 | 5.63E−04 | |
| STD | 1.30E-04 | 2.54E−04 | 3.24E−04 | 2.70E−02 | 4.60E−04 | 1.55E−04 | 1.25E−02 | 4.01E−04 | 4.92E−03 | 3.66E−03 | 2.81E−04 | |
| F16 | AVG | −1.03E+00 | − | −3.78E−01 | − | − | − | − | − | − | − | |
| STD | 1.94E-10 | 1.32E−08 | 4.20E−07 | 3.42E−01 | 2.95E−03 | 6.78E−16 | 6.78E−16 | 6.78E−16 | 6.78E−16 | 6.78E−16 | 6.78E−16 | |
| F17 | AVG | 3.97E−01 | 3.97E−01 | 5.24E−01 | 4.00E−01 | |||||||
| STD | 8.20E-07 | 1.19E−07 | 2.70E−05 | 6.06E−02 | 1.39E−03 | 1.69E−16 | 1.69E−16 | 1.69E−16 | 1.69E−16 | 1.69E−16 | 1.69E−16 | |
| F18 | AVG | 3.10E+00 | ||||||||||
| STD | 1.65E-14 | 5.98E-07 | 4.22E-15 | 0.00E+00 | 7.60E−02 | 0.00E+00 | 4.07E−05 | 0.00E+00 | 0.00E+00 | 0.00E+00 | 0.00E+00 | |
| F19 | AVG | − | −3.85E+00 | −3.42E+00 | − | − | − | − | − | − | − | |
| STD | 6.85E-06 | 3.19E−03 | 2.70E−03 | 3.03E−01 | 1.24E−03 | 3.16E−15 | 3.14E−03 | 3.16E−15 | 1.44E−03 | 3.16E−15 | 3.16E−15 | |
| F20 | AVG | −3.27 | −2.98105 | −1.61351 | −3.11088 | − | −3.25866 | −3.28105 | −3.23509 | −3.24362 | −3.27048 | |
| STD | 0.057049 | 0.060296 | 0.376653 | 0.46049 | 0.029126 | 0.019514 | 0.064305 | 0.063635 | 0.064223 | 0.15125 | 0.058919 | |
| F21 | AVG | −8.7129 | −7.04918 | −6.66177 | −4.14764 | −5.21514 | −8.64121 | −7.67362 | −6.8859 | −8.64525 | −9.64796 | |
| STD | 0.011404 | 2.4655 | 3.629551 | 3.732521 | 0.919578 | 0.008154 | 2.563356 | 3.50697 | 3.18186 | 1.76521 | 1.51572 | |
| F22 | AVG | −8.1948 | −8.18178 | −5.58399 | −6.01045 | −5.34373 | −10.4014 | −9.63827 | −8.26492 | −10.2251 | −9.74807 | |
| STD | 0.015752 | 2.9941 | 3.29202 | 2.605837 | 1.962628 | 0.053685 | 0.000678 | 2.293901 | 3.076809 | 0.007265 | 1.987703 | |
| F23 | AVG | −8.7416 | −9.34238 | −4.69882 | −4.72192 | −5.29437 | −10.0836 | −9.75489 | −7.65923 | −10.0752 | − | |
| STD | 0.011063 | 2.8113 | 2.414737 | 3.256702 | 1.742618 | 0.356377 | 1.721889 | 2.345487 | 3.576927 | 1.696222 | 8.88E−15 |
FIGURE 5Test functions of functions F7, F12, and F20.
Comparing sequences from NACS/Seq, DEPT, MO-ABC, pMO-ABC, and HSWOA.
| Seq. | C | P | H | S | Tm | GC% | Seq. | C | P | H | S | Tm | GC% |
| CCTCTCCATCCTTATCCTTC | 0 | 0 | 35 | 66 | 60.88 | 50 | CTCTCATCTCTCCGTTCTTC | 0 | 0 | 37 | 58 | 61.43 | 50 |
| CCAGACCAATACAGAACCAC | 0 | 0 | 50 | 57 | 62.54 | 50 | TATCCTGTGGTGTCCTTCCT | 0 | 0 | 45 | 57 | 64.46 | 50 |
| CTCCTCTTCTCCTTCTTCTC | 0 | 0 | 28 | 82 | 60.72 | 50 | GTATTCCAAGCGTCCGTGTT | 0 | 0 | 55 | 49 | 65.29 | 50 |
| CACAACCAATCACTCTCACC | 0 | 0 | 38 | 65 | 63.10 | 50 | TCTCTTACGTTGGTTGGCTG | 0 | 0 | 51 | 53 | 64.63 | 50 |
| CCACCTGACCGACTAATAAC | 0 | 0 | 48 | 61 | 62.02 | 50 | CTCTTCATCCACCTCTTCTC | 0 | 0 | 43 | 58 | 61.38 | 50 |
| CCAACCACTCTTCTACAACC | 0 | 0 | 37 | 72 | 62.47 | 50 | ATTCTGTTCCGTTGCGTGTC | 0 | 0 | 52 | 56 | 65.82 | 50 |
| CCTTCTTCTCTCTCTCTCTC | 0 | 0 | 30 | 75 | 60.13 | 50 | AAACCTCCACCAACACACCA | 9 | 0 | 55 | 43 | 66.71 | 50 |
| CCATTCCTTAACCTCTCTCC | 0 | 0 | 59 | 39 | 61.39 | 50 | GTAAGGAAGGCAAGGCAGAA | 0 | 0 | 42 | 54 | 64.70 | 50 |
| ACACACACACACACACACAC | 0 | 0 | 27 | 49 | 65.85 | 50 | GTTGGTGGTTGTTGGTGGTT | 0 | 0 | 46 | 36 | 66.00 | 50 |
| GGAAGGAGGAGGAAGAAGAA | 0 | 0 | 37 | 45 | 62.83 | 50 | GGAGACGGAATGGAAGAGTA | 0 | 0 | 44 | 55 | 62.93 | 50 |
| GAGAGAAGAGAAGAGGCCAA | 0 | 0 | 39 | 53 | 63.17 | 50 | CCATTCTTCTCTTCTCTCCC | 9 | 0 | 67 | 22 | 61.39 | 50 |
| ACCACAACAACAACACACCC | 9 | 0 | 29 | 55 | 65.96 | 50 | AGGAGAGGAGAGGAGGAAAA | 16 | 0 | 31 | 53 | 63.80 | 50 |
| GGAGCAATGGAGAATAAGGG | 9 | 0 | 48 | 47 | 62.42 | 50 | ATAAGAGAGAGAGAGAGGGG | 16 | 0 | 34 | 51 | 61.11 | 50 |
| CCATACCAGCCAACCGAAAA | 16 | 0 | 42 | 56 | 65.33 | 50 | GAGCCAACAGCCAACCAAAA | 16 | 0 | 48 | 45 | 66.40 | 50 |
| GGTGGTATTGGTGGTATTGG | 0 | 0 | 47 | 47 | 62.64 | 50 | CTCGTCTAACCTTCTTCAGC | 0 | 0 | 63 | 51 | 62.28 | 50 |
| CTTCTCTTCTCTTGCCGCTT | 0 | 0 | 39 | 56 | 64.70 | 50 | CTGTGTGGAATGCAAGGATG | 0 | 0 | 64 | 48 | 63.82 | 50 |
| CTCTCTCTCTCACTCTCTCA | 0 | 0 | 41 | 48 | 61.32 | 50 | CGAGCGTAGTGTAGTCATCA | 0 | 0 | 63 | 69 | 63.56 | 50 |
| AACAACCTCCACACCGAACA | 0 | 0 | 62 | 32 | 66.69 | 50 | AGTTACAGGACACCACCGAT | 0 | 0 | 65 | 51 | 66.39 | 50 |
| TGTGGTTGGTTAGTCGGTTG | 0 | 0 | 46 | 49 | 63.80 | 50 | CAGTAGCAGTCATAACGAGC | 0 | 0 | 64 | 56 | 62.69 | 50 |
| TGGTGTTGCTGGTGTAGGTT | 0 | 0 | 48 | 51 | 66.46 | 50 | GCATAGCACATCGTAGCGTA | 0 | 0 | 59 | 54 | 64.60 | 50 |
| CTCTCATTCCTTCTTACCCC | 16 | 0 | 43 | 51 | 61.40 | 50 | TGGACCTTGAGAGTGGAGAT | 0 | 0 | 62 | 50 | 64.44 | 50 |
FIGURE 6Average fitness of continuity and hairpin for ICW and other algorithms.
FIGURE 9Variance of different algorithms.
Comparing the melting temperatures of the various algorithm sequences.
| ICW | NACST/Seq | DEPT | MO-ABC | pMO-ABC | HSWOA | |
| Var | 1.24 | 4.33 | 3.37 | 4.37 | 4.91 | 1.86 |
FIGURE 7Average fitness of H-measure and similarity for ICW and other algorithms.
FIGURE 8Value of i for different algorithms.
FIGURE 10(A) The structure of sequence ACAGTCGTTAAATGGGAGTC in NUPACK. (B) The structure of sequence CCTCTCCATCCTTATCCTTC in NUPACK.