| Literature DB >> 27689089 |
Pavani Yashodha De Silva1, Gamage Upeksha Ganegoda1.
Abstract
With the exponential growth in the capacity of information generated and the emerging need for data to be stored for prolonged period of time, there emerges a need for a storage medium with high capacity, high storage density, and possibility to withstand extreme environmental conditions. DNA emerges as the prospective medium for data storage with its striking features. Diverse encoding models for reading and writing data onto DNA, codes for encrypting data which addresses issues of error generation, and approaches for developing codons and storage styles have been developed over the recent past. DNA has been identified as a potential medium for secret writing, which achieves the way towards DNA cryptography and stenography. DNA utilized as an organic memory device along with big data storage and analytics in DNA has paved the way towards DNA computing for solving computational problems. This paper critically analyzes the various methods used for encoding and encrypting data onto DNA while identifying the advantages and capability of every scheme to overcome the drawbacks identified priorly. Cryptography and stenography techniques have been analyzed in a critical approach while identifying the limitations of each method. This paper also identifies the advantages and limitations of DNA as a memory device and memory applications.Entities:
Year: 2016 PMID: 27689089 PMCID: PMC5027317 DOI: 10.1155/2016/8072463
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Schematic representation of storing data on DNA.
Figure 2Block diagram of encrypting a message using DNA hybridization technique.
Figure 3Block diagram of decrypting a message using DNA hybridization.
Figure 4Block diagram of encryption process using DNA Chromosome Indexing.
Figure 5Block diagram of decryption process using DNA Chromosome Indexing.
Comparison of encoding models.
| Encoding model | Advantages | Disadvantages |
|---|---|---|
| Microvenus project | Laid the foundation for storing abiotic information in DNA | Being inaccurate and not distinctively decodable |
|
| ||
| Genesis project | Laid the research work to explore the intricate relationship between biology, belief systems, information technology, dialogical interaction, ethics, and the Internet | Inaccurate as the original sentence was altered during mutation at the presence of ultraviolet light |
|
| ||
| PCR based encoding models | High security because of the size of the microdots and even if an adversary identifies the microdot it would be extremely difficult without the knowledge of the primer sequence | Insertion of errors in template region making it unmanageable to recover the encoded data |
|
| ||
| Alignment based encoding models | Independent of Polymerase Chain Reaction | Multiplication of cassettes leads to redundant volumes |
|
| ||
| Rewritable and random access based DNA storage system | Random access to data blocks of DNA which promotes nonlinear access | High cost |
|
| ||
| Next generation digital information storage | Employment of one-bit representation per base | Cost is unfeasible |
|
| ||
| Encoding scheme for small text files | High volume data storage density | Have not proceeded in implementing the biological protocols to insert the sequence in genome of bacteria |
Comparison of encryption codes.
| Huffman code | Comma code | Alternate code | Comma-free code | Improved Huffman code | Perfect genetic code | |
|---|---|---|---|---|---|---|
| Base-to-character ratio | ~2.2 | ~6 | ~6 | Variable | ~3.5 | Variable |
|
| ||||||
| Economical | Very economical | No | No | Yes | ||
|
| ||||||
| Long term storage | No | Yes | Yes | |||
|
| ||||||
| Error correcting | Yes | Yes | Yes | |||
|
| ||||||
| Protection from mutation | Yes | Yes | No | |||
|
| ||||||
| Isothermal melting temperature | Yes | Yes | ||||
|
| ||||||
| Synthetic DNA | Yes | Yes | ||||
|
| ||||||
| Special features | Uses the principle of varying the length of symbols used for representation based on the recurrence of a character | Consists of fixed length reading frames of 6 bases including the comma, G | Fixed length base frames without commas to separate the frames | Stores text, images, and music in DNA | 70% more efficient than the other codes due to the use of a variable code length | |
Comparison of the performance of cDNA secret writing techniques.
| Features | DNA hybridization technique | Chromosome DNA indexing |
|---|---|---|
| Running time | Less | More |
|
| ||
| Size of the key | Large depending on the input | Large independent of the input |
|
| ||
| Strength of the algorithm | High based on the type, size, and the randomness of the key | High based on the key type and key size and the randomly produced index |
|
| ||
| Memory space | Needs more memory space for storing the lengthy key and performing the operations involving it | More than the hybridization type because of the huge key length and the index array involved |
|
| ||
| Cost | High | High |
|
| ||
| Longevity | Believed to withstand any duration | Believed to withstand any duration |