| Literature DB >> 31139727 |
Brian J Cafferty1, Alexei S Ten2, Michael J Fink1, Scott Morey2, Daniel J Preston1, Milan Mrksich2,2, George M Whitesides1,3,4.
Abstract
Although information is ubiquitous, and its technology arguably among the highest that humankind has produced, its very ubiquity has posed new types of problems. Three that involve storage of information (rather than computation) include its usage of energy, the robustness of stored information over long times, and its ability to resist corruption through tampering. The difficulty in solving these problems using present methods has stimulated interest in the possibilities available through fundamentally different strategies, including storage of information in molecules. Here we show that storage of information in mixtures of readily available, stable, low-molecular-weight molecules offers new approaches to this problem. This procedure uses a common, small set of molecules (here, 32 oligopeptides) to write binary information. It minimizes the time and difficulty of synthesis of new molecules. It also circumvents the challenges of encoding and reading messages in linear macromolecules. We have encoded, written, stored, and read a total of approximately 400 kilobits (both text and images), coded as mixtures of molecules, with greater than 99% recovery of information, written at an average rate of 8 bits/s, and read at a rate of 20 bits/s. This demonstration indicates that organic and analytical chemistry offer many new strategies and capabilities to problems in long-term, zero-energy, robust information storage.Entities:
Year: 2019 PMID: 31139727 PMCID: PMC6535762 DOI: 10.1021/acscentsci.9b00210
Source DB: PubMed Journal: ACS Cent Sci ISSN: 2374-7943 Impact factor: 14.553
Correspondence of an Alphanumeric Character (the Letter “K”) Encoded in ASCII in Binary, and in Four Molbits as Oligopeptides
Figure 1Design of oligopeptide molbits and spectrum of all 32 molbits in a single mixture. (a) Oligopeptide molbits contain an information region that consists of one to five amino acids (chosen from 2-aminobutyric acid, alanine, arginine, glycine, leucine, phenylalanine, proline, tyrosine, valine), which provides a distinguishable mass-to-charge ratio for each peptide (a difference of 6–42 a.m.u.), a charge residue (trimethyl lysine), and an anchor residue (terminal cysteine). The N-terminus is capped by an acetyl group for chemical stability. (b) Schematic showing an example of the immobilization of two oligopeptides (corresponding to molbit 1 and molbit 2 in panel c) to a maleimide-terminated monolayer for storage. Prior to conjugation of oligopeptide(s), the monolayer consists of a mixture of triethyleneglycol undecanethiol (EG3-capped alkanethiol) terminating in either an alcohol or maleimide. (c) A spectrum of a SAMDI spot containing all 32 molbits; the intensity was normalized to the highest signal. Oligopeptides were grouped by molecular weight into sets of eight, representing a byte of information (4 bytes total). The single-letter codes of residues in the information region are listed above each peak in the mass spectrum (see Table S1 for a full list of peptide sequence and corresponding masses, and Figure S1 for a detailed spectrum). The observed masses are for mixed disulfides derived from a EG3-capped alkanethiol and the oligopeptide conjugated to a maleimide-terminated EG3-capped alkanethiol.
Figure 2Overview of “writing” using mixtures of molbits and “reading” using SAMDI MS. “Writing” is performed by first translating information (here, the alphanumeric characters of Feynman’s lecture “There is plenty of room at the bottom”) into binary. Binary information is converted to oligopeptides immobilized on a self-assembled monolayer, for storage. A MALDI-TOF mass spectrometer analyzes (“reads”) these plates. A program decodes the information in the spectra and generates a bitstring that is used to regenerate the original text. Recovery of information was determined by (number of correctly identified molbits)/(total number of molbits) × 100%.
Figure 3JPEG images encoded using mixtures of molbits. (a) Image of Claude Shannon, “the father of information theory,” encoded in 6094 molbytes, and (b) image of the woodblock print entitled The Great Wave off Kanagawa, by Hokusai, encoded in 5953 molbytes. Each image was encoded on a single plate and decoded with zero errors (100% recovery). Images were encoded as JPEG files as described in the Supporting Information. See Figure S2 for an example of the resolution of an encoded JPEG file. Photograph of Claude Shannon by Alfred Eisenstaedt/The LIFE Picture Collection/Getty.