| Literature DB >> 30717360 |
Abstract
Extant life uses two kinds of linear biopolymers that mutually control their own production, as well as the cellular metabolism and the production and homeostatic maintenance of other biopolymers. Nucleic acids are linear polymers composed of a relatively low structural variety of monomeric residues, and thus a low diversity per accessed volume. Proteins are more compact linear polymers that dispose of a huge compositional diversity even at the monomeric level, and thus bear a much higher catalytic potential. The fine-grained diversity of proteins makes an unambiguous information transfer from protein templates too error-prone, so they need to be resynthesized in every generation. But proteins can catalyse both their own reproduction as well as the efficient and faithful replication of nucleic acids, which resolves in a most straightforward way an issue termed "Eigen's paradox". Here the importance of the existence of both kinds of linear biopolymers is discussed in the context of the emergence of cellular life, be it for the historic orgin of life on Earth, on some other habitable planet, or in the test tube. An immediate consequence of this analysis is the necessity for translation to appear early during the evolution of life.Entities:
Keywords: digit multiplicity; diversity; encoding; function; information; translation; transmission
Year: 2019 PMID: 30717360 PMCID: PMC6463190 DOI: 10.3390/life9010017
Source DB: PubMed Journal: Life (Basel) ISSN: 2075-1729
String lengths M, binary string compression factors ‡ and approximate SI values for strings of different integer-digit multiplicities # bearing approximately the same storage capacity N ≈ 1012 §.
| Digit Multiplicity # | String Length | String Compression ‡ | Shannon Information | Storage Capacity | ||
|---|---|---|---|---|---|---|
|
|
|
| ||||
| 2 | 40 | 1.00 | 40 | 240 ≈ 1012 | ||
| 3 | 26 | 1.54 | ≈ 41.2 | 326 ≈ 1012 | ||
| 4 | 20 | 2.00 | 40 | 420 ≈ 1012 | ||
| 5 | 17-18 | 2.35–2.22 | ≈ 39.5–41.8 | 517–18 ≈ 1011–12 | ||
| 8 | 13-14 | 3.08–2.85 | ≈ 39.0–42.0 | 813–14 ≈ 1011–12 | ||
| 9 | 12-13 | 3.33–3.08 | ≈ 38.0–41.2 | 912–13 ≈ 1011–12 | ||
| 16 | 10 | 4.00 | 40 | 1610 ≈ 1012 | ||
| 20 | 9-10 | 4.44–4.00 | ≈ 38.9–43.2 | 209–10 ≈ 1011–13 | ||
| 27 | 9 | 4.44 | ≈ 42.8 | 279 ≈ 1012 | ||
| 64 | 6-7 | 6.67–5.71 | ≈ 36.0–42.0 | 646–7 ≈ 1011–12 | ||
| 81 | 6 | 6.67 | ≈ 38.0 | 816 ≈ 1011–12 | ||
| 256 | 5 | 8.00 | 40 | 2565 ≈ 1012 | ||
# b’ = b2, b3, b4 (three different codon lengths); b’ = reduced through redundancy from higher b’, cf. Section 3. ‡ cf. complexity Ψ = bits per monomer (e.g. per nucleotide, codon, amino acid) [9]. § precision within ±1 order of magnitude.
Figure 1Template-copying and translating an exemplary string of total composed of b different digits transmitting, according to Equation (3), a given amount of Shannon Information SI(M) from one parental string to others. Upper part: Transcription/Replication. A single complement-copying event (vertical arrow) is realised during, both, transcription (or reverse transcription) and the first step of replication; the second (horizontal) step applies to replication only. Complement-copying rules (small round-edged shaded frames) by virtue of a minimal requirement for self-complementary digits, i.e., bits {0,1}, trits {0,1,2} and quits {0,1,2,3}. Lower part: Translation of consecutive blocks (B = bytes, T = trytes, Q = quytes) of 2–4 digits (A bits, B trits, C quits). The reading frame (underligned coloured digits) is translated into products (strings of letters) of a condensed residue number M’, higher digit multiplicity (diversity) b’ = b2–4, and unchanged SI’. Frameshifts 1–2 for 2-digit blocks, 1–3 for 3-digit blocks and 1–4 for 4-digit blocks generate alternative translation products of the same length M’, diversity b’ and SI’ but radically different sequences. (A) Binary digit (bit) strings, replicated and translated from 2–bit bytes (2B), 3–bit bytes (3B) and 4–bit bytes (4B). (B) Ternary digit (trit) strings, replicated and translated from 2–trit trytes (2T), 3–trit trytes (3T) and 4–trit trytes (4T). (C) Quaternary digit (quit) strings, replicated and translated from 2–quit quytes (2Q), 3–quit quytes (3Q) and 4–trit quytes (4Q, code and translation products not shown). Shadowed large frame: the current natural (biotic) memory system are quit strings being translated from reading frames of consecutive 3Q utilizing b’ = 64 codons that are reduced, mainly for fidelity reasons, to b’ = 20 effectively translated digits, viz. the “universal genetic code” for 20 different amino acids and a stop signal (i.e., lack of amino acid). Reproduced and modified from The Handbook of Astrobiology; published by CRC Press, 2019 © Taylor & Francis [21].
Figure 2Translation of parts of low-digit memory polymers into high-digit functional polymers as a means to achieve inheritable exponential population growth. The expectedly much higher catalytic potential of high-digit polymers (gene products) allows for more efficient use of the templating ability of the low-digit memory polymers (containing genes), both in terms of copying fidelity Q = (1–µ), where µ denotes mutation probability of every digit, and population growth order p, where 0 < p < 1 defines the parabolic growth regime and p = 1 the exponential growth regime, in populations of replicators x that grow in time t, as in dx = k, where each replicator population i replicates with an apparent replication rate konstant k