| Literature DB >> 33880685 |
Abstract
Of Chargaff's four "rules" on DNA base frequencies, the functional interpretation of his second parity rule (PR2) is the most contentious. Thermophile base compositions (GC%) were taken by Galtier and Lobry (1997) as favoring Sueoka's neutral PR2 hypothesis over Forsdyke's selective PR2 hypothesis, namely that mutations improving local within-species recombination efficiency had generated a genome-wide potential for the strands of duplex DNA to separate and initiate recombination through the "kissing" of the tips of stem-loops. However, following Chargaff's GC rule, base composition mainly reflects a species-specific, genome-wide, evolutionary pressure. GC% could not have consistently followed the dictates of temperature, since it plays fundamental roles in both sustaining species integrity and, through primarily neutral genome-wide mutation, fostering speciation. Evidence for a local within-species recombination-initiating role of base order was obtained with a novel technology that masked the contribution of base composition to nucleic acid folding energy. Forsdyke's results were consistent with his PR2 hypothesis, appeared to resolve some root problems in biology and provided a theoretical underpinning for alignment-free taxonomic analyses using relative oligonucleotide frequencies (k-mer analysis). Moreover, consistent with Chargaff's cluster rule, discovery of the thermoadaptive role of the "purine-loading" of open reading frames made less tenable the Galtier-Lobry anti-selectionist arguments.Entities:
Keywords: Base composition; Purine-loading; Speciation; Stem-loops; Taxonomy; Thermoadaptation
Mesh:
Year: 2021 PMID: 33880685 PMCID: PMC8057000 DOI: 10.1007/s10709-021-00119-5
Source DB: PubMed Journal: Genetica ISSN: 0016-6707 Impact factor: 1.633
Fig. 1Determination of the base order-dependent component of stem-loop (fold) potential by subtracting the base composition-dependent component from total stem-loop potential. A natural sequence (horizontal red line at left) when optimally folded (vertical arrow at left) is calculated to have a certain stability (e. g. − 30 kcal/mol). Its base order is then randomized to produce ten shuffled sequences that share only their base compositions with the originating natural sequence. These are then optimally folded to obtain corresponding stability values. Idiosyncrasies, due to the base order that each randomized sequence has acquired due to the shuffling, are averaged out (at right) to determine the contribution of base composition to the total fold potential. The contribution of base order is determined by subtraction. This figure is with permission
reproduced from Forsdyke (2016)