| Literature DB >> 36178491 |
Mahsa Alemrajabi1, Ksenia Macias Calix2, Raquel Assis3,4.
Abstract
Epistasis is an evolutionary phenomenon whereby the fitness effect of a mutation depends on the genetic background in which it arises. A key source of epistasis in an RNA molecule is its secondary structure, which contains functionally important topological motifs held together by hydrogen bonds between Watson-Crick (WC) base pairs. Here we study epistasis in the secondary structure of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) by examining properties of derived alleles arising from substitution mutations at ancestral WC base-paired and unpaired (UP) sites in 15 conserved topological motifs across the genome. We uncover fewer derived alleles and lower derived allele frequencies at WC than at UP sites, supporting the hypothesis that modifications to the secondary structure are often deleterious. At WC sites, we also find lower derived allele frequencies for mutations that abolish base pairing than for those that yield G·U "wobbles," illustrating that weak base pairing can partially preserve the integrity of the secondary structure. Last, we show that WC sites under the strongest epistatic constraint reside in a three-stemmed pseudoknot motif that plays an essential role in programmed ribosomal frameshifting, whereas those under the weakest epistatic constraint are located in 3' UTR motifs that regulate viral replication and pathogenicity. Our findings demonstrate the importance of epistasis in the evolution of the SARS-CoV-2 secondary structure, as well as highlight putative structural and functional targets of different forms of natural selection.Entities:
Keywords: Compensatory evolution; Coronavirus; Epistasis; SARS-CoV-2; Secondary structure
Year: 2022 PMID: 36178491 PMCID: PMC9523185 DOI: 10.1007/s00239-022-10073-1
Source DB: PubMed Journal: J Mol Evol ISSN: 0022-2844 Impact factor: 3.973
Fig. 1Compensatory evolution in a stem loop of a secondary structure. The ancestral stem loop (top) contains a WC base pair (blue). A mutation of one nucleotide (G) results in either UP nucleotides (middle left) or a weak G·U wobble base pair (middle right). In both scenarios, a compensatory mutation of the opposing nucleotide restores WC base pairing (bottom), though it may have more time to arise through the more stable G·U intermediate (bottom right) (Color figure online)
Fig. 2Derived allele frequencies at WC and UP sites. Distributions of derived allele frequencies for A) all ancestral WC and UP sites and B) WC → GU and WC → UP mutations. * (see “Methods” section for details)
Fig. 3Identification of WC sites under the strongest and weakest epistatic constraint. Log-transformed derived allele frequency spectrum for ancestral WC sites. Red and blue dashed lines show cutoffs used for selecting WC sites under the strongest and weakest epistatic constraint, respectively. Table S1 provides details for these sites, which are displayed in red and blue text (Color figure online)
Fig. 4Motifs containing WC sites under the strongest and weakest epistatic constraint. WC sites colored in red and blue correspond to those from Fig. 3 that are under the strongest and weakest epistatic constraint, respectively (Color figure online)