| Literature DB >> 36097552 |
Shaorong Fan1,2, Wenju Sun1, Ligang Fan1,2,3, Nan Wu1,2, Wei Sun1, Haiqian Ma1, Siyuan Chen4, Zitong Li4, Yu Li4, Jilin Zhang2, Jian Yan1,2,3.
Abstract
The binding of SARS-CoV-2 nucleocapsid (N) protein to both the 5'- and 3'-ends of genomic RNA has different implications arising from its binding to the central region during virion assembly. However, the mechanism underlying selective binding remains unknown. Herein, we performed the high-throughput RNA-SELEX (HTR-SELEX) to determine the RNA-binding specificity of the N proteins of various SARS-CoV-2 variants as well as other β-coronaviruses and showed that N proteins could bind two unrelated sequences, both of which were highly conserved across all variants and species. Interestingly, both sequences are virtually absent from the human transcriptome; however, they exhibit a highly enriched, mutually complementary distribution in the coronavirus genome, highlighting their varied functions in genome packaging. Our results provide mechanistic insights into viral genome packaging, thereby increasing the feasibility of developing drugs with broad-spectrum anti-coronavirus activity by targeting RNA binding by N proteins.Entities:
Keywords: Conserved; Coronavirus; Nucleocapsid protein; RNA binding specificity; SARS-CoV-2; Virion assembly
Year: 2022 PMID: 36097552 PMCID: PMC9454191 DOI: 10.1016/j.csbj.2022.09.007
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 6.155
Fig. 1RNA binding specificity of beta-coronavirus. a. Network analysis reveals the similarity of RNA binding specificity of RNA binding proteins, including all human RBPs with available motifs and nucleocapsid proteins of different beta-coronaviruses. Note that the subnetwork of coronavirus nucleocapsid proteins is disconnected from other subnetworks and zoomed in for clarification. Diamonds indicate individual motifs, and circles denote RBPs. The RBP is connected to its own binding motif. The edge between motifs is drawn if the SSTAT similarity score >1.0E−5. For details, please see Supplementary File S1. b. Network analysis of the similarity of RNA binding specificity of N proteins in different variants of SARS-CoV-2. The N protein of different SARS-CoV-2 variants is connected to its own binding motif. The edge between motifs is drawn if the SSTAT similarity score > 1.0E-5. c. The comparison of binding site density of N protein on human transcriptome and viral genome. Left, binding density of motif 1. Right, binding density of motif 2. The ellipse illustrates binding density on respective viral genome, and the box plot represents binding density on human transcriptome. Green, BAT-COV; Red, MERS-COV; Blue, SARS-COV; Yellow, SARA-COV-2. d. The binding site density (KDE, bandwidth = 0.1) of N protein along SARS-CoV-2 genome. Upper, motif 1. Lower, motif 2. A binding site is significantly detected when the sequence matches the motif (P < 0.05). e. The binding site density (KDE, bandwidth = 0.1) of different N proteins along the corresponding viral genomes. Upper, motif 1. Lower, motif 2. A binding site is significantly detected when the sequence matches the motif (P < 0.05). Smoothened color curves represent different viruses. f. The cartoon illustrates the model of the process of viral genome packaging and virion particle assembly. Upper, the structure of N protein, including 5′-arm, RNA binding domain (RBD1), intrinsically disordered linker region (IRD), dimerization domain (RBD2) and 3′-arm. Lower, model of the viral genome packaging and virion assembly. In step 1, N protein forms homodimer and bind to 5′ and 3′UTRs. IDRs between different N proteins promote formation of LLPS. Subsequently in step 2, viral RNA full of motif 2 is exposed to more N proteins. Even binding of N proteins to motif 2 increases the solubility that prevents intermingling between large RNA molecules and dissolves the condensates. Finally, in step 3, the dissolved LLPS releases the packaged RNA genome and makes the N protein accessible to interact with other viral proteins, such as M and E proteins, to complete the virion assembly. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)