| Literature DB >> 33143071 |
Mariola Dutkiewicz1, Jakub Kuczynski1, Michal Jarzab1, Aleksandra Stachowiak1, Agata Swiatkowska1.
Abstract
Here we present a set of new structural elements formed within the open reading frame of the virus, which are highly probable, evolutionarily conserved and may interact with host proteins. This work focused on the coding regions of the CVB3 genome (particularly the V4-, V1-, 2C-, and 3D-coding regions), which, with the exception of the cis-acting replication element (CRE), have not yet been subjected to experimental analysis of their structures. The SHAPE technique, chemical modification with DMS and RNA cleavage with Pb2+, were performed in order to characterize the RNA structure. The experimental results were used to improve the computer prediction of the structural models, whereas a phylogenetic analysis was performed to check universality of the newly identified structural elements for twenty CVB3 genomes and 11 other enteroviruses. Some of the RNA motifs turned out to be conserved among different enteroviruses. We also observed that the 3'-terminal region of the genome tends to dimerize in a magnesium concentration-dependent manner. RNA affinity chromatography was used to confirm RNA-protein interactions hypothesized by database searches, leading to the discovery of several interactions, which may be important for virus propagation.Entities:
Keywords: CVB3; Coxsackie B3 virus; RNA motif; RNA secondary structure; RNA structural element; RNA structure of coding region; RNA virus; RNA–protein interaction; coxsackievirus B3; enterovirus
Mesh:
Substances:
Year: 2020 PMID: 33143071 PMCID: PMC7692623 DOI: 10.3390/v12111232
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Primers used in this study. F-used as forward primers; R-used as reverse primers, also 5′-fluorescently labeled with VIC, FAM, PET, or NED where indicated.
| No. | Primer Name | Nucleotide Sequence |
|---|---|---|
| 1 | 1F | 5′-TAATACGACTCACTATAGGAAACAGCCTGTGGGTTGATCC-3′ |
| 2 | 1R | 5′- CTAGCATTCAGCCTGGTCTC-3′ |
| 3 | 1–2F | 5′-TAATACGACTCACTATAGGTCTAATACAGACATGGTGCGAAG-3′ |
| 4 | 1–2R | 5′-TCTGCCATTGCACAGAGTCAAG-3′ |
| 5 | 4F | 5′-TAATACGACTCACTATAGGTGTTGCTTCAGATGAGTATACCG-3′ |
| 6 | 4R | 5′-AAATTCAGACCATCCGTCATAG-3′ |
| 7 | 6–7F | 5′-TAATACGACTCACTATAGGCGAGTTCCTGAACAGACTTAAAC-3′ |
| 8 | 6–7R | 5′-TAGAGAGTATCTGACCTGTGTTC-3′ |
| 10 | 8–9F | 5′-TAATACGACTCACTATAGGCAATTAACACCAGCAAGTTTCC-3′ |
| 11 | 8–9R | 5′-TTCCTTTCGCTACCTTCTCTATG-3′ |
| 12 | 9F | 5′-TAATACGACTCACTATAGGGTTTCCAGTCATCAACACACC-3′ |
| 13 | 9R | 5′-GGAGTTGCACAAGTAGTCAATG-3′ |
| 14 | 9–10F | 5′-TAATACGACTCACTATAGGGAACCTACCAATGGTGACTTATG-3′ |
| 15 | 9–10R | 5′-GATTCGTGTATGTCTTTCATGG-3′ |
| 16 | 10F | 5′-TAATACGACTCACTATAGGCTTGGATACACGCACAAAGAG-3′ |
| 17 | 10R | 5′-GCACCGAATGCGGAGAATTTAC-3′ |
| 18 | 10+(R) | 5′-TTTTTTTTTTTCCGCACCGAATGC-3′ |
Figure 1Scheme of coxsackievirus B3 genome organization with division into fragments, ca. 750 nt each. Fragments examined in this study are highlighted with colors corresponding to respective regions of the viral genome; UTR—untranslated region; VP1, VP2, VP3, VP4—capsid proteins; 2A, 2B, 2C, 3A, 3B, 3C, 3D—non-structural proteins.
Figure 2New RNA structural elements identified in the coding part of the Coxsackie B3 virus genome. Most probable secondary structural models of 21 new motifs with experimental mapping results depicted according to the key in the figure. The color of the letters corresponds to the probability values calculated by RNAstructure, according to the key in the figure. Motifs marked in red, orange, and yellow have the highest probability of occurrence in the analyzed RNA fragment and in the corresponding region of the viral genome, so they are considered structurally well-defined. Predicting secondary structure of partly overlapping fragments: F1 and F1–2.
Figure 3Secondary structure model of F10 RNA, without the last 120 nt of the known pseudoknot. The free energy of the structure is −286.4 kcal/mol. This model displays NMIA modification sites from the SHAPE reaction, DMS modification sites, RNase L digestion, and Pb2+ cleavages, according to the key. In addition, the most interesting structural motifs are marked with their names.
Figure 4(A) Consensus secondary structures generated by the RNAalifold program for particular RNA motifs identified in the coding part of the Coxsackie B3 virus genome, the “Nancy” strain and 19 other CVB3 strains/isolates. The colors indicate structural conservation according to the key in the figure. (B) Consensus secondary structures generated by RNAalifold program for particular RNA motifs identified in the coding part of the Coxsackie B3 virus genome, the “Nancy” strain and some of 11 other enterovirus strains we investigated. (CVA16-U05876.1|CAU05876 coxsackievirus A16 G-10; CVA21-D00538.1|CXA21CG Human coxsackievirus A21 (strain Coe); CVB1-M16560.1|CXA1G Coxsackievirus B1; CVB2-AF085363.1|Coxsackievirus B2 strain Ohio-1; CVB3-JX312064.1|Human coxsackievirus B3 strain Nancy; CVB3W-U57056.1|CXU57056 Coxsackievirus B3 Woodruff variant; CVB4-X05690.1|Coxsackievirus B4; CVB5-X67706.1|Coxsackievirus B5; PV1-K01392.1|POL3L37 Poliovirus P3/Leon/37 (type 3; PV2-X00595.1|Poliovirus type 2 genome (strain Sabin 2); PV3-V01150.1|Human poliovirus strain Sabin 1; ECH9-X84981.1|Echovirus 9 (strain Hill); E71-AF176044.1|Enterovirus 71.) The colors indicate structural conservation according to the key in the figure.
Figure 5Dimerization of the F10 RNA, the 3′-terminal region of the CVB3 genome. (A) Impact of ionic and buffer concentrations; (B) impact of divalent magnesium ions on dimerization process; (C) inhibition of homodimer formation in the presence of “antydimer” (DNA 18-mer), complementary to the F10 in the region of the potential dimerization site; and (D) computer predictions of the region responsible for F10-homodimer formation.
Features of the new RNA elements/motifs found in the CVB3 genome. The structure motifs were experimentally characterized with the indicated method/s, (SHAPE) or (SHAPE, Pb2+, DMS)—(Figure 2 and Figure 3, and Figure S1); (−)—experimentally uncharacterized; (H)—structure predicted by RNAstructure with high probability; (L)—structure predicted with low or moderate probability (Figure 2 and Figure 3, and Figure S1); (CVB3)—motif specific for different CVB3 strains according to RNAalifold; (E)—motif also conserved in other enteroviruses (Figure 4A (CVB3), Figure 4B(E), and Figure S2); potential interactions with proteins were proposed with help of the ATtRACT database [40]; their gene names are indicated in this table (Table S4). * Interactions with proteins and F10 RNA, which were experimentally confirmed in RNA-centric affinity chromatography (Figure S5); nt-length of a motif/number of nucleotides.
| RNA Structure Motif | Length (nt) | Genome Region | RNA Fragment in which | RNA Fragment in which | Motif | Potential Interactions with Proteins According to |
|---|---|---|---|---|---|---|
| SL665-748 | 84 | 5′UTR | F1, F1–2 (SHAPE) | F1 (H), | (some | CELF1, CELF 2, ELAVL2, ENOX1, GRSF1, HNRNP (A1, A2B1, F, H1, H2, H3, K, L), IGF2BP2, IGF2BP3, KHDRBS1, KHSRP, NOVA1, NOVA 2, OAS1, PCBP2, PIWIL1, PPIE, PTBP1, RBMS3, RBMX, RNASEL, SRP (19, 54, 68), SRSF (2, 5, 9, 10), TIA1, TIAL1, XPO5, YTHDC1, ZFP36, ZRANB2 |
| SL799-854 | 54 | V4 | - | F1–2 (H) | (CVB3) | A1CF, HNRNPL, IGF2BP3, MBNL1, NOVA1, NOVA2, PABPC1, PPIE, PTBP1, RBMY1A1, SRSF3, SRSF9, TIA1, TIAL1, XPO5, YBX1, YTHDC1, |
| SL2313-2343 | 31 | V1 | - | F4 (H) | (some | HNRNPL, HNRNPLL, IGF2BP2, IGF2BP3, NOVA1, PABPC1, PTBP1, RBMX, SAMD4A, SRP14, ZNF346 |
| SL2562-2590 | 29 | V1 | - | F4 (H) | (some | FUS, HNRNPL, KHSRP, MBNL1, NOVA1, NOVA2, RBMY1A1, SRSF (1, 2, 5, 6), YTHDC1, ZRANB2, RBMX, |
| SL2664-2719 | 55 | V1 | F4 (SHAPE) | F4 (H) | (CVB3) | CELF2, CPEB4, ELAVL2, FUS, IGF2BP2, IGF2BP3, KHDRBS1, LIN28A, MBNL1, NOVA1, NUDT21, PTBP1, RBFOX1, RNASEL, SRP54, SRP68, SRSF (1, 2, 5, 6, 9), SSB, TIA1, TIAL1, YTHDC1, ZRANB2, |
| SL4558-4656 | 99 | 2C | F6–7 (SHAPE) | F6–7 (H) | (E) | ACO1, CELF1, CELF2, CMTR1, ELAVL (1, 2, 4), ESRP2, FUS, GRSF1, HNRNP (F, H1, H2, H3, K, L), KHSRP, MBNL1, NOVA1, NOVA2, NUDT21, NXF1, PTBP1, RBM28, RC3H1, SRP14, SRSF (1, 2, 3, 6, 9), SSB, TIA1, TIAL1, TRA2B, XPO5, YBX1, ZFP36, ZNF346, |
| SL4765-4802 | 38 | 2C | F6–7 (SHAPE) | F6–7 (H) | (E) | CELF2, ELAVL (1, 2, 4), HNRNPL, NOVA1, NOVA2, OAS1, PHAX, PPIE, PTBP1, RC3H1, SRP54, SRSF (2, 3, 9), SSB, TIA1, TIAL1, YBX1, YBX2, ZRANB2 |
| SL6236-6260 | 25 | 3D | F8–9 (SHAPE) | F8–9 (H), | (CVB3) | ADAR, CMTR1, DDX58, DHX9, HNRNPL, MBNL1, OAS1, PIWIL1, PTBP1, QKI, SRSF3, YBX1, YBX2, YTHDC1 |
| SL6289-6308 | 20 | 3D | F8–9, F9 (SHAPE) | F8–9 (H), | (CVB3) | F2, GRSF1, HNRNP (F, H1, H2, H3, L), KHSRP, NONO, PABPN1, PTBP1, PTBP2, SRP (19, 54, 68), SRSF (10, 2, 3, 5), TRA2B, YBX1, ZFP36, |
| SL6376-6399 | 24 | 3D | F8–9, F9 (SHAPE) | F8–9 (H), | (CVB3) | CELF2, NUDT21, PABPN1, PTBP1, QKI, RNASEL, SF1, SRP19 |
| SL6476-6495, | 20 | 3D | F9 (SHAPE) | F9 (H) | (CVB3) | ELAVL (1, 2, 4), HNRNPL, IGF2BP3, NXF1, PTBP1, SSB, ZRANB2 |
| SL6581-6597 | 17 | 3D | F9–10 (SHAPE) | F9–10 (H), | (CVB3) | CELF1, CELF2, HNRNPA1, HNRNPL, KHSRP, SRSF (1, 2, 3, 9), YBX1, |
| SL6689-6722 | 34 | 3D | F9–10 (SHAPE) | F9–10 (H), | (E) | CELF1, FXR2, |
| SL6738-6761 | 24 | 3D | - | F10 (H) | (E) | CELF1, ESRP1, GRSF1, |
| SL6810-6826 | 17 | 3D | F9–10, F10 (SHAPE, Pb2+, DMS) | F9–10 (H), | (E) | HNRNPL, NOVA1, NOVA2, QKI, SF1, TIAL1 |
| SL6854-6878 | 25 | 3D | F10 (SHAPE, Pb2+, DMS) | F10 (H) | (E) | EIF4B, ESRP1, GRSF1, |
| SL6891–6922 | 32 | 3D | F10 (SHAPE, Pb2+, DMS) | F10 (L) | (E) | CELF2, CMTR1, FUS, |
| SL6940-6957 | 18 | 3D | F9–10, F10 (SHAPE, Pb2+, DMS) | F9–10 (H), F10 (H) | (E) | F2, MBNL1, PTBP1, SAMD4A, SRSF1 |
| SL6987-7036 | 50 | 3D | F10 (SHAPE, Pb2+, DMS) | F10 (L) | (E) | CPEB4, ELAVL2, F2, GRSF1, |
| E-10 like/ | 36 | 3D | F10 (SHAPE, Pb2+, DMS) | F10 (L) | (CVB3) | CELF1, CELF2, GRSF1, |
| SL7247-7291 | 45 | 3D | F10 (SHAPE, Pb2+, DMS) | F10 (L) | (E) | AGO1, DHX9, ELAVL1, ELAVL2, F2, OAS1, PCBP1, |
Figure 6The proteins identified in the RNA-centric affinity chromatography that were found to be interacting with F10 RNA fragment from the 3′-terminus of the CVB3 genome. Possible localizations of the RNA–protein interactions for several proteins is given, based on the presence of a predicted sequence binding site (ATtRACT database) within the F10 sequence. The figure shows the gene names rather than full protein names to minimize the space required. Different colors represent distinct proteins.