| Literature DB >> 35215915 |
Marta Soszynska-Jozwiak1, Agnieszka Ruszkowska1, Ryszard Kierzek1, Collin A O'Leary2, Walter N Moss2, Elzbieta Kierzek1.
Abstract
SARS-CoV-2 belongs to the Coronavirinae family. Like other coronaviruses, SARS-CoV-2 is enveloped and possesses a positive-sense, single-stranded RNA genome of ~30 kb. Genomic RNA is used as the template for replication and transcription. During these processes, positive-sense genomic RNA (gRNA) and subgenomic RNAs (sgRNAs) are created. Several studies presented the importance of the genomic RNA secondary structure in SARS-CoV-2 replication. However, the structure of sgRNAs has remained largely unsolved so far. In this study, we probed the sgRNA M model of SARS-CoV-2 in vitro. The presented model molecule includes 5'UTR and a coding sequence of gene M. This is the first experimentally informed secondary structure model of sgRNA M, which presents features likely to be important in sgRNA M function. The knowledge of sgRNA M structure provides insights to better understand virus biology and could be used for designing new therapeutics.Entities:
Keywords: COVID-19; RNA structure; SARS-CoV-2; chemical mapping; sgRNA; sgRNA M; subgenomic RNAs
Mesh:
Substances:
Year: 2022 PMID: 35215915 PMCID: PMC8878378 DOI: 10.3390/v14020322
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Primers for polymerase chain reaction used to obtain DNA template for sgRNA M. The underlined nucleotide residues are the polymerase T7 promoter.
| Primer Name | Primer Length (nt) | Sequence 5′→3′ |
|---|---|---|
| FM | 46 | GCG |
| RM | 29 | TTACTGTACAAGCAAAGCAATATTGTCAC |
| F1 | 54 | CTTGTAGATCTGTTCTCTAAACGAACTAAATATTATATTAGTTTTTCTGTTTGG |
| F2 | 48 | GTAACAAACCAACCAACTTTCGATCT CTTGTAGATCTGTTCTCTAAAC |
| F3 | 45 | ATTAAAGGTTTATACCTTCCCAGGTAACAAACCAACCAACTTTCG |
| FC | 29 | TTCTGCAG ATTAAAGGTTTATACCTTCCC |
| RC | 37 | ATGAATTCTTACTGTACAAGCAAAGCAATATTGTCAC |
| M13F | 24 | CGCCAGGGTTTTCCCAGTCACGAC |
Primers for reverse transcription. Each primer was labeled with 6-FAM or 5-JOE at the 5′ end.
| Primer Name | Primer Length (nt) | Sequence 5′→3′ |
|---|---|---|
| M1 | 27 | TTACTGTACAAGCAAAGCAATATTGTC |
| M2 | 21 | CAGCTCCGATTACGAGTTCAC |
| M3 | 24 | CAAGCTAAAGTTACTGGCCATAAC |
Figure 1Predicted probability of nucleotides being paired or single-stranded in sgRNA M using the RNAstructure program. Probability lower than 50% is not colored. The partition function calculation incorporated restraints from strong reactivity of DMS and CMCT as well as SHAPE reactivities converted to pseudo-energies.
Figure 2sgRNA M model predicted by RNAstructure 6.2 using experimental data as constraints. Strong DMS and CMCT modifications, as well as SHAPE reactivities converted to pseudo-free energies, were used. The numbering of sgRNA M is from its 5′ end. The AUG start codon spans nucleotides 120–122. Red nucleotides indicate TRS sequences. Hairpins SL1, SL2 and SL3, are indicated.
Figure 3Comparison of the secondary structure of sgRNA M and its corresponding region of SARS-CoV-2 genome mapped in cells. Blue rectangle indicates the same base pairs within the sgRNA M model and the corresponding region of the SARS-CoV-2 genome mapped in cells [20]. Yellow rectangle indicates motifs of the leader sequence.
Mutations in the sgRNA M coding region in SARS-CoV-2 variants.
| Nucleotide Mutations in Coding Region 1 | SARS-CoV-2 Variants and Variants Representatives 2 | Amino Acid Mutation 3 | Influence of Nucleotide Mutations on RNA Structure 4 |
|---|---|---|---|
| 8(127) A > G | Omicron-B.1.1.529 (EPI ISL 6704867) | 3D > G | consistent mutation |
| 46(165) C > A | AT.1 (EPI ISL 1652580) | 16L > I | inconsistent mutationG-C > G..A |
| 55(174) C > G | Omicron-B.1.1.529 (EPI ISL 6704867) | 19Q > E | single-stranded region—no influence |
| 82(201) U > C | R.1 (EPI ISL 1123466) | 28F > L | single-stranded region—no influence |
| 84(203) C > U | B.1.619.1 (EPI ISL 2361101) | - | single-stranded region—no influence |
| 85(204) C > U | C.1.2 (EPI ISL 2942287) | 29L > F | single-stranded region—no influence |
| 159(278) C > U | A.2.5.2 (EPI ISL 1502915) | - | consistent mutation |
| 187(306) G > A | Omicron-B.1.1.529 (EPI ISL 6704867) | 63A > T | inconsistent mutations |
| 213(332) C > U | B.1.466.2 (EPI ISL 1533080) | - | consistent mutation |
| 241(361) G > U | Delta-AY.27 B.1.617.2 (EPI ISL 3910943) | 81A > S | single-stranded region—no influence |
| 245(364) U > C | Delta-B.1.617.2 (EPI ISL 1758376) | 82I > T | single-stranded region—no influence |
| 245(364) U > G | Kappa-B.1.617.1 (EPI ISL 1384866) | 82I > S | single-stranded region—no influence |
| 279(398) C > G | B.1.177.82 (EPI ISL 617709) | - | single-stranded region—no influence |
| 372(491) C > U | Lambda-C.37 (EPI ISL 1138413) | - | single-stranded region—no influence |
| 373(492) C > U | AV.1 B.1.1.482.1 (EPI ISL 2179526) | 125H > Y | single-stranded region—no influence |
| 450(569) U > C | B.1.258.17 (EPI ISL 618584) | - | single-stranded region—no influence |
| 621(740) C > U | A.2.5.2 (EPI ISL 1502915) | - | single-stranded region—no influence |
1—The first number in the column indicates the nucleotide number from the 5′ end of the M coding region; number in parenthesis means the corresponding nucleotide number of sgRNA M, as in Figure 2. 2—Variants and accession number of variant representatives in GISAID. 3—Number before mutation means amino acid number in M protein. 4—Consistent mutation means single point mutation that preserves pairing. “-“ indicates Watson-Crick pair, “●” shows GU wobble pair, “..” indicates no Watson-Crick and GU interactions.