Literature DB >> 23064116

The influenza A segment 7 mRNA 3' splice site pseudoknot/hairpin family.

Walter N Moss1, Lumbini I Dela-Moss, Salvatore F Priore, Douglas H Turner.   

Abstract

The 3' splice site of the influenza A segment 7 transcript is utilized to produce mRNA for the critical M2 ion-channel protein. In solution a 63 nt fragment that includes this region can adopt two conformations: a pseudoknot and a hairpin. In each conformation, the splice site, a binding site for the SF2/ASF exonic splicing enhancer and a polypyrimidine tract, each exists in a different structural context. The most dramatic difference occurs for the splice site. In the hairpin the splice site is between two residues that are involved in a 2 by 2 nucleotide internal loop. In the pseudoknot, however, these bases are canonically paired within one of the pseudoknotted helices. The conformational switching observed in this region has implications for the regulation of splicing of the segment 7 mRNA. A measure of stability of the structures also shows interesting trends with respect to host specificity: avian strains tend to be the most stable, followed by swine and then human.

Entities:  

Keywords:  conformational switch; influenza; pseudoknot; splicing; structure mapping; virus

Mesh:

Substances:

Year:  2012        PMID: 23064116      PMCID: PMC3597570          DOI: 10.4161/rna.22343

Source DB:  PubMed          Journal:  RNA Biol        ISSN: 1547-6286            Impact factor:   4.652


Overview

Influenza viruses are divided into three major clades: influenza A, B and C. Of these, influenza A and B are the most dangerous as they cause seasonal epidemics. There are an estimated three to five million serious infections yearly, resulting in approximately 500,000 deaths. The economic cost of seasonal influenza is very steep. In the US alone the yearly cost is $10.4 billion in direct medical expenses and $16.3 billion in lost wages due to illness or death. Even more troubling is the propensity of influenza A strains to cause pandemic outbreaks. Influenza A is one of the major killers of the 20th century. The Spanish Flu of 1918, caused by a pandemic H1N1 avian strain, killed at least 20 million people and perhaps as many as 100 million. A major factor in the rise of pandemics is the propensity of strains from different host species, e.g., human and avian, to combine and form a novel strain, to which humans are immunologically naive., Re-assortment in influenza is possible because the virus has a segmented genome. There are eight negative-sense RNA segments that make up the influenza A genome. Viral RNAs (vRNAs) bind to multiple copies of virally encoded NP protein, and the heterotrimeric viral polymerase to form vRNPs, which are uniquely packaged into active virions. RNA structure plays an important role in the formation of the vRNP and in the replication of the vRNA. The 5′ and 3′ untranslated regions of the vRNAs have conserved complementary regions which allow for the formation of long range base pairs.- The association of these ends forms the promoter necessary for the initiation of RNA synthesis and RNA structure may influence which positive-sense RNA [(+)RNA] is produced-: protein coding mRNA or template complementary RNA (cRNA) that is used for producing more vRNA. This interaction is stable under physiological conditions; the particular structure, however, remains controversial, with two competing models: a stem-like “panhandle” structure, or a more complex “corkscrew” model. Beyond this region there is little evidence for additional structure in the vRNA. Calculations of global folding free energy indicate that the vRNA is much less stable than the (+)RNA and shows little propensity for maintaining RNA structure in influenza A strains. In contrast to the vRNA, the influenza A (+)RNA shows evidence for stable, global RNA structure in four of the eight viral segments: segments 1, 5, 7 and 8. Additionally, a survey for local RNA structure revealed 20 regions where the (+)RNA showed a propensity for forming unusually stable and conserved predicted RNA structure. Several sites also occurred in regions with suppressed third codon variability, which indicates the possible constraint of maintaining RNA structure. Five regions of special interest occurred within or near to functional sites, showed suppressed codon evolution and were within or near to predicted structured regions. These sites occurred in segments 8, 7 and 2. The best characterized of these predicted regions occurs at the 3′ splice site in the segment 7 transcript. Segment 7 encodes the M1 matrix protein and the alternatively spliced M2 ion channel protein as well as the less well-understood M3 polypeptide and M4 protein. The conserved structural region encompasses the 63 nt surrounding the splice site and, in addition to the splice site, contains multiple splicing signals (Fig. 1): a binding site for the SF2/ASF exonic splicing enhancer, a polypyrimidine tract and a putative branch point signal, are all contained within this region. This region folds as both a pseudoknot and a hairpin, and each conformation places these splicing signals in different structural contexts (Fig. 1).. When folded in the presence of Mg2+ the ratio of each conformation is roughly 50/50, which implies a similar free energy of folding for each structure. Such a delicate equilibrium may be easily influenced by changes in the cellular environment or the effects of proteins. A similar, but structurally distinct, conformational switch, from hairpin to pseudoknot, was discovered in the 3′ splice site of the influenza segment 8 transcript. The structures discovered in segment 8 comprise a distinct family of structured RNAs conserved between influenza A and B. Segment 7, in contrast, is not spliced in influenza B and the pseudoknot/hairpin are not predicted to form in influenza B.

Figure 1. Cartoon of the influenza A segment 7 transcript indicating the M1 and M2 open reading frames (ORFs) with orange and green, respectively. The 3′ end of the M1 ORF lies within the M2 coding region and is indicated with a dashed line. The splice sites are indicated with red arrows. Below are the two secondary structures of the 3′ splice site region where the branch point, polypyrimidine tract, and SF2/ASF exonic splicing enhancer site are indicated with purple, blue and orange, respectively.

Figure 1. Cartoon of the influenza A segment 7 transcript indicating the M1 and M2 open reading frames (ORFs) with orange and green, respectively. The 3′ end of the M1 ORF lies within the M2 coding region and is indicated with a dashed line. The splice sites are indicated with red arrows. Below are the two secondary structures of the 3′ splice site region where the branch point, polypyrimidine tract, and SF2/ASF exonic splicing enhancer site are indicated with purple, blue and orange, respectively. RNA secondary structure is known to play important roles in alternative splicing. In particular, hiding or revealing splice sites- and protein binding sites- are mechanisms used in nature to regulate the splicing of mRNA. The splicing of the M2 mRNA of influenza A is timed to produce this protein late in viral infection, and this product is roughly 5% as abundant as the M1 transcript. Thus, RNA conformational switching could be involved in the regulation of the amount and timing of splicing. This raises the possibility of specifically targeting either or both conformations of the 3′ splice site of segment 7 to modulate biological activity or for the application of oligonucleotide- or small-molecule, therapeutics. For example, the hairpin and pseudoknot structures were probed with a library of 861 unique pentamer and hexamer oligonucleotides. The observed binding pattern was unique and specific to each conformation.

RNA Structures

The hairpin

The hairpin conformation and structurally relevant mutations of the 3′ splice site are shown in Figure 2A. P1 and P2 are common structures between the hairpin and pseudoknot. P1, which contains the polypyrimidine tract and putative branch point, is less stable and more accessible to enzymes in the hairpin conformation. P2 and P3 are separated by a 2 by 2 nucleotide internal loop and the splice site occurs within the 5′ side of this loop. The nucleotides in this loop likely form non-canonical GA and GG pairing interactions. Similar GA/GG loops are observed in the HIV-1 Rev binding domain and the ribosomal loop-E motif and are important for protein recognition. The P3 hairpin loop is comprised of a hexamer terminal loop and a six base pair stem, which is interrupted by a single bulged A at nt 730. Most of the terminal loop and the 3′ half of P3 comprise the key residues for binding the SF2/ASF exonic splicing enhancer protein(Fig. 1). This purine-rich binding site extends into the 3′ side of the 2 by 2 nt internal loop and two nts of the P2 stem (Fig. 1).

Figure 2. Consensus secondary structures of the hairpin (A) and pseudoknot (B) conformations. Structurally relevant mutations that occurred with a frequency greater than two in the alignment of all available sequences are annotated: consistent mutations, single point mutations that preserve pairing, are indicated with green, while compensatory mutations, double point mutations that preserve pairing, are indicated in blue.

Figure 2. Consensus secondary structures of the hairpin (A) and pseudoknot (B) conformations. Structurally relevant mutations that occurred with a frequency greater than two in the alignment of all available sequences are annotated: consistent mutations, single point mutations that preserve pairing, are indicated with green, while compensatory mutations, double point mutations that preserve pairing, are indicated in blue. The hairpin structure is well conserved. Canonical base pairing is, on average, 96.8% conserved (Table 1). Each helix is supported by at least one compensatory (double point mutation that preserves base pairing) or consistent mutation (single point mutation that preserves canonical pairing; Fig. 2A).

Table 1. Canonical and non-canonical base pair composition for helixes of the pseudoknot, top, and P3′ of the hairpin conformation, bottom

Single point mutations that would preserve canonical pairing are in green, while double point mutations that would preserve pairing are in blue. Areas where the most frequent pairing is expected to be non-canonical are indicated with italics. Numbers are from all available influenza A sequences (NCBI).

Single point mutations that would preserve canonical pairing are in green, while double point mutations that would preserve pairing are in blue. Areas where the most frequent pairing is expected to be non-canonical are indicated with italics. Numbers are from all available influenza A sequences (NCBI).

The pseudoknot

In the pseudoknot, nucleotides that make up P3 in the hairpin conformation are re-arranged to make P3′, and nucleotides 714 to 717 are paired in the P0 helix (Fig. 2B). In this structure the 3′ splice site is canonically paired in the middle of the P0 helix. In addition to four canonical pairs, P3′ has several non-canonical pairing possibilities (Fig. 2B). The loop of P3′ may contain three continuous GA pairs. Continuous stretches of three purine-purine pairs (the 3Rs motif) are especially stabilizing in internal loops. P0 and P3′ are both well conserved, with canonical pairing 99.0 and 100.0% conserved, respectively (Table 1). There are five consistent mutations in P0, but only a single change at C720 in P3′. Interestingly, when mutations occur within the loop of P3′, they always lead to purines at these positions (Table 1). This observation supports the potential formation of a structure similar to the 3Rs motif.

Evolutionary constraints and species distribution

The 3′ splice site structured region was initially discovered by identifying parts of the coding RNA with constrained evolution of synonymous sites. This implies a need to maintain RNA secondary structure in addition to protein sequence, which reduces the allowable synonymous site substitutions. Indeed, the sequence in this structured region has multiple constraints on its evolution: in addition to encoding both the hairpin and pseudoknot structures, it must maintain the M1 open reading frame (ORF) and, after nt 714, the M2 ORF, an SF2/ASF protein binding site, and the polypyrimidine and branch point sequences (Fig. 1). These constraints explain the small number of double point mutations (compensatory changes) that preserve canonical base pairing compared with single point mutations (consistent changes; Table 1). For example, residues 689 and 702 are most often an AU pair (Fig. 2B), but mutate to GU and AC pairs with much higher frequency than the GC double point mutation. In no case do compensatory changes outnumber single mutations and many single mutations resulted in non-canonical pairs. Mutations from canonical to non-canonical pairs resulted in primarily CA pairs followed by GA pairs (Table 1). CA and GA pairs are commonly observed non-canonical pairs that are able to maintain helicicity in RNA structures-; they also play important roles in molecular recognition., Thus, it may be hard to make double point mutations that do not alter the protein coding sequence, as synonymous sites are rarely paired in these structures. An interesting trend was observed when all available influenza A sequences for the 3′ splice site structure were sorted by expected stability. Sequences with the highest fraction of canonical and GC pairs, which are expected to stabilize structure, were overwhelmingly comprised of avian specific strains. As the fraction of canonical and GC pairs decreased, the percentage of avian specific strains decreased. Sequences with the smallest fraction of canonical and GC pairs were mainly from human specific strains with the percentage decreasing with increasing canonical and GC pairing potential. Swine specific strains tended to fall in between. Interestingly, no significant difference was observed in the stability of hairpin vs. pseudoknot in terms of host-specific structural stability. Perhaps the ratio of each conformation needs to be maintained irrespective of host. This trend in host-specific structural stability was explored in the context of whole coding region folding thermodynamics and was found to be a general phenomenon in influenza A. In general, avian sequences are more stable than human sequences and swine specific strains fall in between. This trend parallels the temperature at which the virus must replicate. The human and swine respiratory tract, and avian gut are 33, 37 and 42°C, respectively. Perhaps the RNA structure is optimized to perform its function at each temperature.

Concluding Remarks

A new structured RNA family has been characterized in influenza A. This family joins the structured splice site from segment 8 in the growing collection of known influenza RNA structures. Both of these structured regions are proposed to influence splicing of their respective mRNAs. Conformational switching places sites that are functionally important for splicing in different structural contexts. In particular, these sites are expected to be more accessible in the hairpin conformation than in the pseudoknot. Switching between hairpin and pseudoknot may be a conserved mechanism for modulation of splicing in influenza. Such a switch makes an attractive target for RNA therapeutics as either structure may be specifically targeted with small molecules or oligonucleotides to inhibit the virus. A seed alignment for this structured region, created by collapsing alignments to only include non-redundant sequences, has been submitted to the Rfam database. Click here for additional data file.
  48 in total

1.  Identification of a specific exon sequence that is a major determinant in the selection between a natural and a cryptic 5' splice site.

Authors:  L Domenjoud; H Gallinaro; L Kister; S Meyer; M Jacob
Journal:  Mol Cell Biol       Date:  1991-09       Impact factor: 4.272

2.  A secondary structure at the 3' splice site affects the in vitro splicing reaction of mouse immunoglobulin mu chain pre-mRNAs.

Authors:  A Watakabe; K Inoue; H Sakamoto; Y Shimura
Journal:  Nucleic Acids Res       Date:  1989-10-25       Impact factor: 16.971

Review 3.  Phylogenetic comparative analysis and the secondary structure of ribonuclease P RNA--a review.

Authors:  N R Pace; D K Smith; G J Olsen; B D James
Journal:  Gene       Date:  1989-10-15       Impact factor: 3.688

4.  Genomic RNAs of influenza viruses are held in a circular conformation in virions and in infected cells by a terminal panhandle.

Authors:  M T Hsu; J D Parvin; S Gupta; M Krystal; P Palese
Journal:  Proc Natl Acad Sci U S A       Date:  1987-11       Impact factor: 11.205

5.  5' and 3' terminal nucleotide sequences of the RNA genome segments of influenza virus.

Authors:  J S Robertson
Journal:  Nucleic Acids Res       Date:  1979-08-24       Impact factor: 16.971

6.  A secondary structure that contains the 5' and 3' splice sites suppresses splicing of duck hepatitis B virus pregenomic RNA.

Authors:  Daniel D Loeb; Amanda A Mack; Ru Tian
Journal:  J Virol       Date:  2002-10       Impact factor: 5.103

7.  The 3' and 5'-terminal sequences of influenza A, B and C virus RNA segments are highly conserved and show partial inverted complementarity.

Authors:  U Desselberger; V R Racaniello; J J Zazra; P Palese
Journal:  Gene       Date:  1980-02       Impact factor: 3.688

8.  Mutational analysis of the influenza virus cRNA promoter and identification of nucleotides critical for replication.

Authors:  Mandy Crow; Tao Deng; Mark Addley; George G Brownlee
Journal:  J Virol       Date:  2004-06       Impact factor: 5.103

9.  The 3' splice site of influenza A segment 7 mRNA can exist in two conformations: a pseudoknot and a hairpin.

Authors:  Walter N Moss; Lumbini I Dela-Moss; Elzbieta Kierzek; Ryszard Kierzek; Salvatore F Priore; Douglas H Turner
Journal:  PLoS One       Date:  2012-06-07       Impact factor: 3.240

10.  RNA folding affects the recruitment of SR proteins by mouse and human polypurinic enhancer elements in the fibronectin EDA exon.

Authors:  Emanuele Buratti; Andrés F Muro; Maurizio Giombi; Daniel Gherbassi; Alessandra Iaconcig; Francisco E Baralle
Journal:  Mol Cell Biol       Date:  2004-02       Impact factor: 4.272

View more
  18 in total

1.  Evidence of pervasive biologically functional secondary structures within the genomes of eukaryotic single-stranded DNA viruses.

Authors:  Brejnev Muhizi Muhire; Michael Golden; Ben Murrell; Pierre Lefeuvre; Jean-Michel Lett; Alistair Gray; Art Y F Poon; Nobubelo Kwanele Ngandu; Yves Semegni; Emil Pavlov Tanov; Adérito Luis Monjane; Gordon William Harkins; Arvind Varsani; Dionne Natalie Shepherd; Darren Patrick Martin
Journal:  J Virol       Date:  2013-11-27       Impact factor: 5.103

2.  RNA structural constraints in the evolution of the influenza A virus genome NP segment.

Authors:  Alexander P Gultyaev; Anton Tsyganov-Bodounov; Monique I J Spronken; Sander van der Kooij; Ron A M Fouchier; René C L Olsthoorn
Journal:  RNA Biol       Date:  2014-07-23       Impact factor: 4.652

3.  Mapping the RNA structural landscape of viral genomes.

Authors:  Ryan J Andrews; Levi Baber; Walter N Moss
Journal:  Methods       Date:  2019-11-08       Impact factor: 3.608

4.  The RNA pseudoknots in foot-and-mouth disease virus are dispensable for genome replication, but essential for the production of infectious virus.

Authors:  Joseph C Ward; Lidia Lasecka-Dykes; Chris Neil; Oluwapelumi O Adeyemi; Sarah Gold; Niall McLean-Pell; Caroline Wright; Morgan R Herod; David Kealy; Emma Warner; Terry Jackson; Donald P King; Tobias J Tuthill; David J Rowlands; Nicola J Stonehouse
Journal:  PLoS Pathog       Date:  2022-06-06       Impact factor: 7.464

Review 5.  In silico discovery and modeling of non-coding RNA structure in viruses.

Authors:  Walter N Moss; Joan A Steitz
Journal:  Methods       Date:  2015-06-23       Impact factor: 3.608

6.  Identification of conserved RNA secondary structures at influenza B and C splice sites reveals similarities and differences between influenza A, B, and C.

Authors:  Lumbini I Dela-Moss; Walter N Moss; Douglas H Turner
Journal:  BMC Res Notes       Date:  2014-01-09

7.  Secondary structure of a conserved domain in an intron of influenza A M1 mRNA.

Authors:  Tian Jiang; Scott D Kennedy; Walter N Moss; Elzbieta Kierzek; Douglas H Turner
Journal:  Biochemistry       Date:  2014-08-06       Impact factor: 3.162

Review 8.  Microarrays for identifying binding sites and probing structure of RNAs.

Authors:  Ryszard Kierzek; Douglas H Turner; Elzbieta Kierzek
Journal:  Nucleic Acids Res       Date:  2014-12-12       Impact factor: 16.971

9.  Genome-wide analyses of Epstein-Barr virus reveal conserved RNA structures and a novel stable intronic sequence RNA.

Authors:  Walter N Moss; Joan A Steitz
Journal:  BMC Genomics       Date:  2013-08-09       Impact factor: 3.969

Review 10.  Influenza viruses and mRNA splicing: doing more with less.

Authors:  Julia Dubois; Olivier Terrier; Manuel Rosa-Calatrava
Journal:  mBio       Date:  2014-05-13       Impact factor: 7.867

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.