| Literature DB >> 36230922 |
Nhan L T Nguyen1, Nelly Panté1.
Abstract
Influenza viruses deliver their genome into the nucleus of infected cells for replication. This process is mediated by the viral nucleoprotein (NP), which contains two nuclear localization sequences (NLSs): NLS1 at the N-terminus and a recently identified NLS2 (212GRKTR216). Through mutagenesis and functional studies, we demonstrated that NP must have both NLSs for an efficient nuclear import. As with other NLSs, there may be variations in the basic residues of NLS2 in different strains of the virus, which may affect the nuclear import of the viral genome. Although all NLS2 variants fused to the GFP mediated nuclear import of GFP, bioinformatics showed that 98.8% of reported NP sequences contained either the wild-type sequence 212GRKTR216 or 212GRRTR216. Bioinformatics analyses used to study the presence of NLS2 variants in other viral and nuclear proteins resulted in very low hits, with only 0.4% of human nuclear proteins containing putative NLS2. From these, we studied the nucleolar protein 14 (NOP14) and found that NLS2 does not play a role in the nuclear import of this protein but in its nucleolar localization. We also discovered a functional NLS at the C-terminus of NOP14. Our findings indicate that NLS2 is a highly conserved influenza A NP sequence.Entities:
Keywords: NLS; NOP14; influenza A virus; nuclear import; nuclear localization sequence; nucleolar protein 14; nucleoprotein
Mesh:
Substances:
Year: 2022 PMID: 36230922 PMCID: PMC9563117 DOI: 10.3390/cells11192957
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 7.666
Primers used in this study.
| Construct Name | Primer Sequence (F Indicates Forward; R Indicates Reverse) |
|---|---|
| NLS1 MT | F′5′-GATCCAATGGCGTCTCAAGGCACCAAACGATCATATGAACAATGCCG-3′ |
| NLS2 MT | F′5′-GAGGGGTGAAAATGGAGCAAAGACAGCGCCGAATC-3′ |
| NOP14-mutNLS2 | F′5′-ACGGCCCACGACGTGGGACTGCCC-3′ |
| NOP14-mutSeq3 and NOP14-mutNLS2/Seq3 | F′5′-TCTGGCGGCGGCAGCGTTCAAAAAAACGCG-3′ |
| P2 | F′5′-TGAAAATGGAAAAAAGACAAGGCCG-3′ |
| P3 | F′5′-AAATGGACGAAGGACAAGGCCGG-3′ |
| P5 | F′5′-ACGAAAGACAAAGCCGGATCCAC-3′ |
| P2P3 | F′5′-TGAAAATGGAAAAAGGACAAGGCCG-3′ |
| P2P5 | F′5′-TGAAAATGGAAAAAGGACAAGGCCG-3′ |
| P3P5 | F′5′-ACGAAGGACAAAGCCGGATCCAC-3′ |
| P2P3P5 | F′5′-AAATGGAAAAAGGACAAAGCCGG-3′ |
Figure 1Subcellular localization of WT and mutant NPs. (A) Schematic representations of WT NP and its mutants (NLS1 MT and NLS2 MT). Basic residues of NLS1 and NLS2 are indicated in bold. Alanine substitutions of K or R are shown in red. (B) Confocal images of HeLa cells transfected with a plasmid expressing WT NP at 30 h post-transfection in the presence or absence of LMB. (C) Confocal images of HeLa cells transfected with plasmids expressing WT NP, NLS1, or NLS2 mutant NP at 30 h post-transfection in the presence of LMB. For B and C, LMB was added 12 h post-transfection to inhibit nuclear export of NP. Samples were prepared for indirect immunofluorescence microscopy using an antibody against NP (red), and nuclei were stained with DAPI. Scale bars for (B,C), 10 μm. (D) Quantification of the ratio of nuclear to cytoplasmic fluorescence (Fn/c) from the experimental conditions shown in (C). Shown is the means ± standard error of the means scored from 85–100 cells for each condition from three independent experiments. (NS, not significant; **** p < 0.0001, Kruskal–Wallis test).
Name of NLS2 variants.
| NLS2 Variant | Position at the Importin-α Major Binding Site 1 |
|---|---|
| WT | 212G R K T R216 |
| P2: R→ K at 213 | 212G |
| P3: K→ R at 214 | 212G R |
| P5: R→ K at 216 | 212G R K T |
| P2P3: R→ K at 213 and K→ R at 214 | 212G |
| P2P5: R→ K at 213 and R→ K at 216 | 212G |
| P3P5: K→ R at 214 and R→ K at 216 | 212G R |
| P2P3P5: R→ K at 213 and K→ R at 214 and R→K at 216 | 212G |
1 Substituted amino acids are in red.
Figure 2Bioinformatics of the sequence GRKTR and its variants for NP from different influenza A virus strains. (A) The sequence GRKTR and its variants were entered into the BLASTP sequence alignment algorithm, and the number of hits for each variant is reported for NP from influenza A viruses. As controls, a random peptide consisting of five amino acids (GLVTV) and the SV40 Large T-antigen NLS (PKKKRK) were also used in the BLASTP analysis. The amino acids in red indicate substitutions from the WT NLS2. (B) Percentage of NLS2 variants in NP from different strains of influenza A viruses. (C) Percentage of K and R at P2, P3, and P5 of NP from different influenza A virus strains.
Figure 3Functional analysis of NLS2 variants naturally found in NP of different influenza A virus strains. (A) Schematic representation of the chimeric proteins containing NLS1 and NLS2 variants fused to the C-terminus of 5GFP. The amino acids in red indicate substitutions from the WT NLS2 sequence. (B) Confocal images of HeLa cells transfected with plasmids expressing 5GFP, 5GFP-NLS1, or five NLS2 variants fused to 5GFP 24 h post-transfection. Nuclei were stained with DAPI. Scale bar, 10 μm. (C) Quantification of the ratio of nuclear to cytoplasmic fluorescence (Fn/c) from the experimental conditions shown in B. Shown is the means ± standard error of the means scored from 85–100 cells for each condition from three independent experiments. (NS, not significant; **** p < 0.0001, one-way ANOVA followed by Tukey’s tests).
Viral proteins from DNA virus families containing variants of the sequence GRKTR and their predicted NLSs.
| Protein 1 | Protein Function 2 [References] | Virus/Host | Putative NLS2 | Predicted NLS 3 |
|---|---|---|---|---|
|
| ||||
| Minor core | Participates in capsid assembly in the nucleus [ | Harbour porpoise adenovirus 1 | P2: | 21RKRKTPKREPKTEIKIERVKTEDVKPFKKGKRRKH55 |
| Human mastadenovirus B and several human adenoviruses (16, 3 + 7, 68, 66, 7d2) | P2P3: 117GKRTR121 | 297YKPPKRQYRKRKTRRVRQGRR317 | ||
| Precursor terminal protein pTP | Participates in viral replication [ | Titi monkey adenovirus ECC-2011 | P3: | 339GARPGLRRRPTAGRR353 |
| Squirrel monkey adenovirus | P3: | 389RLPIRRRRRRAPP401 | ||
| Viral RNA splicing factor L4-33 kDa | Required for genome packaging and capsid assembly in the nucleus [ | Deer mastadenovirus B and murine adenovirus 3 | P2: | 126RGRRR130 |
| Late L2 mu core | Condenses the viral pro-chromatin for encapsidation (reviewed | Murine adenovirus 2 and canine adenovirus 1 | P3: | 15RSRRLRRRLGGGGCSS |
| Encapsidation protein L1-52/55 kDa | Involved in genome packaging in the nucleus [ | Duck adenovirus 4 | P3: | No NLS predicted |
| Hexon-associated structural protein pVIII precursor | Capsid assembly in the nucleus by connecting the major structural units with each other and with the viral core (reviewed in [ | Bovine adenovirus 1 | P3P5: 111GRRTK115 | No NLS predicted |
| Viral transcription factor L4-22 kDa | Required for genome packaging and capsid assembly in the nucleus [ | Murine adenovirus 3 | P2: | No NLS predicted |
|
| ||||
| DNA polymerase processivity subunit | Involved in viral DNA replication [ | Wood mouse herpesvirus, murid gammaherpesvirus 4, and 68 | P2: | 373KRPPPKKEKEPTPKRPK389 |
| Tegument protein UL32 | Associates with nuclear capsids prior to DNA encapsidation and preserves the integrity of capsids through secondary envelopment [ | Cynomolgus macaque cytomegalovirus strain Ottawa | P5: | 546PKAKRRLILKPKTKKNVPKPKP567 |
| Tegument protein VP22 | Regulates the activity of the viral endonuclease vhs [ | Pteropus lylei-associated alpha herpesvirus | P3: | 89RRGRGAARPAAARAPTARRAPASGGAASARGTRGAAAS126 |
| Assembly protein M80 | Coordinates capsid assembly in the nucleus [ | Murine betaherpesvirus 1 | P2P3: 504GKRTR508 | 507G |
|
| ||||
| Capsid protein | Binds and transports the viral genome through the NPC [ | Capybara-associated cyclovirus 1 | P3: | 5RRFK |
|
| ||||
| X protein | Regulates transcription through direct interaction with different transcription factors [ | Human hepatitis B virus | P2P5: | No NLS predicted |
1 Accession numbers of the proteins are listed in Table S3. 2 All listed proteins are known to function in the nucleus of infected cells. 3 NLSs were predicted using NLStradamus [48]. Overlapping regions of the predicted NLS containing GRKTR or its variants are highlighted in red.
Figure 4Bioinformatics of the sequence GRKTR and its variants present in proteins from DNA virus families that localize in the nucleus of infected cells. (A,B) The sequence GRKTR and its variants (listed in Table 2), the control peptide GLVTV, and the SV40 large T-antigen NLS were entered into the BLASTP algorithm. Only the numbers of hits for viral proteins that are known to function in the nucleus of infected cells are reported. The amino acids in red indicate substitutions from the WT NLS2. (C,D) Percentage of the sequence GRKTR and its variants present in the total viral proteins with these sequences from the Adenoviridae (C) and the Herpesviridae (D) families.
Figure 5Bioinformatics of the sequence GRKTR and its variants for nuclear proteins. (A) The sequence GRKTR and its variants (listed in Table 2), the control peptide GLVTV, and the SV40 large T-antigen NLS were entered into the BLASTP algorithm. Only the number of hits for proteins that are known to localize in the nucleus of cells from humans and several model organisms is reported. The amino acids in red indicate substitutions from the WT NLS2. (B) Percentage of BLAST hits for nuclear proteins containing each of the studied sequences. (C) Percentage of nuclear proteins containing each of the studied sequences.
Human nuclear proteins containing GRKTR variants found in one or more other organisms and their predicted NLSs.
| Protein 1 | Organism | Putative NLS2 | Predicted NLS 2 |
|---|---|---|---|
| Nucleolar protein 14 (NOP14) [ |
| WT: | 3KAKKVGARRKASGAPAGARGGPAKA27 |
| Nuclear mitotic apparatus protein 1 isoform X1 [ |
| WT: | 2083RRGASKKALSKASP2096 |
| Spliceosomal factor RED [ |
| WT: | 73RRRKKKS79, 541KRK543 |
| Serum response factor [ |
| P2: | 135KP |
| Replication factor C subunit 3 [ |
| P2: | No NLS predicted |
| Transcription factor Sp1 [ |
| P3: | No NLS predicted |
| ETS-related transcription factor Elf-1 [ |
| P5: | 171QRKRKK |
| PDZ domain-containing protein 2 [ |
| P2P3: | 99KRRGGKKRK107 |
| Nucleolin [ |
| P2P5: | 277AAPGKRKKEMTKQKEAPEAKK297 |
| DnaJ homolog subfamily C member 21 (DNJC21) [ |
| P2P5: | 181KRAMEKENKKIRDRARKEKNELVRQLVAFIRKRDKRVQAHRKLV224 |
| Chromo-domain helicase DNA-binding protein 6 (CHD6) [ |
| P2P5: | 175GSRTKSKKASREQGPTPVERKKKGKRK201 |
| Lupus La protein [ |
| P2P5: | 328KWKSKGRRFKGKGKGNKAAQPGSGKGKV355 |
| Brefeldin A-inhibited guanine nucleotide-exchange protein 1 [ |
| P2P5: | No NLS predicted |
| SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily B member 1 [ |
| P2P5: | No NLS predicted |
| Lysine-specific demethylase PHF2 [ |
| P2P3P5: | 63KKKR66, 888KKR890, 942KNRKKKNTKRKP953 |
| Thymocyte nuclear protein 1 [ |
| P2P3P5: | 26RPRKRQTGTAGPDRKKLSGKR46 |
1 All listed proteins are known to function in the nucleus of infected cells. 2 NLSs were predicted using NLStradamus [48]. Overlapping regions of the predicted NLS containing GRKTR or its variants are highlighted in red.
Figure 6Functional analysis of NLS2 and a predicted NLS of NOP14. (A) Schematic representation of the chimeric proteins NOP14-mutNLS2, NOP14-mutSeq3, and NOP14-mutNLS2/Seq3 fused to GFP. Basic residues of NLS2 and Seq3 are indicated in bold. The amino acids in red indicate substitutions from the WT NOP14 sequence. (B) Confocal images of HeLa cells transfected with plasmids expressing the proteins shown in A 24 h post-transfection. Nuclei were stained with DAPI. Scale bar, 10 μm. (C) Quantification of the ratio of nuclear to cytoplasmic fluorescence (Fn/c) from the experimental conditions shown in B. Shown is the means ± standard error of the means scored from 85–100 cells for each condition from three independent experiments. (NS, not significant; **** p < 0.0001, one-way ANOVA followed by Tukey’s tests).
Figure 7Protein sequence alignments of the putative NLSs of NOP14 from five different organisms. The conserved basic residues are in red. The sequences are from humans, mouse, zebrafish (Danre), fruit fly (Drome), and yeast. An asterisk (*) indicates positions that have single, fully conserved residues in all five sequences; a colon (:) indicates conserved amino acids of strongly similar properties; and a period (.) indicates conservation between amino acids of weakly similar properties.
Figure 8NLS2 plays a role in the nucleolar localization of NOP14. (A) Confocal images of HeLa cells transfected with plasmids expressing NOP14-GFP or NOP14-mutNLS2 24 h post-transfection. Samples were prepared for indirect immunofluorescence microscopy using an antibody against fibrillarin (red), and nuclei were stained with DAPI. Scale bar, 10 μm. (B) Quantification of the ratio of nucleolar to nuclear fluorescence (Fnucleolus/n) from the experimental conditions shown in A. Shown are the means ± standard error of the means scored from 50–80 cells for each condition from three independent experiments. (**** p < 0.0001, one-way ANOVA followed by Tukey’s tests).
Alignments of several NLSs indicating their binding to importin-α deduced from the crystal structure of importin-α in complex with the NLS.
| NLS Type | Minor Binding Site | Linker | Major Binding Site | PDB id |
|---|---|---|---|---|
| SV40 large T-antigen | K | K | 1EJL/1BK6 1 | |
| hPLSCR1-NLS | G | 1Y2A | ||
| hPLSCR4-NLS | I | 3Q5U | ||
| Guα -NLS | K | 3ZIN | ||
| A89-NLS | K | 4B8P 2 | ||
| B54-NLS | K | 2YNS 2 | ||
| TPX2 | K | V | 3KND | |
| C-Myc | K | A | 1EE4 1 | |
| Nucleoplasmin | K | TKKAG | K | 1EJY/1EE5 1 |
| Kap60-IBB | R | TQQVELRKAKRDEA | A | 1WA5 1 |
| h1NLS | K | DSDDWSES | S | 4XZR 1 |
| h2NLS | K | QISTDNEAKMQIQEEKS | K | 4PVZ 1 |
| hRCC1 | K | PPADAIP | S | 5TBK |
| yRCC1 | K | TNGDASGAH | K | 5T94 1 |
| BFDV Cap NLS | Y R | 4HTV | ||
| Influenza A NP-NLS1 | K | 4ZDU | ||
| WT NLS2 | R | G | 5V5O | |
| NLS2 P3 variant | R | G | 5V5P |
1 Denotes yeast importin-α (Kap60). 2 Denotes rice importin-α. In all other cases, mammalian importin-α was co-crystallized with NLSs. Basic residues at P2 (major binding site) or P2’ (minor binding site) are indicated in bold.