| Literature DB >> 23749060 |
John L Goodier1, Ling E Cheung, Haig H Kazazian.
Abstract
LINE1s occupy 17% of the human genome and are its only active autonomous mobile DNA. L1s are also responsible for genomic insertion of processed pseudogenes and >1 million non-autonomous retrotransposons (Alus and SVAs). These elements have significant effects on gene organization and expression. Despite the importance of retrotransposons for genome evolution, much about their biology remains unknown, including cellular factors involved in the complex processes of retrotransposition and forming and transporting L1 ribonucleoprotein particles. By co-immunoprecipitation of tagged L1 constructs and mass spectrometry, we identified proteins associated with the L1 ORF1 protein and its ribonucleoprotein. These include RNA transport proteins, gene expression regulators, post-translational modifiers, helicases and splicing factors. Many cellular proteins co-localize with L1 ORF1 protein in cytoplasmic granules. We also assayed the effects of these proteins on cell culture retrotransposition and found strong inhibiting proteins, including some that control HIV and other retroviruses. These data suggest candidate cofactors that interact with the L1 to modulate its activity and increase our understanding of the means by which the cell coexists with these genomic 'parasites'.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23749060 PMCID: PMC3753637 DOI: 10.1093/nar/gkt512
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.pc-L1-1FH immunoprecipitates basal L1 RNP complexes from 293T cell lysates after α-FLAG agarose purification. (A) Structure of FLAG-HA-tagged pc-L1-1FH cloned in vector pcDNA6 myc/his B. RT: ORF2 reverse transcriptase domain; EN: endonuclease domain; PCMV: CMV promoter; BGH An: bovine growth hormone polyadenylation signal. (B) FLAG-tagged ORF1p expressed from the construct pc-L1-1FH binds α-FLAG agarose independent of RNase digestion (lanes 5 and 8), but untagged ORF1p (construct pc-L1-RP) will not bind (lane 6). (C and D) Detection of L1 proteins in the RNP IP. Lanes 1–4: input lysates; lanes 6–9: immunoprecipitates; lanes 1, 2, 6 and 7: cytoplasmic fractions; lanes 3, 4, 8 and 9: nuclear fractions. (C) FLAG-HA-tagged ORF1p, detected by α-FLAG antibody. Putative ORF1p dimer and trimer bands are visible in IP samples (lanes 7 and 9). The reason for their absence in lysate samples is unclear (lanes 2 and 4). IP purification factors were determined for cytoplasmic (lanes 2 versus 7, 26-fold) and nuclear (lanes 4 versus 9, 42-fold) fractions and are presented in Supplementary Table S3. Lane labels are at the bottom of panel D. (D) ORF2p detected by α-ORF2-N (154–167) antibody (lanes 7 and 9). ORF2p in nuclear lysate samples is below the level of detection (lanes 3 and 4). (E) ORF2p reverse transcriptase activity detected in both nuclear and cytoplasmic IP reactions containing pc-L1-1FH (lanes 3 and 5), but not in reactions with the empty vector (lanes 2 and 4). RT- control: the RT incubation step was omitted and 2 µl of pc-L1-1FH immunoprecipitate was added directly to the PCR reaction. No PCR product was detected (lanes 1 and 6). The assay is described in Kulpa and Moran (51). (F) L1 RNA detected by RT–PCR (lanes 3 and 5). RT-: RT enzyme was omitted from the cDNA synthesis step using pc-L1-1FH immunoprecipitates (lanes 1 and 6). (G) FLAG-HA-tagged ORF2p is detected in nuclear and cytoplasmic extracts after IP of pc-L1-2FH. (H) The purity of nuclear and cytoplasmic whole-cell lysate fractions is shown by western blotting. α-HDAC1 is a strictly nuclear protein (54) and α-MEK1/2 is cytoplasmic (55). (I) Immunoprecipitated samples resolved on silver-stained polyacrylamide gels. To support protein identification data from complex IP samples, selected prominent band regions were excised for additional MS sequencing. Both cytoplasmic (left) and nuclear (right) IP fractions are shown.
Figure 3.Ectopically expressed and endogenous proteins associate with L1 complexes in multiple cell lines. (A) V5-, 6xMyc- or GFP-tagged proteins exogenously expressed in 293T cells specifically co-immunoprecipitate with pc-L1-1FH, but not empty vector (pcDNA6 myc/his B) [IP: α-FLAG affinity gel, western blotting (WB): α-V5, α-Myc or α-GFP]. IP reactions were in the presence or absence of 15 μg/ml RNase (lanes 3–5). Lysate input samples are also shown (lanes 1 and 2). Several protein panels are reproduced from Goodier et al. (29). GFP-mIGF2BP1 is derived from mouse.
The bottom-most panel is representative of tagged ORF1p in the input and IP fractions and confirms that RNase treatment does not affect ORF1p immunoprecipitation on α-FLAG agarose. Molecular weights shown include the epitope tag. The protein standard is See Blue Plus 2 (Invitrogen). (B) Co-IP of endogenous ORF1p from 2102Ep cells by selected V5-tagged proteins (IP: α-V5/IgG affinity gel). Upper rows: detection of ORF1p (WB: α-ORF1 AH40). Asterisk indicates proteins that strongly co-IP endogenous ORF1p. ‘o’ marks proteins that clearly associate with ORF1p on gel overexposure. Lower rows confirm successful IP of the test proteins (WB: α-V5). Exposure times are not necessarily the same for each lane. Input lysate fractions are shown in Supplementary Figure S1. (C) Co-IP of selected endogenous proteins by pc-L1-1FH from 293T cells (IP: α-FLAG affinity gel, WB: various antibodies). The antibody name is followed by the expected protein molecular weight. NCL has an expected weight of 77 kDa, but observed molecular weight of ∼100 kDa. As previously reported (27), an antibody against DDX39B [UAP56; (50)] detects a dominant band of 55 kDa in cytoplasmic lysates, and a smaller isoform of 49 kDa (the expected size for DDX39B) that co-IPs with tagged ORF1p. In total, 12 antibodies were tested; α-PCBP2 (Abnova), α-FBL (Santa Cruz) and α-TOP1 (Spring) failed to detect their endogenous targets in pc-L1-1FH immunoprecipitates. The bottom-most panel shows efficient immunoprecipitation of tagged ORF1p detected by α-FLAG antibody.
Figure 6.Some L1-associated proteins strongly inhibit L1 retrotransposition in 293T cells. (A) The 99-PUR-RPS-EGFP was co-transfected in 293T cells with empty vector (pcDNA3) or test constructs expressing tagged proteins. Five days later, percentages of EGFP-positive cells were determined by flow cytometry. Each plasmid pair was transfected in four replicate wells, and results are normalized to pcDNA3 vector control (black bar). Proteins are ordered by their effect on retrotransposition. (B) To control for any off-target effects, test constructs were co-transfected with pCEP-EGFP, a plasmid that constitutively expresses EGFP. Four days later, cells were assayed for gain or loss of fluorescent cells (panel below). Fluorescence is normalized to pcDNA3 control (table, top row). Readings of ≤80% are marked below with ‘+’. Standard deviation for four replicates is shown (bottom row). These results were then used to adjust the retrotransposition levels of Figure 1A by dividing by the average of the pCEP-EGFP expression readings. P-values were calculated by two-tailed t-test and are indicated above each histogram bar (*P < 0.05, **P < 0.01, ***P < 0.001). The inset table summarizes the number of proteins that fall into each retrotransposition percent range. (C) Results of MultiTox-Fluor Multiplex Cytotoxicity assay (Promega) for potential cell toxicity caused by overexpression of test proteins. Test constructs were transfected in 96-well plates and assayed at 3 days. The histogram shows ratios of live to dead cell readings normalized to empty vector control.
Figure 4.snRNAs, scRNAs (left) and selected mRNAs (right) are detected in L1 RNP immunoprecipitates by RT–PCR. Results are for whole-cell lysates (lanes 1–4) and immunoprecipitates (lanes 5–8) from empty vector (pcDNA6 myc/his B; lanes 1, 3, 5 and 7) and pc-L1-1FH (lanes 2, 4, 6 and 8) transfected cells. RT enzyme was omitted (lanes 1, 2, 5 and 6) or included (lanes 3, 4, 7 and 8).
Summary of analyses of proteins that co-IP with the L1
| Gene symbol | Alternate symbol | Protein name | Co-IPs with pc-L1-1FH | With ORF1p in granules | Nuclear (Nu) / cytoplasmic (C) extracts |
|---|---|---|---|---|---|
| C3orf26 | MGC4308 | Chromosome 3 open reading frame 26 | Y(a) | N | Nu |
| C17orf79 | COPR5 | Chromosome 17 open reading frame 79, cooperator of PRMT5 | Beads | nd | Nu |
| C22orf28 | HSPC117 | Chromosome 22 open reading frame 28, tRNA-splicing ligase RtcB homolog | N | Y | C |
| CAT | Catalase | nc | nc | C | |
| CCNT1 | CYCT1 | Cyclin T1 | ? | N | |
| CCT2 | TCP-1-β | Chaperonin containing TCP1, subunit 2 (β) | N | N | C |
| CCT4 | TCP1 Δ | Chaperonin containing TCP1, subunit 4 (Δ)/stimulator of TAR RNA binding, | nc | nc | C |
| CCT6B | TCP-1-ζ-2 | Chaperonin containing TCP1, subunit 6B (ζ 2) | N | nd | C |
| CCT8 | TCP-1-θ | Chaperonin containing TCP1, subunit 8 (θ) | N | nd | C |
| CDC5L | CDC5 cell division cycle 5-like ( | ? | N | Nu | |
| CSDA | DBPA | Cold shock domain protein A | Y(27) | Y | Nu/C |
| DARS | Aspartyl-tRNA synthetase | nc | nc | C | |
| DDX17 | p82 | DEAD (Asp-Glu-Ala-Asp) box polypeptide 17, isoform p82 variant | Y(a) | N | Nu |
| DDX21 | RH-II/GuA | DEAD (Asp-Glu-Ala-Asp) box polypeptide 21 | Y(a) | N | Nu |
| DDX23 | PRPF28 | DEAD (Asp-Glu-Ala-Asp) box polypeptide 23 | nc | nc | C |
| DDX39A | BAT1, URH49 | DEAD (Asp-Glu-Ala-Asp) box polypeptide 39 | Y(a,b,c) | N | Nu |
| DDX5 | p68 | DEAD (Asp-Glu-Ala-Asp) box polypeptide 5 | Y(a) | N | Nu |
| DHX9 | DDX9, RHA | DEAH (Asp-Glu-Ala-His) box polypeptide 9 | ? | o | Nu |
| DKC1 | DKC1 dyskeratosis congenita 1, dyskerin | N | nd | C | |
| EIF4B | Eukaryotic translation initiation factor 4B | beads | Y | Nu/C | |
| ELAVL1 | HUR | Embryonic lethal abnormal vision n, | Y(a,b) | Y | Nu |
| FAM120A | C9orf10 | Family with sequence similarity 120A/oxidative stress-associated Src activator | nc | nc | Nu |
| FAM98A | Family with sequence similarity 98, member A | Y(a) | Y | C | |
| FBL | Fibrillarin | ? | N | Nu | |
| H1FX | Histone H1x | Y(a,b) | N | Nu | |
| HEXIM1 | HIS1 | Hexamethylene bis-acetamide inducible 1 | Y(a,c) | N | C |
| HIST1H1B | H1B | Histone cluster 1, H1b | nc | nc | Nu/C |
| HIST1H1C | H1.2, H1C | Histone 1 H1C | Y(a) | N | Nu/C |
| HIST1H1E | H1.4, H1E | Histone cluster 1, H1e | nc | nc | Nu |
| HNRNPA1 | Heterogeneous nuclear ribonucleoprotein A1 | Y(a,b) | Y | Nu | |
| HNRNPA2B1 | Heterogeneous nuclear ribonucleoprotein A2/B1, transcript variant B1 | Y(a) | N | Nu | |
| HNRNPAB | ABBP1 | Heterogeneous nuclear ribonucleoprotein A/B/APOBEC1-binding protein 1 | Y(a) | Y | Nu |
| HNRNPC | HNRNP C1/C2 | Heterogeneous nuclear ribonucleoprotein C (C1/C2) | Y(a,b) | N | Nu |
| HNRNPH3 | Heterogeneous nuclear ribonucleoprotein H3 | ? | N | Nu | |
| HNRNPK | Heterogeneous nuclear ribonucleoprotein K | ? | N | Nu/C | |
| HNRNPL | Heterogeneous nuclear ribonucleoprotein L, | Y(a) | N | Nu | |
| HNRNPR | Heterogeneous nuclear ribonucleoprotein R | ? | N | Nu | |
| HNRNPU | SAF-A | Heterogeneous nuclear ribonucleoprotein U (scaffold attachment factor A) | Y(a,b) | N | Nu |
| IGF2BP1 | IMP1, ZBP1 | Insulin-like growth factor 2 mRNA binding protein 1 | Y(a) | Y | Nu/C |
| IGF2BP2 | IMP2 | Insulin-like growth factor-binding protein 2, 36 kDa | nc | nc | C |
| ILF2 | NF45 | Interleukin enhancer-binding factor 2 | Y(a) | o | Nu |
| ILF3 | NFAR, NF90 | Interleukin enhancer-binding factor 3 | Y(a,b) | N | Nu |
| IVNS1ABP | Influenza virus NS1A-binding protein | Beads | Y | C | |
| KPNA2 | Karyopherin α 2 (RAG cohort 1, importin α 1) | Y(a) | nd | C | |
| KRI1 | KRI1 homolog ( | nc | nc | Nu | |
| LARP1 | La ribonucleoprotein domain family, member 1 | Y(c) | Y | Nu/C | |
| LUC7L3 | CROP | Cisplatin resistance-associated-overexpressed protein | nc | nc | C |
| MARS | Methionyl-tRNA synthetase | nc | nc | C | |
| MATR3 | Matrin 3 | nc | nc | Nu | |
| MEPCE | BCDIN3 | 7SK snRNA methylphosphate capping enzyme | Y(a) | N | C |
| MOV10 | KIAA1631 | Moloney leukemia virus 10, homolog (mouse) | Y(a,b) | Y | Nu/C |
| NAT10 | N-acetyltransferase 10 (GCN5-related) | ? | N | Nu | |
| NCL | Nucleolin | Y(a,c) | N | Nu/C | |
| NOP56 | NOL5A | Nucleolar protein 56 | Y(a) | N | Nu |
| NPM1 | B23 | Nucleophosmin 1 | Y(c) | N | Nu |
| NUSAP1 | Nucleolar and spindle-associated protein 1 | ?,Y(b) | N | C | |
| PABPC1 | Poly(A)-binding protein, cytoplasmic 1 | Y(a,c) | Y | Nu/C | |
| PABPC4 | Poly(A)-binding protein, cytoplasmic 4 | nc | nc | Nu/C | |
| PCBP2 | HNRNPE2 | Poly(rC)-binding protein 2 | Y(a) | Y | Nu/C |
| PDIA3 | Protein disulfide isomerase family A, member 3 | nc | nc | C | |
| PPAN | Peter pan homolog ( | nc | nc | C | |
| PPM1B | Protein phosphatase, Mg2+/Mn2+ dependent, 1B | nc | nc | C | |
| PRPF4 | PRP4 pre-mRNA processing factor 4 homolog (yeast)/ U4/U6 small nuclear ribonucleoprotein Prp4 | N | nd | C | |
| PRPF19 | PRP19/PSO4 pre-mRNA processing factor 19 homolog ( | N | N | C | |
| PRPF31 | PRP31 pre-mRNA processing factor 31 homolog ( | beads | Y | C | |
| PTBP1 | HNRNPI | Polypyrimidine tract binding protein 1 | nc | nc | C |
| PURA | Purine-rich element-binding protein A | Y(a,b) | N | Nu | |
| RALY | HNRNPCL2 | RNA-binding protein, autoantigenic (hnRNP-associated with lethal yellow homolog (mouse)) | Y(a,b) | N | Nu |
| RBMX | HNRNPG | RNA-binding motif protein, X-linked | Y(a) | N | Nu |
| RIOK1 | RIO kinase 1 (yeast) | Y(a) | Y | C | |
| RNMT | RG7MT1 | RNA (guanine-7-) methyltransferase | nc | nc | Nu |
| SART1 | Squamous cell carcinoma antigen recognized by Tcells/U4/U6.U5 tri-snRNP-associated protein 1 | Beads | N | Nu | |
| SERBP1 | PAIRBP1, CGI-55 | SERPINE1 mRNA-binding protein 1 | Y(27) | Y | Nu |
| SF3B1 | PRPF10 | Splicing factor 3B, subunit 1 | nc | nc | Nu/C |
| SF3B3 | SAP130 | Splicing factor 3B, subunit 3 | ?,Y(b) | N | Nu/C |
| SNRNP70 | U1 Small nuclear ribonucleoprotein 70 kDa | Y(a) | N | C | |
| SNRPD3 | Small nuclear ribonucleoprotein polypeptide D3 polypeptide 18 kDa | Beads | N | C | |
| SNX8 | Sorting nexin 8 | nc | nc | C | |
| SPIN1 | Spindlin 1 | nc | nc | Nu/C | |
| SR140 | U2SURP | U2 snRNP-associated SURP domain containing | Beads | N | Nu |
| SRP14 | Signal recognition particle 14 kDa | Y(a) | N | Nu | |
| SRSF1 | ASF, SF2, SF2p33 | Serine/arginine-rich splicing factor 1 isoform 1 | Y(a,b) | Y | Nu/C |
| SRSF6 | SFRS6, SRP55 | Splicing factor, arginine/serine-rich 6 | Y(a,b) | N | C |
| SRSF10 | FUSIP1, SRp38 | Serine/arginine-rich splicing factor 10 | Y(a) | Y | Nu |
| SSB | La, LARP3 | Sjogren syndrome antigen B (autoantigen La) | Y(c) | N | Nu/C |
| STAU1 | Staufen, RNA-binding protein, homolog 1 ( | Y(a) | Y | C | |
| STAU2 | Staufen, RNA-binding protein, homolog 2 ( | Y(a,b) | Y | C | |
| STK38 | Serine/threonine protein kinase 38 | Beads | Y | Nu/C | |
| SYNCRIP | hnRNPQ | Synaptotagmin binding, cytoplasmic RNA interacting protein | N | Y | Nu/C |
| TAB1 | MAP3K7IP1 | TGF-β activated kinase 1/MAP3K7-binding protein 1 | ? | N | C |
| TOP1 | DNA topoisomerase I | nc | nc | Nu | |
| TRA2A | Transformer-2 α homolog ( | Y(a) | N | Nu | |
| TRA2B | SFRS10 | Transformer 2 β homolog ( | Y(a) | N | Nu |
| TROVE2 | Ro60, SSA | 60 kDa Ro protein, Sjogren syndrome antigen A2 | Y(a,c) | Y | Nu/C |
| XRCC5 | KU80 | X-ray repair complementing defective repair in Chinese hamster cells 5 (double-strand-break rejoining) | Beads | N | Nu/C |
| YBX1 | YB1, NSEP1 | Y box-binding protein 1 | Y(a,b,c) | o | Nu/C |
Figure 2.Proteins identified with the L1 form a tight network of interactions dominated by RNA-binding proteins. (A) Pie chart of results of DAVID (Database for Annotation, Visualization and Integrated Discovery) analysis showing selected functional categories for the 96 candidate proteins (42). Protein counts for each category are shown within the slices. Protein names are listed in Supplementary Table S4. (B) STRING (Search tool for the retrieval of interacting genes/proteins)-derived network of known protein–protein interactions among the 96 candidate proteins. The confidence view is shown in Jensen et al. (43).
Figure 5.Many L1-associated proteins co-localize with EGFP-ORF1p in cytoplasmic granules of unstressed 293T cells. (A–W) Construct ORF1-EGFP-L1-RP was co-transfected with V5-tagged proteins in all cases except (B) FL-CSDA, (G) RFP-HNRNPA1 and (H) mouse GFP-mIGF2BP1 (the latter being co-transfected with pc-L1-1FH, which was detected by α-FLAG antibody). Only overlaid confocal micrographs are shown. (X) Endogenous LARP1 protein co-localizes with ORF1-EGFP-L1-RP in 293T cells. (Y) Endogenous ORF1p and PCBP2 co-localize in some cytoplasmic granules of 2102Ep cells (shown by arrows). (Z) Endogenous ORF1p and fibrillarin (FBL) co-localize in nucleoli of 2102Ep cells. ORF1p is typically found in nucleoli of only a minor percentage of cells (53). Enlargement of two nucleoli are shown. In Y and Z endogenous, ORF1p is detected by α-ORF1 AH40.1 antibody.