Literature DB >> 30544263

Unexpected diversity in eukaryotic transcription revealed by the retrotransposon hotspot family of Trypanosoma brucei.

Francesca Florini1,2, Arunasalam Naguleswaran1, Walid H Gharib3, Frédéric Bringaud4, Isabel Roditi1.   

Abstract

The path from DNA to RNA to protein in eukaryotes is guided by a series of factors linking transcription, mRNA export and translation. Many of these are conserved from yeast to humans. Trypanosomatids, which diverged early in the eukaryotic lineage, exhibit unusual features such as polycistronic transcription and trans-splicing of all messenger RNAs. They possess basal transcription factors, but lack recognisable orthologues of many factors required for transcription elongation and mRNA export. We show that retrotransposon hotspot (RHS) proteins fulfil some of these functions and that their depletion globally impairs nascent RNA synthesis by RNA polymerase II. Three sub-families are part of a coordinated process in which RHS6 is most closely associated with chromatin, RHS4 is part of the Pol II complex and RHS2 connects transcription with the translation machinery. In summary, our results show that the components of eukaryotic transcription are far from being universal, and reveal unsuspected plasticity in the course of evolution.
© The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30544263      PMCID: PMC6393297          DOI: 10.1093/nar/gky1255

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Trypanosoma brucei spp are unicellular eukaryotes that are pathogenic for humans and domestic animals. They are members of the family of Trypanosomatidae, which includes other parasites such as T. cruzi, the causative agent of Chagas′ Disease, and various species of Leishmania. Most of these parasites have a life cycle that is split between a mammalian host and an insect host which, in the case of Trypanosoma brucei brucei, is the tsetse fly. In the mammal, bloodstream form trypanosomes evade the host immune system by frequent changes of their variant surface glycoprotein (VSG) coat (1). This is shed when the parasites are ingested by a tsetse fly and differentiate to procyclic forms in the fly midgut. Trypanosomatids diverged early from other eukaryotes and have several unusual characteristics, such as a highly condensed mitochondrial genome (the kinetoplast), RNA editing via guide RNAs, and universal trans-splicing of nuclear-encoded mRNAs (2). Moreover, African trypanosomes harbour the only RNA polymerase I (Pol I) known to transcribe mRNAs as well as ribosomal RNAs. The most prominent of these are the mRNAs coding for VSG in bloodstream forms and the procyclin coat of the tsetse midgut forms (3). Transcription of these genes by Pol I ensures higher expression, since Pol I is known to initiate transcription ten times faster than RNA polymerase II (Pol II) (4). Transcription by Pol II is also unusual, sharing some features with prokaryotes and others with eukaryotes. In archaea and bacteria, several genes, which are normally functionally related to each other, are transcribed as a long transcript starting from a single promoter and translated directly from this polycistronic transcript (5). At the other extreme, the majority of eukaryotic genes are monocistronic, with each having its own promoter. There are exceptions to this: some nematodes, including C. elegans, are known to have a portion of their genomes organized in operons (6–8), and some genes in Drosophila and other organisms are transcribed as dicistrons (9). Trypanosomatids, though, are the only eukaryotes in which all genes transcribed by Pol II are organized in long polycistronic units containing up to 100 genes (10,11). The polycistronic RNAs are co-transcriptionally processed, giving rise to monocistronic mRNAs that are capped with a 39-nt spliced leader (SL) and polyadenylated. Trans-splicing of the SL and polyadenylation are both guided by pyrimidine-rich tracts followed by an AG dinucleotide (12–16) which resemble the signals for cis-splicing in other eukaryotes. There are approximately 100 tandem copies of the SL gene (17). It is synthesized as a precursor, and is the only Pol II-transcribed sequence with a promoter in front of each copy (18,19). For protein-coding gene arrays, transcription seems to start over broad regions of around 10 kb (20,21). Genome-wide studies demonstrated that the boundaries of different polycistronic units in T. b. brucei are marked by distinct histone variants and different histone modifications. The histone variants H2AZ and H2BV and the modifications H3K4me3 and H3K10ac are enriched at transcription start sites, while the histone variants H3V and H4V are more abundant at transcription termination sites (22–25). Trypanosomatids also contain a modified nucleotide, base J (26), that is enriched at the end of polycistronic transcription units and promotes termination by Pol II (27). The divergence of trypanosomatids from other eukaryotes explains the difficulties encountered in identifying the canonical transcription factors that are normally present in eukaryotes. Until recently, it was thought that these were completely absent, but it is now accepted that most of the basal transcription factors are present, although not particularly well conserved (17,28–30). Trypanosomatids also lack recognisable orthologues of a number of factors required for transcription elongation (31,32) and mRNA export (33). In yeast and humans, transcription and mRNA export are connected via the TREX complex (34), but, in trypanosomatids, no components of this complex have been identified apart from TbSub2 (33,35). It was recently shown, however, that mRNA export occurs cotranscriptionally (36). Another peculiarity of trypanosomatid Pol II is that the major subunit, RPB1, lacks the characteristic heptapeptide repeats found in the C-terminal domain (CTD) in other eukaryotes (37). Differential phosphorylation of these repeats is required for the CTD to recruit different factors during the transcriptional cycle (38–41). Despite lacking the heptapeptide repeats, it has recently been demonstrated that C-terminus of T. brucei Pol II is essential for transcription (37) and that it contains several phosphorylation sites (42), some of which are required for its activity (43). In an attempt to identify trypanosome-specific transcription factors associated with Pol II, two groups performed tandem affinity purification with different subunits as bait. In both cases, members of the retrotransposon hotspot 4 (RHS) family were identified in the core complex (44,45). More recently, it was demonstrated that TbRRM1, a nucleoprotein that modulates chromatin structure in T. b. brucei interacted with several RHS (46). RHS were first identified in 2002 as a multigene family clustered at subtelomeres (47). Their name derives from a retrotransposon insertion site in the 5′ portion of the coding region, which is predicted to disrupt the function of ∼60% of the genes. Bringaud et al. identified 52 RHS (pseudo)genes in T. b. brucei (TREU927) and classified them into 6 sub-families, RHS1 to RHS6, based on the sequence similarities of their C-termini. The only recognizable feature of RHS proteins is an ATP/GTP binding domain. Following the completion of the T. b. brucei genome sequence, more RHS genes have been annotated—118 at present—and a seventh sub-family (RHS7) was added. RHS proteins are expressed in bloodstream and procyclic forms. Five sub-families were localized to the nucleus, while RHS2 showed a perinuclear localization (47). Based on their known interactions with Pol II and a chromatin-associated protein, we hypothesized that RHS proteins could be trypanosome-specific factors playing a role in transcription. In the present work, we analyse the RHS2, RHS4 and RHS6 sub-families. We show that all three sub-families associate with regions of the genome that are transcribed by Pol II. Furthermore, depletion of RHS reduces nascent RNA synthesis and leads to accumulation of RNA in the nucleus, suggesting that they play a role in Pol II transcription elongation and mRNA export.

MATERIALS AND METHODS

Cell culture

Procyclic forms of T. brucei brucei EATRO 1125, initially derived from AnTat 1.1 bloodstream forms (48), and the derivative for inducible RNAi, AnTat1.1 90-13 (49), were used for the experiments performed in this study. Procyclic forms were cultured in SDM-79 (50) containing 10% foetal bovine serum (FBS) and 20 mM glycerol at 27°C. Parasites were maintained at cell densities between 106 and 107 cells ml−1. Stable transfections of the parasites were performed by electroporation with 10 μg linearized plasmids as previously described (51). Selection of clones was achieved by limiting dilution in medium supplemented with antibiotics (phleomycin at 1.5 μg ml−1 for RNAi constructs; puromycin at 1 μg ml−1 for HA-tagging constructs). For induction of RNAi, 1 μg ml−1 tetracycline was added to the culture medium.

Primers and constructs

Synthetic restriction sites are underlined. RHS2RNAiFW: GGGGGATCCAATTGAGAGTGTGGGCGAAC RHS2RNAiRV: GGGCTCGAGATTACCACTTGCCAGAACGG RHS4RNAiFW: GGGGGATCCCAGTTGAGACTCATTGGGCA RHS4RNAiRV: GGGCTCGAGCCTCACCTCCAGCTCTATCG RHS6RNAiFW: GGATCCCTGATGCAATTGCTGAGGAT RHS6RNAiRV: CTCGAGTATTCGCCACTTCTCTTGCC RHS6-HA-ApaI: TAGGGCCCAGTGTCTTGAGGCATGTAGAGG RHS6-HA-NotI: TAGCGGCCGCAATTCGTTATTCGCCACTTCTCTTGC RHS4-HA-Not: TAGCGGCCGCAACGCGTCTTCACCTCCTTCCAC RHS4-HA-Apa: TAGGGCCCTCGAGTTTTTAAAGTTATCG RHS2-HA-Apa: TAGGGCCCAGGACGGGACTACTTACATG RHS2-HA-Not: TAGCGGCCGCAAGGCAGCGGGGCCACCAGCAATAG SL oligo (FISH): Cy5-CAATATAGTACAGAAACTGT Oligo d(T) probe (FISH): Cy3-TTTTTTTTTTTTTTTTTTTT SL probe (Northern blot): CAATATAGTACAGAAACTGTTCTAATAATAGCGTT (52) 18S probe (Northern blot): GTTCGTCTTGGTGCGGTCTA qPCR oligos: Procycl_Pro_67s: GAGCTTAATGTCCTTTTCA Procycl_Pro_191as: CAAAACAACCATATCACTTC QChIP RLP10 F: AAGAGCATGCCAGCAAATC QChIP RLP10 R: GGGTCAGCGATATACTCCGT QChIP alpha tub F: CCGTCACGTGTAAGATGAGC QChIP alpha tub R: ACTCCACACAGCGGAAGAG 18S_up_377s: CCATGCTCTCTCGTGTGTGTA (53) 18S_up_265as: TTCCTCAAGGCGTCACTCTATC (53) All plasmid inserts were generated by PCR using genomic DNA from T. b. brucei AnTat 1.1 as a template. RNAi constructs were generated by cloning the PCR insert into the stem loop vector pSL-Comp1 (54). HA-tagging constructs were derivatives of the cassette-type construct pC-PTP-NEO (55).

Northern and western blot analysis

Total RNA isolation with guanidine thiocyanate and Northern blot analysis were performed according to standard procedures (56,57). 10 μg of RNA were loaded in each lane. Radioactively labeled probes were prepared using a Megaprime DNA labelling kit (Amersham Biosciences) according to manufacturer′s instructions. A 5′-labeled antisense oligonucleotide probe for the 18S RNA was used as loading control (58). Blots were hybridized at 65° and washed in 0.2× SSC/0.1% SDS for PCR-based probes, or in 1× SSC/0.1% SDS for 18S. For the SL probe, hybridization was performed at 37° and washed in 4× SSC/0.1% SDS at RT. The blots were exposed to Phosphorimager screens and scanned with Typhoon FLA 7000 (GE Healthcare Life Sciences). All signals were normalized in Fiji. For Western blot analysis protein samples (2 × 106 cell equivalents per lane) were separated on 12% SDS-polyacrylamide gels and transferred to Immobilon-P membrane (Millipore). Membranes were incubated overnight with primary antibodies: anti-RHS2, anti-RHS4 and anti-RHS6 (Rat, 1:10 000) (47), anti-HA 3F10 (Rat, 1:2500, Sigma-Aldrich), anti-Alba3 (Rabbit, 1:250) (54). Secondary antibodies were used 1:5000: swine anti-rabbit HRP (Dako Denmark), donkey anti-rat HRP (Invitrogen). The blots were incubated with Super Signal West Pico PLUS Chemiluminescent substrate (Thermo Scientific) and detected with an Amersham Imager 600.

Immunofluorescence and fluorescence in situ hybridization (FISH)

For immunofluorescence, cells were harvested by centrifugation, washed once in PBS, fixed in 4% paraformaldehyde in PBS and permeabilized with Triton X-100 (0.1% in PBS). After blocking in PBS/3% BSA, cells were incubated with primary antibody (1:1000 for anti-RHS antibodies, 1:250 for anti-HA) for 1 h at room temperature, washed 3 times in PBS and subsequently incubated with secondary antibody (1:2000, Alexa Fluor 488 donkey anti-rat, ThermoFisher) for 1 h at room temperature. Cells were then washed 3 times in PBS and stained with Hoechst 33342 prior to embedding in Mowiol (Sigma Aldrich). RNA FISH was performed according to Cassola and colleagues (59): cells were permeabilized and simultaneously blocked for 30 min in PBS/0.5% saponin/2% BSA, and pre-hybridized for 1 h in hybridization solution (2% BSA, 5x Denhardt′s solution, 4× SSC, 5% dextran sulphate, 35% formamide, 0.5 μg μl−1 tRNA). Cy3-labelled oligo-d(T)20 and Cy5-labelled SL-oligo (2 ng μl−1 in hybridization solution) were hybridized overnight at room temperature. Cells were then washed with decreasing concentrations of SSC (from 4× to 1×), and nuclei were stained with Hoechst 33342. To block transcription, cells were treated with 10 μg ml−1 Actinomycin D (Sigma Aldrich) for 2 h. In parallel, a control sample was treated with the same volume of DMSO for the same length of time. Images were captured with a Leica DM 5500 B microscope and analysed using LAS AF software (Leica) and Fiji.

Flow cytometry

2 × 106 cells were collected, washed once in cold PBS and resuspended in 300 μl PBS. Cells were then fixed by drop-wise addition of 700 μl of ice-cold EtOH. After an incubation on ice for 1 h, cells were harvested again by centrifugation and resuspended in 500 μl of PBS containing 50 μg ml−1 propidium iodide (Sigma Aldrich), 0.1 mg ml−1 RNase A and 0.05% Triton X-100. Cells were incubated at 37°C for 30 min and 104 cells per sample were analysed with a NovoCyte flow cytometer (ACEA Biosciences).

Co-immunoprecipitation and mass spectrometry

For isolation of proteins interacting with RHS2-HA or RHS6-HA, 2 × 108 cells were harvested by centrifugation at 2000 rpm for 5 min and washed once in PBS. Cells were then cross-linked by addition of 900 μl 0.1% paraformaldehyde for 8 min, after which the reaction was blocked by addition of 100 μl 1.25 M glycine. After 5 min, cells were pelleted, washed once in PBS and resuspended in lysis buffer (20 mM Tris, 140 mM KCl,1.8 mM MgCl2, 0.1% NP40, 10% glycerol, supplemented with EDTA-free protease inhibitor cocktail (Roche)). Cells were sonicated 3 times for 10 s with a Branson Digital Sonifier at 10% amplitude, interspersed with 30-second intervals on ice. The sonicate was placed at 4° for 30 min, after which it was incubated overnight with 25 μl Pierce Anti-HA Magnetic Beads (ThermoScientific), previously washed 3 times in 1 ml lysis buffer. For RNase and DNase controls, lysates and beads were supplemented with RNase A or DNase I at a final concentration of 100 μg ml−1 during the overnight incubation. An untagged culture was always used in parallel as a negative control. Protein cross-linking was reversed by incubation of the sample in Laemmli Buffer for 20 min at 95°C. Samples were loaded on an SDS-polyacrylamide gel and run until they had just entered the 12% resolving gel. Proteins were then gel-extracted, subjected to trypsin digestion and analysed by LC–tandem MS. Protein identification was performed at the Proteomics and Mass Spectrometry facility at University of Bern, CH.

Chromatin immunoprecipitation (ChIP) and RNA immunoprecipitation (RIP)

ChIP experiments were performed as described previously (60,61) with some modifications. 108 parasites were washed once in PBS and fixed with 1% paraformaldehyde (w/v) in PBS for 8 min at room temperature; the reaction was stopped by adding glycine to a final concentration of 125 mM for 5 min. Cells were then washed once in PBS and resuspended in lysis buffer (50 mMTris-HCl, pH 8, 10 mM EDTA, 1% SDS, and EDTA-free protease inhibitor cocktail (Roche)). Subsequently, the lysate was sonicated with a Bioruptor (Diagenode) using the following settings: Power ‘L’, ON/OFF (Sec) 30/90 for 7 min, to produce fragments <1 kb. The soluble chromatin was diluted 1:100 in RIPA buffer (10 mM Tris-HCl, pH 7.5, 1 mM EDTA, 0.5 mM EGTA, 1% Triton X-100, 0.1% SDS, 0.1% sodium deoxycholate, 140 mM NaCl) and used for immunoprecipitation. Anti-RHS (47) and anti-RPB1 antibodies immobilized on Dynabeads Sheep Anti-Rat IgG (Invitrogen) were used for immunoprecipitation. For the Pol I control, a monoclonal Anti-RPA1 antibody (62) was immobilized on Dynabeads Pan Mouse IgG (Invitrogen). For RNase treatment, lysates and beads were supplemented with RNase A at a final concentration of 100 μg ml−1 during the overnight incubation. DNA was eluted from the beads by digestion with Proteinase K at 68°C for 2 h, purified by phenol-chloroform extraction and ethanol precipitation, and quantified using a Qubit dsDNA HS Assay kit (Invitrogen). For qPCR, 1 ng DNA was added to each reaction; qPCR reactions were performed in triplicate. qPCR was performed using MESA Green qPCR MasterMix Plus for SYBR Assay (Eurogentec) in the ABI Prism 7000 sequence detection system (Applied Biosystems). The data were analyzed using 7000 System SDS software v1.2 (Applied Biosystems). Primers used for qPCR are listed above. Illumina sequencing was performed at Fasteris (Geneva) using the Hiseq system. RNA IP experiments were performed using cells expressing RHS2-HA and RHS6-HA as described previously (63). For isolation of RNA bound to RHS, anti-HA magnetic beads (Pierce) were used, and the RNA-protein complexes were resuspended in 20 mM TrisHCl (pH 7.5), 5 mM EDTA, 50mMNaCl, 0.1% SDS, and 50 μg ml−1 proteinase K and incubated at 68°C for 40 min. RNA was purified by phenol-chloroform extraction and ethanol precipitation, subjected to DNase treatment, and quantified using a Qubit RNA HS Assay kit (Invitrogen).

5-Ethynyl uridine incorporation and processing for GRO-Seq

Procyclic forms at a density of 7–8 × 106 ml−1 were pulsed for 10 min with 200 μM 5-ethynyl uridine (5-EU; Jena Biosciences). Cells were spun down and nuclei were isolated as described by Murphy et al. (64) with some modifications. Briefly, cell pellets were resuspended in ice-cold buffer (0.5 M sucrose, 50 mM KCl, 5 mM MgCl2, 50 mM TrisHCl, pH 7.4) and incubated on ice. After 5 min, NP-40 was added to final concentration of 0.1%, the sample was vortexed for 10 s and nuclei were pelleted by centrifugation at 3300g at 4°C for 3 min. Total RNA was isolated from nuclear pellets and stored at –70°C. The amount of RNA was measured using a Qubit RNA HS assay kit (Invitrogen). Copper-dependent click-it chemistry was used to label nascent RNAs that incorporated 5-EU with biotin as described (65), starting with 2 mg total RNA. Biotin-ligated RNA was fragmented to ∼250 bp using a magnesium-dependent RNA fragmentation module (New England Biolabs) and the biotinylated RNA fragments were purified using Dynabeads™ M280 streptavidin (ThermoFisher Scientific) and quantitated using a Qubit RNA HS assay kit. Purified nascent RNAs were used for cDNA library preparation using Illumina Truseq RNA sample preparation kit and sequenced at Fasteris (Geneva) as described above.

NGS sequencing and bioinformatics

Illumina reads were mapped and analysed using the Galaxy suite version 18.05 (66), as described in Naguleswaran et al. (67). For Figures 4A and 4B, the Chip-seq reads were mapped against release 8.1 of T. b. brucei TREU927 genome downloaded from TritrypDB (http://tritrypdb.org/tritrypdb/). The BWA software (Li H., 2013, Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv:1303.3997v1) with default parameters was launched in paired-end and single-end modes with respect to the replicate sequencing libraries. We discarded alignment matches shorter than 19 bp and allowed gaps shorter than 100 bp. The algorithm uses a scoring scheme (Z-dropoff) which avoids unnecessary extension of poor matching regions and reduces poor alignments in good matching regions by dynamically calculating the alignment score. the full list of the default parameters can be found on the software website (http://bio-bwa.sourceforge.net/bwa.shtml). The output of BWA was sorted and indexed using SAMtools (68). Peak calling was performed using MACS2 software (69) on RHS2, RHS4, RHS6 and Pol II using the replicates for each run. Reads from duplicate controls (DNA extracted from the same chromatin samples used for ChIP) were used as input.
Figure 4.

RHS2, RHS4 and RHS6 are globally associated with actively transcribing Pol II (A) Pairwise Spearman correlations of Pol II, RHS2, RHS4 and RHS6. Each dot represents the natural logarithm of the average read counts of a 10kb bin plotted pairwise between samples. The sub-figures over the diagonal (upper left to lower right corners) represent the distribution of Pol II, RHS2, RHS4 and RHS6, respectively. The correlation values corresponding to their mirror correlation graphs are above the diagonal. (B) Alignment of RHS2, 4 and 6 as a function of Pol II enrichment. On the x-axis the ‘0’ corresponds to the summits of the peaks of Pol II ±5 kb. The colour values correspond to the coverage/depth (reads per kb per million reads mapped) over a certain position in a 10kb window. (C) RHS2, RHS4, RHS6 and Pol II occupancy across a region of chromosome 6 that includes a procyclin locus that is transcribed by Pol I (3). The flanking genes are transcribed by Pol II. Spliced leader addition sites are from Nilsson and colleagues (100). See Supplemental Figure S3b for additional controls. (D) RHS2, RHS4, RHS6 and Pol II occupancy across a region of chromosome 4 reveals that RHS proteins are associated with actively transcribed regions. Reads obtained from input chromatin are shown in black. Spliced leader addition sites are from Nilsson and colleagues (100). RNA-seq reads are from early procyclic forms (67). Genes are depicted as yellow bars and annotated according to TriTrypDB. (E) RHS2, RHS4, RHS6 and Pol II occupancy across part of the SL locus on chromosome 9. Reads obtained from input chromatin are shown in black. The 5′ ends of the spliced leader precursors are from reference 100.

The pairwise Spearman correlations in Figure 4A were computed using the R package GGally (B. Schloerke et al., 2018, GGally: Extension to ‘ggplot2′. R package version 1.4.0). To generate the heatmap in Figure 4B, alignment coverage was normalized to Reads Per Kilobase Million (RPKM) using deepTools (70) with the option normalize using RPKM and converted to BigWig format. The normalized alignment as well as the peak enriched intervals (narrow peaks outputs) were used as input files in R Bioconductor package EnrichedHeatmap (Zuguang Gu, 2017, EnrichedHeatmap: Making Enriched Heatmaps. R package) was used to compute the heatmap. Raw read files are deposited at the European Nucleotide Archives (ENA) http://www.ebi.ac.uk/ena as study PRJEB29185.

RESULTS

RHS are required for growth and viability

Of the 118 genes annotated as RHS genes in the genome of T. b. brucei TREU927, 78 are considered to be pseudogenes, either because of the insertion of a retrotransposon or because the open reading frame is truncated. Figure 1 shows the 40 ‘full length’ RHS grouped into seven sub-families based on similarities in their C-termini. RHS are highly expressed by different life-cycle stages (67), with the exception of the RHS7 sub-family which is only expressed in the tsetse salivary glands (AN and IR, unpublished). We chose to focus on three sub-families—RHS2 because of the unusual localization of these proteins compared to other RHS, RHS4 because of their association with Pol II (44,45) and RHS6 because it was annotated as the single member of its sub-family. Bringaud et al. reported that RHS2 was perinuclear (47). In our hands, and using the same polyclonal antisera, but a different stock of T. brucei, RHS2 was mainly in the cytoplasm (Figure 2A). RHS4 and RHS6 were both nuclear (Figure 2A), as reported previously (47).
Figure 1.

RHS are grouped into seven sub-families neighbour-joining phylogenetic tree, without distance correction, of protein-coding RHS in T. b. brucei TREU927. The tree was obtained with the Multiple Sequence Alignment tool Clustal Omega. Gene IDs and annotations are from Tritrypdb (http://tritrypdb.org/tritrypdb/).

Figure 2.

RHS are required for growth and viability of T. brucei procyclic forms (A) Immunolocalization of RHS proteins. Cells were incubated with anti-RHS2, anti-RHS4 or anti-RHS6 polyclonal antisera (first panel) and DAPI (second panel). The third and fourth panels show the merged fluorescence and phase contrast, respectively. Scale bar = 5 μm. (B) Effect of RHS RNAi on growth. The graphs display cumulative cell numbers from three biological replicates of each RNAi line. Uninduced cultures (–Tet) and cultures induced with tetracycline (+Tet) were monitored daily for 6 days. The efficiency of knockdown by RNAi was assessed by northern and western blot analysis on day 2 after induction of RNAi, and quantified with Fiji 2.0 in biological triplicates. Error bars = SD. 18S ribosomal RNA (northern blot) and Alba3 (Western blot) were used as loading controls. See also Supplementary Figure S1.

RHS are grouped into seven sub-families neighbour-joining phylogenetic tree, without distance correction, of protein-coding RHS in T. b. brucei TREU927. The tree was obtained with the Multiple Sequence Alignment tool Clustal Omega. Gene IDs and annotations are from Tritrypdb (http://tritrypdb.org/tritrypdb/). RHS are required for growth and viability of T. brucei procyclic forms (A) Immunolocalization of RHS proteins. Cells were incubated with anti-RHS2, anti-RHS4 or anti-RHS6 polyclonal antisera (first panel) and DAPI (second panel). The third and fourth panels show the merged fluorescence and phase contrast, respectively. Scale bar = 5 μm. (B) Effect of RHS RNAi on growth. The graphs display cumulative cell numbers from three biological replicates of each RNAi line. Uninduced cultures (–Tet) and cultures induced with tetracycline (+Tet) were monitored daily for 6 days. The efficiency of knockdown by RNAi was assessed by northern and western blot analysis on day 2 after induction of RNAi, and quantified with Fiji 2.0 in biological triplicates. Error bars = SD. 18S ribosomal RNA (northern blot) and Alba3 (Western blot) were used as loading controls. See also Supplementary Figure S1. Since nothing is known about the function of RHS proteins, we performed RNAi using tetracycline inducible constructs that were targeted against specific regions in the coding region of each sub-family; these were designed to knock down all members of a particular sub-family at the same time (Figure 2B). Depletion of both RHS2 and RHS6 resulted in growth arrest by day 3 post induction. The phenotype upon knockdown of RHS4 was even more extreme, resulting in the onset of cell death by day 4 and complete clearance of the culture by day 6. The efficiency of RNAi, and its impact on the expression of other RHS, was assessed by Northern and Western blot analyses on day 2 post induction (Figure 2B and Supplementary Figure S1), when the cells were still growing normally. Whereas the depletion of RHS2 and RHS4 were both very efficient, residual RHS6 protein was clearly visible. Knockdown of any one of these sub-families did not affect the expression of the others, indicating that there was no crosstalk or compensatory mechanisms (Supplementary Figure S1). To investigate whether cells that had stopped dividing were stalled at a particular phase of the cell cycle, we took advantage of the fact that these can be distinguished based on the configuration of the nucleus and the kinetoplast DNA (kDNA) in the single mitochondrion. In G1/GO, cells contain one nucleus and one kDNA (1N1K), cells in S phase duplicate and segregate their kDNA ahead of their nuclear DNA (1N2K) and post-mitotic cells have the configuration 2N2K. A small proportion of cells in a culture contains kDNA but no nuclear DNA (0N1K), presumably due to a defect in DNA replication and /or nuclear segregation. These cells are termed zoids. When the three RNAi lines were analysed for cell-cycle defects, both RHS2 and RHS4 knockdowns showed a slight increase in the number of zoids (Figure 3A). This phenotype was much more pronounced in RHS 6 knockdown, however, reaching 25% by day 2. Propidium iodide staining showed an increase in cells with a nuclear content <2n (presumably corresponding to zoids), but no increase in cells with a nuclear content ≥4n (Figure 3B). This points to a defect in DNA replication rather than nuclear segregation.
Figure 3.

Depletion of RHS6 causes a defect in DNA replication (A) Effect of RHS RNAi on the cell cycle. Configurations of nuclear and kDNA were determined in uninduced (–Tet) and induced (+Tet) cells (N = 200 per sample). The analysis was performed at day 2 post induction. P-values are shown for unpaired, two-tailed t-tests. Error bars = SD. (B) DNA content and proportion of cells in different phases of the cell cycle. The figure shows one of three representative experiments. Cultures of the RHS6 RNAi cell line that were not induced (red), or induced for 2 days (grey), were analysed by flow cytometry following propidium iodide staining. The inset shows cells stained with DAPI. Arrowheads show two zoids (cells with kDNA, but no nucleus). Scale bar = 5 μm.

Depletion of RHS6 causes a defect in DNA replication (A) Effect of RHS RNAi on the cell cycle. Configurations of nuclear and kDNA were determined in uninduced (–Tet) and induced (+Tet) cells (N = 200 per sample). The analysis was performed at day 2 post induction. P-values are shown for unpaired, two-tailed t-tests. Error bars = SD. (B) DNA content and proportion of cells in different phases of the cell cycle. The figure shows one of three representative experiments. Cultures of the RHS6 RNAi cell line that were not induced (red), or induced for 2 days (grey), were analysed by flow cytometry following propidium iodide staining. The inset shows cells stained with DAPI. Arrowheads show two zoids (cells with kDNA, but no nucleus). Scale bar = 5 μm. Taken together, our RNAi data show that RHS2, RHS4 and RHS6 are all required for normal cell growth and viability; furthermore, loss of RHS6 causes a defect in DNA replication.

Proteins interacting with RHS2 and RHS6 point to roles in RNA metabolism

As mentioned above, several RHS sub-families coprecipitate with TbRRM1 (46) and RHS4 is part of the core complex of Pol II (44,45). To obtain more information about RHS protein complexes we tagged members of each of the three sub-families in situ. RHS2 and RHS6 tagged at the C-terminus with a haemagglutinin (HA) tag localized in the same way as the endogenous proteins (Supplementary Figure S2). RHS4 was consistently mislocalized to the cytoplasm, however, regardless of the tag or whether it was placed at the N- or C-terminus (Supplementary Figure S2). Similar problems with mislocalization were reported independently for RHS4 (Tryptag.org). The tagged versions of RHS2 and RHS6 were used to immunoprecipitate complexes in order to identify interacting partners. All major interactions were confirmed to be DNA- and RNA-independent. The predominant components co-purifying with RHS2 were ribosomal proteins, RNA binding proteins (e.g. puf6, DRBD2, polyadenylate binding proteins 1 and 2) and translation initiation factors (Tables 1A and Supplementary Table S1). The proteins present in the pulldown of RHS6 showed no consistent overlap with those obtained with RHS2. In addition to RHS6, members of the RHS1, 4 and 5 sub-families were present. The major category consisted of factors known to be involved in transcription elongation in other eukaryotes (Tables 1B and Supplementary Table S2), for example the two largest subunits of Pol II, Spt5 and 6, ISWI, FACTSpt16 and several components of the PAF complex. Proteins involved in RNA processing (e.g. PTB2, PRP19, splicing auxiliary factors and symplekin) and RNA export were also present. A third category consisted of different subunits of the replication factor C complex, which may explain why knockdown of RHS6 results in a defect in DNA replication. Given that many of the proteins interacting with RHS2 and RHS6 possess RNA-binding domains, we performed RNA-immunoprecipitations (RIP). Although nucleic acids were present in both complexes after pulldown, these were lost when the RHS6 complex was treated with DNAse, demonstrating that RHS6 predominantly binds to DNA and not RNA (Supplementary Figure S3A). In contrast, the RHS2 complex contained both RNA and DNA. These data suggest that both RHS2 and RHS6 are involved in RNA metabolism, but at different levels: RHS6 is in a complex with proteins connected to transcription elongation and RNA processing, while RHS2, despite binding both DNA and RNA, mainly interacts with the translation machinery.
Table 1.

RHS2 and RHS6 interact with proteins involved in RNA metabolism

Gene IDProductPMSS average (N = 3)
A. RHS2-associated proteins
Tb927.2.280/380/400RHS21985
Tb927.9.10770Polyadenylate-binding protein 2595.18
Tb927.3.505060S ribosomal protein L4460.87
Tb927.9.607040S ribosomal protein S3454.24
Tb927.2.591040S ribosomal protein S13448.72
Tb927.10.1471040S ribosomal protein S2,432.82
Tb927.7.6090Eukaryotic translation initiation factor 3, subunit A430.40
Tb927.10.394040S ribosomal protein S3A418.16
Tb927.11.630040S ribosomal protein S5415.28
Tb927.11.971060S ribosomal protein L10a392
Tb927.8.134060S ribosomal protein L7a365.25
Tb927.11.14020Nuclear RNA binding domain 2 (NRBD2)321.01
Tb927.5.2570Eukaryotic translation initiation factor 3, subunit B251.97
Tb927.10.8290Eukaryotic translation initiation factor 3 subunit C207.45
Tb927.10.11760Pumilio/PUF RNA binding protein 6206.97
Tb927.6.4370Eukaryotic translation initiation factor 3, subunit D179.68
Tb927.11.550SCD6179.21
Tb927.6.4440RNA-binding protein 42159.09
Tb927.9.13990DRBD2, RNA-binding protein150
Tb927.9.9290Polyadenylate-binding protein-1144.07
Tb927.11.11590Eukaryotic translation initiation factor 3, subunit E99.48
Tb927.8.1190Eukaryotic translation initiation factor 3, subunit H98.69
Tb927.11.13250Eukaryotic translation initiation factor 2 gamma84.86
Tb927.8.1500Hypothetical protein, mRNA binding70.43
Tb927.9.8130Nascent polypeptide associated complex61.98
B. RHS6-associated proteins
Tb11.v5.0713RHS64459
Tb927.1.420/2.240/2.1080RHS53218
Tb927.1.120/2.340RHS4676.5
Tb927.5.4420Nucleolar RNA helicase II570.1
Tb927.10.7440MTR4 RNA helicase519.8
Tb927.3.5620FACT complex subunit SPT16486.3
Tb927.11.13360AAA ATPase481.5
Tb927.2.5240PRP19-like protein382.3
Tb927.11.11550DNA topoisomerase II alpha313.83
Tb927.2.100/1.220/2.370RHS1287
Tb927.2.1810Chromatin-remodeling complex ATPase chain ISWI285.3
Tb927.11.5650/6.3980/10.7990/11.9550Replication factor C subunits213.4
Tb927.10.13720RNA binding protein 29174
Tb927.8.7490Symplekin172
Tb927.8.7400RNA polymerase IIA largest subunit (RPB1)160
Tb927.8.2640Ubiquitin-activating enzyme E1146.8
Tb927.4.3810DNA-directed RNA Polymerase II subunit (RPB2)116.4
Tb927.10.7060NUP9697.8
Tb927.2.5810Spt6 (transcription elongation)115.3
Tb927.11.14100Polypyrimidine tract-binding II (PTB2/DRBD4)268.64
Tb927.3.3220CTR9 (PAF complex)132.11
Tb927.10.3200U2 splicing auxiliary factor177.5
Tb927.2.5030Spt5 (transcription elongation)62.70
Tb927.11.10230CDC73 (PAF complex)47.3

List of proteins consistently identified in immunoprecipitations of (A) RHS2 and (B) RHS6 with ranking according to PMSS average. All proteins listed were identified in three biological replicates. For simplicity, we have not included the full set of ribosomal proteins. The complete lists of proteins are shown in Supplemental Tables S1 and S2.

RHS2 and RHS6 interact with proteins involved in RNA metabolism List of proteins consistently identified in immunoprecipitations of (A) RHS2 and (B) RHS6 with ranking according to PMSS average. All proteins listed were identified in three biological replicates. For simplicity, we have not included the full set of ribosomal proteins. The complete lists of proteins are shown in Supplemental Tables S1 and S2.

Chromatin immunoprecipitation reveals global association with RNA polymerase II transcription units

The results shown above strongly suggest that RHS proteins associate with chromatin. To identify DNA binding sites, we performed chromatin immunoprecipitation with antisera against RHS2, 4 and 6 followed by deep sequencing (ChIP-Seq). In parallel, we performed ChIP-Seq with antibodies directed against RPB1, the large subunit of Pol II. All three RHS sub-families showed the same distribution as Pol II (Figures 4A and B) with Pearson correlations ≥0.94, while correlations between the different RHS exceeded 0.99. Like Pol II, the RHS proteins covered almost the entire genome, but were depleted from regions known to be transcribed by Pol I such as the procyclin locus on chromosome 6 (Figure 4C). As a control, we performed ChIP with the anti-RBP1 antiserum or with a monoclonal antibody against the large subunit of Pol I, followed by qPCR. This confirmed the differential binding of the two polymerases to their respective transcription units (Supplementary Figure S3B). In addition, we found that RHS and Pol II were strongly depleted from silent regions of the genome. One example is a region on chromosome 4, encoding VSG and Expression Site Associated Genes (Figure 4D), which produces no steady state mRNA in this stage of the life cycle. The input genomic DNA gave good coverage of these regions, however, confirming that the lack of reads after ChIP was not due to a mapping artefact. Taken together, these data show that RHS are globally associated with active Pol II and suggest a potential role in transcription elongation. RHS2, RHS4 and RHS6 are globally associated with actively transcribing Pol II (A) Pairwise Spearman correlations of Pol II, RHS2, RHS4 and RHS6. Each dot represents the natural logarithm of the average read counts of a 10kb bin plotted pairwise between samples. The sub-figures over the diagonal (upper left to lower right corners) represent the distribution of Pol II, RHS2, RHS4 and RHS6, respectively. The correlation values corresponding to their mirror correlation graphs are above the diagonal. (B) Alignment of RHS2, 4 and 6 as a function of Pol II enrichment. On the x-axis the ‘0’ corresponds to the summits of the peaks of Pol II ±5 kb. The colour values correspond to the coverage/depth (reads per kb per million reads mapped) over a certain position in a 10kb window. (C) RHS2, RHS4, RHS6 and Pol II occupancy across a region of chromosome 6 that includes a procyclin locus that is transcribed by Pol I (3). The flanking genes are transcribed by Pol II. Spliced leader addition sites are from Nilsson and colleagues (100). See Supplemental Figure S3b for additional controls. (D) RHS2, RHS4, RHS6 and Pol II occupancy across a region of chromosome 4 reveals that RHS proteins are associated with actively transcribed regions. Reads obtained from input chromatin are shown in black. Spliced leader addition sites are from Nilsson and colleagues (100). RNA-seq reads are from early procyclic forms (67). Genes are depicted as yellow bars and annotated according to TriTrypDB. (E) RHS2, RHS4, RHS6 and Pol II occupancy across part of the SL locus on chromosome 9. Reads obtained from input chromatin are shown in black. The 5′ ends of the spliced leader precursors are from reference 100. As shown above, the RHS2 complex contains RNA. Two RHS4 sub-family members are part of the Pol II complex (44,45) and an association with RNA has been reported for one of them (71). To determine whether the binding to chromatin was mediated by RNA we subjected ChIP samples to RNase A treatment during the immunoprecipitation. This had little effect on the amount of DNA pulled down with RHS6, but reduced the amount associated with RHS2 and RHS4 to 30% and 25%, respectively (Supplementary Figure S3C). This implies that RHS6 is in close proximity to the DNA and that RNA partially bridges the interaction of RHS2 and RHS4 with DNA.

Depletion of RHS causes a block in spliced leader expression and RNA export

Under normal conditions the majority of RHS2 is in the cytoplasm (Figure 2A), but the finding that it is also associated with nuclear DNA suggests that it might be shuttling between the two compartments. In line with this possibility, bioinformatic analysis of the RHS2 family identified a putative nuclear localization signal and a putative nuclear export signal in each of the three family members. It has previously been reported that active transcription is required for nucleocytoplasmic shuttling of certain proteins and that blocking transcription with Actinomycin D (ActD) leads to their accumulation in the nucleus (72–74). Consistent with this, incubating trypanosomes with ActD resulted in an increase in nuclear RHS2 within 2 h (Figures 5A and B). An increase in nuclear RHS2 was also observed when RHS4 was knocked down, suggesting that RHS4 depletion and Act D have similar effects. This is again consistent with RHS4 playing a role in transcription.
Figure 5.

RHS depletion causes defects in RNA export and spliced leader expression (A) Redistribution of RHS2 in response to inhibition of Pol II or depletion of RHS4. Immunofluorescence with anti-RHS2 antisera shows that RHS2 is largely excluded from the nucleus of untreated control cells or the uninduced RHS4 RNAi cell line, but shows increased nuclear localization in cells treated with 10 μg ml−1 Actinomycin D (ActD) for 2 h or in cells from the RHS4 RNAi line 2 days post induction. Cells were co-stained with DAPI. Scale bar = 5μm. (B) Quantification of nuclear fluorescence signals with Fiji 2.0 using the cell lines described above. As an additional control, a cell line in which RHS2 is tagged with an HA epitope (RHS2HA) was either untreated, or treated with ActD for 2 h, and stained with an anti-HA antibody. P-values are shown for unpaired, two-tailed t-tests. Error bars = SD. (C) Fluorescence in situ hybridization (FISH) to detect spliced and polyadenylated RNAs. RNAi lines for Mex67, RHS2, RHS4 or RHS6 were induced with tetracycline for 2 days and hybridized with Cy3-oligo d(T) to detect poly(A) tails (PolyA) and a Cy5-labelled probe complementary to the spliced leader (SL). The latter also binds to the spliced leader precursor. Cells were stained in parallel with DAPI. Scale bar = 5 μm. The RHS4 RNAi cell line cultured in the absence of tetracycline served as the control. (D) Quantification of polyA signals with Fiji 2.0. Biological triplicates were performed for each cell line. Cyt: >70% of the signal is cytoplasmic. Nuc: >70% of the signal is nuclear. For each sample, n = 200 cells. P-values are shown for unpaired, two-tailed t-tests. Error bars = SD. White arrows point to examples of cells with nuclear accumulation of poly(A) but no detectable SL RNA. (E) Upper panel: Northern blot analysis of total RNA extracted from cells cultured without tetracycline (–Tet) or induced with tetracycline for 2 days (+Tet). An oligonucleotide complementary to the spliced leader, 5′-labelled with 32P-phosphate, was used as a probe. Signals were detected using a Typhoon FLA 7000 (GE Healthcare). Lower panel: quantification of 3 biological replicates. Signals for mRNA and the SL precursor RNA were normalized to 18S rRNA. Error bars = SD.

RHS depletion causes defects in RNA export and spliced leader expression (A) Redistribution of RHS2 in response to inhibition of Pol II or depletion of RHS4. Immunofluorescence with anti-RHS2 antisera shows that RHS2 is largely excluded from the nucleus of untreated control cells or the uninduced RHS4 RNAi cell line, but shows increased nuclear localization in cells treated with 10 μg ml−1 Actinomycin D (ActD) for 2 h or in cells from the RHS4 RNAi line 2 days post induction. Cells were co-stained with DAPI. Scale bar = 5μm. (B) Quantification of nuclear fluorescence signals with Fiji 2.0 using the cell lines described above. As an additional control, a cell line in which RHS2 is tagged with an HA epitope (RHS2HA) was either untreated, or treated with ActD for 2 h, and stained with an anti-HA antibody. P-values are shown for unpaired, two-tailed t-tests. Error bars = SD. (C) Fluorescence in situ hybridization (FISH) to detect spliced and polyadenylated RNAs. RNAi lines for Mex67, RHS2, RHS4 or RHS6 were induced with tetracycline for 2 days and hybridized with Cy3-oligo d(T) to detect poly(A) tails (PolyA) and a Cy5-labelled probe complementary to the spliced leader (SL). The latter also binds to the spliced leader precursor. Cells were stained in parallel with DAPI. Scale bar = 5 μm. The RHS4 RNAi cell line cultured in the absence of tetracycline served as the control. (D) Quantification of polyA signals with Fiji 2.0. Biological triplicates were performed for each cell line. Cyt: >70% of the signal is cytoplasmic. Nuc: >70% of the signal is nuclear. For each sample, n = 200 cells. P-values are shown for unpaired, two-tailed t-tests. Error bars = SD. White arrows point to examples of cells with nuclear accumulation of poly(A) but no detectable SL RNA. (E) Upper panel: Northern blot analysis of total RNA extracted from cells cultured without tetracycline (–Tet) or induced with tetracycline for 2 days (+Tet). An oligonucleotide complementary to the spliced leader, 5′-labelled with 32P-phosphate, was used as a probe. Signals were detected using a Typhoon FLA 7000 (GE Healthcare). Lower panel: quantification of 3 biological replicates. Signals for mRNA and the SL precursor RNA were normalized to 18S rRNA. Error bars = SD. To test if RHS are involved in transporting mRNAs to the cytoplasm, we analysed the RNAi cell lines by fluorescence in situ hybridization (FISH). For this we used two probes: an oligo d(T) probe that binds the poly(A) tails and a spliced leader (SL) probe that recognizes the conserved 39 base sequence at the 5′ end of all mRNAs. It has been shown previously that blocking export by down-regulating Mex67 causes poly(A) RNA to accumulate in the nucleus (33,75). We replicated these findings and also showed that RNAs detected with the SL probe accumulate in the same way (Figure 5C). Depletion of RHS2, 4 or 6 resulted in accumulation of poly(A) RNA in the nucleus (Figures 5C and D) indicating that RNA export was impaired. This was most pronounced for RHS4, which is down-regulated more efficiently than the other RHS. In contrast to the Mex67 RNAi line, however, RHS-depleted cells with nuclear poly(A) RNA had a strongly decreased signal for SL (Figure 5C). This suggests that the RNAs accumulating in the nucleus are not mature (i.e. have not been trans-spliced). The multiple copies of the SL precursor are transcribed by Pol II (18). Moreover, RBP1 and RHS proteins are associated with this region of the genome (Figure 4E), so one possibility is that knocking down RHS would lead to a defect in their synthesis. Northern blot analysis confirmed that cells depleted of RHS4 had lower levels of the SL precursor and of spliced mRNAs (Figure 5E). Since treatment with ActD leads to increased nuclear RHS2, we analysed its effect on the distribution of poly(A) RNA. Within 2 h there was a clear increase in the amount of poly(A) RNA in the nucleus (Supplementary Figure S4). Taken together, these results show that blocking transcription results in nuclear accumulation of mRNA and that the same phenotype is observed when RHS proteins are depleted. This suggests that active transcription and mRNA export are interconnected, which is consistent with recent results from Goos et al. (36). In this respect trypanosomes resemble yeast and metazoa, which coordinate these two processes via transcription elongation complexes (76), although they use alternative factors to accomplish this.

RHS depletion causes a global reduction in RNA polymerase II transcription

If RHS are required for transcription elongation, we would predict that knocking them down would affect the synthesis of nascent RNAs. To test this, cultures were pulsed with 5-ethynyl uridine (5-EU) for 10 min. Cells were subsequently separated into nuclear and cytoplasmic fractions before extracting RNA, biotinylating the ethynyl groups, and cleaving the RNA into fragments of ∼300 bases. Fragmentation is necessary to ensure that only newly synthesized RNAs, rather than entire polycistronic transcripts, are isolated. Biotinylated RNA was then captured with streptavidin beads. With this length of pulse, approximately equal amounts of incorporated 5-EU were in the nuclear and cytoplasmic fractions of wild type cells (Supplementary Figure S5A). When the RNAi lines were analysed by the same method, depletion of RHS resulted in reduced amounts of 5-EU-labelled nuclear RNA in all three cases (Figure 6A). To determine whether this effect was global or restricted to specific regions of the genome, the nascent RNAs were sequenced. Global run-on sequencing (GRO-Seq) has not been performed before in trypanosomes and at present there are no reliable methods for depleting ribosomal RNAs. To overcome this problem, we sequenced the entire biotinylated RNA obtained from the RNAi lines and mapped the reads to the coding regions of genes. In each case we obtained a minimum of 1.25 million mapped reads per sample. Biological replicates showed excellent correlations (>0.93; Supplementary Figure S5B). Moreover, the correlation between induced and uninduced RNAi lines within an experiment was >0.98 (Figure 6B). This indicates that the same regions are transcribed, albeit to a lesser extent when RHS were depleted. In summary, our results demonstrate that RHS proteins are globally involved in Pol II transcription in trypanosomes.
Figure 6.

RHS are involved in global mRNA synthesis (A) Quantification of 5-ethynyl uridine (5-EU) incorporation. Each RNAi line was cultured in the presence or absence of tetracycline for 2 days and pulsed for 10 min with 5-EU (n = 3). Values are given as a percentage of the uninduced control. P-values are shown for paired, one-tailed t-tests. Error bars = SD. (B) Correlation between reads per million (RPM) for nascent mRNAs from RNAi lines cultured in the presence or absence of tetracycline for 2 days. R: Pearson correlation coefficient. Biological replicates were performed. See also Supplemental Figure S5.

RHS are involved in global mRNA synthesis (A) Quantification of 5-ethynyl uridine (5-EU) incorporation. Each RNAi line was cultured in the presence or absence of tetracycline for 2 days and pulsed for 10 min with 5-EU (n = 3). Values are given as a percentage of the uninduced control. P-values are shown for paired, one-tailed t-tests. Error bars = SD. (B) Correlation between reads per million (RPM) for nascent mRNAs from RNAi lines cultured in the presence or absence of tetracycline for 2 days. R: Pearson correlation coefficient. Biological replicates were performed. See also Supplemental Figure S5.

DISCUSSION

Trypanosoma brucei transcriptional regulation is divergent from most other eukaryotes and may take advantage of some species-specific factors to overcome these differences. We hypothesized that RHS proteins, which were previously found to associate with chromatin and Pol II (44–46), might play such a role in trypanosomes. These proteins have been neglected since their discovery, in part because of the lack of similarity to any other known proteins outside the genus Trypanosoma and in part because of the absence of informative functional domains. RHS mRNAs and proteins are extremely abundant and have often been identified in transcriptomic and proteomic studies, but tend to be excluded from analyses, either as probable contaminants or because of their daunting complexity (17,77,78). We show that members of the RHS2, RHS4 and RHS6 sub-families are required for the growth of trypanosomes and that each sub-family plays non-redundant and essential roles in transcription and mRNA export. Together they form part of a larger network in which RHS6 is most closely associated with the DNA, RHS4 is part of the core complex of Pol II (44,45) and RHS2 connects transcription with the translation machinery. RHS2 and RHS6 interact with different sets of proteins, with the former associating mainly with cytosolic RNA-binding proteins and translation initiation factors, and the latter with all the other nuclear RHS (RHS1, RHS4 and RHS5), as well as factors involved in chromatin remodelling, and transcription elongation. Orthologues of known elongation factors in other eukaryotes co-precipitate with the RHS6 complex and show good overlap with proteins recently found to interact with Pol II (17). In addition to the factors described above, several subunits of Replication Factor C, a protein complex known to be of major importance in DNA replication, were associated with RHS6. This might explain why RHS6 depletion compromises the replication of nuclear DNA resulting in the accumulation of zoids. It is difficult, however, to know whether this effect is direct or a secondary consequence of a transcriptional defect. A connection between transcription and replication has been demonstrated in several organisms (79–81). These two processes may also be interlinked in trypanosomes, as two initiators of replication, ORC1 and ORC6, bind at transcription start sites (TSS) and their depletion causes changes in mRNA abundance (82). Further evidence that RHS are involved in transcription elongation comes from ChIP-seq, with excellent correlations between the binding sites for RHS and active Pol II. In other eukaryotes the histone modification H3K36me3 is a marker for elongation by Pol II (83–85). Unfortunately, this epitope is only partially conserved in trypanosomes and commercial antibodies directed against it are not specific (data not shown). Moreover, no recognisable orthologues of Set2 histone methyltransferase can be identified. Interestingly, we did not find evidence of Pol II accumulation at TSS. This may reflect the polycistronic nature of transcription, rather than the individual regulation of promoters operating in the majority of eukaryotes. It is also worth noting that trypanosomes, like yeasts (86,87), do not encode recognisable orthologues of NELF subunits which pause Pol II at promoters in metazoa (88,89). The ChIP-seq experiments also indicate that silent regions of the genome are silent in every respect, with no Pol II binding, no trans-splicing and no RHS. It is possible that these regions are covered with other proteins to prevent access to them, or that some regions of the genome are sequestered in a particular domain of the nucleus that is less accessible. Depletion of RHS caused a reduction in the incorporation of 5-EU into nascent RNA, prompting us to establish GRO-Seq to determine which RNAs were affected. In keeping with their distribution throughout the transcription units, RHS seem to be required for the production of all mRNAs transcribed by Pol II. Once again, this suggests that RHS are general facilitators of transcription. While we cannot exclude entirely that RHS might also be required for transcription initiation, their main roles appear to be in elongation. Messenger RNA export is another process that is tightly connected to transcription in other organisms (90–93). This would also appear to be true of T. brucei, as depletion of any of the RHS resulted in accumulation of nuclear RNAs that were polyadenylated, but not spliced. This differed from the phenotype caused by the depletion of the export factor Mex67, which resulted in accumulation of mRNAs that were trans-spliced and polyadenylated. These disparities might be explained by the finding that the SL precursor, which is also transcribed by Pol II, was reduced after knockdown of RHS. In other eukaryotes, actively transcribing Pol II is localized close to nuclear pores to facilitate mRNA export (94–96). Consistent with this, we found that RNA export factors and nucleoporins were associated with RHS6. There are numerous examples of proteins involved in transcription that shuttle between the nucleus and the cytoplasm (72–74). We postulate that this might be the case for RHS2, which is predominantly cytoplasmic, but is also associated with chromatin. Multiple copies of RHS genes have been annotated in all African trypanosomes (T. brucei, T. congolense, T. vivax) and in other members of the genus Trypanosoma, such as T. grayi, T. theileri and T. cruzi. T. cruzi RHS have been identified in the nuclear proteome and shown to be upregulated after heat shock, but nothing is known about their function (97,98). Leishmania spp., despite belonging to the trypanosomatids, have no RHS genes. A possible explanation could come from phylogenetic studies, which indicate that Leishmania and trypanosomes have quite distant evolutionary histories (99). One reason that the RHS family might have expanded is that they contain a hot spot for retrotransposon insertion, which creates nonsense mutations or frameshifts within the coding sequence and results in the production of truncated proteins. As a consequence, the parasites may use gene duplication as a means of preserving intact copies of the genes. The fact that RHS genes are largely sub-telomeric would aid recombination. Both gene duplication and retrotransposon insertion seem to happen independently in different trypanosome strains, as there is a lack of synteny in the distribution of RHS genes (TritrypDB). All in all, this suggests that retrotransposon insertion is an undesirable event that might still occur. In conclusion, our data show that RHS proteins are novel factors that are intimately associated with the production and export of Pol II transcripts in trypanosomes. This, in turn, demonstrates that the components of eukaryotic transcription are far from being universal, and indicates that studies of a diverse range of organisms would be beneficial to our understanding of how this process has evolved over time.

DATA AVAILABILITY

Raw read files are deposited at the European Nucleotide Archives (ENA) http://www.ebi.ac.uk/ena as study PRJEB29185. Click here for additional data file.
  100 in total

1.  Trypanosome spliced leader RNA genes contain the first identified RNA polymerase II gene promoter in these organisms.

Authors:  G Gilinger; V Bellofatto
Journal:  Nucleic Acids Res       Date:  2001-04-01       Impact factor: 16.971

Review 2.  The role of chromatin during transcription.

Authors:  Bing Li; Michael Carey; Jerry L Workman
Journal:  Cell       Date:  2007-02-23       Impact factor: 41.582

3.  Transcription of the Trypanosoma brucei spliced leader RNA gene is dependent only on the presence of upstream regulatory elements.

Authors:  A Günzl; E Ullu; M Dörner; S P Fragoso; K F Hoffmann; J D Milner; Y Morita; E K Nguu; S Vanacova; S Wünsch; A O Dare; H Kwon; C Tschudi
Journal:  Mol Biochem Parasitol       Date:  1997-03       Impact factor: 1.759

4.  Trypanosoma brucei repeated element with unusual structural and transcriptional properties.

Authors:  N B Murphy; A Pays; P Tebabi; H Coquelet; M Guyaux; M Steinert; E Pays
Journal:  J Mol Biol       Date:  1987-06-20       Impact factor: 5.469

5.  The role of deadenylation in the degradation of unstable mRNAs in trypanosomes.

Authors:  Angela Schwede; Theresa Manful; Bhaskar Anand Jha; Claudia Helbig; Natalia Bercovich; Mhairi Stewart; Christine Clayton
Journal:  Nucleic Acids Res       Date:  2009-07-13       Impact factor: 16.971

6.  Base J represses genes at the end of polycistronic gene clusters in Leishmania major by promoting RNAP II termination.

Authors:  David L Reynolds; Brigitte T Hofmeister; Laura Cliffe; T Nicolai Siegel; Britta A Anderson; Stephen M Beverley; Robert J Schmitz; Robert Sabatini
Journal:  Mol Microbiol       Date:  2016-06-01       Impact factor: 3.501

7.  SAGA and a novel Drosophila export complex anchor efficient transcription and mRNA export to NPC.

Authors:  Maria M Kurshakova; Alexey N Krasnov; Daria V Kopytova; Yulii V Shidlovskii; Julia V Nikolenko; Elena N Nabirochkina; Danièle Spehner; Patrick Schultz; Làszlò Tora; Sofia G Georgieva
Journal:  EMBO J       Date:  2007-11-22       Impact factor: 11.598

8.  Cold shock and regulation of surface protein trafficking convey sensitization to inducers of stage differentiation in Trypanosoma brucei.

Authors:  Markus Engstler; Michael Boshart
Journal:  Genes Dev       Date:  2004-11-15       Impact factor: 11.361

9.  Operon conservation and the evolution of trans-splicing in the phylum Nematoda.

Authors:  David B Guiliano; Mark L Blaxter
Journal:  PLoS Genet       Date:  2006-10-09       Impact factor: 5.917

10.  Unveiling the Trypanosoma cruzi Nuclear Proteome.

Authors:  Agenor de Castro Moreira dos Santos Júnior; Dário Eluan Kalume; Ricardo Camargo; Diana Paola Gómez-Mendoza; José Raimundo Correa; Sébastien Charneau; Marcelo Valle de Sousa; Beatriz Dolabela de Lima; Carlos André Ornelas Ricart
Journal:  PLoS One       Date:  2015-09-18       Impact factor: 3.240

View more
  7 in total

1.  Regulation of gene expression in trypanosomatids: living with polycistronic transcription.

Authors:  Christine Clayton
Journal:  Open Biol       Date:  2019-06-05       Impact factor: 6.411

2.  An Alba-domain protein required for proteome remodelling during trypanosome differentiation and host transition.

Authors:  Shubha Bevkal; Arunasalam Naguleswaran; Ruth Rehmann; Marcel Kaiser; Manfred Heller; Isabel Roditi
Journal:  PLoS Pathog       Date:  2021-01-25       Impact factor: 6.823

3.  Nuclear mRNA maturation and mRNA export control: from trypanosomes to opisthokonts.

Authors:  Susanne Kramer
Journal:  Parasitology       Date:  2021-01-19       Impact factor: 3.234

4.  Developmental changes and metabolic reprogramming during establishment of infection and progression of Trypanosoma brucei brucei through its insect host.

Authors:  Arunasalam Naguleswaran; Paula Fernandes; Shubha Bevkal; Ruth Rehmann; Pamela Nicholson; Isabel Roditi
Journal:  PLoS Negl Trop Dis       Date:  2021-09-20

5.  Genomic Organization and Generation of Genetic Variability in the RHS (Retrotransposon Hot Spot) Protein Multigene Family in Trypanosoma cruzi.

Authors:  Werica P Bernardo; Renata T Souza; André G Costa-Martins; Eden R Ferreira; Renato A Mortara; Marta M G Teixeira; José Luis Ramirez; José F Da Silveira
Journal:  Genes (Basel)       Date:  2020-09-17       Impact factor: 4.096

6.  Cyclic AMP signalling and glucose metabolism mediate pH taxis by African trypanosomes.

Authors:  Sebastian Knüsel; Daniel Abbühl; Arunasalam Naguleswaran; Sebastian Shaw; Ruth Etzensperger; Mattias Benninger; Isabel Roditi
Journal:  Nat Commun       Date:  2022-02-01       Impact factor: 17.694

7.  A hub-and-spoke nuclear lamina architecture in trypanosomes.

Authors:  Norma E Padilla-Mejia; Ludek Koreny; Jennifer Holden; Marie Vancová; Julius Lukeš; Martin Zoltner; Mark C Field
Journal:  J Cell Sci       Date:  2021-06-21       Impact factor: 5.285

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.