Literature DB >> 35312693

Several different sequences are implicated in bloodstream-form-specific gene expression in Trypanosoma brucei.

Tania Bishola Tshitenge1, Lena Reichert1, Bin Liu1, Christine Clayton1.   

Abstract

The parasite Trypanosoma brucei grows as bloodstream forms in mammalian hosts, and as procyclic forms in tsetse flies. In trypanosomes, gene expression regulation depends heavily on post-transcriptional mechanisms. Both the RNA-binding protein RBP10 and glycosomal phosphoglycerate kinase PGKC are expressed only in mammalian-infective forms. RBP10 targets procyclic-specific mRNAs for destruction, while PGKC is required for bloodstream-form glycolysis. Developmental regulation of both is essential: expression of either RBP10 or PGKC in procyclic forms inhibits their proliferation. We show that the 3'-untranslated region of the RBP10 mRNA is extraordinarily long-7.3kb-and were able to identify six different sequences, scattered across the untranslated region, which can independently cause bloodstream-form-specific expression. The 3'-untranslated region of the PGKC mRNA, although much shorter, still contains two different regions, of 125 and 153nt, that independently gave developmental regulation. No short consensus sequences were identified that were enriched either within these regulatory regions, or when compared with other mRNAs with similar regulation, suggesting that more than one regulatory RNA-binding protein is important for repression of mRNAs in procyclic forms. We also identified regions, including an AU repeat, that increased expression in bloodstream forms, or suppressed it in both forms. Trypanosome mRNAs that encode RNA-binding proteins often have extremely extended 3'-untranslated regions. We suggest that one function of this might be to act as a fail-safe mechanism to ensure correct regulation even if mRNA processing or expression of trans regulators is defective.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35312693      PMCID: PMC8982893          DOI: 10.1371/journal.pntd.0010030

Source DB:  PubMed          Journal:  PLoS Negl Trop Dis        ISSN: 1935-2727


Introduction

Kinetoplastids are unicellular flagellates, some of which are parasites of vertebrates, invertebrates and plants. The African trypanosome Trypanosoma brucei is a kinetoplastid that causes sleeping sickness in humans in Africa and infects livestock throughout the tropics, with a substantial economic impact [1]. T. brucei are transmitted by a definitive host, the Tsetse fly, or during passive blood transfer by biting flies. The parasites multiply extracellularly as long slender bloodstream forms in the mammalian blood and tissue fluids, escaping the host immune response by expressing different Variant Surface Glycoproteins (VSGs) [2]. High cell density triggers growth arrest and a quorum sensing response, prompting differentiation of the long slender bloodstream forms to stumpy forms [3]. The stumpy form is pre-adapted for differentiation into the procyclic form, which multiplies in the Tsetse midgut. The transition from the mammalian host to the Tsetse fly entails a decrease in temperature, from 37°C to between 20°C and 32°C; and a switch from glucose to amino acids as the main source of energy, with a concomitant change to dependence on mitochondrial metabolism for energy generation [4,5]. Meanwhile, the VSG coat is replaced by the procyclins [6,7]. Procyclic forms later undergo further differentiation steps before developing into epimastigotes, and then mammalian-infective metacyclic forms in the salivary glands [8]. The developmental transitions in the T. brucei life cycle are marked by extensive changes in mRNA and protein levels [9-17]. Differentiation of trypanosomes from the bloodstream form to the procyclic form can be achieved in vitro through addition of 6 mM cis-aconitate and a temperature switch from 37°C to 27°C, followed by a medium change [18,19]. Bloodstream-form trypanosomes rely on substrate-level phosphorylation for ATP generation, and the first seven enzymes of glycolysis and glycerol metabolism are located in a microbody, the glycosome [20,21]. Within the branch of the pathway that has pyruvate as the end-product, the last enzyme that is in the glycosome is phosphoglycerate kinase (PGK). In procyclic-form trypanosomes, although the glycosomal glycolytic pathway is retained, it has some other roles and most phosphoglycerate kinase activity is in the cytosol. The trypanosome genome encodes three PGK isozymes [22], all of which are enzymatically active [23-25]. PGKA (Tb927.1.720) is expressed at low levels and is in the glycosome [26]; PGKB (Tb927.1.710) is the cytosolic enzyme found in procyclic forms; and PGKC (Tb927.1.700) is the glycosomal isozyme that is found in mammalian-infective forms [27,28]. This regulated compartmentation is essential. Expression of cytosolic PGK is lethal to bloodstream-form trypanosomes [29], perhaps because it disrupts the ATP/ADP balance within the glycosome [30,31]; and expression of PGKC in procyclic forms is similarly lethal (at least in the presence of glucose), probably because it reduces cytosolic ATP production [32]. In Kinetoplastids, nearly all protein-coding genes are arranged in polycistronic transcription units. Mature mRNAs are generated from the primary transcript by 5’-trans-splicing of a 39nt capped leader sequence, and by 3’-polyadenylation (reviewed in [33,34]). The detection of intergenic RNA from the PGK locus was indeed one of the early pieces of evidence for polycistronic transcription [35]. Polyadenylation is not very precise: sites are found in a region that is about 140nt upstream of the trans-splice site that is used to process the following mRNA [11,36], and quite often, alternative splice acceptor sites (and therefore polyadenylation sites) are used. The parasite regulates mRNAs mainly by post-transcriptional mechanisms, supplemented, in the case of some constitutively abundant mRNAs, by the presence of multiple gene copies. Regulation of mRNA processing, degradation, and translation are therefore central to parasite homeostasis, and for changes in gene expression during differentiation [14-17]. The sequences required for regulation of mRNA stability and translation often lie in the 3’-UTRs of the mRNAs, and most regulation so far has been found to depend on RNA-binding proteins [37,38]. RBP10 (Tb927.8.2780) is an RNA-binding protein which has been detected only in bloodstream forms and metacyclic forms. The RBP10 mRNA is also at least four times more abundant in bloodstream forms than in procyclic forms [9,10,39-41]; it persists in stumpy forms [42-44], but decreases rapidly after induction of differentiation [45]. Within the Tsetse fly, RBP10 mRNA may be up-regulated in metacyclic forms [9,10,46] although this was not seen in all studies. Procyclic forms can be induced to differentiate to epimastigotes, and then mammalian-infective metacyclic forms, by induced expression of the RNA-binding protein RBP6 (Tb927.3.2930) [46,47]. RBP10 protein is detected in bloodstream forms [39] and RBP6-induced metacyclic forms [40,47]. Using mass spectrometry, the degree of RBP10 regulation is difficult to calculate because of poor detectability in procyclic forms, but was estimated in one study at 25-fold [48]; in another, RBP10 protein was undetectable in stumpy forms [13]. RBP10 specifically associates with procyclic-specific mRNAs that contain the motif UA (U)6 in their 3’-UTRs, targeting them for destruction [49]. Depletion of RBP10 from bloodstream forms gives cells that can only grow as procyclic forms [49] and RBP10 expression in procyclic forms makes them able to grow only as bloodstream forms, with expression of some metacyclic-form VSG mRNAs but no detectable formation of epimastigotes [49,50]. After RBP6 induction in procyclic forms, RBP10 was also required for expression of metacyclic VSG mRNAs [46]. Correct developmental regulation of RBP10 is therefore critical throughout the parasite life cycle. The distribution of 3’-UTR lengths in trypanosomes is remarkably broad. Initial estimates from high-throughput RNA sequencing (RNA-Seq) suggested median lengths of 400-500nt [11,36] and 6% over 2kb. Subsequent studies of individual genes have revealed that these are under-estimates: some trypanosome mRNAs have 3’-UTRs of 5kb or more. This is rather unusual compared with Opisthokont model organisms. For example, Saccharomyces cerevisiae 3’-UTRs are 0-1461nt long, with a median of 104nt [51], while Caenorhabditis elegans 3’-UTRs have a median length of 133nt; less than 2% of C. elegans mRNAs had 3’-UTRs longer than 2kb [52]. Interestingly, the longest trypanosome 3’-UTRs tend to be found in mRNAs encoding RNA-binding proteins and protein kinases [37]. Since these protein classes have vital regulatory functions, it is likely that their expression has to be particularly tightly controlled. For example, the mRNA encoding the procyclic-specific mRNA binding protein ZC3H22 (Tb927.7.2680) has a 3’-UTR that is over 5kb long, with 9 copies of the UA (U)6 motif [53]. Over 400 T. brucei mRNAs are at least 10-fold better expressed in bloodstream forms than in procyclic forms, as judged by the relative numbers of ribosome footprints, which measure of both mRNA abundance and translation [14]. To our knowledge, however, the only short RNA motif that has so far been implicated in such regulation is an 8mer (UGCUACUU) that is specific to the 3’-UTR of the VSG mRNA [54]. In a previous study, we used a reporter assay to examine the functions of various segments of the PGKC 3’-UTR [55]. Results from deletions indicated that the sequences that were required for developmental regulation were in the terminal 424nt of the PGKC 3’-UTR [55]—a region that is likely to be bound by several different proteins [56]. In this paper, we aimed to find shorter sequences that are responsible for the bloodstream-form-specific expression of both the PGKC and RBP10 mRNAs. We found that PGKC has at least two such sequences, while RBP10 has at least six which are scattered throughout the 7.3 kb 3’-UTR. Our results suggest that several different sequence motifs—and therefore, probably, a similar number of RNA-binding proteins—are implicated in controlling bloodstream-form-specific mRNA stability and translation.

Materials and methods

Trypanosome culture

The experiments in this study were carried out using the pleomorphic cell line EATRO 1125 [57], constitutively expressing the tetracycline repressor. The bloodstream form parasites were cultured as routinely in HMI-9 medium supplemented with 10% heat inactivated foetal bovine serum at 37°C with 5% CO2. During proliferation, the cells were diluted to 1x105 cells/ml and maintained in density between 0.2–1.5x106 [58]. To preserve the pleomorphic morphology between the experiments, the EATRO 1125 cells were maintained in HMI-9 medium containing 1.1% methylcellulose [59]. For generation of stable cell lines, ~1–2 x 107 cells were transfected by electroporation with 10 μg of linearized plasmid at 1.5 kV on an AMAXA Nucleofector. Selection of new transfectants was done after addition of appropriate antibiotic and serial dilution, usually about 6-8h after transfection. The differentiation of bloodstream forms to procyclic forms was induced by addition of 6mM cis-aconitate (Sigma) to 1x106 long slender trypanosomes; after 17-24h, the cells were transferred into procyclic form media (~ 8x105 cells/ml) and maintained at 27°C without CO2.

Plasmid constructs

To assess the role of the RBP10 3’-UTR at full length, a cell line expressing a chloramphenicol acetyltransferase (CAT) mRNA with the RBP10 3’-UTR was generated by replacing one allele of the RBP10 coding sequence with the coding region of the CAT reporter. The RBP10 5’-UTR was also replaced with the beta-tubulin (TUB) 5’-UTR (Fig 1E). To map the regulatory sequences, plasmids used for stable transfection were based on pHD2164, a dicistronic vector containing the CAT and neomycin phosphotransferase (NPT) resistance genes (Fig 2A). Downstream of the CAT gene, we cloned different fragments of the RBP10 3’-UTR in place of the actin (ACT) 3’-UTR, using Sal I and Xho I restriction sites (Fig 2A). The different fragments were obtained by PCR using genomic DNA from Lister 427 trypanosomes as the template. Mutations on smaller fragments of the RBP10 3’-UTR were done using either site directed mutagenesis (NEB, Q5 Site-Directed Mutagenesis Kit Quick Protocol, E0554) or by PCR mutagenesis with Q5 DNA polymerase.
Fig 1

Developmental regulation of the RBP10 mRNA.

A. RNA-sequencing data and ribosome profiling [14] reads for bloodstream forms (B) and procyclic forms (P), aligned to the relevant segment of the TREU927 reference genome. Unique reads are in blue and non-unique reads are in grey, on a log scale. The sequences in the RBP10 (Tb927.8.2780) 3’-UTR designated as non-unique in this image are also present in an TREU927 DNA contiguous sequence that has not been assigned to a specific chromosome; this segment is absent in the Lister427 (2018) genome. The positions of open reading frames (black) and untranslated regions have been re-drawn, with the initial "Tb927.8" removed for simplicity. Transcription is from left to right. Data from all remaining panels are for the EATRO1125 strain. B. Northern blot for two independent cultures showing the relative mRNA abundance of RBP10 mRNA in bloodstream forms (B) and in procyclic forms (P). A section of methylene blue staining is depicted to show the loading and the measured amounts in procyclic forms, relative to bloodstream forms, are also shown. C. Regulation of RBP10 protein expression during differentiation. Cells were incubated with or without 6 mM cis-aconitate at the temperatures and for the times indicated. The asterisk shows a band that cross-reacts with the antibody, and was used as a control. The graph shows quantitation (individual results) from three independent experiments, the horizontal line is the arithmetic mean. D. Half-life of RBP10 mRNA. Cells were incubated with Actinomycin D (10 μg/ml) and Sinefungin (2 μg/ml) to stop both mRNA processing and transcription. The amount of RBP10 mRNA in bloodstream forms was measured in three independent replicates. The initial delay or even increase in abundance seen in one replicate is commonly seen for relatively stable trypanosome mRNAs, and is of unknown origin. E. The RBP10 3’-UTR is sufficient for regulation. A dicistronic construct mediating puromycin resistance (PAC gene) and encoding chloramphenicol acetyltransferase (CAT gene) replaced one of the two RBP10 open reading frames (upper panel) in bloodstream forms (B). These then were differentiated to procyclic forms (P). The lower panel shows measurements of CAT activities and mRNA for three independent cell lines, normalised to the average for bloodstream forms. Data for this Figure are at: https://figshare.com/articles/figure/Full_Northern_and_Western_blots_RBP10_Fig_1_pdf/17099168.

Fig 2

The RBP10 3’-UTR contains several regulatory sequences.

A. Cartoon showing the CAT reporter construct (pHD2164) used in this study. The different fragments of the RBP10 3’-UTR were cloned downstream of the CAT reporter coding sequence. The polyadenylation site "p (A)" is specified by the NPT splice signal. SL indicates the spliced leader addition sites. ACT denotes actin. B. Schematic diagram of the RBP10 3’-UTR segments used to map the regulatory sequences. Different segments are in different colours and the diagram is to scale. Measurements of CAT activity and mRNA levels are on the right. Each dot is a result for an independent clone, with blue and red representing bloodstream (B) an procyclic (P) forms, respectively. Values were normalized to those for the ACT 3’-UTR control (pHD2164). The average result for the control BSF cell line was set to 1. For "expression", averages are: "-" = <0.25x, "+/-" = 0.25–0.5x, "+" = 0.5-2x, "++" = 2-3x and "+++" >3x. The complete Northern blots are available at: https://figshare.com/articles/figure/Northern_blots_RBP10_3_-UTR/17085989 and the measurements for this and Fig 4 are at: https://figshare.com/articles/dataset/CAT_activity_and_mRNA_measurements_RBP10/19188431.

To study the PGKC 3’-UTR, the starting plasmid used, pHD3261 (Fig 3A), was built by incorporating various fragments either from previous plasmids [54], or made by PCR amplification with plasmid templates. The starting plasmid contains 790nt of the PGKC 3’-UTR, re-amplified from [60]. Most of the sequence, apart from the plasmid backbone, was verified by sequencing and this is included as S2 Text. Deletions were done by specific PCR followed by cloning between the Bam HI and Sal I sites (Fig 3). Some deletions occurred during the PCR, so the 3’-UTR sequences used are included as supplementary S1 Fig. The reconstructed sequence of the derivative with the 3’-UTR of genes encoding actin (ACT) is S2 File.
Fig 3

The PGKC 3’-UTR contains at least two regulatory sequences.

A. Cartoon showing one of the CFP reporter constructs (pHD3261) used in this study. Different fragments of the PGKC 3’-UTR were cloned downstream of the CFP reporter coding sequence. The polyadenylation site "p (A)" is specified by the PAC splice signal. TUB denotes the tubulin locus. ACT denotes actin. B. Schematic diagram of the PGKC 3’-UTR segments used to map the regulatory sequences. Measurements are on the right. Each dot is a result for an independent clone, with red and blue representing bloodstream (B) and procyclic (P) forms, respectively. Protein levels were estimated from Western blot quantitation and mRNA levels were measured by real-time PCR, taking the average of three technical replicates for at least two independent populations. Values were normalized to the arithmetic mean for the PGKC 3’-UTR control populations, bloodstream forms with pHD3261. For "expression", averages are: "-" = <0.25x, "+/-" = 0.25–0.5x, "+" = 0.5-2x, "++" = 2-3x and "+++" >3x. All available values for actin, including those from the experiment in Fig 4, were included in the current Figure. The Western blots for this and Fig 4 are at: https://figshare.com/articles/figure/Complete_Western_blots_for_PGKC_3_-UTR_assays/19188455 The qPCR results are at: https://figshare.com/articles/dataset/qPCR_for_PGKC_3_UTR_analysis/19188248 C. RNA from the bloodstream-form samples used in (B) were examined on Northern blots to examine the size of CFP mRNAs bearing PGKC 3’-UTRs. The methylene blue-stained membrane and the tubulin mRNAs are shown as loading controls. The gels ran slightly differently and the identity of the faint band at about 2 Kb in the right-hand blot is unknown, though it could be cross-hybridisation with the largest LSU rRNA. Measured mRNA lengths are in S2 Table. The original blots are at: https://figshare.com/articles/figure/Complete_Northern_blots_PGKC/19181888.

The precise details of the different constructs and their associated primers used for cloning are included in S1 Table.

RNA analysis

Total RNA was isolated from approximately 1x108 bloodstream-form trypanosomes or 5x107 procyclic-form cells growing in logarithmic phase (less than about 8 x 105/ml for bloodstream forms, or 4 x 106 /ml for procyclic forms) using either peqGold Trifast (PeqLab) or RNAzol RT following the manufacturer’s instructions. To detect the CAT or CFP mRNAs by Northern blot, 5 or 10 μg of total RNA was resolved on formaldehyde agarose gels, transferred onto nylon membranes (GE Healthcare) by capillary blotting and fixed by UV-crosslinking. The membranes were pre-hybridized in 5x SSC, 0,5% SDS with 200 mg/ml of salmon sperm DNA (200 mg/ml) and 1x Denhardt’s solution, for an hour at 65°C. The probes were generated by PCR of the coding sequences of the targeted mRNAs, followed by incorporation of radiolabelled [α32P]-dCTP and purification using the QIAGEN nucleotide removal kit according to the manufacturer’s instructions. The purified probes were then added to the prehybridization solution and the membranes were hybridized with the respective probes at 65°C for overnight (while rotating). After rinsing the membranes in 2x SSC buffer/0.5% SDS twice for 15 minutes, the probes were washed out once with 1x SSC buffer/0.5% SDS at 65°C for 15 minutes and twice in 0.1x SSC buffer/0.5% SDS at 65°C each for 10 minutes. The blots were then exposed onto autoradiography films for 24–48 hours and the signals were detected with the phosphorimager (Fuji, FLA-7000, GE Healthcare). Care was taken to ensure that signals were not over-exposed so that quantitation would be in the linear range. The signal intensities of the bands were measured using ImageJ. CFP mRNA levels were measured by quantitative real-time PCR as previously described [61]. To measure mRNA half-lives, mRNA transcription and trans-splicing were simultaneously inhibited by addition to the growth culture medium of 10 μg/ml Actinomycin D and 2 μg/ml Sinefungin. The cells were collected at the indicated different time points and RNA was isolated by Trizol extraction [62]. The mRNA levels were assessed by Northern blotting.

CAT assay

To perform the CAT assays, approximately 2×107 cells were harvested at 2300 rpm for 8 minutes and washed three times with cold PBS. The pellet was re-suspended in 200 μl of CAT buffer (100mM Tris-HCl pH 7.8) and lysed by freeze-thawing three times using liquid nitrogen and a 37°C heating block. The supernatants were then collected by centrifugation at 15,000×g for 5 min and kept in ice. The protein concentrations were determined by Bradford assay (BioRad) according to the manufacturer’s protocol. For each setup, 0.5 μg of protein in 50 μl of CAT buffer, 10 μl of radioactive butyryl CoA (14C), 2 μl of chloramphenicol (stock: 40 mg/ml), 200 μl of CAT buffer and 4 ml of scintillation cocktail were mixed in a Wheaton scintillation tube HDPE (neoLab #9–0149) and the incorporation of radioactive acetyl group on chloramphenicol was measured using a Beckman LS 6000IC scintillation counter.

Western blots

Protein samples were collected from approximately 5x106 cells growing at logarithmic phase. Samples were run according to standard protein separation procedures using SDS-PAGE. The primary antibodies used in this study were mouse monoclonal IgG against GFP (Santa Cruz Biotechnology) and rat α-ribosomal protein S9 (own antibody). We used horseradish peroxidase coupled secondary antibodies (α-rat, 1:2000 and α-mouse, 1:1000). The blots were developed using an enhanced chemiluminescence kit (Amersham) according to the manufacturer’s instructions. The signal intensities of the images were quantified using Image J.

Results

The RBP10 3’-UTR is sufficient for developmental regulation

Our first experiments were designed to check high-throughput results concerning RBP10 regulation. Results from RNA-Seq and ribosome profiling [17] suggest that the 3’-UTR is about 7.3 kb long, giving a total mRNA length of about 8.5 kb (Fig 1A). The middle region of the 3’-UTR is shown as being present more than once in the TREU927 genome assembly, as indicated by the grey-coloured reads in the alignment in Fig 1A, but we have been unable to locate this other copy in either this or the other available genomes (e.g. [63]). In our experiments we used the EATRO1125 T. brucei strain, which is differentiation-competent but for which an assembled genome [64] became available only after this work was complete.

Developmental regulation of the RBP10 mRNA.

A. RNA-sequencing data and ribosome profiling [14] reads for bloodstream forms (B) and procyclic forms (P), aligned to the relevant segment of the TREU927 reference genome. Unique reads are in blue and non-unique reads are in grey, on a log scale. The sequences in the RBP10 (Tb927.8.2780) 3’-UTR designated as non-unique in this image are also present in an TREU927 DNA contiguous sequence that has not been assigned to a specific chromosome; this segment is absent in the Lister427 (2018) genome. The positions of open reading frames (black) and untranslated regions have been re-drawn, with the initial "Tb927.8" removed for simplicity. Transcription is from left to right. Data from all remaining panels are for the EATRO1125 strain. B. Northern blot for two independent cultures showing the relative mRNA abundance of RBP10 mRNA in bloodstream forms (B) and in procyclic forms (P). A section of methylene blue staining is depicted to show the loading and the measured amounts in procyclic forms, relative to bloodstream forms, are also shown. C. Regulation of RBP10 protein expression during differentiation. Cells were incubated with or without 6 mM cis-aconitate at the temperatures and for the times indicated. The asterisk shows a band that cross-reacts with the antibody, and was used as a control. The graph shows quantitation (individual results) from three independent experiments, the horizontal line is the arithmetic mean. D. Half-life of RBP10 mRNA. Cells were incubated with Actinomycin D (10 μg/ml) and Sinefungin (2 μg/ml) to stop both mRNA processing and transcription. The amount of RBP10 mRNA in bloodstream forms was measured in three independent replicates. The initial delay or even increase in abundance seen in one replicate is commonly seen for relatively stable trypanosome mRNAs, and is of unknown origin. E. The RBP10 3’-UTR is sufficient for regulation. A dicistronic construct mediating puromycin resistance (PAC gene) and encoding chloramphenicol acetyltransferase (CAT gene) replaced one of the two RBP10 open reading frames (upper panel) in bloodstream forms (B). These then were differentiated to procyclic forms (P). The lower panel shows measurements of CAT activities and mRNA for three independent cell lines, normalised to the average for bloodstream forms. Data for this Figure are at: https://figshare.com/articles/figure/Full_Northern_and_Western_blots_RBP10_Fig_1_pdf/17099168. Previous transcriptome and ribosome profiling results indicated that there are about 4 copies of RBP10 mRNA per cell in bloodstream forms, and slightly under 1 per cell in procyclic forms [15], but that the ribosome density on the coding region is 9 [16] or 100 [14] times higher in bloodstream forms. Northern blot results for the EATRO1125 strain showed an RBP10 mRNA that migrated slower than the 6kb marker (Fig 1B). When we extrapolated the marker curve, the results gave an RBP10 mRNA length consistent with the 8.5 kb from RNA-Seq. In this experiment, there was 8-fold more RBP10 mRNA in bloodstream forms than in procyclic forms (Fig 1B). The amount of RBP10 protein decreased about 3-fold after 24h incubation with 6mM cis-aconitate at either 20°C or 27°C, and both cis-aconitate and the temperature drop were required for the regulation (Fig 1C). Previous RNA-Seq measurements suggested an RBP10 mRNA half-life of just over 1h in Lister 427 bloodstream forms [15]. Three new individual measurements in EATRO1125 were difficult to interpret because of an initial apparent increase in the mRNA, but did suggest a half-life of 1–2 h (Fig 1D). The initial increase has previously been seen for other stable mRNAs and its cause is unknown. For procyclic forms, the RNA-Seq replicates for degradation measurements were very poor [15], but from Northern blotting the half-life was probably less than 30 min (Fig 1D). Finally, to find out whether the RBP10 3’-UTR was sufficient for regulation, we integrated a chloramphenicol acetyltransferase (CAT) gene into the genome of strain EATRO1125 bloodstream forms, directly replacing one RBP10 allele (Fig 1E). After differentiation to procyclic forms, CAT mRNA was about 3-fold down-regulated, but there was no detectable CAT activity (Fig 1E). This shows that the RBP10 3’-UTR is sufficient for developmental regulation.

The RBP10 3’-UTR contains numerous regulatory elements

In order to find sequences that contribute to the stability and translation of RBP10 mRNA in bloodstream forms, or to its instability and translational repression in procyclic forms, we made use of a reporter plasmid that integrates into the tubulin locus, resulting in read-through transcription by RNA polymerase II. The CAT reporter mRNA has a 5’-UTR and splice signal from an EP procyclin gene (Fig 2A). At the 3’-end between CAT and NPT is an intergenic region from between the two actin (ACT) genes, with a restriction site exactly at the mapped polyadenylation site (Fig 2A). Polyadenylation of the CAT mRNA is directed by the polypyrimidine tract that precedes the ACT 5’-UTR. Cell lines can be selected with G418 using the NPT (neomycin phosphotransferase) marker, the mRNA of which has an ACT 5’-UTR. The reporter produces a CAT mRNA bearing the ACT 3’-UTR, with polyadenylation driven by the downstream splice site for NPT. In all experiments, the reporter plasmid was transfected into EATRO1125 bloodstream forms and two or three independent clones were then differentiated into procyclic forms. CAT activities were measured enzymatically and mRNA levels were measured by Northern blotting, which simultaneously allowed us to check the sizes of the mRNAs. All values were normalised to arithmetic mean results from the ACT 3’-UTR control. The sizes of the mRNAs are tabulated in S2 Table and the sequences are in S1 Text. Most of the reporter mRNAs migrated either as expected, or slightly faster. The latter suggests polyadenylation upstream of the expected sites.

The RBP10 3’-UTR contains several regulatory sequences.

A. Cartoon showing the CAT reporter construct (pHD2164) used in this study. The different fragments of the RBP10 3’-UTR were cloned downstream of the CAT reporter coding sequence. The polyadenylation site "p (A)" is specified by the NPT splice signal. SL indicates the spliced leader addition sites. ACT denotes actin. B. Schematic diagram of the RBP10 3’-UTR segments used to map the regulatory sequences. Different segments are in different colours and the diagram is to scale. Measurements of CAT activity and mRNA levels are on the right. Each dot is a result for an independent clone, with blue and red representing bloodstream (B) an procyclic (P) forms, respectively. Values were normalized to those for the ACT 3’-UTR control (pHD2164). The average result for the control BSF cell line was set to 1. For "expression", averages are: "-" = <0.25x, "+/-" = 0.25–0.5x, "+" = 0.5-2x, "++" = 2-3x and "+++" >3x. The complete Northern blots are available at: https://figshare.com/articles/figure/Northern_blots_RBP10_3_-UTR/17085989 and the measurements for this and Fig 4 are at: https://figshare.com/articles/dataset/CAT_activity_and_mRNA_measurements_RBP10/19188431.
Fig 4

Analysis of specific sequences.

A. Results for a CAT reporter plasmid containing RBP10 3’-UTR fragment 1.2 bearing (AU)10 repeats and a mutated version without them. The average result for the fragment with the repeats is set to 1. B. As (A) but showing deletion of a poly (A) tract from fragment 3.1.1. C. Results for a CFP reporter plasmid containing the ACT 3’-UTR with either (AT)10 or the PGKC* fragment inserted after the CFP coding region. The measurements are the averages of duplicate technical replicates for two independent populations, analysed with two different blots.

First, we examined four different fragments (1–4 in Fig 2B), each roughly 2 kb long. Fragment 4 extends beyond the RBP10 polyadenylation site, including the intergenic region before downstream gene (Tb927.8.2790); and the size of the resulting RNA suggested use of the genomic processing signals (S2 Table). It was notable that the results for RNA were much more variable than for the CAT activity. Since RBP10 protein also shows more regulation than the RNA, the following discussion will consider mostly the CAT activity. We consider CAT activity to be "low" ("-" in Fig 2) if the median value was less than 25% of the activity measured using our standard, the 3’-UTR from an actin gene (ACT), and "high" ("++" in Fig 2) if the median CAT activity was more than twice the control; results in between are designated "+". The degree of developmental regulation is calculated by dividing the lowest value for bloodstream forms by the highest value in procyclic forms. We did not calculate P-values because with 3 replicates, it is impossible to tell whether the values are normally distributed; and we here discuss only the fragments that gave similar results for all replicates. To our surprise, all of the 2kb fragments gave low CAT activity in procyclic forms (Fig 2B). Fragment 2 also reproducibly gave high CAT activity in bloodstream forms (Fig 2B). Clearly, several regulatory sequences were present. We next further dissected the fragments. We attempted to examine sequences of equal sizes, but were constrained by the need to design PCR primers that avoided the numerous low-complexity sequences in the RBP10 3’-UTR. Our results showed that fragments 1.1, 1.2, 1.4 and 2.2.2 (Fig 2B) each gave low CAT activity in procyclic forms with at least 4-fold developmental regulation; fragment 1.2 also reproducibly gave more activity in bloodstream forms than the parent sequence. Fragment 1.4 (234nt) was the shortest that gave this pattern; when it was cut in half, regulation was lost, either because we accidentally cut within a relevant motif, or because the cleavage adversely affected a required secondary structure. Fragments 3.2 and 3.2.2 gave low CAT activity in both stages although the mRNA was readily detected, suggesting possible translation repression. Fragments 2.1 and 2.2.4 suppressed CAT activity in procyclics but with rather low CAT activity in bloodstream forms and little RNA regulation, suggesting a role in repression of translation in procyclic forms. It was notable that sometimes, fragmentation of a sequence revealed activity that had been absent previously: for example, fragment 2.2 gave no regulation but the sub-fragment 2.2.2 regulated like fragment 2. Fragment 2.2 may also include a repressive element whose removal results in high activity in bloodstream forms (fragment 2.2.3). Fragment 3.1 also gave high activity in bloodstream forms. (Fig 2B). These results show that regulation of RBP10 expression is achieved by numerous sequence elements.

At least two segments of the PGKC 3’-UTR contain regulatory elements

To study regulation by the PGKC 3’-UTR, we used the reporter plasmid shown in Fig 3A. After restriction enzyme cleavage and transfection, the plasmid integrates into the tandemly repeated alpha-beta tubulin array, cleanly replacing an alpha tubulin gene (Fig 3A). It is therefore, like the previous reporter, transcribed by RNA polymerase II. A cyan fluorescent protein (CFP) reporter open reading frame is followed by a 3’-UTR, then an intergenic region (IGR), splice signal and 5’-UTR from the actin (ACT) locus. Splicing of the mRNA encoding CFP is directed by the alpha-tubulin signal, resulting in a CFP mRNA with an alpha-tubulin 5’-UTR and the 3’-UTR of interest—in this case, the ~780nt 3’-UTR of PGKC. The puromycin resistance marker mRNA (PAC, puromycin acetyltransferase) is trans-spliced using the signal from the ACT gene. This splice signal also directs polyadenylation of the CFP reporter mRNA, approximately at the position of the Sal I site that divides the tested 3’-UTRs from the intergenic region (Fig 3A). A full sequence of the plasmid is included as S1 File. To design deletions (Fig 3B) we examined the predicted secondary structure of the 3’-UTR using the RNA fold web server (http://rna.tbi.univie.ac.at/cgi-bin/RNAWebSuite/RNAfold.cgi). The resulting predictions are not included here because they were not subsequently verified in any way. However, our deletions were designed to avoid disruption of predicted secondary structures. The full sequence of the PGKC 3’-UTR is in S2 Text. For each deletion, several plasmids were sequenced and the one that best matched the expected sequence was selected for further use. In some cases a few nucleotides were deleted or exchanged: an alignment that shows all of the 3’-UTR sequences tested is in S1 Fig. (Plasmids 17 and 18 are not included but had no mutations relative to plasmid 16.)

The PGKC 3’-UTR contains at least two regulatory sequences.

A. Cartoon showing one of the CFP reporter constructs (pHD3261) used in this study. Different fragments of the PGKC 3’-UTR were cloned downstream of the CFP reporter coding sequence. The polyadenylation site "p (A)" is specified by the PAC splice signal. TUB denotes the tubulin locus. ACT denotes actin. B. Schematic diagram of the PGKC 3’-UTR segments used to map the regulatory sequences. Measurements are on the right. Each dot is a result for an independent clone, with red and blue representing bloodstream (B) and procyclic (P) forms, respectively. Protein levels were estimated from Western blot quantitation and mRNA levels were measured by real-time PCR, taking the average of three technical replicates for at least two independent populations. Values were normalized to the arithmetic mean for the PGKC 3’-UTR control populations, bloodstream forms with pHD3261. For "expression", averages are: "-" = <0.25x, "+/-" = 0.25–0.5x, "+" = 0.5-2x, "++" = 2-3x and "+++" >3x. All available values for actin, including those from the experiment in Fig 4, were included in the current Figure. The Western blots for this and Fig 4 are at: https://figshare.com/articles/figure/Complete_Western_blots_for_PGKC_3_-UTR_assays/19188455 The qPCR results are at: https://figshare.com/articles/dataset/qPCR_for_PGKC_3_UTR_analysis/19188248 C. RNA from the bloodstream-form samples used in (B) were examined on Northern blots to examine the size of CFP mRNAs bearing PGKC 3’-UTRs. The methylene blue-stained membrane and the tubulin mRNAs are shown as loading controls. The gels ran slightly differently and the identity of the faint band at about 2 Kb in the right-hand blot is unknown, though it could be cross-hybridisation with the largest LSU rRNA. Measured mRNA lengths are in S2 Table. The original blots are at: https://figshare.com/articles/figure/Complete_Northern_blots_PGKC/19181888. To test the functions of the 3’-UTR segments, we cut each plasmid to remove the plasmid backbone, transfected the DNA into bloodstream-form trypanosomes, and selected independent populations for further analysis. The transfection should normally result in parasites with a single copy of the plasmid into the alpha-beta-tubulin tandem repeat, although insertion of two or more copies would be possible. To allow for this, two different populations for each plasmid were selected for protein preparation, RNA preparation, and differentiation into procyclic forms. Protein levels were assessed for each population by quantitative Western blotting, with ribosomal protein S9 as a loading control (Fig 3C). RNA levels were measured in triplicate technical replicates for each clone, by reverse transcription and real-time PCR (Fig 3D). All values were normalized to the average of expression in bloodstream forms for the 782nt full-length PGKC 3’-UTR. The lengths of the mRNAs, measured in bloodstream forms (Fig 3E) mostly appeared to be around 100nt longer than the expected lengths, which were calculated assuming a poly(A) tail length of 60nt (S2 Table). Either we under-estimated the poly (A) tail length, or the Northern measurements are consistently a little too long. Using CAT reporters, in bloodstream forms the PGKC 3’-UTR gave twice as much mRNA and CAT activity as the ACT 3’-UTR [55], and there was no developmental regulation of the ACT 3’-UTR reporter. Using CFP as the reporter, the difference between PGKC and ACT in bloodstream forms was less marked, and the ACT 3’-UTR gave about 30% less activity in procyclic forms than in bloodstream forms. Our results revealed that all fragments that included the final 278nt showed developmental regulation of protein levels: the amount of protein in procyclic forms was too low to measure accurately (Fragments 1–5; Fig 3C). For these 3’-UTR fragments, RNA in procyclic forms was 15%±6% of the bloodstream-form level. The difference in mRNA level was less than that seen previously using CAT, and also less than that seen with the mRNAs that include the PGK coding region. It was also rather more variable, as we had previously observed for RBP10 (Fig 3D). Perhaps the mRNA level is more sensitive to cell density than the protein level; and the coding region may also contribute [65,66]. We here therefore focus mainly on regulation of protein expression. Bloodstream-form levels of expression for fragments 4 and 5 were slightly lower than for the full-length plasmid (Fig 3C). Next, we deleted from the 3’-end (Fig 3B, fragments 6–9). Deletion of the final 278nt (Fig 3B, fragment 7) was sufficient to abolish developmental regulation (Fig 3C). Intriguingly, further deletion (Fragment 8) gave a plasmid which again showed a roughly 2-fold difference between bloodstream and procyclic forms, while an additional deletion (Fragment 9) resulted in no regulation. The results so far localized the regulatory sequences to the last 278 nt of the 3’-UTR. Testing of individual fragments, however, gave a more nuanced picture. Fragment 12 gave regulation like fragment 8. Fragment 13 contains a UA (U)6 motif, but fragment 13, if anything, gave slightly more expression in bloodstream forms than in procyclic forms. Fragments 14 and 15 independently suppressed expression in procyclic forms, indicating that they contain two separate regulatory elements, although fragment 15 was the most effective. Fragment 16—which includes fragments 13 and 14—behaved like fragment 14. We next made 3’- deletions of fragment 16. Intriguingly, an initial trucation (fragment 17) gave a 3’-UTR with no regulation at all, but further removal (plasmid 18) restored the regulation. Plasmid 18 contains just 61nt of fragment 14 (designated with an asterisk). The combined results suggest that the 61nt in plasmid 18 include a region that represses expression in procyclic forms, but whose function is affected by the surrounding sequence. Additional regulatory elements are in fragments 12 and 15.

AU-Rich elements affect expression

We next looked for regulatory motifs that could be tested in another sequence context. First, we searched for sequences that might specifically repress expression in procyclic forms. We compared different sets of sequences using MEME or DREME [67,68]. We initially searched for 6–12 nt motifs enriched in the RBP10 regulatory fragments relative to the those that lacked regulation. (The maximum was set because most RNA-binding proteins are specific for less than 12nt.) No motifs were found but the sample size was small, so we expanded the dataset by including 3’-UTRs from other co-regulated bloodstream-form specific mRNAs: the two regulatory regions for PGKC, Phosphofructokinase (Tb927.3.3270), pyruvate kinase (Tb927.10.14140), glycerophosphate isomerase (Tb927.1.3830, the hexose transporter THT1 (Tb927.10.8450) [69], and an aquaglyceroporin (Tb927.10.14160) - 11 sequences altogether. As a comparator we used 402 3’-UTRs from procyclic-specific mRNAs, of the same average length. No motifs with statistically significant enrichment were found in the bloodstream-form mRNAs. Next, we looked at low-complexity regions which might serve to bind multiple copies of sequence-specific RNA-binding proteins. One element consists of (AU) repeats, present in fragments 1.2 ((AU)11), 1.3 ((AU)9 and (AU)8), and 3.2.1 ((AU)10), and of course the larger fragments that include them. We had some previous preliminary data that suggested that this sequence was implicated in good expression of ZC3H11, so we tested its function in the context of the RBP10 3’-UTR. Deletion of the AU repeat from RBP10 fragment 1.2 indeed resulted in a drastic decrease of reporter expression levels in bloodstream forms (Fig 4A). Further, several fragments of the RBP10 3’-UTR contain poly (A) tracts (F1.2, 2.1, 2.2.1, 2.2.2, 2.2.3, 3.1.2 and 3.2.1), and we speculated that they might act by recruiting a poly(A) binding protein. However, deletion of the poly (A) tracts from one of these fragments unexpectedly resulted in an increase in reporter expression levels in bloodstream forms, rather than a decrease (Fig 4B).

Analysis of specific sequences.

A. Results for a CAT reporter plasmid containing RBP10 3’-UTR fragment 1.2 bearing (AU)10 repeats and a mutated version without them. The average result for the fragment with the repeats is set to 1. B. As (A) but showing deletion of a poly (A) tract from fragment 3.1.1. C. Results for a CFP reporter plasmid containing the ACT 3’-UTR with either (AT)10 or the PGKC* fragment inserted after the CFP coding region. The measurements are the averages of duplicate technical replicates for two independent populations, analysed with two different blots. A brief search of annotated 3’-UTRs revealed (AU)9 or longer in 580 other mRNAs; this is almost certainly an under-estimate because many annotated 3’-UTRs are truncated. In order to find out whether either the repeat could enhance expression in another context, we inserted it between the CFP coding region and the ACT 3’-UTR. We here analysed just two independent populations. There was no effect on expression in either form (Fig 4C). Similarly, the 61nt fragment from the PGKC 3’-UTR (PGK*) was unable to repress expression in procyclic forms when inserted upstream of the ACT 3’-UTR. In both cases, therefore, the functions of these sequences depend on the surrounding sequence context.

Discussion

The main aim of this study was to find sequences that can cause bloodstream-form specific expression in T. brucei, while a secondary aim was to investigate the general reason for the existence of very long 3’-UTRs. Regarding the first aim, we found that the RBP10 3’-UTR contains at least six sequences that specifically give low expression in procyclic forms (fragments 1.1, 1.2, 1.4, 2.2.2, 3 and 4) while the much shorter PGKC 3’-UTR contains at least two. There was also evidence, from the RBP10 3’-UTR, for sequences that stimulate expression in bloodstream forms (1.1, 1.2, 2.2.3, 3.1), or repress (3.2.2) or enhance (2.2.3) in both forms. We were unable to identify any specific short (<12nt) enriched motifs, either repeated within the PGKC and RBP10 3’-UTRs, or in comparison with other similarly regulated mRNAs. However, we were able to narrow down the regulatory regions considerably. The absence of a single enriched motif suggests that several different trans-acting factors, with different sequence specificities, are able to act independently to ensure appropriate expression of both PGKC and RBP10 (Fig 5). These factors probably each act on a subset of other similarly regulated mRNAs as well. Therefore, if our results are combined with further investigations of additional 3’-UTRs, it is likely that some short regulatory motifs will begin to emerge. Our results also suggested that the action of the regulatory motifs is context-dependent. For example, fragment 2.2.2 gave stronger regulation than the parent fragment 2.2, and fragment 3.1 gave higher activity in bloodstream forms than fragment 3 (Fig 2B). Perhaps the different fragments are bound by proteins that compete for binding on the RBP10 3’-UTR and this binding is influenced by secondary structures. Therefore, a dissection might allow some proteins to bind more efficiently while others are losing binding.
Fig 5

Model for developmental regulation of RBP10 and PGKC.

The two mRNAs are shown approximately to scale, using the colour coding from Figs 2 and 3. Regulatory RNA-binding proteins that bind the the 3’-UTRs are shown schematically with different shapes; it is possible that some of them bind in more than one position. Proteins that bind but do not give strong regulation, or which compete with other proteins for binding, are almost certainly also present but are not indicated.

Model for developmental regulation of RBP10 and PGKC.

The two mRNAs are shown approximately to scale, using the colour coding from Figs 2 and 3. Regulatory RNA-binding proteins that bind the the 3’-UTRs are shown schematically with different shapes; it is possible that some of them bind in more than one position. Proteins that bind but do not give strong regulation, or which compete with other proteins for binding, are almost certainly also present but are not indicated. One interesting observation was the decrease in expression caused by deletion of (AU) repeats from one of the RBP10 3’-UTR fragments. This might mean that the repeats enhance expression, but if so, this is context dependent: when (AU)10 was inserted at the 5’-end of the actin 3’-UTR it had no effect. Another possibility is that the (AU) repeats affect secondary structures of neighbouring regulatory elements. Our results nevertheless suggest that the functions of (AU) repeats should be considered during studies of other trypanosome mRNAs. Many studies of mRNA binding proteins compare the mRNAs that are bound to that protein with the effects on the transcriptome after depletion of that protein. It is notable that in most cases, some bound mRNAs show clear changes in abundance after the protein is depleted, whereas others are unaffected. Our results clearly show why this is the case: for at least some mRNAs, regulation can be sustained by more than one region of the mRNA and probably, more than one RNA-binding protein. In practical terms, our results mean that screens for RNA-binding proteins that regulate one specific mRNA are unlikely to succeed if the reporter that contains the whole 3’-UTR is used for selection. For example, elimination of the regulatory protein that binds to fragment 15 of the PGKC 3’-UTR would probably have no effect on PGKC expression because regulation could be maintained by the sequences in fragment 14 (Fig 5). Indeed, a screen that was designed to find trans acting factors that regulate the mRNA encoding glycosyl phosphatidylinositol phospholipase C resulted only in the selection of 3’ deletion mutants [70]. In order to search for proteins that regulate PGKC and RBP10 expression, it will probably be necessary to focus on the individual regulatory segments. Once such proteins are identified it is likely that they will be found to regulate other bloodstream-form-specific mRNAs as well. The 3’-UTRs of the RBP10 mRNA, and many other mRNAs encoding RNA-binding proteins, are extraordinarily long. Khong and Parker [56] have calculated that Opisthokont mRNAs are probably bound by at least 4–18 proteins/kb. If trypanosomes are similar, the RBP10 3’-UTR would be predicted to bind 28–126 proteins, while the PGKC 3’-UTR would bind between 3 and 14. Long mRNAs generally have relatively low abundances [15], which might be important for regulatory proteins. The results presented here, however, suggest that long 3’-UTRs may have evolved as a "fail-safe" mechanism that ensures correct regulation even if the 3’-UTR is truncated by alternative processing, or one of the controlling proteins is absent.

Plasmids and oligonucleotides.

(XLSX) Click here for additional data file.

mRNA lengths.

(XLSX) Click here for additional data file.

PGKC 3’-UTR sequence alignments.

The sequences were aligned using the Segman programme in the DNAStar package. (PDF) Click here for additional data file.

RBP10 3’-UTR sequences.

(DOCX) Click here for additional data file.

PGKC 3’-UTR sequence.

(DOCX) Click here for additional data file.

pHD3261 sequence in ApE format.

(APE) Click here for additional data file.

pHD3301 sequence in ApE format.

(APE) Click here for additional data file. 14 Feb 2022 Dear Prof. Clayton, Thank you very much for submitting your manuscript "Several different sequences are implicated in bloodstream-form-specific gene expression in Trypanosoma brucei" for consideration at PLOS Neglected Tropical Diseases. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. The reviewers appreciated the attention to an important topic. Based on the reviews, we are likely to accept this manuscript for publication, providing that you modify the manuscript according to the review recommendations. I have evaluated the manuscript and the comments of the three expert reviewers. This work has clear merit and will contribute to our understanding of the mechanisms of gene expression regulation in trypanosomes, which is in line with the recommendations of the reviewers. In my view, the reviewers are clear and specific, and will hopefully present a resource towards providing a revised version of the paper for further evaluation. Only some minor changes are needed, with inclusion of supporting blot analysis data as supplemental information as requested by reviewer 2. Once you've had a chance to consider the reviewers' comments, please let me know if you have any questions. Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. When you are ready to resubmit, please upload the following: [1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out [2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file). Important additional instructions are given below your reviewer comments. Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments. Sincerely, Guy Caljon Associate Editor PLOS Neglected Tropical Diseases Anthony Papenfuss Deputy Editor PLOS Neglected Tropical Diseases *********************** I have evaluated the manuscript and the comments of the three expert reviewers. This work has clear merit and will contribute to our understanding of the mechanisms of gene expression regulation in trypanosomes, which is in line with the recommendations of the reviewers. In my view, the reviewers are clear and specific, and will hopefully present a resource towards providing a revised version of the paper for further evaluation. Only some minor changes are needed, with inclusion of supporting blot analysis data as supplemental information as requested by reviewer 2. Once you've had a chance to consider the reviewers' comments, please let me know if you have any questions. Reviewer's Responses to Questions Key Review Criteria Required for Acceptance? As you describe the new analyses required for acceptance, please consider the following: Methods -Are the objectives of the study clearly articulated with a clear testable hypothesis stated? -Is the study design appropriate to address the stated objectives? -Is the population clearly described and appropriate for the hypothesis being tested? -Is the sample size sufficient to ensure adequate power to address the hypothesis being tested? -Were correct statistical analysis used to support conclusions? -Are there concerns about ethical or regulatory requirements being met? Reviewer #1: The methods are orthodox and well described, there no problems as far as I can tell. Reviewer #2: The methodology is appropriate and the experiments are well designed. Reviewer #3: No major issues with methods generally, but a few concerns about some of the stats used/not used -------------------- Results -Does the analysis presented match the analysis plan? -Are the results clearly and completely presented? -Are the figures (Tables, Images) of sufficient quality for clarity? Reviewer #1: Yes to all questions. The results are presented clearly, reflect the aims and support the conclusions. Reviewer #2: The data are sound, however, a major concern is the lack of Western and Northern blots to illustrate the expression levels of the different cell lines in Figures 2, 3 and 4. These data should at least be included in the supplementary data. Reviewer #3: Yes, analysis is logical, figures mostly of suitable quality/clarity -------------------- Conclusions -Are the conclusions supported by the data presented? -Are the limitations of analysis clearly described? -Do the authors discuss how these data can be helpful to advance our understanding of the topic under study? -Is public health relevance addressed? Reviewer #1: The conclusions are appropriate and supported, as far as I can judge. There is no immediate public health relevance. Reviewer #2: The conclusions drawn by the authors are consistent with the data presented. Reviewer #3: conclusions are perhaps a little short and worthy of further exploration/explanation -------------------- Editorial and Data Presentation Modifications? Use this section for editorial suggestions as well as relatively minor modifications of existing data that would enhance clarity. If the only modifications needed are minor and/or editorial, you may wish to recommend “Minor Revision” or “Accept”. Reviewer #1: In places, I wasn't sure why various facts were being provided. I suspect the text could be clarified with a further redraft, with the authors asking themselves if they have explained everything they have written. Reviewer #2: A few minor points that could also be addressed by the authors: 1. Line 217, “in order to find” instead of “in other find” 2. Line 251, remove one “that” 3. The authors used cyan fluorescent protein (eCFP) as reporter, but wrote “GFP protein” and “GFP mRNA” in figures 3C and 3D, respectively. Similarly, GFP instead of CFP is mentioned in S2 Table. This needs to be corrected. In addition, the authors must choose between CFP and eCFP throughout the manuscript. 4. In line 287, it is mentioned that “protein levels were assessed for each population by quantitative Western blotting, with ribosomal protein S9 as a loading control”, however the Western blots are not shown in the manuscript and should be included in the supplementary data (see general comment). 5. In the legend of Figure 3B, reverse blue and red in the sentence “Each dot is a result for an independent clone, with blue and red representing bloodstream (B) and procyclic (P) forms, respectively” to be consistent with the colors used in the figure. 6. In Figure 3A, all lanes showed an additional band of 1.7-1.8 kb detected by the CPF probe, especially in the right blot. In addition, the position of the markers is different between the two panels (see for example the mRNA detected in lanes 1 that is not have the same size in the left and right blots). 7. Line 325, “dataset” instead of “datset”. 8. In the second last paragraph of the result section, as well as in the corresponding figure 4, it is not mentioned in which form of parasite the experiments were conducted, PCF or BSF. We can assume it’s in BSF, but it should be clarified. 9. In the first column of figure 4B, which may correspond to the actin control in Figure 3B, one may expect to see an expression level in PCF around 0.5. But no value for PCF is included in this graph. This important control should be included in the figure. Reviewer #3: One typo spotted I think, otherwise very clear. -------------------- Summary and General Comments Use this section to provide overall comments, discuss strengths/weaknesses of the study, novelty, significance, general execution and scholarship. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. If requesting major revision, please articulate the new experiments that are needed. Reviewer #1: This study has delineated the 3'UTRs of two transcripts in Trypanosoma brucei that are developmentally regulated and encode important bloodstream-stage proteins. They have succeeded in producing a more accurate map of the regulatory motifs within the 3'UTRs, showing that they are likely bound by many different regulatory proteins. This is a useful achievement that progresses our understanding of the interaction hierarchies that must control development within these parasites. I have no direct experience of the methods used in the study, but they are certainly orthodox for the field and well described in the manuscript. I have no reservations about the quality of the work or the veracity of the findings. I have just three comments to make about the text. First, it begins "Kinetoplastids are unicellular flagellated parasites that infect mammals and plants." I would suggest changing this since most Kinetoplastids are not parasites (better to use 'trypanosomatids') and trypanosomatids largely infect insects and vertebrates (although plants are included). "Mammals and plants" is too selective to be accurate. Second, ll194-6 describe an acetyl-coA synthetase coding region downstream of the RBP10 gene. The authors state that "[it is] present elsewhere in the genome, but the presence of read alignments over the region that surrounds the Tb927.8.2790 coding region suggests that the pseudogene mRNA is also present in both bloodstream and procyclic forms." I didn't understand why we were being told this. Is the pseudogene implicated in the RBP10 3'UTR? Please explain what we need to know about this. Third, in the section beginning on l319 the authors look for common sequence motifs among their regulatory sequences, but find none. Is a similar exercise not possible for secondary structures? Have the authors or others not looked for conserved loops or folds among these UTRs with similar expression profiles? Reviewer #2: The 3'-UTR lengths in trypanosomes is remarkably long compared to other Opisthokont model organisms, with mRNAs having 3'-UTRs of 5 kb or more. This is particularly true for mRNAs encoding RNA-binding proteins, which have vital regulatory functions. As their expression has to be particularly tightly controlled, the authors hypothesized that these long 3'-UTR may contain several regulatory sequences acting independently. To address this question, Tania Bishola Tshitenge and colleagues used two report systems (CAT and eCFP) to identify the regulatory sequences within the 3’-UTR of two mRNA that are responsible for the specific expression in the bloodstream forms of T. brucei (BSF), i.e. the 7.2 kb 3’-UTR of RBP10, an RNA binding protein previously characterized by the same research team, and the terminal 424 nt of the PGKC 3'-UTR that the authors previously identified as responsible for specific expression in BSF. They found that PGKC has at least two such sequences, while RBP10 has at least four which are scattered throughout the 7.2 kb 3’-UTR. The authors concluded that several different sequence motifs - and therefore, probably, a similar number of RNA-binding proteins - are implicated in controlling bloodstream-form-specific mRNA stability and translation. The data are sound and the conclusions drawn by the authors are consistent with the data presented. However, a major concern is the lack of Western and Northern blots to illustrate the expression levels of the different cell lines in Figures 2, 3 and 4. These data should at least be included in the supplementary data. Reviewer #3: I went through this line by line, comments and questions given below with their line numbers. 48/50/51/52 PGK, systematic gene names the first time the gene is mentioned please. 70: quantitate "much more" 75: RBP6 systematic name? 88: Opisthokont, is this not a rather diverse group about which to make comparative comments? 94: systematic name for ZC3H22? 96-97: what is meant by "better expressed"? Not quite clear here, no numbers either. 98: give the 8mer sequence 103-105: maybe actually give the sequences here for the reader!? 129: are the RBP10 3' UTR sequences identical in 427 and 1125? Was there a reason not to use 1125 genomic DNA as a template for the PCRs? 135-136: very ambiguous about how much of what was actually sequenced to confirm identity: this is important 188-192: I found this a bit confusing: are there TWO regions of UTR like sequence in separate locations in the genome, and if so, how can you be certain which one is responsible for the noted effects? Are there not PacBio sequence data available for 1125, which might help clear up this quandry, given it is perhaps central to how robust your conclusions/inferences might be? 206-207: see 188-192? Fig 1A, units for Y axis? Fig 1A, procyclics, wild/crazy observation, but it looks as if majority of transcripts may not have the long UTR sequence for 2780, perhaps polyadenylated much closer to the stop codon? Do you have any oligo-dT based 3' end sequence/PCR data for checking this out? Or ratio of read depths for the CDS and 5UTR in B or P on basis that if it has the CDS it should have the UTR. Might fit with your comment on line 232. Might depend on the characteristics of the RNAseq libraries whether that is meaningful; the sequence reads used in Seattle were pretty short and were aligned locally using bowtie2; are there no more recent data with larger insert, paired end reads? Fig 1D, lower panel: two? sets of blue dots, are they the three replicates?, no error bars 209-210: range, SD of measurements? Saying "probably" without stats is meaningless! 217: typo Fig 2: no indication what the different colours indicate? I think they are regions/constructs, but we dont see them all together until Fig 5 Fig 2: should show the low-complexity repeat regions Fig 2: For "expression", averages are: "-" = <0.25x, "+/-" = 0.25-0.5x, "+" = 0.5-2x, "++"=2-3x and "+++" >3x. I have "problems" with these categories, with values less than 1 being included in the "+" category, surely that would be either "-" or if redefined, "+/-" perhaps? 241-242: why did you not just use the means, would be much more appropriate? Currently a very curious choice of metric that artificially maximises the result divergence, particularly pronounced when n is such a small number... 243: "mostly not normal": by what measure did you reach this conclusion? 248: use nuclease digestion to make fragments? 261: what are the sequences of the regulatory elements proposed? 274: was this approach done for RBP10 at all? 278: was the RNAfold re-run on these deleted lines, to check that things were still as you expected/predicted? 293-294: this is possibly worthy of further explanation/exploration? 300-301: is the variability seen across any other mRNAs you measured (if you did) 303: which is...? 331-340: were RNAfolds done for all the various fragments to try and draw a link to the observed experimental results? 331-340: these are all longer than what the MEME search was done with, so you may be missing things 331-340: what other genes have similar AU repeats in their 3'UTR (or, for that matter, perhaps also the 5UTR?) 362-365: perhaps, but what are the sequences in the fragments? Are there any footprint data for these regions from other studies? References: I have not checked them. -------------------- PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No Reviewer #3: No Figure Files: While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Data Requirements: Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5. Reproducibility: To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols References Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article's retracted status in the References list and also include a citation and full reference for the retraction notice. 23 Feb 2022 Submitted filename: _Reply to reviewers_21_2.docx Click here for additional data file. 3 Mar 2022 Dear Prof. Clayton, We are pleased to inform you that your manuscript 'Several different sequences are implicated in bloodstream-form-specific gene expression in Trypanosoma brucei' has been provisionally accepted for publication in PLOS Neglected Tropical Diseases. Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests. Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated. IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript. Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS. Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Neglected Tropical Diseases. Best regards, Guy Caljon Associate Editor PLOS Neglected Tropical Diseases Anthony Papenfuss Deputy Editor PLOS Neglected Tropical Diseases *********************************************************** Thank you for addressing all comments of the reviewers and for providing the additional data in the figshare links. Congratulations with your work. Sincerely yours, Guy Caljon 17 Mar 2022 Dear Prof. Clayton, We are delighted to inform you that your manuscript, "Several different sequences are implicated in bloodstream-form-specific gene expression in  Trypanosoma brucei," has been formally accepted for publication in PLOS Neglected Tropical Diseases. We have now passed your article onto the PLOS Production Department who will complete the rest of the publication process. All authors will receive a confirmation email upon publication. The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any scientific or type-setting errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript. Note: Proofs for Front Matter articles (Editorial, Viewpoint, Symposium, Review, etc...) are generated on a different schedule and may not be made available as quickly. Soon after your final files are uploaded, the early version of your manuscript will be published online unless you opted out of this process. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers. Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Neglected Tropical Diseases. Best regards, Shaden Kamhawi co-Editor-in-Chief PLOS Neglected Tropical Diseases Paul Brindley co-Editor-in-Chief PLOS Neglected Tropical Diseases
  70 in total

1.  A novel strategy to identify the location of necessary and sufficient cis-acting regulatory mRNA elements in trypanosomes.

Authors:  Helena Webb; Roisin Burns; Nicola Kimblin; Louise Ellis; Mark Carrington
Journal:  RNA       Date:  2005-05-31       Impact factor: 4.942

2.  The zinc finger proteins ZC3H20 and ZC3H21 stabilise mRNAs encoding membrane proteins and mitochondrial proteins in insect-form Trypanosoma brucei.

Authors:  Bin Liu; Kevin Kamanyi Marucha; Christine Clayton
Journal:  Mol Microbiol       Date:  2019-12-11       Impact factor: 3.501

3.  Simultaneous purification of hexokinase, class-I fructose-bisphosphate aldolase, triosephosphate isomerase and phosphoglycerate kinase from Trypanosoma brucei.

Authors:  O Misset; F R Opperdoes
Journal:  Eur J Biochem       Date:  1984-11-02

4.  Roles of a Trypanosoma brucei 5'->3' exoribonuclease homolog in mRNA degradation.

Authors:  Chi-Ho Li; Henriette Irmer; Drifa Gudjonsdottir-Planck; Simone Freese; Heike Salm; Simon Haile; Antonio M Estévez; Christine Clayton
Journal:  RNA       Date:  2006-10-31       Impact factor: 4.942

5.  Characterization of a divergent glycosomal microbody phosphoglycerate kinase from Trypanosoma brucei.

Authors:  K Alexander; M Parsons
Journal:  Mol Biochem Parasitol       Date:  1993-08       Impact factor: 1.759

6.  The phosphoglycerate kinases from Trypanosoma brucei. A comparison of the glycosomal and the cytosolic isoenzymes and their sensitivity towards suramin.

Authors:  O Misset; F R Opperdoes
Journal:  Eur J Biochem       Date:  1987-02-02

7.  An improved purification of glycosomes from the procyclic trypomastigotes of Trypanosoma brucei.

Authors:  R A Aman; C C Wang
Journal:  Mol Biochem Parasitol       Date:  1986-12       Impact factor: 1.759

Review 8.  The landscape of eukaryotic mRNPs.

Authors:  Anthony Khong; Roy Parker
Journal:  RNA       Date:  2019-12-26       Impact factor: 4.942

Review 9.  Through the dark continent: African trypanosome development in the tsetse fly.

Authors:  Brice Rotureau; Jan Van Den Abbeele
Journal:  Front Cell Infect Microbiol       Date:  2013-09-18       Impact factor: 5.293

10.  Integrative analysis of the Trypanosoma brucei gene expression cascade predicts differential regulation of mRNA processing and unusual control of ribosomal protein expression.

Authors:  Enoch B Antwi; Jurgen R Haanstra; Gowthaman Ramasamy; Bryan Jensen; Dorothea Droll; Federico Rojas; Igor Minia; Monica Terrao; Clémentine Mercé; Keith Matthews; Peter J Myler; Marilyn Parsons; Christine Clayton
Journal:  BMC Genomics       Date:  2016-04-26       Impact factor: 3.969

View more
  1 in total

1.  The Trypanosoma brucei RNA-binding protein DRBD18 ensures correct mRNA trans splicing and polyadenylation patterns.

Authors:  Tania Bishola Tshitenge; Christine Clayton
Journal:  RNA       Date:  2022-07-06       Impact factor: 5.636

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.