| Literature DB >> 27118143 |
Enoch B Antwi1, Jurgen R Haanstra2,3, Gowthaman Ramasamy4, Bryan Jensen4, Dorothea Droll1,5, Federico Rojas6, Igor Minia1, Monica Terrao1, Clémentine Mercé1, Keith Matthews6, Peter J Myler4,7,8, Marilyn Parsons4,7, Christine Clayton9.
Abstract
BACKGROUND: Trypanosoma brucei is a unicellular parasite which multiplies in mammals (bloodstream form) and Tsetse flies (procyclic form). Trypanosome RNA polymerase II transcription is polycistronic, individual mRNAs being excised by trans splicing and polyadenylation. We previously made detailed measurements of mRNA half-lives in bloodstream and procyclic forms, and developed a mathematical model of gene expression for bloodstream forms. At the whole transcriptome level, many bloodstream-form mRNAs were less abundant than was predicted by the model.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27118143 PMCID: PMC4845500 DOI: 10.1186/s12864-016-2624-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Gene expression from DNA to mRNA in Trypanosoma brucei. The upper panel is a schematic time-lapse image of a polymerase II complex progressing along a chromosome. Coding regions are in dark colours and 3′ and 5′ untranslated regions are in lighter colours. The capped spliced leader is in orange. Kinetic constants for the different processes are indicated and the formulae that comprise the model are shown below the figure 5′ trans splicing (rate constant k1) and 3′ polyadenylation (rate constant k2) compete with nuclear degradation of the precursor (rate constant k3) and the 5′ spliced intermediate (rate constant k4). Assuming that the two processes are coupled, the rate constant for 3′ polyadenylation (k2) of mRNA A is expected to equal the rate constant for 5′ trans splicing (k1) of mRNA B, and the same applies for mRNA and mRNA C (k2 of mRNA B = k1 for mRNA C). Based on the observation that that long mRNAs had unexpectedly low abundances, we added a factor (α) that incorporates length-dependent precursor degradation into the model [16]. In the analysis described in this paper, we tested alternative versions of this and used a different factor for PC trypanosomes (see text and Table 1). Finally, there is degradation of the mature mRNA (rate constant k5). Values for k5 are based on transcriptome-wide decay measurements. In addition, the growth rate of the cells is included via the specific growth rate μ. The growth rate affects the abundance of every RNA species, as during growth the pre-existing RNA species get diluted
Parameters used in the different models
| BS-A | BS-B | BS-C | BS-D | BS-E | BS-F | PC-A | PC-B | ||
|---|---|---|---|---|---|---|---|---|---|
| a | No genes examined | 851 | 851 | 642 | 851 | 851 | 851 | 3776 | 3776 |
| b | μ (min−1) Division time | 0.0019 (6 h) | 0.0019 (6 h) | 0.0019 (6 h) | 0.0019 (6 h) | 0.0019 (6 h) | 0.0019 (6 h) | 0.0011 (10.5 h) | 0.0011 (10.5 h) |
| c | v (molecules.cell−1 .min−1) | 0.24 | 0.24 | 0.24 | 0.357 | 0.278 | 0.278 | 0.24 | 0.234 |
| d | k1 | 0.41 | data | data | 0.41 | 0.41 | data | 0.41 | 0.41 |
| e | k2 | 0.41 | 0.41 | data | 0.41 | 0.41 | 0.41 | 0.41 | 0.41 |
| f | Length adjustment α | z = 600 | z = 600 | z = 600 | z = 660 | § | § | z = 600 | § |
| g | Correlation coefficient (r) | 0.59 | 0.60 | 0.43 | 0.59 | 0.54 | 0.55 | 0.66 | 0.60 |
The Table shows the parameters used for the various models. See also Fig. 1
a) To optimise the model we chose genes for which we had reliable measurements of both half-life and mRNA abundance. For bloodstream forms, we had attempted to use RNASeq data to determine the rate at which the sequences immediately upstream of the splice site disappeared, with the hope of using that as a proxy for the splicing rate [16]. For this form we restricted the dataset to mRNAs that had measured precursor half-times of 1–5 min. For BS-C we narrowed this evern further, using only genes that also had a similar measurement for the gene immediately downstream. This means that the correlation coefficient for BS-C is not directly comparable to those for the other BS models
(b) Based on measured growth rates (see also [17])
(c) The transcription rate (v) is influenced by the initiation and elongation rates; since neither has been measured, the transcription rate was originally calculated for bloodstream forms to obtain the measured steady-state mRNA level for PGKC [17] and this value was used in models BS-A - BS-C. For models BS-D and BS-E the rate was adjusted to give a better fit for the dataset (BS-D) and for BS-E and BS-F, to give a better fit with the new length adjustment (f). For PC-A the rate was left the same as in BS-A. For PC-B we multiplied the adjusted rates from BS-E and BS-F by a factor of 0.84, based on the fact that the DNA replication rate in procyclic forms is 0.84 times that of bloodstream forms [56]
(d) k1 = 0.41 is equivalent to a splicing half-time of 1.7 min. This is based on reliable individual measurements of splicing for 3 mRNAs, which gave estimates of 1–3 min [14, 17, 31]. The “data” are the rates estimated by RNASeq [16] (see (a))
(e) Data for BS-C were from the gene immediately downstream (see (a))
(f) For models with a z value: α = (mRNA_length)/z as used in [16]. For models marked §: α = 1/(20.603 x (mRNA_length-0.461) for bloodstream forms; α = 1/(343.77 x (mRNA_length-0.858) for procyclics. The change in the spliced intermediate over time is:
d([5′ spliced intermediate]/dt) = k1 x [precursor] - (k2 + α k4 + μ) x [5′ spliced intermediate]
(g) Pearson’s correlation coefficient (r) between the abundances predicted for individual RNAs, and the measured abundances of those mRNAs
The mRNA degradation (k5) values for the mRNAs were those measured in [16]. The precursor degradation rates k3 and k4 were always 0.08 (half-life 8 min) which is the rate of disappearance of the PGKB precursor when splicing is inhibited by Sinefungin in bloodstream forms [17]
Fig. 2Correlation between predicted and measured mRNA amounts. The numbers of mRNAs per cell were predicted for the subset of mRNAs with reliable measurements of gene copy number, half-life, and abundance. The predictions were then compared with measured values of mRNA abundance for (a) procyclic forms (model PC-A) and (b) bloodstream forms (model BS-D). Each diamond represents a different open reading frame. Correlation coefficients are for log2-transformed values. The pink dotted lines are for y = 2x, y = x and 2y = x. Results for ribosomal protein genes are in cyan
Fig. 3Relationship between prediction and measurement for bloodstream and procyclic forms. For each unique gene considered, the amount of mRNA that was measured was divided by the predicted amount, and the log2 of the values for both stages plotted. BS - bloodstream form, model BS-D; PC - procyclic form, model PC-A. A value of 0 indicates that the prediction was perfect; this could however also be true if transcription was faster, and processing slower, than the values that were set in the model. Measures above 0 indicate either that a processing half-time is shorter than 1.7 min, or that the steady-state transcription rate is greater than the value that was used for the model. values below 0 indicate that the splicing or polyadenylation half-time is longer than 1.7 min. Spots above the line “PC = 2xBS” are mRNAs for which the measured/prediction results was at least 2-fold higher in procyclic than in bloodstream forms; those below the “BS = 2xPC” line have better processing in bloodstream forms than in procyclics
Optimised transcription rates for selected polycistronic units. Rates of transcription were optimised using mRNA half-life and abundance data to get the best fit, using model PC-B. For each pair of transcription units, divergent transcription initiates within a common region
| Polycistronic unit pair | Unit | Optimised Transcription rate | Pearson correlations (R) | Number of genes |
|---|---|---|---|---|
| Tb927_10_p1 | 1 | 0.51 | 0.51 | 28 |
| 2 | 0.20 | 0.55 | 36 | |
| Tb927_10_p2 | 1 | 0.26 | 0.73 | 31 |
| 2 | 0.15 | 0.66 | 36 | |
| Tb927_10_p3 | 1 | 0.26 | 0.66 | 30 |
| 2 | 0.15 | 0.67 | 34 | |
| Tb927_10_p4 | 1 | 0.28 | 0.73 | 36 |
| 2 | 0.07 | 0.36 | 52 | |
| Tb927_10_p5 | 1 | 0.24 | 0.66 | 40 |
| 2 | 0.13 | 0.67 | 26 | |
| Tb927_11_p1 | 1 | 0.05 | 0.73 | 27 |
| 2 | 0.20 | 0.65 | 41 | |
| Tb927_11_p2 | 1 | 0.12 | 0.66 | 36 |
| 2 | 0.45 | 0.3 | 18 | |
| Tb927_9_p1 | 1 | 0.14 | 0.81 | 35 |
| 2 | 0.08 | 0.59 | 56 | |
| Tb927_9_p2 | 1 | 0.43 | 0.59 | 37 |
| 2 | 0.03 | 0.74 | 13 | |
| Tb927_9_p3 | 1 | 0.61 | 0.36 | 18 |
| 2 | 0.34 | 0.71 | 14 |
Fig. 4Ribosome profiling results. a Ribosomes per coding sequence (CDS) per cell, procyclic forms (PC). Our previously published results (labelled “Jensen”) [20] are compared with the set reported here (labelled “Silicotryp”). b As in (a) but for bloodstream forms (BS). c Developmental regulation: the numbers of ribosomes per kilobase are compared from bloodstream and procyclic forms. The grey-shaded area indicates impossible densities of less than 30 nt per ribosome. d The relationship between mRNA half-life and ribosome density for procyclic forms. Only mRNAs with reliable measurements of abundance and half-life were considered, and impossible ribosome densities were excluded
Fig. 5Effect of methods on the measurement of mRNA abundance. a Poly(A) + mRNA from bloodstream forms, fragmented to 30–70 nt before sequencing; comparison of Silicotryp results vs Jensen results [20]. b Poly(A) + RNA from bloodstream forms, 30–70 nt fragments compared with published results for rRNA-depleted (ribo-minus) RNA [16], prepared from bloodstream forms cultured under identical conditions
Fig. 6Ribosome density measurements reflect variable loading of mRNA on polysomes. The number of ribosomes per kb of total mRNA (note log scale) is plotted against the proportion of the same mRNA that is in the polysomal fraction. Results for specific functional classes are highlighted in different colours