| Literature DB >> 27852012 |
Dongyan Zhao1,2, John P Hamilton2, Michael Hardigan2, Dongmei Yin1, Tao He3,4, Brieanne Vaillancourt2, Mauricio Reynoso5,6, Germain Pauluzzi5,6, Scott Funkhouser7, Yuehua Cui3, Julia Bailey-Serres5,6, Jiming Jiang8, C Robin Buell9, Ning Jiang10,7,11.
Abstract
Gene expression is controlled at transcriptional and post-transcriptional levels including decoding of messenger RNA (mRNA) into polypeptides via ribosome-mediated translation. Translational regulation has been intensively studied in the model dicot plant Arabidopsis thaliana, and in this study, we assessed the translational status [proportion of steady-state mRNA associated with ribosomes] of mRNAs by Translating Ribosome Affinity Purification followed by mRNA-sequencing (TRAP-seq) in rice (Oryza sativa), a model monocot plant and the most important food crop. A survey of three tissues found that most transcribed rice genes are translated whereas few transposable elements are associated with ribosomes. Genes with short and GC-rich coding regions are overrepresented in ribosome-associated mRNAs, suggesting that the GC-richness characteristic of coding sequences in grasses may be an adaptation that favors efficient translation. Transcripts with retained introns and extended 5' untranslated regions are underrepresented on ribosomes, and rice genes belonging to different evolutionary lineages exhibited differential enrichment on the ribosomes that was associated with GC content. Genes involved in photosynthesis and stress responses are preferentially associated with ribosomes, whereas genes in epigenetic regulation pathways are the least enriched on ribosomes. Such variation is more dramatic in rice than that in Arabidopsis and is correlated with the wide variation of GC content of transcripts in rice. Taken together, variation in the translation status of individual transcripts reflects important mechanisms of gene regulation, which may have a role in evolution and diversification.Entities:
Keywords: GC content; Oryza sativa; mRNA length; translating ribosome affinity purification sequencing
Mesh:
Substances:
Year: 2017 PMID: 27852012 PMCID: PMC5217110 DOI: 10.1534/g3.116.036020
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Epitope-tagged rice RPL18 assembles into functional ribosomes that can be purified by TRAP. (A) Confirmation of RPL18 assembly into ribosomes. The p35S:HF-OsRPL18 rice line was used as the source of ribosomal complexes, which were separated by ultracentrifugation on a 20–60% (w/v) sucrose density gradient. The absorbance at 254 nm was recorded to detect the ribosomal subunits of 40S and 60S, monosomes (80S), and polysomes. The gradient was fractionated and proteins in the 10 fractions were analyzed by SDS-PAGE separation and western blotting processed with anti-FLAG (α-FLAG) or anti-RPS6 (α-RPS6) antisera. Molecular mass markers are indicated on the left. (B) Purification of polysomes by TRAP. Equal weights of pulverized tissue from untransformed Nipponbare (control) and homozygous transgenic p35S:HF-OsRPL18 shoots were solubilized in polysome extraction buffer to obtain a clarified supernatant (total). The extract was incubated with anti-FLAG-bound Dynabeads coupled to Protein G to bind HF-RPL18. The supernatant (unbound fraction) was collected to evaluate the efficiency of the immunopurification. The magnetically captured protein–RNA complexes (TRAP fraction) was eluted from the beads using 3X-FLAG peptide. Each fraction was analyzed by western blot with α-FLAG and α-RPS6. The expected molecular mass of HF-RPL18 is 25 kDa. SDS-PAGE, sodium dodecyl sulfate-polyacrylamide gel electrophoresis; TRAP, translating ribosome affinity purification.
Gene expression evidence from transcriptomes and translatomes of three tissues of rice
| No. of Genes | Fraction of gene set | Callus | Panicle | Shoot | Total Unique Genes | |
|---|---|---|---|---|---|---|
| Nontransposon genes | 39,049 | mRNA-seq (%total) | 18,793 (48.13%) | 19,311 (49.45%) | 17,713 (45.36%) | 22,589 (57.85%) |
| TRAP-seq (%total) | 18,013 (46.13%) | 17,987 (46.06%) | 15,943 (40.83%) | 21,647 (55.44%) | ||
| Overlap (%mRNA-seqb) | 16,931 (90.09%) | 16,804 (87.02%) | 15,030 (84.85%) | 20,457 (90.56%) | ||
| Transposon genes | 15,461 | mRNA-seq (%total) | 293 (1.90%) | 255 (1.65%) | 202 (1.31%) | 381 (2.46%) |
| TRAP-seq (%total) | 148 (0.96%) | 149 (0.96%) | 85 (0.55%) | 211 (1.36%) | ||
| Overlap (%mRNA-seq) | 138 (47.10%) | 122 (48.03%) | 77 (38.31%) | 180 (47.24%) |
No., number; mRNA, messenger RNA; TRAP-seq, Translating Ribosome Affinity Purification followed by mRNA-sequencing; %total, the percent of annotated genes with mRNA-seq or TRAP-seq FPKM ≥ 1; %mRNA-seq, the percent that are translated among the genes with transcription evidence.
Translational status of transposable elements documented to be active in Nipponbare
| Element | Superfamily | Locus | Tissue | mRNA-seq (FPKM) | TRAP-seq (FPKM) | TEI | Translation Evidence |
|---|---|---|---|---|---|---|---|
| LOC_Os10g29650 | Callus | 15.03 | 0.58 | 0.04 | Maybe | ||
| LOC_Os06g38480 | Callus | 13.01 | 15.61 | 1.20 | Yes | ||
| LOC_Os04g32750 | Panicle | 0.01 | 0.00 | NA | Not expressed | ||
| LOC_Os03g43750 | Callus | 10.12 | 4.83 | 0.48 | Yes | ||
| LOC_Os03g36279 | Callus | 2.26 | 2.56 | 1.14 | Yes | ||
| LOC_Os06g32890 | Shoot | 0.13 | 0.08 | NA | Not expressed | ||
| LOC_Os02g24150 | Panicle | 1.92 | 2.02 | 1.05 | Yes | ||
| LOC_Os06g39624 | Panicle | 16.50 | 9.41 | 0.57 | Yes | ||
| LOC_Os11g19864 | Callus | 6.40 | 6.36 | 0.99 | Yes |
mRNA-seq, messenger RNA sequencing; FPKM, fragments per kilobase per exon model per million mapped reads; TRAP-seq, Translating Ribosome Affinity Purification followed by mRNA-sequencing; TEI, translatome enrichment index; NA, not applicable.
If the transposon is expressed in multiple tissues, the one with the highest TRAP-seq level is listed.
TRAP-seq FPKM ≥ 1 is considered to be translated; mRNA-seq FPKM ≥1 is considered to be expressed.
The effect of length and GC content of different portions of transcripts on TEI
| Feature | 5′ UTR (bp) | CDS (bp) | 3′ UTR (bp) | |
|---|---|---|---|---|
| Size (bp) | ||||
| Mean Median Correlation | 215 | 1234 | 433 | |
| 147 | 1095 | 354 | ||
| Negative | Negative | Optimal size 200–400 bp | ||
| GC content (%) | ||||
| Mean Median Correlation | 58.3 | 56.1 | 39.3 | |
| 59.2 | 54.0 | 39.1 | ||
| Weak negative | Positive | Weak positive | ||
UTR, untranslated region; CDS, coding sequence; TEI, translatome enrichment index.
Only genes with both UTRs meeting size criterion (see Materials and Methods for details) and with TEI in at least one tissue were considered for calculation of mean and medium values.
Figure 2Plot of expression abundance at transcription level (mRNA-seq) vs. translation level (TRAP-seq) in relation to gene length (A) and GC content (B) of genes expressed in shoots. CDS, coding sequence; FPKM, fragments per kilobase per exon model per million mapped reads; mRNA-seq, messenger RNA sequencing; TRAP-seq, translating ribosome affinity purification followed by mRNA-sequencing.
Figure 3Distribution of translatome enrichment index (TEI) by transcription specificity and transcription level in shoots. (A) TEI of genes with transcripts detected across 15 mRNA-seq (messenger RNA sequencing) samples. (B) TEI of genes with transcript abundance variation as estimated by fragments per kilobase of exon per million mapped reads (FPKM). Genes with extremely high TEIs (1% of the genes in each group) were not shown.
Figure 4The impact of retained intron on transcription and translation of the relevant transcripts in shoots. (A) Expression abundance at transcription level (log2 FPKM of mRNA-seq) of isoforms with or without retained intron in different portions (5′ UTR, CDS, and 3′ UTR) of genes. (B) Expression abundance at translation level (log2 FPKM of TRAP-seq) of isoforms with a retained intron in different portions of genes. (C) Size of the retained introns. (D) GC content of the retained introns. Extreme values (top 1%) in each group were not shown. CDS, coding sequence; FPKM, fragments per kilobase per exon model per million mapped reads; mRNA-seq, messenger RNA sequencing; RI, retained intron; TRAP-seq, translating ribosome affinity purification followed by mRNA-sequencing; UTR, untranslated region.
Figure 5Correlation between translatome enrichment index (TEI) and size of untranslated regions (UTRs) in shoots. (A) Variation of TEI with length of 5′ UTR. (B) Variation of TEI with length of 3′ UTR. Error bars correspond to 2 × SE. CDS, coding sequence.
Figure 6GC content and translatome enrichment index (TEI) of rice genes based on their evolutionary origin. (A) GC content of rice non-TE genes based on orthologous clustering with predicted proteomes from 12 angiosperm species. (B) TEI of nontransposon genes in rice shoots based on orthologous clustering with predicted proteomes from 12 angiosperm species.
Ten biological processes in which genes are associated with the highest and lowest TEI in rice shoots
| GO Term (Process) | TEI | mRNA-seq (FPKM) | TRAP-seq (FPKM) | CDS size (bp) | 5′ UTR Length | CDS GC Content (%) |
|---|---|---|---|---|---|---|
| GO processes with high TEI | ||||||
| Translation | 1.78 | 69.6 | 173.6 | 956 (−59) | 141 (34) | 54.5 (8.6) |
| DNA metabolic process | 1.47 | 44.2 | 87.8 | 1183 (−133) | 175 (47) | 53.6 (9.0) |
| Photosynthesis | 1.47 | 430.9 | 986.3 | 1019 (−46) | 138 (30) | 59.6 (12.9) |
| Generation of precursor metabolites and energy | 1.36 | 190.3 | 387.5 | 1174 (−33) | 168 (54) | 57.1 (10.8) |
| Cellular homeostasis | 1.34 | 68.1 | 248.4 | 1207 (−36) | 201 (60) | 55.9 (10.5) |
| Response to stress | 1.31 | 48.0 | 67.9 | 1305 (−21) | 200 (59) | 56.5 (11.0) |
| Response to abiotic stimulus | 1.29 | 77.6 | 126.8 | 1294 (−17) | 205 (59) | 57.4 (11.5) |
| Secondary metabolic process | 1.27 | 26.4 | 32.3 | 1215 (−59) | 160 (44) | 61.0 (15.7) |
| Biological process | 1.22 | 27.6 | 37.0 | 1171 (12) | 206 (60) | 55.5 (10.5) |
| Response to biotic stimulus | 1.19 | 47.4 | 56.9 | 1385 (−86) | 209 (66) | 56.9 (11.1) |
| Mean | 1.37 | 103.0 | 220.4 | 1191 (−48) | 180 (51) | 56.8 (11.2) |
| GO processes with low TEI | ||||||
| Regulation of gene expression, epigenetic | 0.53 | 16.9 | 10.2 | 2084 (−79) | 207 (58) | 47.7 (3.2) |
| Carbohydrate metabolic process | 0.73 | 43.3 | 42.6 | 1577 (−44) | 222 (73) | 54.8 (9.8) |
| Lipid metabolic process | 0.74 | 28.8 | 20.7 | 1383 (−32) | 208 (54) | 54.6 (9.9) |
| Cell differentiation | 0.86 | 20.8 | 18.1 | 1589 (−82) | 238 (51) | 54.5 (9.2) |
| Protein modification process | 0.86 | 28.8 | 42.1 | 1496 (−37) | 270 (81) | 52.3 (7.7) |
| Cell cycle | 0.87 | 16.7 | 13.7 | 1492 (−230) | 247 (78) | 51.0 (6.9) |
| Catabolic process | 0.90 | 44.5 | 45.2 | 1502 (−84) | 220 (77) | 52.6 (7.5) |
| Cell growth | 0.91 | 19.9 | 17.4 | 1560 (−92) | 191 (36) | 55.1 (9.5) |
| Growth | 0.92 | 22.5 | 21.2 | 1605 (−32) | 245 (102) | 53.6 (8.2) |
| Multicellular organismal development | 0.92 | 21.3 | 18.1 | 1556 (−91) | 235 (63) | 54.9 (9.8) |
| Mean | 0.82 | 26.3 | 24.9 | 1584 (−80) | 228 (67) | 53.1 (8.2) |
| Genome-wide mean | 1.21 | 33.4 | 56.0 | 1208 | 212 | 55.6 |
| Difference with high TEI processes | −0.55 | −76.7 | −195.5 | 394 (−33) | 48 (16) | −3.7 (−3.0) |
| % Variation | 67.10 | 291.6 | 785.1 | 33.1 | l26.7 | 7.0 |
GO, gene ontology; TEI, translatome enrichment index; mRNA-seq, messenger RNA sequencing; FPKM, fragments per kilobase per exon model per million mapped reads; TRAP-seq, Translating Ribosome Affinity Purification followed by mRNA-sequencing; CDS, coding sequence; UTR, untranslated region.
Only processes with 40 or more genes are considered. The values are mean values in each process.
Numbers in parentheses indicate difference compared with the corresponding Arabidopsis processes.
Genome-wide mean of 5′ UTR is derived from nontransposon genes with TEI in shoots and UTRs ≥ minimum (20 bp for 5′ UTR and 110 bp for 3′ UTR).
%Variation equals difference divided by the lower values of the two means, then times 100.
Figure 7Fraction (%) of rice and Arabidopsis genes with different TEIs and their average GC and GC3 content of CDS. CDS, coding sequence; TEI, translatome enrichment index.