| Literature DB >> 22467213 |
Simon Haenni1, Zhe Ji, Mainul Hoque, Nigel Rust, Helen Sharpe, Ralf Eberhard, Cathy Browne, Michael O Hengartner, Jane Mellor, Bin Tian, André Furger.
Abstract
Despite the many advantages of Caenorhabditis elegans, biochemical approaches to study tissue-specific gene expression in post-embryonic stages are challenging. Here, we report a novel experimental approach for efficient determination of tissue-specific transcriptomes involving the rapid release and purification of nuclei from major tissues of post-embryonic animals by fluorescence-activated nuclei sorting (FANS), followed by deep sequencing of linearly amplified 3'-end regions of transcripts (3'-end-seq). We employed these approaches to compile the transcriptome of the developed C. elegans intestine and used this to analyse tissue-specific cleavage and polyadenylation. In agreement with intestinal-specific gene expression, highly expressed genes have enriched GATA-elements in their promoter regions and their functional properties are associated with processes that are characteristic for the intestine. We systematically mapped pre-mRNA cleavage and polyadenylation sites, or polyA sites, including more than 3000 sites that have previously not been identified. The detailed analysis of the 3'-ends of the nuclear mRNA revealed widespread alternative polyA site use (APA) in intestinally expressed genes. Importantly, we found that intestinal polyA sites that undergo APA tend to have U-rich and/or A-rich upstream auxiliary elements that may contribute to the regulation of 3'-end formation in the intestine.Entities:
Mesh:
Year: 2012 PMID: 22467213 PMCID: PMC3401467 DOI: 10.1093/nar/gks282
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 3.3′-end-seq. (A) Schematic of 3′-end-seq. See Materials and Methods section for details. (B) Distribution of 3′-end-seq reads in the genome. 3′-most exons, introns and exons are defined by Wormbase (WS190); Extended refers to the region up to 3 kb downstream of the Wormbase annotated 3′-UTR. (C) An example gene (F44C4.3) showing that the reads are mapped to the 3′-end. Y-axis is the RPM value. Top, unsorted sample; bottom, sorted sample.
Figure 1.Release of transcriptionally active nuclei from postembryonic stages of C. elegans. (A) Representative pictures of isolated nuclei with different sizes and (B) of a wider field of nuclei to indicate the size distribution. DIC: differential interference contrast; D: DAPI staining; M: merged picture (DIC and D). Size bar: 5 µm. (C) Schematic of a 7-kb long ribosomal transcription unit. Antisense riboprobes are indicated by blue boxes; ets, external transcribed spacer; its1 and its2, internal transcribed spacers; 26S1 and 26S2, two different probes for the 26S rRNA. (D) Top panel (T7): control hybridization of filters with radiolabelled T7 sense transcripts; middle panel (−α): nuclear run-on analysis without α-amanitin; bottom panel (+α): run-on analysis with nuclei pre-incubated with α-amanitin. pGEM, control probe from the empty vector; Rps-6, v2 and v3, probes for the pol II transcribed rps-6 (rps-6) and vit-2 (v2 and v3) genes; 5S, probe for the pol III transcribed 5S rRNA (E) Nuclear run-on analysis for vit-2 detection plus (+α) and minus (−α) α-amanitin with a larger volume of nuclei preparation. (F) Quantitation of signals from (D). Values from each filter were normalized to 18S signals.
Figure 2.FANS purification of intestinal nuclei. (A) Strain JM149 with an intestinal nuclear marker (elt-2p::nls::gfp-h2b). Top panels: display of whole animal. Size bar: 100 µm. Bottom panels: representative pictures of isolated labelled nuclei. Size bar: 5 µm. DIC: differential interference contrast; G: green fluorescence; M: merged picture (DIC and G); D: DAPI staining. (B) Flow cytometry analysis of the isolated nuclei and the R3 gate used for sorting. Top: scatterplot of all events; bottom: summary of data. Count: number of events; % Hist: percentage of events displayed; % All: percentage of all events detected by the flow cytometer. (C) Representative pictures of nuclei prior to (unsorted) and after (sorted) sorting. DIC, G, and M are as in (A); PI, propidium iodide. (D) Purity assessment of FANS: strains JM149 (GFP::H2B marker) and CB4974 (MyoD::β-Gal marker) were equally mixed and subjected to FANS for GFP. RNA from unsorted and sorted material was isolated, amplified to aRNA and quantitatively analysed by real-time RT–PCR (see Materials and Methods section for details). The graph displays real-time PCR signals normalized against rpl-43 with unsorted values set to 1. The 21-fold enrichment of GFP::H2B mRNA over β-Gal mRNA indicates about 4.7% contamination.
Figure 4.Analysis of gene expression in the intestine using 3′-end-seq reads. (A) Scatterplot of reads in unsorted (x-axis) and sorted (y-axis) samples. Genes higher and lower expressed (P < 0.01, Fisher’s exact test; fold change > 2) in the sorted sample compared with the unsorted sample are shown in red and green, respectively. (B) Example genes having differential expression in the sorted and unsorted samples. K12C11.3 (top) and T05C12.10 (bottom) have higher and lower expression, respectively, in the sorted sample than the unsorted sample. Gene expression values (RPM) are indicated. (C)Top 8 hexamers significantly enriched for the promoter regions (−500 to +100 nt surrounding the TSS) of genes highly expressed in the intestine. P-values were based on the Fisher’s exact test comparing genes with higher expression in the sorted sample with those with higher expression in the unsorted, as showed in (A). (D) The fractions of genes containing GATA elements in different regions surrounding the TSS. Genes were divided into three groups: (i) with higher expression in the sorted (sorted >>unsorted); (ii) with higher expression in the unsorted (sorted <
Gene Ontology (GO) analysis of differentially expressed genes
| GO_ID, GO_term | -log( | |
|---|---|---|
| GO significantly associated with genes highly expressed in the intestine | GO:0060255, regulation of macromolecule metabolic process | 8.23 |
| GO:0016070, RNA metabolic process | 6.05 | |
| GO:0006629, lipid metabolic process | 5.73 | |
| GO:0055085, transmembrane transport | 4.40 | |
| GO:0006952, defence response | 2.63 | |
| GO:0042221, response to chemical stimulus | 2.61 | |
| GO:0005975, carbohydrate metabolic process | 2.42 | |
| GO:0055114, oxidation reduction | 2.20 | |
| GO significantly associated with genes lowly expressed in the intestine | GO:0003006, reproductive developmental process | 6.28 |
| GO:0010171, body morphogenesis | 5.63 | |
| GO:0007049, cell cycle | 4.46 | |
| GO:0000910, cytokinesis | 4.26 | |
| GO:0042692, muscle cell differentiation | 3.91 | |
| GO:0022607, cellular component assembly | 3.40 | |
| GO:0016192, vesicle-mediated transport | 3.19 | |
| GO:0006006, glucose metabolic process | 3.11 | |
| GO:0030036, actin cytoskeleton organization | 2.86 | |
| GO:0035188, hatching | 2.66 | |
| GO:0040012, regulation of locomotion | 2.54 | |
| GO:0019953, sexual reproduction | 2.21 | |
| GO:0005996, monosaccharide metabolic process | 2.13 |
GO terms were analysed using the Fisher’s exact test, based on highly and lowly expressed genes in the sorted sample versus unsorted (Figure 4A). GO terms associated with more than 1000 genes or more than 1000 child terms were discarded. To eliminate redundancy, we required that each reported GO term had at least 25% of associated genes not associated with any GO term with a more significant P-value.
Figure 5.Analysis of alternative polyadenylation by 3′-end-seq. (A) Venn diagram comparing polyA sites identified in this study and those reported by Mangone et al. and Jan et al. PolyA sites unique to this study were further separated based on detection in unsorted and/or sorted samples, as shown in the table next to the Venn diagram. PolyA sites from different studies that are located within 20 nt from one another were considered identical sites. pA, polyA site. (B) Percentage of genes having alternative polyA sites in different regions of genes. (C) Schematic of alternative polyA sites (top) and percentage of polyA sites with differential usage (bottom). PolyA sites were grouped into different types. Two cutoffs were used: ≥5% change and ≥10% change. ‘Single’ refers to genes that contain a single polyA site in the 3′-most exon and further polyA sites in upstream exons. (D) Percentage of polyA sites of different types with more usage in the unsorted sample (unsorted > sorted) or in the sorted sample (unsorted < sorted). Two cutoffs were used, i.e. 5 and 10%, as indicated on the top. (E) Percentage of APA genes showing significant difference in polyA site usage in sorted versus unsorted samples. Genes were grouped based on the trans-splicing structure. Operon genes were divided into first, middle and last genes, based on the gene location in operons. SL1 genes have the SL1 spliced leader, and genes without spliced leader annotation are shown as No SL genes. (F) Significant 5-mers associated with polyA sites more used in the sorted versus unsorted samples. Differentially used polyA sites were selected using P < 0.05 (Fisher’s exact test) and difference in usage >5%. Four regions around the polyA site were analysed. P-values were derived from the Fisher’s exact test.