Literature DB >> 21179090

The developmental transcriptome of Drosophila melanogaster.

Brenton R Graveley1, Angela N Brooks, Joseph W Carlson, Michael O Duff, Jane M Landolin, Li Yang, Carlo G Artieri, Marijke J van Baren, Nathan Boley, Benjamin W Booth, James B Brown, Lucy Cherbas, Carrie A Davis, Alex Dobin, Renhua Li, Wei Lin, John H Malone, Nicolas R Mattiuzzo, David Miller, David Sturgill, Brian B Tuch, Chris Zaleski, Dayu Zhang, Marco Blanchette, Sandrine Dudoit, Brian Eads, Richard E Green, Ann Hammonds, Lichun Jiang, Phil Kapranov, Laura Langton, Norbert Perrimon, Jeremy E Sandler, Kenneth H Wan, Aarron Willingham, Yu Zhang, Yi Zou, Justen Andrews, Peter J Bickel, Steven E Brenner, Michael R Brent, Peter Cherbas, Thomas R Gingeras, Roger A Hoskins, Thomas C Kaufman, Brian Oliver, Susan E Celniker.   

Abstract

Drosophila melanogaster is one of the most well studied genetic model organisms; nonetheless, its genome still contains unannotated coding and non-coding genes, transcripts, exons and RNA editing sites. Full discovery and annotation are pre-requisites for understanding how the regulation of transcription, splicing and RNA editing directs the development of this complex organism. Here we used RNA-Seq, tiling microarrays and cDNA sequencing to explore the transcriptome in 30 distinct developmental stages. We identified 111,195 new elements, including thousands of genes, coding and non-coding transcripts, exons, splicing and editing events, and inferred protein isoforms that previously eluded discovery using established experimental, prediction and conservation-based approaches. These data substantially expand the number of known transcribed elements in the Drosophila genome and provide a high-resolution view of transcriptome dynamics throughout development.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 21179090      PMCID: PMC3075879          DOI: 10.1038/nature09715

Source DB:  PubMed          Journal:  Nature        ISSN: 0028-0836            Impact factor:   49.962


INTRODUCTION

Drosophila melanogaster is an important non-mammalian model system that has played a critical role in basic biological discoveries, such as identifying chromosomes as the carriers of genetic information1 and uncovering the role of genes in development2,3. Because it shares a substantial genic content with humans4, Drosophila is increasingly used as a translational model for human development, homeostasis, and disease5. High quality maps are needed for all functional genomic elements. Previous studies demonstrated that a rich collection of genes is deployed during the life cycle of the fly6-8. While expression profiling using microarrays has revealed the expression of ~13K annotated genes, it is difficult to map splice junctions and individual base modifications generated by RNA editing9 using such approaches. Single-base-resolution is essential to precisely define the elements that comprise the Drosophila transcriptome. Estimates of the number of transcript isoforms are less accurate than estimates of the number of genes. While ~20% of Drosophila genes are annotated as encoding alternatively spliced pre-mRNAs, splice-junction microarray experiments suggest that this number is at least 40%7. Determining the diversity of mRNAs generated by alternative promoters, alternative splicing and RNA editing will substantially increase the inferred protein repertoire. Non-coding RNA genes (ncRNAs) including siRNAs and miRNAs (reviewed in 10), and longer ncRNAs such as bxd11 and rox12 play important roles in gene regulation, while others such as snoRNAs and snRNAs are important components of macromolecular machines such as the ribosome and spliceosome. The transcription and processing of these ncRNAs must also be fully documented and mapped. As part of the modENCODE project to annotate the functional elements of the D. melanogaster and C. elegans genomes13-15, we used RNA-Seq and tiling microarrays to sample the Drosophila transcriptome at unprecedented depth throughout development from early embryo to aging, male and female adults. We report on a high-resolution view of the discovery, structure and dynamic expression of the D. melanogaster transcriptome.

RESULTS

Strategy for Characterization of the Transcriptome

To discover new transcribed features (Supplementary Table 1) and comprehensively characterize their expression dynamics throughout development, we conducted complementary tiling microarray and RNA-Seq experiments using RNA isolated from 30 whole-animal samples representing 27 distinct stages of development (Supplementary Table 2). These included 12 embryonic samples collected at two-hour intervals for 24 hours, six larval, six pupal, and three sexed adult stages at 1, 5, and 30 days post-eclosion. We used 38 bp resolution genome tiling microarrays to analyze total RNA from all 30 biological samples and poly(A)+ mRNA from the 12 embryonic samples (Supplementary Fig. 1). To attain single nucleotide resolution and to facilitate the analysis of alternative splicing and RNA editing, we performed non-strand specific poly(A)+ RNA-seq from all 30 samples generating a combination of single and paired-end ~75 bp reads on the Illumina GAII platform (short poly (A)+ RNA-Seq) (Supplementary Table 3, Supplementary Fig. 2). To identify primary transcripts and non-coding RNAs, the 12 embryonic time points were also interrogated with strand-specific 50 bp sequence reads from partially rRNA-depleted total RNA on the Applied Biosystems SOLiD platform (Supplementary Table 4, Supplementary Fig. 3). To improve connectivity, mixed-stage embryos, adult males and adult females were used to generate ~250 bp reads on the Roche 454 platform (non-strand specific long poly(A)+ RNA-Seq) (Supplementary Table 5). In total, we generated 176,962,906,041 bp of mapped sequence representing 1,266-fold coverage of the genome and 5,902-fold coverage of the annotated D. melanogaster transcriptome.

Discovery of New Transcribed Regions

We identified 1,938 New Transcribed Regions (NTRs) not linked to any annotated gene models. Herein, “transcripts” refer to RNA molecules synthesized from a genomic locus while “genes” refer to one or more transcripts that share exons in their mature spliced form. modENCODE cDNAs fully support 13% of the NTRs (Supplementary Fig. 4) and partially support 23%. Most NTRs (84%) are detected by poly(A)+ RNA-seq, 44% by total RNA-Seq, and 42% by tiling array. Approximately half of the NTRs are conserved in the distantly related D. pseudoobscura and D. mojavensis (Supplementary Fig. 4b) and 30% of these are detected by poly(A)+ RNA-Seq data from D. pseudoobscura or D. mojavensis adult heads (Supplementary Fig. 4c,d, Supplementary Table 6). The NTRs most likely eluded prior detection because they are expressed at low levels, in temporally restricted patterns, and are enriched for single exon genes. The new multi-exon gene models (48%) have fewer, shorter, and less conserved exons than annotated genes. Nearly one-third of the NTRs have a predicted ORF greater than 100 aa. The remaining NTRs could encode small peptides but many are likely to be non-coding RNAs. A small fraction (9%) of NTRs are heterochromatic, the majority of these (232) have sequence similarity (greater than 100 nt match and greater than 60% identity) to transposable elements (TEs) and represent transcribed TEs or TE fragments. It remains to be determined if these regions have any function, although recent studies describe TE associated regions that have acquired functions16,17. Even in the well-studied bithorax complex2, we found an NTR. Known genetic breakpoints in the infra-abdominal regions iab-3 to iab-8, which lie between the homeotic genes abdominal-A (abd-A) and Abdominal-B (Abd-B), disrupt normal male development and affect fertility18,19. Within this region are regulatory elements20 and evidence for long non-coding RNAs that have eluded detection for over 20 years21-23. We used the RNA-seq data to infer the structures of at least three overlapping transcripts and validated one form (Fig. 1). The RNAs are expressed in embryos and adult males but not females. Based on the presumed role of this new gene and spatial expression in the embryonic gonad (data not shown), we have named it male specific abdominal (msa). The cDNA contains short open reading frames (ORFs) that are conserved in the melanogaster subgroup and could encode male specific peptides. Whether they function as regulatory and/or as peptide encoding RNAs is an important question for understanding development and segmental morphological diversity.
Figure 1

Discovery of new RNAs in the bithorax complex

Genomic organization and experimental evidence for new transcripts located between the HOX genes, abd-A and Abd-B based on short poly(A)+ RNA and total RNA-seq expression profiles. The numbers to the left of each track indicate the maximal number of reads for that sample. Three manually curated junction-based transcript models are shown, the green transcript model was fully validated by a cDNA, MIP06894.

Discovery of small non-coding RNAs

We identified 37 unannotated intron-encoded and two unannotated intergenic small ncRNAs (<300 nt) with an average fragments per kilobase of transcript per million fragments mapped (FPKM)24 >20 from total embryonic RNA-Seq (Fig. 2 and Supplementary Table 7). Most of these ncRNAs are highly conserved in Drosophila sibling species25. We found published but unannotated ncRNAs: a U4atac snRNA26 and four small Cajal body-specific RNAs (scaRNAs)27. Of the remaining 34 ncRNAs, three are box C/D-like snoRNAs, 28 are box H/ACA-like snRNAs, one is a scaRNA-like RNA, and two are unclassified. One-third are located in the introns of genes encoding RNA binding proteins, the majority involved in pre-mRNA splicing (xl6, SC35, tra2, dek, prp8, tudor-SN, and pUf68).
Figure 2

Discovery of small non-coding RNAs

a. Poly(A)+ (yellow) and total RNA (blue) data from 10-12 h embryos are shown for the gp210 gene which hosts a representative new snoRNA. The maximal number of reads in the poly(A)+ and total RNA-Seq data are shown on the left and right of the track, respectively. b. The predicted RNA secondary structure of snoRNA is characteristic of a H/ACA-box snoRNA. Nucleotides that are 100% conserved in sequence or base-pairing are indicated in green and blue, respectively. c. Embryonic expression of the new small RNAs. The scale bar indicates FPKM Z-scores.

Discovery of microRNA primary transcripts

MicroRNAs are processed from primary microRNA transcripts (pri-miRNAs) and are either independently transcribed or embedded in the introns of protein-coding genes. We identified 23 putative independently transcribed pri-miRNAs from the total embryonic RNA-Seq and tiling array data that encode 37 annotated miRNAs (Supplementary Table 8). Only two primary transcripts were previously annotated (bft and iab-4). The pri-miRNAs range from 1 to 18 kb and terminate at the mature miRNA (pre-mir-315, Supplementary Fig. 5a). Twelve of the 23 precursors have Cap Analysis of Gene Expression (CAGE) peaks that map at their initiation sites28. pri-miRNA expression is dynamic in embryonic development (Supplementary Fig. 5b).

Overview of the Drosophila Transcriptome

We calculated expression levels of annotated genes, transcripts, and NTRs (Supplementary Table 9) in the short poly(A)+ RNA-Seq and tiling array datasets. From the RNA-Seq data we detected expression of 14,862 genes (Supplementary Fig. 7a) and 36,274 transcripts (Fig. 3a) with an FPKM>1 (Supplementary Tables 9-18) of which 67% of genes and 58% of transcripts were also observed in the array data (score >300) (Supplementary Fig. 6, Supplementary Tables 19 & 20). This includes the confirmation of 87% of annotated genes and transcripts and the discovery of 17,745 new transcripts. In addition, from the total RNA-Seq data we detected expression of 12,854 genes and 32,139 transcripts with an FPKM>1 (Supplementary Tables 12,13,21 & 22) of which 77% of genes and 89% of transcripts were also observed in the array data. Of the genes and transcripts observed exclusively in the total RNA-Seq data, 519 genes and 1,005 transcripts (primarily noncoding) were previously annotated and 122 genes and 1,422 transcripts are new discoveries. The genes and transcripts not detected in any dataset include small genes (< 200 bp), members of multi-copy gene families such as ribosomal RNAs, paralogs (expected due to our mapping parameters), genes known to be expressed at low levels or in small numbers of cells (e.g. gustatory and odorant receptor genes), and non-polyadenylated transcripts.
Figure 3

Dynamics of Gene Expression

a. Transcripts expressed (FPKM>1) in the short poly(A)+ RNA-Seq data, FB5.12 (blue), modENCODE (purple). The bar graphs indicate the number of transcripts expressed in each sample (Supplementary Table 1), and the lines, the cumulative number of expressed transcripts. The lighter blue and purple lines indicate the cumulative number of transcripts expressed in the embryonic Total RNA-Seq samples. The horizontal dotted lines indicate the number of expressed previously annotated transcripts. b. Scatter plot of sex-biased gene expression. light red: female-biased annotated (n=960), dark red: female-biased NTRs (n=12), light blue: male-biased annotated (n=2,401), dark blue: male-biased NTRs (n=431), light grey: unbiased annotated (n=8,217), black: unbiased NTRs (n=136). c. Genome Coverage. For each developmental sample, the short poly(A)+ reads were used to estimate the percent of the genome covered using a cutoff of two reads. The mature and primary transcripts were inferred for the previously FB5.12 (dotted lines) and modENCODE (solid lines) gene models.

Expression Dynamics

We examined the dynamics of gene expression throughout development using the short poly(A)+ RNA-Seq data. The numbers of expressed genes (FPKM >1) (Supplementary Fig. 7a) and transcripts (Fig. 3a) gradually increases, from 7,045 (0-2 hr embryos) to 12,000 (adult males). Adult males express ~3,000 more genes than adult females, consistent with the known transcriptional complexity of the testis29. We observed that 40% of expressed genes are constitutively expressed in 30 samples (Supplementary Fig. 7b). We also observed developmentally regulated expression of transposable elements (Supplementary Materials, Supplementary Fig. 8). We observed pronounced expression changes in over 1,500 genes in the first two third instar larval samples (Supplementary Fig. 7a,c). Expression of 1,199 genes increased at least 10-fold, and 421 genes decreased at least 10-fold (Supplemental Table 23). Nearly all of the up-regulated genes are expressed for the first time during the third instar stage and most are poorly characterized genes. The earliest known event in metamorphosis is the “mid-3rd transition”30,. identified by the synchronous changes in the transcription of a number of well-studied genes, Ecdysone-induced protein 28/29kD and fat body protein 1 (reviewed in31) and the switch from proximal to distal promoters of Alcohol dehydrogenase32. These markers coincide with the surge reported here. The mid-3rd transition has no morphological or behavioral correlates and is associated with a pulse of the steroid hormone, ecdysone33 acting through a non-standard receptor34. Whether the onset of testis development is a consequence of the mid-3rd transition, or whether the two events are functionally related remains to be investigated. Over 29% of protein-coding genes showed significant sex-biased expression in adults (FDR<0.1%), with more male-biased (1,829) or male-specific genes (572) than female-biased (945) or female-specific genes (15) (Supplementary Tables 24 & 25) and Fig. 3b). Known female (ovo and otu) and male (dj) sex-biased genes were expressed as expected. We found that 74% of the NTRs expressed in adults were significantly male-biased whereas only 2.1% were significantly female-biased.

Genome Coverage

Mature mRNAs are encoded by 20% of the D. melanogaster genome and primary transcripts by 60% (Fig. 3c). An additional 15% of the genome (~75% total) is detected when considering all of the short poly(A)+ RNA-Seq data. However, as greater than 99% of the reads map within the bounds of the transcript models, the reads that map to intergenic regions constitute a small minority of our data. Thus, though pervasive transcription of mammalian genomes has been observed in microarray studies35, we found little evidence of such “dark matter”36.

Discovery and dynamics of alternative splicing

To characterize constitutive and alternative splicing, we identified 71,316 splice junctions, of which 22,965 were new discoveries. Of the new splice junctions, 26% were supported by multiple experimental data types and 74% by only one data type, (Supplementary Fig. 9a) primarily short poly(A)+ RNA-Seq. Of the 20,751 new junctions from the short poly(A)+ RNA-Seq data, 7,833 were incorporated into new transcript models or transcribed regions (NTRs). The remaining new junctions have yet to be incorporated into transcript models. We also identified a total of 102,026 exons (Supplementary Table 26). Of the 52,914 representing new and revised exons, 65% were validated by capture and sequencing of cDNAs and 2,586 were supported by RNA-Seq data from D. mojavensis and D. pseudoobscura. Of the new exons, 3,392 were identified from the new splice junctions but have yet to be incorporated into transcript models. To examine splicing dynamics throughout development, we categorized all splicing events into the common types of alternative splicing events (Table 1). We identified a total of 23,859 splicing events, of which 18,490 were new or recategorized, a three-fold increase from annotated splicing events. An additional 2,988 intron-retention events were identified from the short poly(A)+ RNA-seq data, and are yet to be supported by other experimental data. In all, 7,473 genes contain at least one alternative splicing event, 60.7% of the 12,295 expressed multi-exon genes – also a three-fold increase in the fraction of genes with alternatively spliced transcripts. While smaller than the fraction of human genes with alternatively spliced transcripts (95%)37,38, a larger proportion of Drosophila genes encode alternative transcripts than was previously known.
Table 1

Classification of Alternative Splicing Events

FlyBase r5.12
modENCODE
New Events
Short Poly(A)+ RNA-Seq
Significantly Changing
Cassette Exons7932,7172,0142,3691,539
Alternative 5’ Splice Sites8435,1924,5994,5833,142
Alternative 3’ Splice Sites8796,2535,5055,5793,242
Mutually Exclusive Exons229251123228226
Coordinate Cassette Exons3011,227979992467
Alternative First Exons1,7674,9363,4424,4733,996
Alternative Last Exons227604553553471
Retained/Unprocessed Introns1,4342,679 (5,667)1,275 (4,263)2,439 (35,641)868 (8,998)

6,47323,859 (26,847)18,490 (21,478)21,216 (54,418)13,951 (22,081)

The number of retained/unprocessed introns indicated in parentheses indicates the total number identified, while the number not in parentheses indicates the subset of identified events that have been validated by cDNA sequences or FlyBase 5.12 annotations.

Of the new alternative exons, 8,226 were previously annotated as constitutive. As observed39, annotated cassette exons, and their flanking introns are more highly conserved than annotated constitutive exons (Fig. 4a). The newly discovered cassette exons are more highly conserved than the new constitutive exons, though both classes are less conserved than the corresponding class of annotated exons. New cassette exons that were previously annotated as constitutive exons are the most highly conserved set of exons (Fig. 4a). Annotated and new cassette exons show a strong tendency to preserve reading-frame (Supplementary Fig. 9b) suggesting that these transcripts increase protein diversity. Both annotated and new cassette exons tend to be shorter than their constitutive counterparts, though both sets of new exons tend to be shorter than annotated exons.
Figure 4

Developmentally regulated splicing events

a. Conservation of internal constitutive and cassette exons >50 nt that were annotated or new discoveries. (Annotated Constitutive, n=26,127; Annotated Cassette, n=438; modENCODE Cassette n=173; modENCODE Constitutive n=306; 5.12 Constitutive to modENCODE Cassette n=304). b. Clusters of regulated cassette exon events during development. The scale bar indicates Z-scores of Ψ. c. Regulated alternative splicing in CadN during embryogenesis. The maximal number of reads in the poly(A)+ RNA-Seq data are indicated for each track

To assess the extent of splicing variation we calculated the “percent spliced in” or Ψ38 for each splicing event in each sample as well as the switch score (ΔΨ) by determining the difference between the highest and lowest Ψ values across development (ΔΨ = ΨMAX - ΨMIN). This revealed a very smooth distribution of ΔΨ among all events indicating that the splicing of most exons is fairly constant while only a minority change dramatically (Supplementary Fig. 9c, Supplemental Table (was old 31 now is Table 27). Only 831 splicing events have a ΔΨ value >90. Further statistical analyses (see Methods) identified 15,847 (67%) alternative splicing events that change significantly throughout development (Supplemental Table (was old 32 now is Table 28). Hierarchical clustering of cassette exon events revealed the dynamic nature of splicing throughout development (Fig. 4b) as exemplified by Cadherin-N (CadN), a gene with three sets of mutually exclusive exons (Fig. 4c). In each set, one exon is preferentially included in early embryos, the other in late embryos, with a smooth transition between the two. Our analysis also identified groups of exons that have coordinated splicing patterns (Fig. 4b). A set of 55 genes contains exons that are preferentially included in early embryos, late larvae, early pupae, and females but skipped in all other stages. GO analysis of these genes suggests that many encode proteins involved in epithelial cell-to-cell junctions. GO analysis of genes that contain exons preferentially included during late pupal and adult stages, suggests that many encode proteins that are part of neuronal synapses.

Sex-biased alternative splicing

Sex determination in Drosophila is mediated by a cascade of regulated alternative splicing events involving Sex lethal (Sxl), transformer (tra), male-specific lethal 2 (msl-2), doublesex (dsx), and fruitless (fru) that specify nearly all physical and behavioral dimorphisms between males and females as well as X chromosome dosage compensation40. Our RNA-Seq data confirm sex-biased splicing of Sxl (ΔΨ=89.6), tra (ΔΨ=39.2), dsx (ΔΨ=59.7), and fru (ΔΨ=100). In addition to the canonical sex-determination cascade, we identified 119 strongly sex-biased splicing events (ΔΨ>70) (Supplementary Fig. 9d). One striking example is Reps which was annotated as containing six constitutive exons. RNA-Seq data indicate that exon five is a sex-biased alternative cassette exon (ΔΨ=73.39) (Supplementary Fig. 10). This highly conserved exon is included in males and skipped in females. The intron upstream of this cassette exon contains conserved SXL binding sites suggesting it is regulated by SXL and is a candidate sex differentiation gene.

Discovery of RNA editing sites

Previous studies identified 127 sites in 55 Drosophila genes that undergo A-to-I RNA editing41. This post-transcriptional modification is catalyzed by dADAR, which is expressed at increasing levels throughout development and is thought to target products involved in nervous system function. We analyzed the poly(A)+ RNA-Seq data to identify exonic nucleotide positions consistent with A-to-I editing and defined 972 edited positions within transcripts of 597 genes, including previously described edited sites in the transcripts of 36 genes (Supplementary Table 29). These genes include those required for rapid neurotransmission and other widely ranging functions. For most sites, the frequency of editing increases throughout development and does not correlate with overall expression levels (Fig. 5a). Editing typically begins in late pupal stages, although we find transcripts that appear to be edited in late embryogenesis. Consistent with earlier studies42, exons containing editing sites are more highly conserved than unedited exons. The majority of the edited positions (630) alter amino acid coding, the others are either silent (201) or within UTRs (141). For example, the transcripts of quiver (qvr) are edited at six positions, four that result in amino acid changes (Fig. 5b). qvr encodes a potassium channel subunit that modulates the function of the voltage-gated Shaker (SH) potassium channel. Sh transcripts are also edited at multiple positions43. The combinatorial editing of both proteins likely plays an important role in modulating action potentials in the arthropod nervous system and may have implications for the regulation of sleep44. ESTs, long poly(A)+ RNA-seq and cDNAs cross validate nearly a quarter (214) of the newly discovered sites.
Figure 5

Discovery of RNA editing events

a. Rows represent an edited sites. Rank ordered expression levels (number of reads) are shown in green and the rank ordered editing ratios are shown in red. Pictogram representations of editing motifs A, B, and C are shown. b. RNA editing of qvr. Male and female expression and conservation tracks are shown above RNA-Seq reads from adult females that align to the edited positions (orange). Conceptual translation of the unedited and fully transcripts result in four amino acid changes (red) at the C-terminus of QVR.

Computational analysis identified three potential editing associated sequence motifs (Fig. 5a). We observe 381 sites with one or more motifs in close proximity to the edited nucleotide (Supplementary Table 30). Motif ‘C’, while less common than motifs ‘A’ and ‘B’, is more strongly associated with the editing site. Most (93%) instances of motif ‘C’ occur on the sense strand of the transcript and the A at the 3’ end of the motif is the edited nucleotide. This motif is overrepresented in editing events that occur early in development.

DISCUSSION

Our interrogation of the transcriptome of D. melanogaster throughout development has considerably expanded the number of building blocks used to make a fly. Specifically, we identified nearly 2,000 NTRs, increased the number of alternative splicing events by three-fold and the number of RNA editing sites by an order of magnitude. The resulting view of the transcriptome at single-base resolution dramatically improves our understanding of expression dynamics throughout the Drosophila life cycle and has substantial biological implications. The D. melanogaster, C. elegans and human genomes are organized quite differently. Specifically, 20%, 45% and 2.5% of the D. melanogaster, C. elegans and human genomes, respectively, encode exons or mature transcripts. Primary transcripts comprise a larger fraction of each genome – 60%, 82% and 37%. This highlights the facts that primary transcripts and introns are much shorter in D. melanogaster and C. elegans than in human and that the D. melanogaster and C. elegans genomes are more compact than the human genome. The existence of unannotated genes was suggested by microarray studies8,45 and conservation among Drosophilid genomes25. However, the NTRs we identified were not identified by comparative sequence analysis46 as they are less conserved than most previously known genes. This emphasizes the importance of using both comparative analyses and transcriptome profiling for genome annotation. Despite the depth of our sequencing, the annotation of the D. melanogaster transcriptome is not finished. We failed to detect expression of 1,488 annotated genes including members of gene families to which short reads can not be uniquely mapped and genes expressed at low levels or in spatially and temporally restricted patterns. Moreover, though we substantially increased the fraction of genes that encode alternatively spliced or edited transcripts, we again failed to detect several annotated RNA processing events. Study of more temporally and spatially restricted samples will allow deeper exploration of the Drosophila transcriptome, and almost certainly result in the discovery of yet additional features. Furthermore, functional studies of the new and previously unstudied elements, will provide valuable insight into metazoan development.

METHODS SUMMARY

Animal Staging, Collection and RNA extraction

Isogenic (y) embryos were collected at two-hour intervals for 24 hours. Collection of later staged animals started with synchronized embryos and included resynchronizing with appropriate age indicators. Six larval, six pupal, and three adult sexed stages, 1, 5 and 30 days were collected. RNA was isolated using TRIzol (Invitrogen), DNased, and purified on a RNAeasy column (Qiagen). poly(A)+ RNA was prepared from an aliquot of each total RNA sample using an Oligotex kit (Qiagen).

Tiling Arrays

RNAs from three biological replicates of each sample were independently hybridized on 38-bp arrays (Affymetrix GeneChip® Drosophila Tiling 2.0R Array) as described47.

RNA-Seq

Libraries were generated and sequenced on an Illumina Genome Analyzer IIx using single or paired-end chemistry and 76 bp cycles. SOLiD sequencing used total RNA treated with the RiboMinusTM Eukaryote Kit (Invitrogen). Samples were fragmented, adaptors ligated (Ambion) and sequenced for 50 bases using SOLiD V3 chemistry. 454 sequencing used poly(A)+ RNA from Oregon R adult males and females and mixed-staged y embryos. Sequences are available from the Short Read Archive and the modENCODE website (http://www.modencode.org/).

Targeted RT-PCR and cDNA Isolation and Sequencing

Standard procedures were used for RT-PCR and targeted cDNA isolation and sequencing.

Analysis

Cufflinks24 was used to identify new transcript models and to calculate expression levels for annotated and predicted transcript models. MFold48 was used to predict secondary structures from the new snoRNA-like RNAs. JuncBASE 49 identified alternative splicing events and calculated percent spliced in (Ψ)38. Editing sites were identified by comparing aligned reads to the reference genome. Full Methods and any associated references are available in the online version of the paper at www.nature.com/nature.
  46 in total

1.  A revised model for U4atac/U6atac snRNA base pairing.

Authors:  Richard A Padgett; Girish C Shukla
Journal:  RNA       Date:  2002-02       Impact factor: 4.942

2.  Conservation of an RNA regulatory map between Drosophila and mammals.

Authors:  Angela N Brooks; Li Yang; Michael O Duff; Kasper D Hansen; Jung W Park; Sandrine Dudoit; Steven E Brenner; Brenton R Graveley
Journal:  Genome Res       Date:  2010-10-04       Impact factor: 9.043

3.  Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures.

Authors:  Alexander Stark; Michael F Lin; Pouya Kheradpour; Jakob S Pedersen; Leopold Parts; Joseph W Carlson; Madeline A Crosby; Matthew D Rasmussen; Sushmita Roy; Ameya N Deoras; J Graham Ruby; Julius Brennecke; Emily Hodges; Angie S Hinrichs; Anat Caspi; Benedict Paten; Seung-Won Park; Mira V Han; Morgan L Maeder; Benjamin J Polansky; Bryanne E Robson; Stein Aerts; Jacques van Helden; Bassem Hassan; Donald G Gilbert; Deborah A Eastman; Michael Rice; Michael Weir; Matthew W Hahn; Yongkyu Park; Colin N Dewey; Lior Pachter; W James Kent; David Haussler; Eric C Lai; David P Bartel; Gregory J Hannon; Thomas C Kaufman; Michael B Eisen; Andrew G Clark; Douglas Smith; Susan E Celniker; William M Gelbart; Manolis Kellis
Journal:  Nature       Date:  2007-11-08       Impact factor: 49.962

4.  RNA editing in Drosophila melanogaster: New targets and functional consequences.

Authors:  Mark Stapleton; Joseph W Carlson; Susan E Celniker
Journal:  RNA       Date:  2006-10-03       Impact factor: 4.942

5.  Molecular analysis of the initiation of insect metamorphosis: a comparative study of Drosophila ecdysteroid-regulated transcription.

Authors:  A J Andres; J C Fletcher; F D Karim; C S Thummel
Journal:  Dev Biol       Date:  1993-12       Impact factor: 3.582

6.  Modulation of the frequency response of Shaker potassium channels by the quiver peptide suggesting a novel extracellular interaction mechanism.

Authors:  Jing W Wang; Chun-Fang Wu
Journal:  J Neurogenet       Date:  2010-07       Impact factor: 1.250

7.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project.

Authors:  Ewan Birney; John A Stamatoyannopoulos; Anindya Dutta; Roderic Guigó; Thomas R Gingeras; Elliott H Margulies; Zhiping Weng; Michael Snyder; Emmanouil T Dermitzakis; Robert E Thurman; Michael S Kuehn; Christopher M Taylor; Shane Neph; Christoph M Koch; Saurabh Asthana; Ankit Malhotra; Ivan Adzhubei; Jason A Greenbaum; Robert M Andrews; Paul Flicek; Patrick J Boyle; Hua Cao; Nigel P Carter; Gayle K Clelland; Sean Davis; Nathan Day; Pawandeep Dhami; Shane C Dillon; Michael O Dorschner; Heike Fiegler; Paul G Giresi; Jeff Goldy; Michael Hawrylycz; Andrew Haydock; Richard Humbert; Keith D James; Brett E Johnson; Ericka M Johnson; Tristan T Frum; Elizabeth R Rosenzweig; Neerja Karnani; Kirsten Lee; Gregory C Lefebvre; Patrick A Navas; Fidencio Neri; Stephen C J Parker; Peter J Sabo; Richard Sandstrom; Anthony Shafer; David Vetrie; Molly Weaver; Sarah Wilcox; Man Yu; Francis S Collins; Job Dekker; Jason D Lieb; Thomas D Tullius; Gregory E Crawford; Shamil Sunyaev; William S Noble; Ian Dunham; France Denoeud; Alexandre Reymond; Philipp Kapranov; Joel Rozowsky; Deyou Zheng; Robert Castelo; Adam Frankish; Jennifer Harrow; Srinka Ghosh; Albin Sandelin; Ivo L Hofacker; Robert Baertsch; Damian Keefe; Sujit Dike; Jill Cheng; Heather A Hirsch; Edward A Sekinger; Julien Lagarde; Josep F Abril; Atif Shahab; Christoph Flamm; Claudia Fried; Jörg Hackermüller; Jana Hertel; Manja Lindemeyer; Kristin Missal; Andrea Tanzer; Stefan Washietl; Jan Korbel; Olof Emanuelsson; Jakob S Pedersen; Nancy Holroyd; Ruth Taylor; David Swarbreck; Nicholas Matthews; Mark C Dickson; Daryl J Thomas; Matthew T Weirauch; James Gilbert; Jorg Drenkow; Ian Bell; XiaoDong Zhao; K G Srinivasan; Wing-Kin Sung; Hong Sain Ooi; Kuo Ping Chiu; Sylvain Foissac; Tyler Alioto; Michael Brent; Lior Pachter; Michael L Tress; Alfonso Valencia; Siew Woh Choo; Chiou Yu Choo; Catherine Ucla; Caroline Manzano; Carine Wyss; Evelyn Cheung; Taane G Clark; James B Brown; Madhavan Ganesh; Sandeep Patel; Hari Tammana; Jacqueline Chrast; Charlotte N Henrichsen; Chikatoshi Kai; Jun Kawai; Ugrappa Nagalakshmi; Jiaqian Wu; Zheng Lian; Jin Lian; Peter Newburger; Xueqing Zhang; Peter Bickel; John S Mattick; Piero Carninci; Yoshihide Hayashizaki; Sherman Weissman; Tim Hubbard; Richard M Myers; Jane Rogers; Peter F Stadler; Todd M Lowe; Chia-Lin Wei; Yijun Ruan; Kevin Struhl; Mark Gerstein; Stylianos E Antonarakis; Yutao Fu; Eric D Green; Ulaş Karaöz; Adam Siepel; James Taylor; Laura A Liefer; Kris A Wetterstrand; Peter J Good; Elise A Feingold; Mark S Guyer; Gregory M Cooper; George Asimenos; Colin N Dewey; Minmei Hou; Sergey Nikolaev; Juan I Montoya-Burgos; Ari Löytynoja; Simon Whelan; Fabio Pardi; Tim Massingham; Haiyan Huang; Nancy R Zhang; Ian Holmes; James C Mullikin; Abel Ureta-Vidal; Benedict Paten; Michael Seringhaus; Deanna Church; Kate Rosenbloom; W James Kent; Eric A Stone; Serafim Batzoglou; Nick Goldman; Ross C Hardison; David Haussler; Webb Miller; Arend Sidow; Nathan D Trinklein; Zhengdong D Zhang; Leah Barrera; Rhona Stuart; David C King; Adam Ameur; Stefan Enroth; Mark C Bieda; Jonghwan Kim; Akshay A Bhinge; Nan Jiang; Jun Liu; Fei Yao; Vinsensius B Vega; Charlie W H Lee; Patrick Ng; Atif Shahab; Annie Yang; Zarmik Moqtaderi; Zhou Zhu; Xiaoqin Xu; Sharon Squazzo; Matthew J Oberley; David Inman; Michael A Singer; Todd A Richmond; Kyle J Munn; Alvaro Rada-Iglesias; Ola Wallerman; Jan Komorowski; Joanna C Fowler; Phillippe Couttet; Alexander W Bruce; Oliver M Dovey; Peter D Ellis; Cordelia F Langford; David A Nix; Ghia Euskirchen; Stephen Hartman; Alexander E Urban; Peter Kraus; Sara Van Calcar; Nate Heintzman; Tae Hoon Kim; Kun Wang; Chunxu Qu; Gary Hon; Rosa Luna; Christopher K Glass; M Geoff Rosenfeld; Shelley Force Aldred; Sara J Cooper; Anason Halees; Jane M Lin; Hennady P Shulha; Xiaoling Zhang; Mousheng Xu; Jaafar N S Haidar; Yong Yu; Yijun Ruan; Vishwanath R Iyer; Roland D Green; Claes Wadelius; Peggy J Farnham; Bing Ren; Rachel A Harte; Angie S Hinrichs; Heather Trumbower; Hiram Clawson; Jennifer Hillman-Jackson; Ann S Zweig; Kayla Smith; Archana Thakkapallayil; Galt Barber; Robert M Kuhn; Donna Karolchik; Lluis Armengol; Christine P Bird; Paul I W de Bakker; Andrew D Kern; Nuria Lopez-Bigas; Joel D Martin; Barbara E Stranger; Abigail Woodroffe; Eugene Davydov; Antigone Dimas; Eduardo Eyras; Ingileif B Hallgrímsdóttir; Julian Huppert; Michael C Zody; Gonçalo R Abecasis; Xavier Estivill; Gerard G Bouffard; Xiaobin Guan; Nancy F Hansen; Jacquelyn R Idol; Valerie V B Maduro; Baishali Maskeri; Jennifer C McDowell; Morgan Park; Pamela J Thomas; Alice C Young; Robert W Blakesley; Donna M Muzny; Erica Sodergren; David A Wheeler; Kim C Worley; Huaiyang Jiang; George M Weinstock; Richard A Gibbs; Tina Graves; Robert Fulton; Elaine R Mardis; Richard K Wilson; Michele Clamp; James Cuff; Sante Gnerre; David B Jaffe; Jean L Chang; Kerstin Lindblad-Toh; Eric S Lander; Maxim Koriabine; Mikhail Nefedov; Kazutoyo Osoegawa; Yuko Yoshinaga; Baoli Zhu; Pieter J de Jong
Journal:  Nature       Date:  2007-06-14       Impact factor: 49.962

8.  Most "dark matter" transcripts are associated with known genes.

Authors:  Harm van Bakel; Corey Nislow; Benjamin J Blencowe; Timothy R Hughes
Journal:  PLoS Biol       Date:  2010-05-18       Impact factor: 8.029

9.  Learning the common language of genetics.

Authors:  Allan C Spradling
Journal:  Genetics       Date:  2006-09       Impact factor: 4.562

10.  Gene expression during the life cycle of Drosophila melanogaster.

Authors:  Michelle N Arbeitman; Eileen E M Furlong; Farhad Imam; Eric Johnson; Brian H Null; Bruce S Baker; Mark A Krasnow; Matthew P Scott; Ronald W Davis; Kevin P White
Journal:  Science       Date:  2002-09-27       Impact factor: 47.728

View more
  804 in total

1.  In search of beneficial coding RNA editing.

Authors:  Guixia Xu; Jianzhi Zhang
Journal:  Mol Biol Evol       Date:  2014-11-12       Impact factor: 16.240

2.  Widespread generation of alternative UTRs contributes to sex-specific RNA binding by UNR.

Authors:  Marija Mihailovic; Marija Mihailovich; Laurence Wurth; Federico Zambelli; Irina Abaza; Cristina Militti; Francesco M Mancuso; Guglielmo Roma; Giulio Pavesi; Fátima Gebauer
Journal:  RNA       Date:  2011-11-18       Impact factor: 4.942

3.  Binary regulation of Hippo pathway by Merlin/NF2, Kibra, Lgl, and Melted specifies and maintains postmitotic neuronal fate.

Authors:  David Jukam; Claude Desplan
Journal:  Dev Cell       Date:  2011-11-03       Impact factor: 12.270

4.  Extra double-stranded RNA binding domain (dsRBD) in a squid RNA editing enzyme confers resistance to high salt environment.

Authors:  Juan Pablo Palavicini; Rodrigo A Correa-Rojas; Joshua J C Rosenthal
Journal:  J Biol Chem       Date:  2012-03-28       Impact factor: 5.157

5.  Evidence of abundant stop codon readthrough in Drosophila and other metazoa.

Authors:  Irwin Jungreis; Michael F Lin; Rebecca Spokony; Clara S Chan; Nicolas Negre; Alec Victorsen; Kevin P White; Manolis Kellis
Journal:  Genome Res       Date:  2011-10-12       Impact factor: 9.043

6.  Functionalization of a protosynaptic gene expression network.

Authors:  Cecilia Conaco; Danielle S Bassett; Hongjun Zhou; Mary Luz Arcila; Sandie M Degnan; Bernard M Degnan; Kenneth S Kosik
Journal:  Proc Natl Acad Sci U S A       Date:  2012-06-20       Impact factor: 11.205

7.  Polymorphism and Divergence of Novel Gene Expression Patterns in Drosophila melanogaster.

Authors:  Julie M Cridland; Alex C Majane; Hayley K Sheehy; David J Begun
Journal:  Genetics       Date:  2020-07-31       Impact factor: 4.562

8.  Extensive Differential Splicing Underlies Phenotypically Plastic Aphid Morphs.

Authors:  Mary E Grantham; Jennifer A Brisson
Journal:  Mol Biol Evol       Date:  2018-08-01       Impact factor: 16.240

9.  Evolution of gene structural complexity: an alternative-splicing-based model accounts for intron-containing retrogenes.

Authors:  Chengjun Zhang; Andrea R Gschwend; Yidan Ouyang; Manyuan Long
Journal:  Plant Physiol       Date:  2014-02-11       Impact factor: 8.340

10.  Xio is a component of the Drosophila sex determination pathway and RNA N6-methyladenosine methyltransferase complex.

Authors:  Jian Guo; Hong-Wen Tang; Jing Li; Norbert Perrimon; Dong Yan
Journal:  Proc Natl Acad Sci U S A       Date:  2018-03-19       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.