Literature DB >> 25273863

Revised annotations, sex-biased expression, and lineage-specific genes in the Drosophila melanogaster group.

Rebekah L Rogers1, Ling Shao2, Jaleal S Sanjak2, Peter Andolfatto3, Kevin R Thornton2.   

Abstract

Here, we provide revised gene models for D. ananassae, D. yakuba, and D. simulans, which include untranslated regions and empirically verified intron-exon boundaries, as well as ortholog groups identified using a fuzzy reciprocal-best-hit blast comparison. Using these revised annotations, we perform differential expression testing using the cufflinks suite to provide a broad overview of differential expression between reproductive tissues and the carcass. We identify thousands of genes that are differentially expressed across tissues in D. yakuba and D. simulans, with roughly 60% agreement in expression patterns of orthologs in D. yakuba and D. simulans. We identify several cases of putative polycistronic transcripts, pointing to a combination of transcriptional read-through in the genome as well as putative gene fusion and fission events across taxa. We furthermore identify hundreds of lineage specific genes in each species with no blast hits among transcripts of any other Drosophila species, which are candidates for neofunctionalized proteins and a potential source of genetic novelty.
Copyright © 2014 Rogers et al.

Entities:  

Keywords:  D. ananassae; D. simulans; D. yakuba; gene annotation; gene fusion; polycistronic genes; sex-specific gene expression

Mesh:

Year:  2014        PMID: 25273863      PMCID: PMC4267930          DOI: 10.1534/g3.114.013532

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


Accurate models of gene structure including untranslated regions (UTRs), intron-exon boundaries, as well as coding sequences are essential for proper interpretation of molecular genetics (Fire , Jinek ), demographic inference (Halligan and Keightley 2006, Parsch , Clemente and Vogl 2012), tests of selection (Mcdonald and Kreitman 1991), and comparative genomics (Chen ). The Drosophila offer an excellent model for comparative genomics, with high-quality sequenced genomes for 12 species(Drosophila Twelve Genomes Consortium 2007) as well as draft genomes for an additional eight species (Chen ) spanning a total of 63 million years (Tamura ). Previous gene models provided for the 12 Drosophila genomes focused on gene prediction with the aid of homology to establish putative annotations of coding sequences across taxa with 15,000−16,000 genes for most species (Drosophila Twelve Genomes Consortium 2007). These gene models produce reliable annotations for conserved genes as well as genes that are present in multiple species but will lack lineage specific genes where sources of genetic novelty add to the genome, will misidentify rapidly evolving sequences, and will neglect isoforms that may offer alternative molecular functions. Recent work has validated gene models in D. melanogaster through cross-species comparisons (Chen ). Although aligned CDS sequences often display patterns of expression consistent with conservation, gene structure varies across taxa at UTRs, introns, and noncoding DNA (Chen ). Furthermore, any gene families and functional classes subject to rapid evolution in gene structure that is unlikely to be reflected in homology-based annotations (Wasbrough ) and de novo genes also will be absent, despite their role in developing functional diversity (Zhao ). Beyond the evolution of gene structure, expression patterns of genes across tissues is expected to influence their evolutionary constraints and rates of sequence evolution (Van Dyken and Wade 2010) as well as their functional roles within the organism. Sex-biased genes are often rapidly evolving (Ranz ; Ellegren and Parsch 2007), especially among male reproductive proteins (Haerty ; Zhang ; Meisel 2011). Moreover, sex-specific expression patterns can influence the extent to which genes contribute to sexual antagonism, influencing their role in the evolution of sexual dimorphism Mank . Correctly identifying sex-specific and tissue specific expression is therefore essential for studying the molecular and evolutionary impacts of genes within species. Changes in tissue specific or sex-specific expression may point to rapid evolution of gene functions and candidate loci to search for signals of evolutionary change and differential selective effects between the sexes. Here, we describe RNA-seq based gene annotations for Drosophila outgroup species D. simulans, D. yakuba, and D. ananassae based on whole transcriptome sequencing of male and female adult tissues. These revised gene models capture hundreds of lineage-specific genes on major chromosomal arms in D. yakuba and D. simulans. We also describe 5′ and 3′ UTRs, and intron-exon structure for genes throughout the D. melanogaster group. Finally, we describe sex-biased expression across species, identifying thousands of genes that are differentially expressed across tissues. These revised gene models as well as results of sex-biased and tissue-biased differential expression testing should serve as a resource for the Drosophila evolutionary and molecular genetics community interested in evolution, conservation, and gene expression.

Materials and Methods

Sample preparation

Fly stocks were incubated under controlled conditions at 25° and ~40% humidity. Virgin flies were collected within 2 hr of eclosion, then aged 2−5 d posteclosion before dissection. We dissected samples in isotonic Ringers solution using female ovaries and headless gonadectomized carcass from two adult flies as well as testes plus glands and male headless gonadectomized carcass for four adult flies for each sample RNA prep. We collected three biological replicates of the D. yakuba reference, three biological replicates of the D. simulans w501 reference, and one replicate of the D. ananassae reference and one replicate of the D. melanogaster reference (stock numbers in Table 1). Samples were flash frozen in liquid nitrogen immediately after dissection, and and stored in 0.2 mL of Trizol at −80°. All samples were homogenized in 0.5 mL of Trizol Reagent (Invitrogen) with plastic pestle in a 1.5-mL tube, mixed with 0.1 mL of chloroform, and centrifuged 12,000g for 15 min at 4°C, as Trizol RNA extraction protocol. The RNAs in the supernatant about 0.4 mL were then collected and purified with Direct-Zol RNA MiniPrep Kit (Zymo), following the standard protocol. The total RNAs were eluted in 65 μL of RNase-Free H2O. Approximately 1 μg of purified RNAs were treated with 2 μL of Turbo DNase (Invitrogen) in a 65-μL reaction, incubated for 15 min at room temperature with gentle shaking. These RNAs were further purified with RNA Clean and Concentrator-5 (Zymo). One extra wash with fresh 80% ethanol after the final wash step was added into the original protocol. The treated RNAs were eluted with 15 μL of RNAse-Free H2O, and stored at −80°.
Table 1

Fly stocks used for RNA-seq

SpeciesStrain
D. melanogaster14021-0231.36
D. simulansw501
D. yakuba14021-0261.01
D. ananassae14024-0371.13
The amplified cDNAs were prepared from 100 ng of DNase treated RNA with Ovation RNA-Seq System V2 (Nugen) and modified protocol. The preparations followed the protocol to the step of SPIA Amplification (Single Primer Isothermal Amplification). The amplified cDNAs were first purified with Purelink PCR Purification Kit (Invitrogen, HC Binding Buffer) and eluted in 100 μL of EB (Invitrogen). These cDNAs were purified again to 25 μL of Buffer EB with DNA Clean and Concentrator -5 Kit (Zymo) for Nextera library preparation. About 43 ng of cDNA was used to construct libraries with Nextera DNA Sample Preparation Kit (Illumina) and modified protocol. After tagmentation, a Purelink PCR Purification Kit with HC Binding Buffer was used for purification and eluted with 30μL of EB or H2O. The products (libraries) of final polymerase chain reaction (PCR) amplification were purified with DNA Clean and Concentractor-5 and eluted in 20 μL of EB. The average library lengths of approximately 500 bp were estimated from profiles on a Bioanalyzer 2100 (Agilent) with DNA HS Assay. All libraries were normalized to 2−10 nM based on the real-time PCR method with Kapa Library Quant Kits (Kapa Biosystems). The qualities and quantities of these RNAs, cDNAs and final libraries were measured from Bioanalyzer with the Qubit RNA HS or DNA HS Assay Kit (Invitrogen) with RNA HS or DNA HS Reagents, respectively. Samples were barcoded and sequenced using 76-bp reads in 4-plex on an Illumina HiSequation 2500 using standard Illumina barcodes, resulting in high coverage (Figure S1, Figure S2, Figure S3, Figure S4, Figure S5, Figure S6). Coverage in RNA-seq data is dependent on expression level, but the majority of sites have coverage less than 50 reads. One replicate of female tissues was sequenced using single end reads. All other libraries were sequenced using paired end reads and dual indexing. Samples were sequenced as they became available, across the equivalent of one eight lane flow cell for the 32 samples in total.

Trinity and Augustus gene annotation

RNA sequencing data for ovary, female carcass, testes, and male carcass of D. yakuba, D. simulans, and D. ananassae were concatenated into a single fastq file and digitally normalized to remove excess redundant reads for highly expressed transcripts. This step results in tractable runtimes with no loss of information in transcript annotations. We ran Trinity (http://trinityrnaseq.sourceforge.net/; Grabherr ; Haas ). For a single sample: FASTQ files for left and right reads were concatenated. The concatanated files were subject to “digital normalization” using the following command in the Trinity package: normalize_by_kmer_coverage.pl --seqType fa --JM 100G --max_cov 30 --left left.fa --right right.fa --pairs_together --PARALLEL_STATS --JELLY_CPU $CORES where $CORES is how many CPU we had available The resulting normalized left and right fastq files where then used in the following command: Trinity.pl --seqType fa --bflyHeapSpaceInit 1G --bflyHeapSpaceMax 8G --JM 7G --left $LEFTFILE --right $RIGHTFILE --output trinity_output where the variables are the normalized fastq files and the number of cores available. Further detail is at https://github.com/ThorntonLab/annotation_methods The resulting annotations were used as input in the Augustus v2.5.5 gene prediction software http://bioinf.uni-greifswald.de/augustus/ with command line options --species=fly --hintsfile=$INFILE_BASE.hints.E.gff --extrinsicCfgFile=augustus.2.5.5/config/extrinsic/extrinsic.ME.cfg $INFILE --gff3=on --uniqueGeneId=true > $INFILE_BASE.gff3

Alignment and annotation

We matched Augustus gene models to previous annotations from FlyBase r1.3 for D. ananassae, and D. yakuba, or to the D. simulans w501 reference annotations. CDS sequences were required to physically overlap with the location of a current FlyBase or w501 gene model and were required to have matches to 85% or more CDS sequence with 90% or greater amino acid similarity in an all-by-all BLASTp of translated sequences at a cutoff of E ≤ 10−10 with low-complexity filters turned off (-F F). We mapped RNA-seq data to known gene models annotated in FlyBase, D. yakuba r1.3, D. ananassae r1.3, D. melanogaster r.5.45, and the D. simulans w501 gene models annotated by Hu et al. (2012). GFF Files were reformatted using gffread from Cufflinks suite. Sequences were mapped to the genome using Tophat v.2.0.6 (Trapnell , Kim ) and Bowtie2 v.2.0.2 (Langmead ), using reference annotations as a guide, with no attempt to identify novel transcripts using reads which fell outside reference annotations (-G) and all other parameters set to default. We used Cufflinks 2.0.2 (Trapnell ) to calculate expression levels across genes and transcripts, normalizing expression by the upper quantile (-N) and ignoring reads which fall outside known gene models (-G) with all other options set to default. Orphaned gene models from FlyBase or from Hu et al. (2012) which had fragments per kilo bases of exons for per million mapped read (FPKM) ≥2 but which had no assembled gene model match from Augustus were included in the final annotations used for differential expression testing. Some annotations contain polycistronic transcripts encompassing multiple independent open reading frames. A portion of these polycistronic transcripts may reflect only low-level polycistronic transcription, rather than polycistronic transcripts serving as the dominant isoforms but the rate of polycistronic transcription cannot be readily determined with available data. For genes with polycistronic transcripts but no 1:1 transcript match with FlyBase gene models, we included annotations for both the polycistronic Augustus gene models and the gene models from FlyBase supporting independent transcripts.

Sex-bias and tissue bias

The union of gene models from Augustus and orphaned gene models from FlyBase were combined into a single GFF containing transcript and CDS annotations for each species. We then re-mapped RNA-seq reads to gene models in the reannotated GFF for D. yakuba, D. ananassae, and D. simulans, as well as unmodified gene models for D. melanogaster r.5.45 with Tophat and performed differential expression testing at a false discovery rate ≤0.1 normalizing expression by the upper quantile (-N) and ignoring reads that fall outside known gene models (-G) with all other options set to default using the Cufflinks suite according to the same criteria described previously. We compared female carcass with female ovaries, male carcass with male testes, female carcass with male carcass, and female ovaries with male testes for each species, grouping replicates for reference genomes.

Orthologs and lineage specific genes

Orthologs were identified using fuzzy reciprocal best hit BLASTp comparisons of all translations across reference genomes for gene model predictions of D. ananassae, D. yakuba, D. simulans, as well as D. melanogaster. Orthologs are similar to those previously used to annotate gene families in Drosophila (Drosophila Twelve Genomes Consortium 2007, Hahn ). Putative orthologs must be putative reciprocal hits of the same rank order, where genes with an E-value within a single log-unit of one another are assigned the same rank, using the best E-Value for a gene with a cutoff of E ≤ 10−10. Lineage-specific genes were defined as genes with no hit in an all-by-all BLASTp of translations against translations of the other outgroups (e.g., D. yakuba, D. ananassae, and D. melanogaster for D. simulans) at a cutoff of E ≤ 10−10 with low-complexity filters turned off (-F F).

Gene ontology

We used DAVID gene ontology analysis software http://david.abcc.ncifcrf.gov/ to determine whether any functional categories were overrepresented among genes with sex specific or tissue specific expression. Functional data for D. ananassae, D. yakuba, and D. simulans are not readily available in many cases, and thus we identified functional classes in the D. melanogaster orthologs as classified in Flybase. Gene ontology clustering threshold was set to Low. Genes with tissue specific expression were based on genes with differential expression from cufflinks for comparisons of female carcass vs. female ovaries and male carcass vs. male testes at a genomewide false discovery rate ≤0.10, according to cufflinks default settings.

Data access

Gene annotations, ortholog calls, gene ontology calls, and Cuffdiff differential expression testing output files for all samples are available at http://github.com/ThorntonLab/GFF. RNA-seq−based annotations as well as first=order orthologs in comparison with D. melanogaster can be viewed in the UCSC browser on the Thornton Lab public track hub at http://genome.ucsc.edu. Sequencing fastq files were deposited in SRA with accession numbers PRJNA196536, PRJNA193071, PRJNA257286, and PRJNA257287.

Results

Annotations

For moderately to highly expressed genes, we recover gene structure with intron-exon boundaries and UTR sequences for full-length transcripts including novel exons that were previously unannotated based on comparative genomics (Figure 1 and Figure 2). Many genes are part of polycistronic transcripts including one from D. melanogaster. We identify 2529 putative polycistronic transcripts in D. yakuba, 2379 in D. ananassae, and 561 in D. simulans. For such genes, we offer gene models from FlyBase as well as fused models from Augustus. The extent to which such transcription of multiple genes is functional as opposed to a stochastic byproduct of transcriptional errors is unclear. We also include FlyBase gene models expressed in the reference genomes with no 1:1 match in gene models from Augustus. The addition of FlyBase gene models results in an additional 4265 annotations in D. yakuba, 5367 in D. ananassae, and 7419 in D. simulans. We identify a total of 22,989 transcripts for 16,278 genes in D. simulans, 20,315 transcripts for 17,579 genes in D. yakuba, and 22,420 transcripts for 20,580 genes in D. ananassae (Table 2). Compared with D. yakuba, for D. ananassae an additional 1173 FlyBase gene models failed to match sufficiently well with RNA-seq supported gene models and these were added to the annotations, explaining some of the excess in the number of genes and also highlighting the difficulties of annotation through comparative genomics across large phylogenetic distances. In D. ananassae, 72% of gene models have RNA-seq data supporting 60% or more gene features (exons, UTRs) compared with 79.4% in D. yakuba and 80.1% in D. simulans (Table 3).
Figure 1

Quantile-normalized coverage for reference strains at the Adh locus in D. yakuba. Coverage shows clear distinctions between introns and exons, and coverage that spans both 5′ and 3′ UTRs in ovaries and carcass. Low coverage of intron sequence points to partial sequencing of low levels of unprocessed transcripts. Top, carcass; bottom, ovaries.

Figure 2

Quantile-normalized RNA-seq data for three replicates of the D. yakuba reference with an example of FlyBase gene model (white) and revised gene model (gray).

Table 2

Number of transcripts and genes identified

SpeciesPrevious ReleaseTranscripts
Genes
RevisedFlyBaseRevisedFlyBase
D. simulansr1.318,78115,41516,27815,413
D. yakubar1.320,23916,08217,57916,077
D. ananassaer1.322,41815,07020,58015,069
Table 3

Percent of revised gene models with ≥60% of features supported by RNA-seq data

SpeciesPercent Suported
D. ananassae72.3
D. yakuba79.4
D. simulans80.05
Quantile-normalized coverage for reference strains at the Adh locus in D. yakuba. Coverage shows clear distinctions between introns and exons, and coverage that spans both 5′ and 3′ UTRs in ovaries and carcass. Low coverage of intron sequence points to partial sequencing of low levels of unprocessed transcripts. Top, carcass; bottom, ovaries. Quantile-normalized RNA-seq data for three replicates of the D. yakuba reference with an example of FlyBase gene model (white) and revised gene model (gray). As defined by a fuzzy reciprocal best-hit blast (Drosophila Twelve Genomes Consortium 2007, Hahn ) we identify 12,127 genes in D. melanogaster with first-order orthologs in D. simulans, 11,425 with first-order orthologs in D. yakuba, and 11,348 with first-order orthologs in D. ananassae (Table 4). The increase in the number of genes with orthologs in D. simulans is the product of improved annotations as well as the improved assembly of the w501 D. simulans reference (Hu et al. 2012). We observe a 1:1 concordance for 48% of FlyBase gene models in D. yakuba and 46% of FlyBase gene models for D. ananassae. These annotations typically include UTR sequences in addition to empirically supported intron-exon and coding sequence boundaries, an improvement over previous gene models from release r1.3 which lack UTRs (Drosophila Twelve Genomes Consortium 2007). We further identify thousands of lineage specific genes in each species of Drosophila with no matching gene model in other outgroup species. Although lineage-specific genes identified on minor chromosomes could result from assembly issues, we identify hundreds on major chromosomal arms (Table 5), suggesting that many of these are in fact cases of lineage-specific gene formation.
Table 4

Genes with a first-order ortholog identified

Genes inWith an Ortholog inRevisedFlyBase
D. melanogasterD. simulans12,19910,705
D. melanogasterD. yakuba11,47211,556
D. melanogasterD. ananassae11,45110,938
D. simulansD. melanogaster13,295
D. simulansD. yakuba12,299
D. simulansD. ananassae11,994
D. yakubaD. melanogaster12,868
D. yakubaD. simulans12,831
D. yakubaD. ananassae12,337
D. ananassaeD. melanogaster12,897
D. ananassaeD. simulans12,612
D. ananassaeD. yakuba12,723
Table 5

Putative Lineage Specific Genes on Major Chromosomes

SpeciesMajor chromosomesTotalDiff Exp
D. ananassae2977118
D. yakuba2301340334
D. simulans3691314222

Sex-biased and tissue-biased expression

We observe thousands of genes with sex-biased or tissue-biased expression in D. yakuba and D. simulans but hundreds of genes with sex-biased or tissue-biased expression in D. melanogaster and D. ananassae, a direct product of increasing power to detect differences in RNA levels with biological replicates. Gene ontology categories overrepresented between ovary and female carcass reflect differences in genes involved in reproduction, chromosome segregation, and DNA synthesis or repair, whereas genes differentially expressed between testes and carcass reflect sperm development, cell division, and energy production (Supporting Information, File S1). We used reciprocal best hit orthologs to identify genes with similar regulation across species, focusing on D. yakuba and D. simulans where biological replicates increase power for differential expression testing (Table 6). A total of 10,369 genes in D. yakuba have reciprocal best-hit orthologs in D. simulans and were retained to compare differential expression across the two species. We have collected replicates for both D. yakuba and D. simulans, contributing to the greater power in differential expression testing. Roughly 60% of genes with tissue biased expression in D. yakuba that have a reciprocal best hit ortholog in D. simulans exhibit the same tissue-specific bias in D. simulans, with marginally greater agreement in genes biased toward the carcass than the reproductive tissues in both males and females (Figure 3). We additionally observe evidence of differential expression in at least one of the four comparisons of male and female germline or somatic tissues for 118 lineage specific genes in D. ananassae, 334 in D. yakuba, and 222 in D. simulans (Table 5), suggesting that they are not solely artifacts of gene annotation software.
Table 6

Differentially Expressed Genes by Tissue and Species at FDR ≤0.1

SpeciesTissueTissueSignificantTested
D. ananassaeFemale ovaryFemale carcass2037537
Female ovaryMale testes2008282
Male carcassMale testes10138349
Male carcassFemale carcass1758417
D. yakubaFemale ovaryFemale carcass54208689
Female ovaryMale testes586810,202
Male carcassMale testes306510,412
Male carcassFemale carcass7249430
D. simulansFemale ovaryFemale carcass50538967
Female ovaryMale testes574110,222
Male carcassMale testes462810,679
Male carcassFemale carcass6119566
D. melanogasterFemale ovaryFemale carcass11211,326
Female ovaryMale testes37012,890
Male carcassMale testes22013,268
Male carcassFemale carcass28612,502
Figure 3

Genes with tissue biased expression in both D. yakuba and D. simulans in (A) female ovary, (B) female carcass, (C) male testes, and (D) male carcass. Numbers shown include only genes with a reciprocal best-hit ortholog in the sister species.

Genes with tissue biased expression in both D. yakuba and D. simulans in (A) female ovary, (B) female carcass, (C) male testes, and (D) male carcass. Numbers shown include only genes with a reciprocal best-hit ortholog in the sister species.

Discussion

Gene models and ortholog calls

Correct interpretations of gene expression changes and gene family evolution depend on accurate gene models. Among the Drosophila, D. melanogaster has received the most attention with empirically verified gene annotations through high-throughput expressed sequence tags and RNA-seq data as well as detailed manual or molecular curation of single genes. Until present, gene models for outgroup species offer only CDS sequences, with no information concerning 5′ or 3′ UTRs or alternative isoforms even for well-studied genes such as . Establishing more complete gene models based on RNA-seq data will allow us to correctly identify coding and noncoding sequences during functional assays, correctly identify putatively neutral vs. non-neutral mutations, and correctly define new mutations including the origins of new genes, expansion of gene families, and gross modification of coding sequences through rearrangement, duplication, and deletion. Moreover, identifying putative isoforms provides a more complete portrait of gene structure and function across species. Here, we provide updated gene annotations based on high coverage RNA-seq data for the reference genomes of three species of Drosophila: D. ananassae, D. yakuba, and D. simulans, which should serve as an excellent springboard and initial resource to the Drosophila molecular and evolutionary genetic community. Additionally, lineage specific genes are unlikely to be identified in previous annotation efforts that focused largely on conserved amino acid sequences. With RNA-seq annotations we can identify lineage specific genes, which are expected to be important in the evolution of genomes and emergence of genetic novelty. The increased power to identify genes independently from conservation is a major step toward studying the evolution of genome content and rapidly evolving gene sequences. The ability to identify transcribed genes independently from conservation will facilitate evolutionary and functional analyses of the most rapidly changing segments of the genome. There may be additional gene models and isoforms in other tissue types or time points, which deserves to be explored. Particularly, these annotations are unlikely to reflect the full diversity of sequences and isoforms expressed during embryonic development postfertilization, pupal development, or in larvae. These alternative time points deserve future exploration and the gene models offered here will serve as a lower bound on the full diversity of sequences that are transcribed within each species.

Polycistronic genes

We observe thousands of putatively polycistronic annotations in D. ananassae and D. yakuba, as well as hundreds in D. simulans w501 where fewer gene model annotations were previously aligned and power to identify polycistronic genes is limited. In D. melanogaster, hundreds of genes are known to show signs of polycistronic transcription (Lin ), offering a means of co-regulation across genes with similar functions (Blumenthal 1998; Slone ). With very high sequencing coverage, we are able to recover a greater number of polycistronic transcripts, though some of these may be false positives resulting from annotation algorithms. Some genes may differ in the frequency with which they are transcribed as polycistronic vs. independent transcripts, but these results imply that at least low levels of polycistronic transcription may be common for many genes in the genome and future validation may explore the extent of their functionality. and show evidence of differing polycistronic status across Drosophila species (Betran and Ashburner 2000) and plant genomes are thought to split and fuse genes at rates of roughly 10−11−10−10 per gene per year (Nakamura ). Switching the rates at which genes are cotranscribed has the potential to alter regulatory patterns across species (Blumenthal 1998; Slone ) and thereby produce novel phenotypes. These differences in polycistronic transcription therefore represent potential sources of genetic change that may be important in evolutionary change.

Tissue-specific expression

We identify thousands of genes that are differentially expressed across tissues in D. yakuba and D. simulans, where biological replicate samples for each tissue are available. A large fraction of the genome appears to display tissue-biased expression between germiline and carcass, with many fewer genes showing differential expression between male and female gonadectomized carcass, consistent with early microarry-based assays in D. melanogaster (Parisi ; 2004). We have sequenced samples to extremely high coverage, generating transcript annotations with 5′ and 3′ UTRs with empirically supported intron-exon structures, improving accuracy in differential expression testing. Biological replicates for D. yakuba and D. simulans result in much greater power to identify differentially expressed genes in comparison with D. ananassae and D. melanogaster. However, even in D. ananassae and D. melanogaster, where only one replicate was available per tissue, we are still able to identify hundreds of genes that are differentially expressed across tissues with high coverage. A comparison of orthologs between D. yakuba and D. simulans where biological replicates should result in sufficient power to detect differential expression reliably across both species reveals that roughly 60% of genes with reciprocal best hit orthologs exhibit similar tissue biased expression between the two species. The remaining 30% represent either genes that are differentially regulated between tissues but with effect sizes beyond the limits of detection, or genes that have evolved independent expression patterns between the two species. We also observe tissue biased expression in hundreds of lineage specific genes, which may represent candidates for neofunctionalization and new gene origination. These genes that display changes in sex-biased or tissue-biased expression across taxa, as well as lineage specific genes that exhibit tissue-biased and sex-biased expression are important candidate loci for evolutionary change in genome content and function that will be useful in exploring the functional and selective impacts of genomic changes.
  33 in total

1.  Ubiquitous selective constraints in the Drosophila genome revealed by a genome-wide interspecies comparison.

Authors:  Daniel L Halligan; Peter D Keightley
Journal:  Genome Res       Date:  2006-06-02       Impact factor: 9.043

Review 2.  The evolution of sex-biased genes and sex-biased gene expression.

Authors:  Hans Ellegren; John Parsch
Journal:  Nat Rev Genet       Date:  2007-08-07       Impact factor: 53.242

3.  Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans.

Authors:  A Fire; S Xu; M K Montgomery; S A Kostas; S E Driver; C C Mello
Journal:  Nature       Date:  1998-02-19       Impact factor: 49.962

4.  Rate and polarity of gene fusion and fission in Oryza sativa and Arabidopsis thaliana.

Authors:  Yoji Nakamura; Takeshi Itoh; William Martin
Journal:  Mol Biol Evol       Date:  2006-10-11       Impact factor: 16.240

5.  Constraint and turnover in sex-biased gene expression in the genus Drosophila.

Authors:  Yu Zhang; David Sturgill; Michael Parisi; Sudhir Kumar; Brian Oliver
Journal:  Nature       Date:  2007-11-08       Impact factor: 49.962

6.  Revisiting the protein-coding gene catalog of Drosophila melanogaster using 12 fly genomes.

Authors:  Michael F Lin; Joseph W Carlson; Madeline A Crosby; Beverley B Matthews; Charles Yu; Soo Park; Kenneth H Wan; Andrew J Schroeder; L Sian Gramates; Susan E St Pierre; Margaret Roark; Kenneth L Wiley; Rob J Kulathinal; Peili Zhang; Kyl V Myrick; Jerry V Antone; Susan E Celniker; William M Gelbart; Manolis Kellis
Journal:  Genome Res       Date:  2007-11-07       Impact factor: 9.043

7.  Adaptive protein evolution at the Adh locus in Drosophila.

Authors:  J H McDonald; M Kreitman
Journal:  Nature       Date:  1991-06-20       Impact factor: 49.962

8.  Evolution in the fast lane: rapidly evolving sex-related genes in Drosophila.

Authors:  Wilfried Haerty; Santosh Jagadeeshan; Rob J Kulathinal; Alex Wong; Kristipati Ravi Ram; Laura K Sirot; Lisa Levesque; Carlo G Artieri; Mariana F Wolfner; Alberto Civetta; Rama S Singh
Journal:  Genetics       Date:  2007-11       Impact factor: 4.562

9.  Sugar receptors in Drosophila.

Authors:  Jesse Slone; Joseph Daniels; Hubert Amrein
Journal:  Curr Biol       Date:  2007-10-04       Impact factor: 10.834

10.  Evolution of genes and genomes on the Drosophila phylogeny.

Authors:  Andrew G Clark; Michael B Eisen; Douglas R Smith; Casey M Bergman; Brian Oliver; Therese A Markow; Thomas C Kaufman; Manolis Kellis; William Gelbart; Venky N Iyer; Daniel A Pollard; Timothy B Sackton; Amanda M Larracuente; Nadia D Singh; Jose P Abad; Dawn N Abt; Boris Adryan; Montserrat Aguade; Hiroshi Akashi; Wyatt W Anderson; Charles F Aquadro; David H Ardell; Roman Arguello; Carlo G Artieri; Daniel A Barbash; Daniel Barker; Paolo Barsanti; Phil Batterham; Serafim Batzoglou; Dave Begun; Arjun Bhutkar; Enrico Blanco; Stephanie A Bosak; Robert K Bradley; Adrianne D Brand; Michael R Brent; Angela N Brooks; Randall H Brown; Roger K Butlin; Corrado Caggese; Brian R Calvi; A Bernardo de Carvalho; Anat Caspi; Sergio Castrezana; Susan E Celniker; Jean L Chang; Charles Chapple; Sourav Chatterji; Asif Chinwalla; Alberto Civetta; Sandra W Clifton; Josep M Comeron; James C Costello; Jerry A Coyne; Jennifer Daub; Robert G David; Arthur L Delcher; Kim Delehaunty; Chuong B Do; Heather Ebling; Kevin Edwards; Thomas Eickbush; Jay D Evans; Alan Filipski; Sven Findeiss; Eva Freyhult; Lucinda Fulton; Robert Fulton; Ana C L Garcia; Anastasia Gardiner; David A Garfield; Barry E Garvin; Greg Gibson; Don Gilbert; Sante Gnerre; Jennifer Godfrey; Robert Good; Valer Gotea; Brenton Gravely; Anthony J Greenberg; Sam Griffiths-Jones; Samuel Gross; Roderic Guigo; Erik A Gustafson; Wilfried Haerty; Matthew W Hahn; Daniel L Halligan; Aaron L Halpern; Gillian M Halter; Mira V Han; Andreas Heger; LaDeana Hillier; Angie S Hinrichs; Ian Holmes; Roger A Hoskins; Melissa J Hubisz; Dan Hultmark; Melanie A Huntley; David B Jaffe; Santosh Jagadeeshan; William R Jeck; Justin Johnson; Corbin D Jones; William C Jordan; Gary H Karpen; Eiko Kataoka; Peter D Keightley; Pouya Kheradpour; Ewen F Kirkness; Leonardo B Koerich; Karsten Kristiansen; Dave Kudrna; Rob J Kulathinal; Sudhir Kumar; Roberta Kwok; Eric Lander; Charles H Langley; Richard Lapoint; Brian P Lazzaro; So-Jeong Lee; Lisa Levesque; Ruiqiang Li; Chiao-Feng Lin; Michael F Lin; Kerstin Lindblad-Toh; Ana Llopart; Manyuan Long; Lloyd Low; Elena Lozovsky; Jian Lu; Meizhong Luo; Carlos A Machado; Wojciech Makalowski; Mar Marzo; Muneo Matsuda; Luciano Matzkin; Bryant McAllister; Carolyn S McBride; Brendan McKernan; Kevin McKernan; Maria Mendez-Lago; Patrick Minx; Michael U Mollenhauer; Kristi Montooth; Stephen M Mount; Xu Mu; Eugene Myers; Barbara Negre; Stuart Newfeld; Rasmus Nielsen; Mohamed A F Noor; Patrick O'Grady; Lior Pachter; Montserrat Papaceit; Matthew J Parisi; Michael Parisi; Leopold Parts; Jakob S Pedersen; Graziano Pesole; Adam M Phillippy; Chris P Ponting; Mihai Pop; Damiano Porcelli; Jeffrey R Powell; Sonja Prohaska; Kim Pruitt; Marta Puig; Hadi Quesneville; Kristipati Ravi Ram; David Rand; Matthew D Rasmussen; Laura K Reed; Robert Reenan; Amy Reily; Karin A Remington; Tania T Rieger; Michael G Ritchie; Charles Robin; Yu-Hui Rogers; Claudia Rohde; Julio Rozas; Marc J Rubenfield; Alfredo Ruiz; Susan Russo; Steven L Salzberg; Alejandro Sanchez-Gracia; David J Saranga; Hajime Sato; Stephen W Schaeffer; Michael C Schatz; Todd Schlenke; Russell Schwartz; Carmen Segarra; Rama S Singh; Laura Sirot; Marina Sirota; Nicholas B Sisneros; Chris D Smith; Temple F Smith; John Spieth; Deborah E Stage; Alexander Stark; Wolfgang Stephan; Robert L Strausberg; Sebastian Strempel; David Sturgill; Granger Sutton; Granger G Sutton; Wei Tao; Sarah Teichmann; Yoshiko N Tobari; Yoshihiko Tomimura; Jason M Tsolas; Vera L S Valente; Eli Venter; J Craig Venter; Saverio Vicario; Filipe G Vieira; Albert J Vilella; Alfredo Villasante; Brian Walenz; Jun Wang; Marvin Wasserman; Thomas Watts; Derek Wilson; Richard K Wilson; Rod A Wing; Mariana F Wolfner; Alex Wong; Gane Ka-Shu Wong; Chung-I Wu; Gabriel Wu; Daisuke Yamamoto; Hsiao-Pei Yang; Shiaw-Pyng Yang; James A Yorke; Kiyohito Yoshida; Evgeny Zdobnov; Peili Zhang; Yu Zhang; Aleksey V Zimin; Jennifer Baldwin; Amr Abdouelleil; Jamal Abdulkadir; Adal Abebe; Brikti Abera; Justin Abreu; St Christophe Acer; Lynne Aftuck; Allen Alexander; Peter An; Erica Anderson; Scott Anderson; Harindra Arachi; Marc Azer; Pasang Bachantsang; Andrew Barry; Tashi Bayul; Aaron Berlin; Daniel Bessette; Toby Bloom; Jason Blye; Leonid Boguslavskiy; Claude Bonnet; Boris Boukhgalter; Imane Bourzgui; Adam Brown; Patrick Cahill; Sheridon Channer; Yama Cheshatsang; Lisa Chuda; Mieke Citroen; Alville Collymore; Patrick Cooke; Maura Costello; Katie D'Aco; Riza Daza; Georgius De Haan; Stuart DeGray; Christina DeMaso; Norbu Dhargay; Kimberly Dooley; Erin Dooley; Missole Doricent; Passang Dorje; Kunsang Dorjee; Alan Dupes; Richard Elong; Jill Falk; Abderrahim Farina; Susan Faro; Diallo Ferguson; Sheila Fisher; Chelsea D Foley; Alicia Franke; Dennis Friedrich; Loryn Gadbois; Gary Gearin; Christina R Gearin; Georgia Giannoukos; Tina Goode; Joseph Graham; Edward Grandbois; Sharleen Grewal; Kunsang Gyaltsen; Nabil Hafez; Birhane Hagos; Jennifer Hall; Charlotte Henson; Andrew Hollinger; Tracey Honan; Monika D Huard; Leanne Hughes; Brian Hurhula; M Erii Husby; Asha Kamat; Ben Kanga; Seva Kashin; Dmitry Khazanovich; Peter Kisner; Krista Lance; Marcia Lara; William Lee; Niall Lennon; Frances Letendre; Rosie LeVine; Alex Lipovsky; Xiaohong Liu; Jinlei Liu; Shangtao Liu; Tashi Lokyitsang; Yeshi Lokyitsang; Rakela Lubonja; Annie Lui; Pen MacDonald; Vasilia Magnisalis; Kebede Maru; Charles Matthews; William McCusker; Susan McDonough; Teena Mehta; James Meldrim; Louis Meneus; Oana Mihai; Atanas Mihalev; Tanya Mihova; Rachel Mittelman; Valentine Mlenga; Anna Montmayeur; Leonidas Mulrain; Adam Navidi; Jerome Naylor; Tamrat Negash; Thu Nguyen; Nga Nguyen; Robert Nicol; Choe Norbu; Nyima Norbu; Nathaniel Novod; Barry O'Neill; Sahal Osman; Eva Markiewicz; Otero L Oyono; Christopher Patti; Pema Phunkhang; Fritz Pierre; Margaret Priest; Sujaa Raghuraman; Filip Rege; Rebecca Reyes; Cecil Rise; Peter Rogov; Keenan Ross; Elizabeth Ryan; Sampath Settipalli; Terry Shea; Ngawang Sherpa; Lu Shi; Diana Shih; Todd Sparrow; Jessica Spaulding; John Stalker; Nicole Stange-Thomann; Sharon Stavropoulos; Catherine Stone; Christopher Strader; Senait Tesfaye; Talene Thomson; Yama Thoulutsang; Dawa Thoulutsang; Kerri Topham; Ira Topping; Tsamla Tsamla; Helen Vassiliev; Andy Vo; Tsering Wangchuk; Tsering Wangdi; Michael Weiand; Jane Wilkinson; Adam Wilson; Shailendra Yadav; Geneva Young; Qing Yu; Lisa Zembek; Danni Zhong; Andrew Zimmer; Zac Zwirko; David B Jaffe; Pablo Alvarez; Will Brockman; Jonathan Butler; CheeWhye Chin; Sante Gnerre; Manfred Grabherr; Michael Kleber; Evan Mauceli; Iain MacCallum
Journal:  Nature       Date:  2007-11-08       Impact factor: 49.962

View more
  10 in total

1.  Accumulation of endosymbiont genomes in an insect autosome followed by endosymbiont replacement.

Authors:  Eric S Tvedte; Mark Gasser; Xuechu Zhao; Luke J Tallon; Lisa Sadzewicz; Robin E Bromley; Matthew Chung; John Mattick; Benjamin C Sparklin; Julie C Dunning Hotopp
Journal:  Curr Biol       Date:  2022-06-06       Impact factor: 10.900

2.  Genomic Patterns of Geographic Differentiation in Drosophila simulans.

Authors:  Alisa Sedghifar; Perot Saelao; David J Begun
Journal:  Genetics       Date:  2016-01-22       Impact factor: 4.562

3.  Comparative population genomics of latitudinal variation in Drosophila simulans and Drosophila melanogaster.

Authors:  Heather E Machado; Alan O Bergland; Katherine R O'Brien; Emily L Behrman; Paul S Schmidt; Dmitri A Petrov
Journal:  Mol Ecol       Date:  2016-01-18       Impact factor: 6.185

4.  An actin-related protein that is most highly expressed in Drosophila testes is critical for embryonic development.

Authors:  Courtney M Schroeder; Sarah A Tomlin; Isabel Mejia Natividad; John R Valenzuela; Janet M Young; Harmit S Malik
Journal:  Elife       Date:  2021-07-20       Impact factor: 8.140

Review 5.  Transcriptome Analysis in Domesticated Species: Challenges and Strategies.

Authors:  Jessica P Hekman; Jennifer L Johnson; Anna V Kukekova
Journal:  Bioinform Biol Insights       Date:  2016-02-16

6.  Retrotransposons Are the Major Contributors to the Expansion of the Drosophila ananassae Muller F Element.

Authors:  Wilson Leung; Christopher D Shaffer; Elizabeth J Chen; Thomas J Quisenberry; Kevin Ko; John M Braverman; Thomas C Giarla; Nathan T Mortimer; Laura K Reed; Sheryl T Smith; Srebrenka Robic; Shannon R McCartha; Danielle R Perry; Lindsay M Prescod; Zenyth A Sheppard; Ken J Saville; Allison McClish; Emily A Morlock; Victoria R Sochor; Brittney Stanton; Isaac C Veysey-White; Dennis Revie; Luis A Jimenez; Jennifer J Palomino; Melissa D Patao; Shane M Patao; Edward T Himelblau; Jaclyn D Campbell; Alexandra L Hertz; Maddison F McEvilly; Allison R Wagner; James Youngblom; Baljit Bedi; Jeffery Bettincourt; Erin Duso; Maiye Her; William Hilton; Samantha House; Masud Karimi; Kevin Kumimoto; Rebekah Lee; Darryl Lopez; George Odisho; Ricky Prasad; Holly Lyn Robbins; Tanveer Sandhu; Tracy Selfridge; Kara Tsukashima; Hani Yosif; Nighat P Kokan; Latia Britt; Alycia Zoellner; Eric P Spana; Ben T Chlebina; Insun Chong; Harrison Friedman; Danny A Mammo; Chun L Ng; Vinayak S Nikam; Nicholas U Schwartz; Thomas Q Xu; Martin G Burg; Spencer M Batten; Lindsay M Corbeill; Erica Enoch; Jesse J Ensign; Mary E Franks; Breanna Haiker; Judith A Ingles; Lyndsay D Kirkland; Joshua M Lorenz-Guertin; Jordan Matthews; Cody M Mittig; Nicholaus Monsma; Katherine J Olson; Guillermo Perez-Aragon; Alen Ramic; Jordan R Ramirez; Christopher Scheiber; Patrick A Schneider; Devon E Schultz; Matthew Simon; Eric Spencer; Adam C Wernette; Maxine E Wykle; Elizabeth Zavala-Arellano; Mitchell J McDonald; Kristine Ostby; Peter Wendland; Justin R DiAngelo; Alexis M Ceasrine; Amanda H Cox; James E B Docherty; Robert M Gingras; Stephanie M Grieb; Michael J Pavia; Casey L Personius; Grzegorz L Polak; Dale L Beach; Heaven L Cerritos; Edward A Horansky; Karim A Sharif; Ryan Moran; Susan Parrish; Kirsten Bickford; Jennifer Bland; Juliana Broussard; Kerry Campbell; Katelynn E Deibel; Richard Forka; Monika C Lemke; Marlee B Nelson; Catherine O'Keeffe; S Mariel Ramey; Luke Schmidt; Paola Villegas; Christopher J Jones; Stephanie L Christ; Sami Mamari; Adam S Rinaldi; Ghazal Stity; Amy T Hark; Mark Scheuerman; S Catherine Silver Key; Briana D McRae; Adam S Haberman; Sam Asinof; Harriette Carrington; Kelly Drumm; Terrance Embry; Richard McGuire; Drew Miller-Foreman; Stella Rosen; Nadia Safa; Darrin Schultz; Matt Segal; Yakov Shevin; Petros Svoronos; Tam Vuong; Gary Skuse; Don W Paetkau; Rachael K Bridgman; Charlotte M Brown; Alicia R Carroll; Francesca M Gifford; Julie Beth Gillespie; Susan E Herman; Krystal L Holtcamp; Misha A Host; Gabrielle Hussey; Danielle M Kramer; Joan Q Lawrence; Madeline M Martin; Ellen N Niemiec; Ashleigh P O'Reilly; Olivia A Pahl; Guadalupe Quintana; Elizabeth A S Rettie; Torie L Richardson; Arianne E Rodriguez; Mona O Rodriguez; Laura Schiraldi; Joanna J Smith; Kelsey F Sugrue; Lindsey J Suriano; Kaitlyn E Takach; Arielle M Vasquez; Ximena Velez; Elizabeth J Villafuerte; Laura T Vives; Victoria R Zellmer; Jeanette Hauke; Charles R Hauser; Karolyn Barker; Laurie Cannon; Perouza Parsamian; Samantha Parsons; Zachariah Wichman; Christopher W Bazinet; Diana E Johnson; Abubakarr Bangura; Jordan A Black; Victoria Chevee; Sarah A Einsteen; Sarah K Hilton; Max Kollmer; Rahul Nadendla; Joyce Stamm; Antoinette E Fafara-Thompson; Amber M Gygi; Emmy E Ogawa; Matt Van Camp; Zuzana Kocsisova; Judith L Leatherman; Cassie M Modahl; Michael R Rubin; Susana S Apiz-Saab; Suzette M Arias-Mejias; Carlos F Carrion-Ortiz; Patricia N Claudio-Vazquez; Debbie M Espada-Green; Marium Feliciano-Camacho; Karina M Gonzalez-Bonilla; Mariela Taboas-Arroyo; Dorianmarie Vargas-Franco; Raquel Montañez-Gonzalez; Joseph Perez-Otero; Myrielis Rivera-Burgos; Francisco J Rivera-Rosario; Heather L Eisler; Jackie Alexander; Samatha K Begley; Deana Gabbard; Robert J Allen; Wint Yan Aung; William D Barshop; Amanda Boozalis; Vanessa P Chu; Jeremy S Davis; Ryan N Duggal; Robert Franklin; Katherine Gavinski; Heran Gebreyesus; Henry Z Gong; Rachel A Greenstein; Averill D Guo; Casey Hanson; Kaitlin E Homa; Simon C Hsu; Yi Huang; Lucy Huo; Sarah Jacobs; Sasha Jia; Kyle L Jung; Sarah Wai-Chee Kong; Matthew R Kroll; Brandon M Lee; Paul F Lee; Kevin M Levine; Amy S Li; Chengyu Liu; Max Mian Liu; Adam P Lousararian; Peter B Lowery; Allyson P Mallya; Joseph E Marcus; Patrick C Ng; Hien P Nguyen; Ruchik Patel; Hashini Precht; Suchita Rastogi; Jonathan M Sarezky; Adam Schefkind; Michael B Schultz; Delia Shen; Tara Skorupa; Nicholas C Spies; Gabriel Stancu; Hiu Man Vivian Tsang; Alice L Turski; Rohit Venkat; Leah E Waldman; Kaidi Wang; Tracy Wang; Jeffrey W Wei; Dennis Y Wu; David D Xiong; Jack Yu; Karen Zhou; Gerard P McNeil; Robert W Fernandez; Patrick Gomez Menzies; Tingting Gu; Jeremy Buhler; Elaine R Mardis; Sarah C R Elgin
Journal:  G3 (Bethesda)       Date:  2017-08-07       Impact factor: 3.154

7.  Tandem duplications lead to novel expression patterns through exon shuffling in Drosophila yakuba.

Authors:  Rebekah L Rogers; Ling Shao; Kevin R Thornton
Journal:  PLoS Genet       Date:  2017-05-22       Impact factor: 5.917

8.  Selection shapes turnover and magnitude of sex-biased expression in Drosophila gonads.

Authors:  Carrie A Whittle; Cassandra G Extavour
Journal:  BMC Evol Biol       Date:  2019-02-20       Impact factor: 3.260

9.  High rate of translocation-based gene birth on the Drosophila Y chromosome.

Authors:  Ray Tobler; Viola Nolte; Christian Schlötterer
Journal:  Proc Natl Acad Sci U S A       Date:  2017-10-19       Impact factor: 11.205

10.  Chromosomal rearrangements as a source of new gene formation in Drosophila yakuba.

Authors:  Nicholas B Stewart; Rebekah L Rogers
Journal:  PLoS Genet       Date:  2019-09-23       Impact factor: 5.917

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.