| Literature DB >> 25552534 |
Yolanda Guillén1, Núria Rius1, Alejandra Delprat1, Anna Williford2, Francesc Muyas1, Marta Puig1, Sònia Casillas3, Miquel Ràmia3, Raquel Egea3, Barbara Negre4, Gisela Mir5, Jordi Camps6, Valentí Moncunill7, Francisco J Ruiz-Ruano8, Josefa Cabrero8, Leonardo G de Lima9, Guilherme B Dias9, Jeronimo C Ruiz10, Aurélie Kapusta11, Jordi Garcia-Mas12, Marta Gut6, Ivo G Gut6, David Torrents7, Juan P Camacho8, Gustavo C S Kuhn9, Cédric Feschotte11, Andrew G Clark13, Esther Betrán2, Antonio Barbadilla3, Alfredo Ruiz14.
Abstract
Cactophilic Drosophila species provide a valuable model to study gene-environment interactions and ecological adaptation. Drosophila buzzatii and Drosophila mojavensis are two cactophilic species that belong to the repleta group, but have very different geographical distributions and primary host plants. To investigate the genomic basis of ecological adaptation, we sequenced the genome and developmental transcriptome of D. buzzatii and compared its gene content with that of D. mojavensis and two other noncactophilic Drosophila species in the same subgenus. The newly sequenced D. buzzatii genome (161.5 Mb) comprises 826 scaffolds (>3 kb) and contains 13,657 annotated protein-coding genes. Using RNA sequencing data of five life-stages we found expression of 15,026 genes, 80% protein-coding genes, and 20% noncoding RNA genes. In total, we detected 1,294 genes putatively under positive selection. Interestingly, among genes under positive selection in the D. mojavensis lineage, there is an excess of genes involved in metabolism of heterocyclic compounds that are abundant in Stenocereus cacti and toxic to nonresident Drosophila species. We found 117 orphan genes in the shared D. buzzatii-D. mojavensis lineage. In addition, gene duplication analysis identified lineage-specific expanded families with functional annotations associated with proteolysis, zinc ion binding, chitin binding, sensory perception, ethanol tolerance, immunity, physiology, and reproduction. In summary, we identified genetic signatures of adaptation in the shared D. buzzatii-D. mojavensis lineage, and in the two separate D. buzzatii and D. mojavensis lineages. Many of the novel lineage-specific genomic features are promising candidates for explaining the adaptation of these species to their distinct ecological niches.Entities:
Keywords: cactophilic Drosophila; ecological adaptation; gene duplication; genome sequence; orphan genes; positive selection
Mesh:
Year: 2014 PMID: 25552534 PMCID: PMC4316639 DOI: 10.1093/gbe/evu291
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
F(a) Phylogenetic relationship of fruit fly species considered in our comparative analysis and their host preference. (b) Geographical distribution of cactophilic species D. buzzatii (red) and D. mojavensis (green) in America.
Summary of Assembly Statistics for the Genome of Drosophila buzzatii
| Assembly | Freeze 1 | SOAPdenovo |
|---|---|---|
| Number of scaffolds (>3 kb) | 826 | 10,949 |
| Coverage | ∼22× | ∼76× |
| Assembly size (bp) | 161,490,851 | 144,184,967 |
| Scaffold N50 index | 30 | 2,035 |
| Scaffold N50 length (bp) | 1,380,942 | 18,900 |
| Scaffold N90 index | 158 | 7,509 |
| Scaffold N90 length (bp) | 161,757 | 5,703 |
| Contig N50 index | 1,895 | 2,820 |
| Contig N50 length (bp) | 17,678 | 3,101 |
Transposable Element Content of Drosophila buzzatii Genome
| Class | Order | Annotated Base Pair | Genome Coverage (%) |
|---|---|---|---|
| I (retrotransposons) | LTR | 2,366,439 | 1.47 |
| DIRS | 55 | 0.00 | |
| LINE | 2,541,645 | 1.57 | |
| II (DNA transposons) | TIR | 2,017,167 | 1.25 |
| Helitron | 5,531,009 | 3.42 | |
| Maverick | 189,267 | 0.12 | |
| Unknown | 973,759 | 0.60 | |
| Total | 13,619,341 | 8.43 |
Note.—The classification follows Wicker et al. (2007).
Satellite DNAs Identified in the Drosophila buzzatii Genome
| Tandem repeat Family | Repeat Length | GC Content (%) | Genome Coverage (%) | Consensus Sequence | Distribution |
|---|---|---|---|---|---|
| pBuM189 | 189 | 29 | 0.039 | GCAAAAGACTCCGTCAATTA GAAAACAAAAAATGTTATAGTTTTGAGGATTAACC GGCAAAAACCGTATTATTTGTTATAT GATTTCTGTATGGAATACCGTTTTAGAA GCGTCTTTTATCGTATTACTCAGATATATCT TAAGATTTAGCATAATCTAAGAACTTTT TGAAATATTCACATTTGTCCA | |
| DbuTR198 | 198 | 34 | 0.027 | AAGGTAGAAAGGTAGTTGGTGAGATAAACCAGAAAAA GAGCTAAAAACGGCTAAAAACGGCTAGAAAATAGCCA GAAAGGTAGATTGAACATTAATGGGCAAATGG ATGGATAAATAAGACTGGTCATCATCCAA TGAACAGAATCATGATTAAGAGATAGAAATA TGATTAGAAAGTAGGATAGAAAGGTTAGAAAG |
aGenome fraction was calculated assuming a genome size of 163,547,398 bp (version 1 freeze of all contigs).
bConsensus sequence generated after clustering TRF results (see Materials and Methods).
FDevelopmental expression profile of D. buzzatii genes. (a) Number of expressed PCGs (red) and ncRNA genes (blue) along five developmental stages. (b) Classification of PCGs and ncRNA genes according to the number of stages where they are expressed.
FVenn diagram showing the number of genes putatively under positive selection detected by two different methods, SM and BSM using three different lineages as foreground branches.
GO Analysis of Putative Genes under Positive Selection Detected by Both SM and BSM
| Codon substitution Models | Lineage (Branch Number) | Number of Candidates | GO enrichment | |||||
|---|---|---|---|---|---|---|---|---|
| Molecular Function | Biological Process | Interpro Domain | ||||||
| ID | Fold Enrichment | ID | Fold Enrichment | ID | Fold Enrichment | |||
| SM | 772 | Antiporter activity | 1.77 | Regulation of transcription | 4.90 | Src homology-3 domain | 1.60 | |
| Transcription factor activity | 1.56 | |||||||
| BSM | 350 | DNA binding | 1.36 | Regulation of transcription DNA dependent | 1.36 | Immunoglobulin-like | 1.33 | |
| Phosphate metabolic process | 0.72 | |||||||
| 172 | Dopamine beta-monooxigenase activity | 2.35 | Heterocycle catabolic process | 2.35 | DOMON (DOpamine beta-MOnooxygenase N-terminal domain) | 2.35 | ||
| Cation transport | 0.98 | |||||||
| Histidine family amino acid catabolic process | 2.35 | |||||||
| Cactophilic #3 | 458 | Zinc ion binding | 2.01 | Cytoeskeleton organization | 1.67 | Zinc finger, PHD-type | 1.93 | |
| Transition metal ion binding | 2.01 | Regulation of transcription DNA dependent | 1.06 | Proteinase inhibitor I1 kazal | 2.20 | |||
| DNA binding | 1.66 | |||||||
Note.—Only categories showing an enrichment with a P value less than 1.0e-03 are included.
FPatterns of divergence in orphan and nonorphan genes. Orphan genes (blue) have significantly higher dn and ω values compared with that of nonorphan genes (red). Nonorphan genes show significantly higher ds.