| Literature DB >> 28854641 |
Katina I Spanier1,2,3, Mieke Jansen1, Ellen Decaestecker4, Gert Hulselmans2,3, Dörthe Becker5,6, John K Colbourne5, Luisa Orsini5, Luc De Meester1, Stein Aerts2,3.
Abstract
Ecological genomics aims to understand the functional association between environmental gradients and the genes underlying adaptive traits. Many genes that are identified by genome-wide screening in ecologically relevant species lack functional annotations. Although gene functions can be inferred from sequence homology, such approaches have limited power. Here, we introduce ecological regulatory genomics by presenting an ontology-free gene prioritization method. Specifically, our method combines transcriptome profiling with high-throughput cis-regulatory sequence analysis in the water fleas Daphnia pulex and Daphnia magna. It screens coexpressed genes for overrepresented DNA motifs that serve as transcription factor binding sites, thereby providing insight into conserved transcription factors and gene regulatory networks shaping the expression profile. We first validated our method, called Daphnia-cisTarget, on a D. pulex heat shock data set, which revealed a network driven by the heat shock factor. Next, we performed RNA-Seq in D. magna exposed to the cyanobacterium Microcystis aeruginosa. Daphnia-cisTarget identified coregulated gene networks that associate with the moulting cycle and potentially regulate life history changes in growth rate and age at maturity. These networks are predicted to be regulated by evolutionary conserved transcription factors such as the homologues of Drosophila Shavenbaby and Grainyhead, nuclear receptors, and a GATA family member. In conclusion, our approach allows prioritising candidate genes in Daphnia without bias towards prior knowledge about functional gene annotation and represents an important step towards exploring the molecular mechanisms of ecological responses in organisms with poorly annotated genomes.Entities:
Keywords: cis-regulation; crustacea endocrine signaling; functional enrichment; motif discovery; omics data integration; prioritization
Mesh:
Substances:
Year: 2017 PMID: 28854641 PMCID: PMC5569996 DOI: 10.1093/gbe/evx127
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—Daphnia-cisTarget. (A) Transcription factors (TFs) bind to specific motifs in cis-regulatory regions to control gene expression. (B) This in vivo process can be inferred in silico by combining motif discovery and gene expression data, since genes that are coexpressed and share the same motif are likely to be regulated by the same transcription factor. (C) For a given input gene set, Daphnia-cisTarget generates a cumulative recovery curve for each motif ranking in the database (blue line left panel). The area under this curve (AUC) is calculated as a measure of enrichment (grey area), and those motifs that surpass the AUC cut-off (dotted line right panel) are listed in the output. The red line in the left panel represents the average recovery over the entire motif ranking database. This average plus two standard deviations yields the green line.
. 3.—Chronic cyanobacteria treatment in Daphnia magna. The upregulated genes (top, red network) contain nuclear receptor (NR) and ovo/Shavenbaby (ovo/Svb) motifs, along with transcription factors that potentially target those motifs (red boxes). The downregulated gene sets (bottom, blue network) are enriched for GATA motifs. (A) The heatmap shows median-centered expression levels (red: upregulation, blue: downregulation). For illustration purpose, only a selection of differentially expressed genes is displayed. The gene names refer to D. melanogaster homologies (“-”: no homology) or manual annotation (Dam*). (B) Sequence logos of Daphnia-cisTarget top scoring motifs, enrichment curve and normalized enrichment score (NES) of those motifs. The curly brackets indicate which genes in panel A are enriched for the respective motif in panel B. (C) Gene regulatory network including all putative target genes of one or several nuclear receptors, ovo/Svb and GATA factors. The node color intensity corresponds to the log2-fold change in the BX treatment (node center) and BN treatment (node border). The labels at gene subsets indicate gene ontology enrichment results. In subsets without gene ontology labels, genes arranged in left semi-circles do not have annotations based on homologies to D. melanogaster.
. 4.—Motif recovery, sequence homology and literature curation uncover links to moulting and growth regulation. Grey edges connect Daphnia-cisTarget motifs with their predicted targets, dotted purple edges indicate literature-curated interactions and solid purple edges literature connections that were confirmed by Daphnia-cisTarget. (A) Daphnia-cisTarget establishes a connection between ecdysone-signaling and the formation of cuticle and cuticular structures in D. magna. The curated network contains genes involved in trichome formation, developmental timing and cuticle formation. References to fruit fly literature: 1) Yoshiyama et al. (2006); 2) Gauhar et al. (2009); 3) Agawa et al. (2007); 4) White et al. (1997); 5) Chanut-Delalande et al. (2014); 6) Kondo et al. (2010); 7) Menoret et al. (2013); 8) Lee and Adler (2004); 9) Gangishetti et al. (2012); 10) Jang et al. (2009); 11) Ono et al. (2006). (B) The heatmap demonstrates coexpression of the genes in the network. Note the anticorrelation of ftz-f1 and its transcriptional repressor Blimp-1 (red box). (C) D. magna might react to poor food quality of cyanobacteria by downregulation of Insulin/IGF-signaling (IIS). Among the downregulated genes are many neuropeptides, including the insulin-related peptide homologue DamIrp2, a neuropeptide receptor, and proteases that activate neuropeptides through cleavage. In insects and crayfish, those genes regulate feeding motivation, nutrient storage and starvation resistance. Homologues of targets of the D. melanogaster midgut differentiation factor GATAe (Okumura et al. 2007) (arrows labeled “13”) are enriched. DamJHE and DamJHBP hint at the action of juvenile hormone (JH). (D) The heatmap demonstrates coexpression of DamGATAe (red box) with its putative target genes, although DamGATAe itself is not significantly differentially expressed. References to known interactions: 1) Honegger et al. (2008) (fruit fly); 2) Huang et al. (2015) (fruit fly); 3) Hwang et al. (2000) (fruit fly); 4) Buch et al. (2008) (fruit fly); 5) Ida et al. (2011) (fruit fly); Maeda et al. (2015) (blowfly); 6) Reiher et al. (2011) (fruit fly); 7) Britton et al. (2002) (fruit fly); 8) Veenstra (2015) (crayfish); 9) Veenstra et al. (2008) (fruit fly); Fu et al. (2007) (crab); 10) Chen et al. (2014) (crab); 11) Lorenz et al. (1995) (cricket); 12) Hua et al. (1999); Davis (2003); Yamanaka et al. (2010) (moths); 13) Okumura et al. (2007) (fruit fly); 14) Kataoka et al. (1989) (moth); 15) Kaneko and Hiruma (2015) (moth); 16) Verlinden et al. (2015) (insects); 17) Natzle et al. (1988) (fruit fly); 18) Nijhout et al. (2014) (insects); 19) Mirth et al. (2014) (insects); 20) Kethidi et al. (2005) (fruit fly); 21) Zhao and Campos (2012) (fruit fly).
. 2.—The heat shock factor as a common regulator in heat shock response. We here show Daphnia-cisTarget results for the upregulated genes in a thermal stress experiment (D. pulex exposed for 2 h to a 10 °C temperature increase, whole animal mRNA measured with microarrays). (A) Daphnia-cisTarget returns a heat shock factor (HSF) motif as the top scoring motif (the mouse motif M01244 of the TRANSFAC© Professional database [Matys et al. 2006]), followed by 14 HSF motifs derived from different species such as yeast, fruit fly and human. Those motifs are clustered as highly similar (indicated through same background color). The Daphnia-cisTarget results also show the motif logo (“enriched motif”), the Normalized Enrichment Score (NES), the recovery curve, the D. pulex HSF homologue JGI_V11_213340 derived through sequence homology as candidate transcription factor (TF) and the predicted target genes (“#targets”). The second and third best motif clusters are assigned to a basic-leucine zipper TF CrebB and a range of homeobox TFs, respectively. The remaining 122 recovered motifs that surpass the NES threshold of 2.5 are not shown. (B) The gene regulatory network controlled by the HSF in D. pulex is reconstructed by merging the 95 predicted targets of all HSF motifs recovered by Daphnia-cisTarget. Gene names are derived through homology to D. melanogaster genes; D. pulex genes mapping to the same D. melanogaster gene are numbered (e.g., slbo#1, slbo#2, etc.).
Members of Gene Regulatory Networks That are Known to be Involved in Growth-Related Processes in Arthropods, are Conserved in Daphnia magna
| Gene Symbol | Gene ID | Gene Symbol homologue | Predicted Function | |
|---|---|---|---|---|
| 20E signaling, developmental timing | mu8AUGep24bs01592g203 | 20E synthesis ( | ||
| mu8PASAgasmbl_42864 | ||||
| mu8AUGep24bs00930g63_ mu8AUGapi5s01568g77 | 20E-responsive NR ( | |||
| mu8AUGepir7p2s02190g15 | 20E-responsive NR ( | |||
| mu8AUGapi5s00642g188 | 20E-responsive NR ( | |||
| mu8AUGepir2p1s00944g59 | 20E-responsive Zn-finger TF, repressor of ftz-f1 ( | |||
| mu8AUGepir7s03025g22 | orphan NR; developmental timing, “competence factor” ( | |||
| mu8AUGepir7s00662g70 | attenuation of 20E-signaling ( | |||
| Trichome formation | mu8AUGep24b_p1s02190g182_ mu8AUGepir2s02346g268 | TF; epidermis differentiation and trichome development ( | ||
| mu8AUGepir3s00311g138 | Svb activation ( | |||
| mu8AUGepir2s00007g44 | wing hair formation ( | |||
| mu8AUGapi5s01092g326 | cuticle pattern formation ( | |||
| mu8AUGepir7p2s01581g65 | unknown | |||
| mu8AUGapi5s01036g31 | cuticular structure formation ( | |||
| mu8AUGapi5s02861g133 | ||||
| mu8AUGepir2s00005g17 | epithelial cell rearrangement ( | |||
| mu8AUGep24b_p1s00687g272 | unknown | |||
| mu8AUGepir7s03057g295 | unknown | |||
| mu8AUGapi5s03057g294 | unknown | |||
| mu8PASAgasmbl_15376 | unknown | |||
| mu8AUGapi5p1s00024g218 | wing hair formation ( | |||
| mu8AUGapi5p1s02190g301 | unknown | |||
| mu8AUGapi5s00190g30t1_ m8AUGepir3s00190g27 | cuticular structure formation ( | |||
| mu8AUGepir2s03326g79 | cuticle pattern formation ( | |||
| mu8AUGepir3s00642g197 | ||||
| mu8AUGep24bs00781g82 | cuticle development ( | |||
| mu8AUGepir7s00311g165 | wing hair formation ( | |||
| Cuticle formation | mu8AUGepir6s00018g56; Dapma7bEVm018464 | adult epidermis differentiation; cuticle organization; wound healing ( | ||
| mu8AUGepir7s01005g246 | wing hair formation ( | |||
| mu8AUGepir7s02385g127 | wing hair orientation ( | |||
| mu8AUGepir2s01005g28_ mu8AUGepir7s01005g30 | wing hair formation ( | |||
| Midgut differentiation | mu8AUGep24b_ p1s02190g273t1_ m8AUGepir7p1s02190g312 | midgut differentiation ( | ||
| mu8AUGepir7s03311g142t1_ m8PASAgasmbl_97928 | Carboxypeptidase A ( | |||
| mu8AUGepir7s02861g45 | serine-protease (Ross et al. 2003) | |||
| mu8PASAgasmbl_13209 | serine-protease homologue (Ross et al. 2003) | |||
| mu8AUGepir7s00868g268 | gut-specific trypsin ( | |||
| mu8AUGepir7s00868g265 | gut-specific trypsin ( | |||
| mu8AUGepir7s00868g262 | gut-specific trypsin ( | |||
| mu8AUGapi5s00868g254 | gut-specific trypsin ( | |||
| mu8AUGepir7s00868g263 | gut-specific trypsin ( | |||
| mu8PASAgasmbl_35335 | ||||
| mu8AUGepir7p2s00512g264 | adhesion/signaling protein regulating cellular adhesion, migration and survival ( | |||
| mu8PASAgasmbl_83547 | ||||
| Food digestion | mu8PASAgasmbl_87235 | gut-specific trypsin ( | ||
| mu8AUGepir7s03102g104 | gut-specific chymotrypsin ( | |||
| mu8AUGepir7p1s00944g14 | ||||
| mu8PASAgasmbl_39448 | ||||
| mu8PASAgasmbl_2973 | starch digestion ( | |||
| mu8PASAgasmbl_45416 | starch digestion ( | |||
| mu8AUGapi5s00725g187 | starch digestion ( | |||
| mu8PASAgasmbl_45408 | starch digestion ( | |||
| mu8AUGapi5s00261g171 | starch digestion ( | |||
| mu8AUGapi5p1s00944g273 | starch digestion ( | |||
| Neuropeptide signaling | mu8PASAgasmbl_73983 | insulin-related neuropeptide ( | ||
| mu8AUGep24bs00005g33 | target of insulin-signaling; glucosidase; influences life span, growth and viability ( | |||
| mu8AUGep24b_p1s02190g119 | suppressor of insulin signaling ( | |||
| mu8PASAgasmbl_9028 | inhibition of JH and 20E biosynthesis ( | |||
| mu8PASAgasmbl_50343 | inhibition of JH and 20E biosynthesis ( | |||
| mu8PASAgasmbl_94734 | attenuation of feeding motivation ( | |||
| mu8AUGapi5s00770g109 | RYamide receptor ( | |||
| mu8PASAgasmbl_40324 | regulates insulin-like signaling and life span; proteolytic peptide hormone activation ( | |||
| mu8PASAgasmbl_17947 | proteolytic peptide hormone activation ( | |||
| mu8PASAgasmbl_33811 | required by amon for maturation ( | |||
| JH signaling | mu8PASAgasmbl_45217 | pfam06585 | JH titre promotion ( | |
| mu8AUGepir3s03135g39 | JH degradation ( |
Drosophila melanogaster.
Daphnia magna.
Daphnia pulex.
Contains haemolymph JH binding protein domain pfam06585 (BLAST CDD).
. 5.—The moulting- and growth-related networks respond to acute as well as to chronic cyanobacteria stress. (A) The gene regulatory networks are based entirely on the acute experiment. We found two additional motifs of the transcription factors Blimp-1 and Grh. Genes encoding those factors were differentially expressed in the chronic data, but their motifs were not enriched, indicating a higher motif discover resolution in the acute data set. The midgut- and ecdysone/cuticle-related subnetworks display the same dichotomy as in the chronic data set. The subnetworks are connected by only three genes, including the egg yolk gene DamVtg2, which is known to be regulated by ecdysone and a GATA factor in mosquitoes (Martín et al. 2001; Park et al. 2006). The network displays genes that belong to the most differentially expressed genes in any of the within-cohort treatment/control comparisons and have a motif instance in at least 3 (of 12) comparisons. Edge color intensity reflects the number of comparisons in which the motif-gene connection was found, node color intensity the absolute value of log2-fold change averaged across all comparisons. Squared nodes represent genes that are significantly differentially expressed in the chronic data set. (B) The gene regulatory networks from the chronic experiment were significantly enriched in the acute experiment. The bar plot is based on gene set enrichment analysis (GSEA) enrichments in supplementary fig. S5, Supplementary Material online. It depicts how often the NR/Svb (red) and GATA (blue) modules derived from the chronic experiment were significantly up- or downregulated (up/down) or not (nonsign.) in the 12 treatment/control comparisons of the acute experiment. The control gene sets, heat shock factor (HSF) targets from the Daphnia-cisTarget validation and ribosomal proteins (ribos), were mostly not significantly enriched (grey bars).
. 6.—The interplay between ecdysone and insulin signaling might coordinate growth and nutritional input. Ecdysone, juvenile hormone, and insulin/IGF signaling are known to be tightly linked and to regulate growth and development in insects (purple arrows). Colored nodes represent genes from the chronic data set that can be mapped onto the insect interactions and connect the midgut GATA network containing neuropeptides and juvenile hormone-related genes, and the cuticle-related ecdysone network. References bold interactions: 1) Mirth et al. (2014) (insects); 2) Mu and Leblanc (2004) (D. magna). References for thin arrows: see figure 4. Please note that this figure depicts a highly simplified model of complex stage-, tissue-, and species-specific interactions. For a comprehensive review we refer the reader to recent literature (Gruntenko and Rauschenbach 2009; Nijhout 2013; Yamanaka et al. 2013; Vafopoulou 2014; Jindra et al. 2013; Dubrovsky and Bernardo 2014).