Literature DB >> 31763380

Estradiol and genistein effects on the sea bass (Dicentrarchus labrax) scales: Transcriptome dataset.

Patricia I S Pinto1, André R Andrade1, Michael A S Thorne2, M Dulce Estêvão1,3, Adelino V M Canario1, Deborah M Power1.   

Abstract

Fish scales are mineralized structures that play important roles in protection and mineral homeostasis. This tissue expresses multiple estrogen receptor subtypes and can be targeted by estrogens or estrogenic endocrine-disrupting compounds, but their effects are poorly explored. The transcriptome data here presented support the findings reported in the research article "Genistein and estradiol have common and specific impacts on the sea bass (Dicentrarchus labrax) skin-scale barrier" [1]. Juvenile sea bass were exposed to estradiol and the phytoestrogen genistein for 1 and 5 days, by intraperitoneal injections, and the effects on scale transcript expression were analysed by RNA-seq using an Illumina Hi-seq 1500. The raw reads of the 30 libraries produced have been deposited in the NCBI-SRA database with the project accession number SRP102504. Mapping of RNA-seq reads against the sea bass reference genome using the Cufflinks/TopHat package identified 371 genes that had significant (FDR<0.05) differential expression with the estradiol or genistein treatments in relation to the control scales at each exposure time, 254 of which presented more than a 2-fold change in expression. The identity of the differentially expressed genes was obtained using both automatic and manual annotations against multiple public sequence databases and they were grouped according to their patterns of expression using hierarchical clustering and heat-maps. The biological processes and KEGG pathways most significantly affected by the estradiol and/or genistein treatments were identified using Cytoscape/ClueGO enrichment analyses. Crown
Copyright © 2019 Published by Elsevier Inc.

Entities:  

Keywords:  Estrogen-responsive genes; Fish scale; Genistein-responsive genes; RNA-seq; Transcriptome

Year:  2019        PMID: 31763380      PMCID: PMC6861651          DOI: 10.1016/j.dib.2019.104587

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table This is the first comprehensive study of the transcriptome of fish scales, an estrogen-target tissue for which information is still limited and a promising practical model for environmental pollution screening and chemical risk assessment. The dataset has relevance for aquaculture and for toxicology/ecotoxicology/environmental studies, since altered levels of hormones (including synthetic, anthropogenic hormones) could affect the development and homeostasis of multiple marine species. The dataset has broad applicability as it is from the European sea bass, which is a representative of the species-rich order Perciformes and an important species for marine fisheries and aquaculture. Global transcriptomes of scales from the European sea bass, exposed for 1 and 5 days to estradiol (a natural estrogen) or genistein (a phytoestrogen) is of use for aquaculture, ichthyologists, toxicologists and comparative endocrinologists. Since estrogenic compounds and genistein can be found in aquatic environments the dataset is of use for aquaculture (e.g increased ingestion of phytoestrogens from plant-based ingredients used in fish feeds) and the comparison of genes and pathways regulated or disrupted by estradiol and genistein is of great interest for studies related to physiology, ecotoxicology/environmental studies or aquaculture in fish or other marine species. The identified sets of differentially expressed genes in sea bass scales after exposure to estradiol or genistein provide a source of potential biomarkers for assessing exposure to estrogens, phytoestrogens or other estrogenic pollutants, and this non-invasive approach complies with the 3R principal and is under validation for environmental pollution.

Data

Table 1 presents the detailed RNA-seq sequence statistics for each of the thirty replicate libraries that were constructed from sea bass scale RNA, grouped according to treatment: fish exposed to estradiol, E2 (E1d and E5d libraries, corresponding to 1 or 5 days of exposure respectively); fish exposed to genistein, Gen (Gen1d and Gen5d) and control fish (C1d and C5d). The transcriptome data released at the NCBI SRA database (Project SRP102504) contains the raw data files of the 30 RNA-seq libraries that were produced from the scales of E2 or Gen-exposed sea bass for each of the six experimental conditions: C1d (experiment with accession SRX2673957, n = 6 replicate libraries), E1d (SRX2673962, n = 5 libraries), Gen1d (SRX2674307, n = 5 libraries), C5d (SRX2674405, n = 5 libraries), E5d (SRX2674467, n = 5 libraries) and Gen5d (SRX2675383, n = 4 libraries).
Table 1

Detailed RNA-seq statistics for each library, grouped by treatment. Individual libraries (n = 4–6 from individual fish) were prepared from RNA extracted from scales of fish sampled 1 day after treatment with the vehicle (C1d), estradiol (E1d) or genistein (Gen1d) and scales sampled five days after each treatment (C5d, E5d and Gen5d, respectively). The number of raw and filtered reads (in millions), percentage of mapped reads (for each individual library or on average) and the total numbers of reads produced in the study are presented.

TreatmentLib nameRaw reads (millions)Filtered reads (millions)Mapped reads (%)
C1dC1d_136.233.089.3%
C1d_232.632.684.2%
C1d_335.735.785.6%
C1d_432.032.085.1%
C1d_532.132.186.4%
C1d_636.136.186.2%
E1dE1d_131.531.587.2%
E1d_237.637.684.6%
E1d_337.637.684.8%
E1d_439.239.284.1%
E1d_531.628.383.1%
Gen1dGen1d_141.641.684.0%
Gen1d_238.535.185.4%
Gen1d_344.444.486.1%
Gen1d_430.430.483.7%
Gen1d_539.238.084.5%
C5dC5d_140.940.984.6%
C5d_230.730.787.3%
C5d_332.132.186.6%
C5d_429.329.388.3%
C5d_545.845.885.6%
E5dE5d_135.435.485.6%
E5d_236.136.185.9%
E5d_338.826.287.0%
E5d_451.049.088.9%
E5d_544.040.891.4%
Gen5dGen5d_144.044.084.5%
Gen5d_240.940.986.1%
Gen5d_342.042.085.9%
Gen5d_444.344.385.7%
Average37.736.8
Total1131.71102.8
Detailed RNA-seq statistics for each library, grouped by treatment. Individual libraries (n = 4–6 from individual fish) were prepared from RNA extracted from scales of fish sampled 1 day after treatment with the vehicle (C1d), estradiol (E1d) or genistein (Gen1d) and scales sampled five days after each treatment (C5d, E5d and Gen5d, respectively). The number of raw and filtered reads (in millions), percentage of mapped reads (for each individual library or on average) and the total numbers of reads produced in the study are presented. A total of 749 differentially expressed (DE) genes were identified. DE genes in the scale transcriptomes of E2-treated or Gen-treated sea bass were identified by comparison with the corresponding controls at each sampling time and genes changing expression in the control groups when comparing 1 day vs 5 days of exposure. The expression levels, fold changes and identities of the 332 DE genes in sea bass scales, that had more than a two-fold change in expression (“DE ≥ 2 FC”), are listed in Supplementary Table 1. A short version of this table (containing only the first 20 genes) is displayed in Table 2. Supplementary Table 2 contains the remaining 417 DE genes (FDR < 0.05 and < 2-fold changes) and an abbreviated version is presented in Table 3. Table 4 presents the detailed results from the annotation of the total list of 749 DE genes (that were used for global enrichment analyses), as well as for the selection of the 332 DE genes with ≥ 2-fold change. Fig. 1 summarizes the preference order strategy used to automatically annotate the 332 selected DE genes using multiple databases, which was followed by careful manual curation. More details about the annotation strategy can be found in the methods below and in the methods and results of the associated JSBMB manuscript [1]. Fig. 2 summarizes the number of genes that were DE in response to the treatments (E2 or Gen) or changed expression between sampling times, when considering different stringency levels: False Discovery Rate (FDR) < 0.05 and < or ≥ 2-fold change in expression. Fig. 3 presents a heatmap showing the grouping of the six treatment groups, according to the identified transcriptome changes in sea bass scales.
Table 2

Selected list (first 20 genes) of the genes differentially expressed with a ≥2-fold difference in sea bass scales from treated versus control animals. The complete list with the expression and annotation details for the 332 genes with significant differential expression (FDR < 0.05 and FC ≥ 2) in each comparison can be consulted in Supplementary Table 1. Normalized expression levels in each experimental group are presented in fragments per kilobase of exon model per million reads mapped (fpkm) and significant differential expression for each comparison is presented in Log2 fold change (FC). The final annotation reached using the preference order strategy (Fig. 1 and Table 4) and revised by manual curation is presented, with the accession numbers from Swiss-Prot, Genbank or from the sea bass genome.

Gene ID
Relative expression levels (fpkm)
Log2FC of E2 or Gen effects vs control at 1d
Log2FC of E2 or Gen effects vs control at 5d
Log2FC of C5d vs C1d
Final gene annotation
Cufflinks#C1dE1dGen1dC5dE5dGen5dup in E2Down in E2Up in GenDown in GenE2 upE2 downGen upGen downC5d upC5d downAccessionHit descriptionSymbol
XLOC_0195555.414.837.733.75.334.01.42.8−2.72.6O75443Alpha-tectorinTECTA
XLOC_01912934.2225.7761.31735.622.01398.82.74.5−6.35.7DLAgn_00248120Mhc class ii antigen beta chainHLADPB1
XLOC_0200580.53.84.03.16.26.02.92.92.6Q5W7F1Neutral ceramidase or N-acylsphingosine amidohydrolase 2ASAH2
XLOC_01922525.263.250.825.017.322.91.31.0Q02817Mucin-2MUC2
XLOC_01944710.822.622.412.013.611.71.11.0XP_019109670.1Integumentary mucin C.1-likeMUCC1
XLOC_01908420.647.438.115.914.540.01.21.3G8HTB6ZP domain-containing protein / CUB and zona pellucida like domains 1CUZD1
XLOC_019425437.41926.91162.01718.52043.42161.02.1P28064Proteasome subunit beta type-8PSMB8
XLOC_02184692.1215.6136.4206.0144.3113.21.21.2No hit found
XLOC_0208665.019.15.90.60.41.11.9DLAgn_00214300Protein fam111a-likeFAM111A
XLOC_00929849.5135.960.473.655.870.01.5DLAgn_00110020Uncharacterized protein loc101483146hyp_loc101483146
XLOC_020865260.91274.3269.312.95.619.72.0DLAgn_00214300Protein fam111a-likeFAM111A
XLOC_0183021.63.92.21.63.51.51.3XP_006805317.1Uncharacterized protein LOC102781223hyp_LOC102781223
XLOC_0016383.94.29.23.26.87.91.21.11.3P35448Thrombospondin-1THBS1
XLOC_0099851.10.62.80.61.92.91.42.2P43300Early growth response protein 3EGR3
XLOC_0094592.82.55.82.44.46.11.01.4Q9ET55NocturninCCRN4L
XLOC_0152026.16.119.33.55.08.41.71.3Q16690Dual specificity protein phosphatase 5DUSP5
XLOC_0096468.05.619.03.07.613.21.22.2−1.4Q20A00DNA damage-inducible transcript 4-like proteinDDIT4L
XLOC_018946261.3451.9567.7482.9396.7190.11.1−1.3DLAgn_00241010Complement c1q-like protein 4 precursorC1QL4
XLOC_0119026.68.213.420.825.49.11.0−1.21.7XP_005946242.1RNA-directed DNA polymerase from mobile element jockey-likepred_pol413
XLOC_0025740.81.42.00.80.91.01.3Q61391Neprilysin / membrane metallo-endopeptidaseMME
Table 3

Selected list (first 20 genes) of the genes differentially expressed at FDR < 0.05 and at < 2-fold change difference, in sea bass scales from treated versus control animals. The complete list with the expression and annotation for the 417 genes with differential expression under these limits (FDR < 0.05 and FC < 2) can be consulted in Supplementary Table 2. Normalized expression levels in each experimental group are presented in fragments per kilobase of exon model per million reads mapped (fpkm) and significant differential expression for each comparison is presented in Log2 fold change (FC). The final annotation reached using the preference order strategy (Fig. 1 and Table 4) is presented, with the accession numbers from Swiss-Prot, Genbank or from the sea bass genome.

Gene ID
Relative expression levels (fpkm)
Log2FC of E2 or Gen effects vs control at 1d
Log2FC of E2 or Gen effects vs control at 5d
Log2FC of C5d vs C1d
Final gene annotation
Cufflinks#C1dE1dGen1dC5dE5dGen5dE2 upE2 downGen upGen downE2 upE2 downGen upGen downC5d upC5d downHit descriptionSymbol
XLOC_01788620.532.335.221.728.226.60.70.8sp|P51890|LUM_CHICK Lumican OS=Gallus gallusLUM
XLOC_00906620.132.223.614.918.816.60.7sp|Q6NVM0|H10_XENTR Histone H1.0H1F0
XLOC_00005739.963.954.851.866.152.30.7sp|Q04857|CO6A1_MOUSE Collagen alpha-1(VI) chainCOL6A1
XLOC_01448417.528.225.820.414.917.20.7sp|Q53RD9|FBLN7_HUMAN Fibulin-7FBLN7
XLOC_01999542.468.541.032.942.922.80.7sp|P55918|MFAP4_BOVIN Microfibril-associated glycoprotein 4MFAP4
XLOC_0062217.311.87.07.310.56.60.7sp|Q4R6P7|SESN1_MACFA Sestrin-1SESN1
XLOC_00214558.595.282.9108.594.273.30.70.9sp|O95428|PPN_HUMAN PapilinPAPLN
XLOC_02230958.496.062.045.381.235.50.70.8sp|Q9U8W8|TL5A_TACTR Techylectin-5A
XLOC_00358110.818.611.212.68.86.90.8−0.9sp|Q802Y8|ZB16A_DANRE Zinc finger and BTB domain-containing protein 16-AZBTB16
XLOC_0111421033.01793.0979.61063.61387.2709.60.8sp|Q66S03|LECG_THANI Galactose-specific lectin nattectinLADD
XLOC_01333340.470.459.750.950.748.90.8sp|Q90611|MMP2_CHICK 72 kDa type IV collagenaseMMP2
XLOC_020568323.2571.3392.8275.6447.7280.90.8gap junction epsilon-1
XLOC_0162205.08.96.16.55.65.40.8sp|Q568Y7|NOE2_RAT Noelin-2OLFM2
XLOC_006639175.0315.2258.5221.0268.7245.30.8sp|Q01584|LIPO_BUFMA LipocalinLCN1
XLOC_01515946.988.467.084.664.295.10.90.9sp|P27590|UROM_RAT UromodulinUMOD
XLOC_01968926.049.426.617.318.017.40.9No hit found
XLOC_0141673.26.15.13.42.62.71.0sp|Q5E9P5|PAMR1_BOVIN Inactive serine protease PAMR1 OS=Bos taur... 413 e-114PAMR1
XLOC_022211101.7197.8148.285.276.3155.61.0No hit found
XLOC_0087323.66.96.75.94.95.31.00.9sp|Q9BQB4|SOST_HUMAN SclerostinSOST
XLOC_018728132.5258.4160.0121.5235.399.01.01.0No hit found
Table 4

Detailed results from the annotation. Annotation results (in number of genes or percentage from total) are shown for the 749 genes differentially expressed with q < 0.05 (columns “DE all”) and for the selection of 332 genes differentially expressed at q < 0.05 with a minimum 2-fold change (columns “DE ≥ 2 FC”).

DE all
DE ≥ 2 FC
Number%Number%
Annotation to different databases:
Genes with BlastX hit to SwissProt5937922968
Genes assigned to predicted genes in genome6008027583
Genes with BlastX hit to GenBank6769028385
Final annotation using preference order:
Genes annotated via SwissProt5937922968
Genes annotated via the sea bass genome92126119
Genes annotated via GenBank345247
Annotated7199631495
Non-annotated304185
Total number of DE genes749100332100
Summary of annotation:
Annotation to known proteins6678928084
Annotation to predicted proteins213124
Annotation to hypothetical/uncharacterized proteins203144
Mapping to non-annotated genes11182
Fig. 1

Annotation to different databases of the 332 genes found to be ≥ 2-fold differentially expressed. Venn diagrams indicate the number of genes with a significant match to the sea bass genome, Swiss-Prot protein database or GeneBank protein database, the number of genes annotated by more than one database are inside the intersecting areas. The annotation was carried out in order of preference; 1) matches to Swiss-Prot, 2) matches to the sea bass genome and 3) Genbank matches as indicated by the colour shading. The areas of each sphere are proportional to the number of genes annotated.

Fig. 2

Proportion of differentially expressed (DE) genes identified in the present study. A. Venn diagrams representing common (Com.) and specific genes differentially expressed in sea bass scales in response to the treatments (17β-estradiol, E2, and/or genistein, Gen, compared to the corresponding controls at each sampling time 1 day and 5 days), compared to the differential expression in the control groups over time. The number of genes by treatment or time and their respective percentage are shown for two levels of stringency. “DE genes” above the diagram corresponds to the 749 genes differentially expressed at an FDR <0.05. Below the diagram the DE genes (332) differentially expressed with a minimum of 2-fold change are considered (“DE ≥ 2 FC” and FDR < 0.05). 51% of the “DE genes” changed expression only in control scales over time but these changes were of low magnitude (average fold change of 1.9-fold between C1d and C5d) and when the analysis stringency was increased to a minimum of 2-fold change, only 24% of these 332 genes changed expression in the control scales over time. The 254 genes (76%) significantly regulated by E2 or Gen were selected for further analyses. B. Shows the comparison of common/specific DE genes between E2 and Gen at the two stringency levels, FDR <0.05 or ≥2 FC and FDR <0.05 (irrespective of the sampling times). For the number of genes regulated by E2 and/or Gen at each sampling time see Fig. 2B of the associated paper in JSBMB [1].

Fig. 3

Heatmap of clustered DE genes identified in sea bass scales after treatment with E2 or Gen. The tree in the upper panel shows the hierarchical clustering of the DE genes (one gene/line) identified in the scales of the six treatment groups [injections with estradiol (E), genistein (G) or vehicle only (control, C) at 1 or 5 days (1d or 5d)]. The red gradient indicates high abundance, the green gradient indicates low abundance and black indicates equal abundance for each gene and condition relative to the average. 1 day after treatment the DE genes of the E2 group clustered more closely to the control than Gen1d. 5 days after treatment both E2-and Gen-treated scales clearly separated from the control and clustered together, suggesting a similar response at this time point.

Selected list (first 20 genes) of the genes differentially expressed with a ≥2-fold difference in sea bass scales from treated versus control animals. The complete list with the expression and annotation details for the 332 genes with significant differential expression (FDR < 0.05 and FC ≥ 2) in each comparison can be consulted in Supplementary Table 1. Normalized expression levels in each experimental group are presented in fragments per kilobase of exon model per million reads mapped (fpkm) and significant differential expression for each comparison is presented in Log2 fold change (FC). The final annotation reached using the preference order strategy (Fig. 1 and Table 4) and revised by manual curation is presented, with the accession numbers from Swiss-Prot, Genbank or from the sea bass genome. Selected list (first 20 genes) of the genes differentially expressed at FDR < 0.05 and at < 2-fold change difference, in sea bass scales from treated versus control animals. The complete list with the expression and annotation for the 417 genes with differential expression under these limits (FDR < 0.05 and FC < 2) can be consulted in Supplementary Table 2. Normalized expression levels in each experimental group are presented in fragments per kilobase of exon model per million reads mapped (fpkm) and significant differential expression for each comparison is presented in Log2 fold change (FC). The final annotation reached using the preference order strategy (Fig. 1 and Table 4) is presented, with the accession numbers from Swiss-Prot, Genbank or from the sea bass genome. Detailed results from the annotation. Annotation results (in number of genes or percentage from total) are shown for the 749 genes differentially expressed with q < 0.05 (columns “DE all”) and for the selection of 332 genes differentially expressed at q < 0.05 with a minimum 2-fold change (columns “DE ≥ 2 FC”). Annotation to different databases of the 332 genes found to be ≥ 2-fold differentially expressed. Venn diagrams indicate the number of genes with a significant match to the sea bass genome, Swiss-Prot protein database or GeneBank protein database, the number of genes annotated by more than one database are inside the intersecting areas. The annotation was carried out in order of preference; 1) matches to Swiss-Prot, 2) matches to the sea bass genome and 3) Genbank matches as indicated by the colour shading. The areas of each sphere are proportional to the number of genes annotated. Proportion of differentially expressed (DE) genes identified in the present study. A. Venn diagrams representing common (Com.) and specific genes differentially expressed in sea bass scales in response to the treatments (17β-estradiol, E2, and/or genistein, Gen, compared to the corresponding controls at each sampling time 1 day and 5 days), compared to the differential expression in the control groups over time. The number of genes by treatment or time and their respective percentage are shown for two levels of stringency. “DE genes” above the diagram corresponds to the 749 genes differentially expressed at an FDR <0.05. Below the diagram the DE genes (332) differentially expressed with a minimum of 2-fold change are considered (“DE ≥ 2 FC” and FDR < 0.05). 51% of the “DE genes” changed expression only in control scales over time but these changes were of low magnitude (average fold change of 1.9-fold between C1d and C5d) and when the analysis stringency was increased to a minimum of 2-fold change, only 24% of these 332 genes changed expression in the control scales over time. The 254 genes (76%) significantly regulated by E2 or Gen were selected for further analyses. B. Shows the comparison of common/specific DE genes between E2 and Gen at the two stringency levels, FDR <0.05 or ≥2 FC and FDR <0.05 (irrespective of the sampling times). For the number of genes regulated by E2 and/or Gen at each sampling time see Fig. 2B of the associated paper in JSBMB [1]. Heatmap of clustered DE genes identified in sea bass scales after treatment with E2 or Gen. The tree in the upper panel shows the hierarchical clustering of the DE genes (one gene/line) identified in the scales of the six treatment groups [injections with estradiol (E), genistein (G) or vehicle only (control, C) at 1 or 5 days (1d or 5d)]. The red gradient indicates high abundance, the green gradient indicates low abundance and black indicates equal abundance for each gene and condition relative to the average. 1 day after treatment the DE genes of the E2 group clustered more closely to the control than Gen1d. 5 days after treatment both E2-and Gen-treated scales clearly separated from the control and clustered together, suggesting a similar response at this time point. Table 5, Table 6 present the significantly enriched GO Biological Processes (GO-BP) and KEGG pathways, respectively, of all genes that presented significant changes in expression in response to the treatments E2 and/or Gen (analysis “All”); Table 7, Table 8 present the significantly enriched GO-BP and KEGGs when directly comparing the E2-or Gen-responsive genes, irrespective of the sampling time (analyses “E2” vs “Gen”); Table 9, Table 10 list the significantly enriched GO-BP and KEGGs when comparing responsive gene lists between 1 day and 5 days (analyses “1d” vs “5d”).
Table 5

Enrichment of GO Biological Processes (GO-BP) using all genes found to be differentially expressed in response to E2 and/or Gen (analysis “All”). Significantly enriched biological processes (FDR < 0.05) were identified by ID and term description and grouped into 23 functionally related networks (GO group) obtained by ClueGO analysis. Each group is named after its most significant term (lowest FDR) and highlighted in bold, which was chosen for GOTerm representation in Fig. 3 of the associated MS in JSBMB [1]. Functionally related groups are sorted by highest enrichment score, calculated as [ -Log2 (group FDR)].

GOIDGOTermTerm FDRGroup FDREnrichment ScoreGO group% Associated GenesNr. Genes
GO:0043207response to external biotic stimulus7.2E-101.8E-1032.4188.5421.00
GO:0051707response to other organism7.2E-101.8E-1032.4188.5421.00
GO:0009615response to virus3.8E-051.8E-1032.41812.967.00
GO:0009617response to bacterium1.5E-031.8E-1032.4185.209.00
GO:0051607defense response to virus2.1E-021.8E-1032.4187.143.00
GO:0043207response to external biotic stimulus7.2E-105.9E-0927.3208.5421.00
GO:0051707response to other organism7.2E-105.9E-0927.3208.5421.00
GO:0051591response to cAMP4.8E-055.9E-0927.32075.003.00
GO:0009617response to bacterium1.5E-035.9E-0927.3205.209.00
GO:0046683response to organophosphorus2.3E-035.9E-0927.32020.003.00
GO:0014074response to purine-containing compound2.3E-035.9E-0927.32020.003.00
GO:0032496response to lipopolysaccharide2.7E-035.9E-0927.3208.335.00
GO:0002237response to molecule of bacterial origin3.1E-035.9E-0927.3207.945.00
GO:0034097response to cytokine3.5E-035.9E-0927.3204.299.00
GO:0006954inflammatory response8.2E-035.9E-0927.3204.327.00
GO:0042493response to drug1.2E-025.9E-0927.3209.383.00
GO:0009612response to mechanical stimulus2.5E-025.9E-0927.3206.673.00
GO:0016126sterol biosynthetic process2.9E-093.0E-0721.72231.039.00
GO:0006694steroid biosynthetic process4.0E-093.0E-0721.72219.3011.00
GO:0016125sterol metabolic process9.6E-093.0E-0721.72220.8310.00
GO:0008202steroid metabolic process1.0E-083.0E-0721.72214.6312.00
GO:1901617organic hydroxy compound biosynthetic process8.7E-073.0E-0721.72215.259.00
GO:0008203cholesterol metabolic process2.6E-063.0E-0721.72220.597.00
GO:1902652secondary alcohol metabolic process4.3E-063.0E-0721.72218.927.00
GO:0006695cholesterol biosynthetic process1.1E-053.0E-0721.72231.255.00
GO:1902653secondary alcohol biosynthetic process1.8E-053.0E-0721.72227.785.00
GO:1901615organic hydroxy compound metabolic process4.4E-053.0E-0721.7227.6910.00
GO:0044283small molecule biosynthetic process8.0E-053.0E-0721.7225.8512.00
GO:0046165alcohol biosynthetic process1.3E-043.0E-0721.72217.865.00
GO:0008610lipid biosynthetic process4.1E-043.0E-0721.7224.8612.00
GO:0006066alcohol metabolic process4.5E-043.0E-0721.7228.337.00
GO:0016053organic acid biosynthetic process1.3E-033.0E-0721.7225.978.00
GO:0046394carboxylic acid biosynthetic process1.3E-033.0E-0721.7225.978.00
GO:1901607alpha-amino acid biosynthetic process1.4E-033.0E-0721.72210.425.00
GO:0008652cellular amino acid biosynthetic process1.6E-033.0E-0721.7229.805.00
GO:0031099regeneration3.0E-033.0E-0721.7224.948.00
GO:0031329regulation of cellular catabolic process3.0E-033.0E-0721.7225.517.00
GO:0009894regulation of catabolic process3.5E-033.0E-0721.7225.227.00
GO:0006520cellular amino acid metabolic process6.5E-033.0E-0721.7224.158.00
GO:0022600digestive system process6.9E-033.0E-0721.72212.003.00
GO:1901605alpha-amino acid metabolic process7.2E-033.0E-0721.7225.086.00
GO:0007586digestion1.2E-023.0E-0721.7229.383.00
GO:0072330monocarboxylic acid biosynthetic process1.5E-023.0E-0721.7225.974.00
GO:0031331positive regulation of cellular catabolic process1.7E-023.0E-0721.7225.714.00
GO:0009896positive regulation of catabolic process1.9E-023.0E-0721.7225.484.00
GO:0097164ammonium ion metabolic process2.9E-023.0E-0721.7226.253.00
GO:0015748organophosphate ester transport3.5E-023.0E-0721.7225.773.00
GO:0006633fatty acid biosynthetic process3.5E-023.0E-0721.7225.773.00
GO:0001878response to yeast4.6E-068.8E-0616.81225.006.00
GO:0009620response to fungus1.2E-058.8E-0616.81220.696.00
GO:0031099regeneration3.0E-032.7E-0411.9194.948.00
GO:0009611response to wounding3.3E-032.7E-0411.9194.339.00
GO:0042060wound healing3.4E-032.7E-0411.9194.688.00
GO:0022600digestive system process6.9E-032.7E-0411.91912.003.00
GO:0007586digestion1.2E-022.7E-0411.9199.383.00
GO:0042246tissue regeneration4.3E-022.7E-0411.9194.084.00
GO:0007599hemostasis4.5E-022.7E-0411.9195.083.00
GO:0007596blood coagulation4.5E-022.7E-0411.9195.083.00
GO:0050817coagulation4.9E-022.7E-0411.9194.843.00
GO:0030162regulation of proteolysis3.4E-035.6E-037.5114.0210.00
GO:0045861negative regulation of proteolysis1.7E-025.6E-037.5114.036.00
GO:0006270DNA replication initiation6.9E-031.2E-026.4012.003.00
GO:0048593camera-type eye morphogenesis6.4E-031.4E-026.2174.647.00
GO:0048592eye morphogenesis7.3E-031.4E-026.2174.048.00
GO:0042462eye photoreceptor cell development1.9E-021.4E-026.2177.503.00
GO:0031076embryonic camera-type eye development3.1E-021.4E-026.2176.123.00
GO:0001754eye photoreceptor cell differentiation4.8E-021.4E-026.2174.923.00
GO:0006979response to oxidative stress3.7E-031.4E-026.2167.255.00
GO:1990748cellular detoxification3.1E-021.4E-026.2166.123.00
GO:0098869cellular oxidant detoxification3.1E-021.4E-026.2166.123.00
GO:0098754detoxification3.5E-021.4E-026.2165.773.00
GO:0031589cell-substrate adhesion9.6E-031.5E-026.1107.024.00
GO:0007160cell-matrix adhesion3.2E-021.5E-026.1106.003.00
GO:0006730one-carbon metabolic process1.4E-021.8E-025.848.573.00
GO:0051241negative regulation of multicellular organismal process1.3E-021.8E-025.834.326.00
GO:0009266response to temperature stimulus1.7E-022.1E-025.687.893.00
GO:0072527pyrimidine-containing compound metabolic process1.9E-022.3E-025.457.503.00
GO:1901136carbohydrate derivative catabolic process3.2E-023.9E-024.766.003.00
GO:0009410response to xenobiotic stimulus3.5E-024.0E-024.695.773.00
GO:0001945lymph vessel development3.8E-024.2E-024.625.563.00
GO:0048864stem cell development4.1E-024.3E-024.5134.174.00
GO:0014031mesenchymal cell development4.1E-024.3E-024.5134.174.00
GO:0014032neural crest cell development4.1E-024.3E-024.5134.174.00
GO:0000188inactivation of MAPK activity3.4E-044.6E-024.42122.224.00
GO:0043407negative regulation of MAP kinase activity5.6E-044.6E-024.42119.054.00
GO:0043409negative regulation of MAPK cascade1.4E-034.6E-024.42114.294.00
GO:0043405regulation of MAP kinase activity2.0E-034.6E-024.4217.326.00
GO:0071901negative regulation of protein serine/threonine kinase activity2.1E-034.6E-024.42112.504.00
GO:0033673negative regulation of kinase activity2.7E-034.6E-024.4216.676.00
GO:0006469negative regulation of protein kinase activity2.7E-034.6E-024.4216.746.00
GO:0051348negative regulation of transferase activity3.3E-034.6E-024.4216.256.00
GO:0001933negative regulation of protein phosphorylation3.4E-034.6E-024.4216.126.00
GO:0042326negative regulation of phosphorylation3.6E-034.6E-024.4216.006.00
GO:1902532negative regulation of intracellular signal transduction8.0E-034.6E-024.4214.966.00
GO:0071900regulation of protein serine/threonine kinase activity8.2E-034.6E-024.4214.926.00
GO:0010563negative regulation of phosphorus metabolic process9.6E-034.6E-024.4214.726.00
GO:0045936negative regulation of phosphate metabolic process9.6E-034.6E-024.4214.726.00
GO:0031400negative regulation of protein modification process9.6E-034.6E-024.4214.726.00
GO:0001706endoderm formation1.6E-024.6E-024.4218.113.00
GO:0001704formation of primary germ layer1.9E-024.6E-024.4215.484.00
GO:0001666response to hypoxia4.4E-024.6E-024.4155.173.00
GO:0070482response to oxygen levels4.6E-024.6E-024.4155.003.00
GO:0036293response to decreased oxygen levels4.6E-024.6E-024.4155.003.00
GO:0009142nucleoside triphosphate biosynthetic process4.5E-024.7E-024.475.083.00
GO:0033334fin morphogenesis4.5E-024.7E-024.415.083.00
GO:0042541hemoglobin biosynthetic process1.7E-036.2E-024.01423.083.00
GO:0020027hemoglobin metabolic process2.0E-036.2E-024.01421.433.00
GO:0055076transition metal ion homeostasis4.9E-026.2E-024.0144.843.00
Table 6

Enrichment of KEGG pathways using all genes found to be differentially expressed in response to E2 and/or Gen (analysis “All”). Significantly enriched pathways (FDR < 0.05) were identified by their KEGG identifier and have been grouped according to 7 functionally related groups obtained by ClueGO analysis. Each group is named after its most significant term (lowest FDR), which is indicated in bold. Functionally related groups are sorted by highest enrichment score, calculated as [- Log2 (group FDR)].

KEGG IdentifierKEGG ID FDRGroup FDREnrichment ScoreGroups% Associated GenesNr. Genes
Steroid biosynthesis240.0E-9210.0E-922.2033.337.00
ECM-receptor interaction1.5E-31.3E-39.658.647.00
Alanine, aspartate and glutamate metabolism8.7E-33.0E-38.4610.004.00
Arginine biosynthesis10.0E-33.0E-38.4612.003.00
DNA replication9.6E-36.3E-37.3210.534.00
p53 signaling pathway12.0E-311.0E-36.546.765.00
Arachidonic acid metabolism11.0E-312.0E-36.417.844.00
Proteasome14.0E-314.0E-36.237.144.00
Table 7

Enrichment of GO Biological Processes (GO-BP) by the genes differentially expressed in response to E2 or to Gen (analyses “E2” vs “Gen”). Significantly enriched biological processes (FDR < 0.05), identified by their ID and term description, are grouped into 12 functionally related networks (GO group) obtained by ClueGO analysis. Each group is named after its most significant term (lowest FDR), highlighted in bold, which was chosen for group representation in Fig. 3 of the associated MS in JSBMB [1]. Functionally related groups are sorted by the highest enrichment score, calculated as [- Log2 (group FDR)]. The classification of each term as specifically enriched in response to the E2 or Gen treatments (or both), obtained by ClueGO cluster analysis, is also shown. For groups including terms enriched in more than one treatment, the classification of the leading term was adopted for the group.

GOIDGOTermTerm FDRGroup FDREnrichment ScoreGOGroups% Associated GenesNr. GenesTreatment%Genes Specific for E2%Genes Specific for Gen
GO:0043207Response to external biotic stimulus540.0E-12100.0E-1233.278.5421.00Specific for Gen51.6773.19
GO:0051707response to other organism540.0E-12100.0E-1233.278.5421.00Specific for Gen51.6773.19
GO:0009615response to virus28.0E-6100.0E-1233.2712.967.00Specific for Gen50.6163.26
GO:0009617response to bacterium1.1E-3100.0E-1233.275.209.00Specific for Gen55.5174.01
GO:0043207Response to external biotic stimulus540.0E-128.9E-926.798.5421.00Specific for Gen51.6773.19
GO:0051707response to other organism540.0E-128.9E-926.798.5421.00Specific for Gen51.6773.19
GO:0051591response to cAMP36.0E-68.9E-926.7975.003.00Specific for Gen45.5168.26
GO:0009617response to bacterium1.1E-38.9E-926.795.209.00Specific for Gen55.5174.01
GO:0046683response to organophosphorus1.8E-38.9E-926.7920.003.00Specific for Gen45.5168.26
GO:0014074response to purine-containing compound1.8E-38.9E-926.7920.003.00Specific for Gen45.5168.26
GO:0032496response to lipopolysaccharide2.0E-38.9E-926.798.335.00Specific for Gen49.6366.17
GO:0002237response to molecule of bacterial origin2.3E-38.9E-926.797.945.00Specific for Gen49.6366.17
GO:0042493response to drug10.0E-38.9E-926.799.383.00Specific for Gen45.5168.26
GO:0009612response to mechanical stimulus23.0E-38.9E-926.796.673.00Specific for Gen45.5168.26
GO:0016126Sterol biosynthetic process2.2E-91.3E-619.61131.039.00Specific for Gen20.3691.63
GO:0006694steroid biosynthetic process3.0E-91.3E-619.61119.3011.00Specific for Gen17.0093.49
GO:0016125sterol metabolic process7.3E-91.3E-619.61120.8310.00Specific for Gen18.5392.66
GO:0008202steroid metabolic process7.7E-91.3E-619.61114.6312.00Specific for Gen15.6994.16
GO:1901617organic hydroxy compound biosynthetic process660.0E-91.3E-619.61115.259.00Specific for Gen20.3691.63
GO:0008203cholesterol metabolic process2.0E-61.3E-619.61120.597.00Specific for Gen25.3088.56
GO:1902652secondary alcohol metabolic process3.2E-61.3E-619.61118.927.00Specific for Gen25.3088.56
GO:0006695cholesterol biosynthetic process8.7E-61.3E-619.61131.255.00Specific for Gen17.9689.82
GO:1902653secondary alcohol biosynthetic process13.0E-61.3E-619.61127.785.00Specific for Gen17.9689.82
GO:1901615organic hydroxy compound metabolic process33.0E-61.3E-619.6117.6910.00Specific for Gen18.5392.66
GO:0044283small molecule biosynthetic process60.0E-61.3E-619.6115.8512.00Specific for Gen29.8789.62
GO:0046165alcohol biosynthetic process100.0E-61.3E-619.61117.865.00Specific for Gen17.9689.82
GO:0008610lipid biosynthetic process310.0E-61.3E-619.6114.8612.00Specific for Gen15.6994.16
GO:0006066alcohol metabolic process340.0E-61.3E-619.6118.337.00Specific for Gen25.3088.56
GO:0016053organic acid biosynthetic process1.0E-31.3E-619.6115.978.00Specific for Gen32.5286.72
GO:0046394carboxylic acid biosynthetic process1.0E-31.3E-619.6115.978.00Specific for Gen32.5286.72
GO:1901607alpha-amino acid biosynthetic process1.0E-31.3E-619.61110.425.00Specific for Gen33.0882.71
GO:0008652cellular amino acid biosynthetic process1.2E-31.3E-619.6119.805.00Specific for Gen33.0882.71
GO:0031099Regeneration2.2E-31.3E-619.6114.948.00Specific for Gen32.5286.72
GO:0006520cellular amino acid metabolic process5.6E-31.3E-619.6114.158.00Specific for Gen22.5890.31
GO:0022600digestive system process6.0E-31.3E-619.61112.003.00Specific for Gen26.4279.25
GO:1901605alpha-amino acid metabolic process6.3E-31.3E-619.6115.086.00Specific for Gen28.7286.17
GO:0007586Digestion10.0E-31.3E-619.6119.383.00Specific for Gen26.4279.25
GO:0072330monocarboxylic acid biosynthetic process14.0E-31.3E-619.6115.974.00Specific for Gen38.6977.37
GO:0031331positive regulation of cellular catabolic process15.0E-31.3E-619.6115.714.00Both treatments58.0358.03
GO:0009896positive regulation of catabolic process17.0E-31.3E-619.6115.484.00Both treatments58.0358.03
GO:0097164ammonium ion metabolic process27.0E-31.3E-619.6116.253.00Specific for Gen26.4279.25
GO:0015748organophosphate ester transport33.0E-31.3E-619.6115.773.00Specific for Gen26.4279.25
GO:0006633fatty acid biosynthetic process33.0E-31.3E-619.6115.773.00Specific for Gen26.4279.25
GO:0001878Response to yeast3.4E-64.8E-617.7425.006.00Specific for Gen43.0871.80
GO:0009620response to fungus9.5E-64.8E-617.7420.696.00Specific for Gen43.0871.80
GO:0031099Regeneration2.2E-3350.0E-611.584.948.00Specific for Gen32.5286.72
GO:0042060wound healing2.8E-3350.0E-611.584.688.00Specific for Gen52.3073.22
GO:0022600digestive system process6.0E-3350.0E-611.5812.003.00Specific for Gen26.4279.25
GO:0007586Digestion10.0E-3350.0E-611.589.383.00Specific for Gen26.4279.25
GO:0042246tissue regeneration41.0E-3350.0E-611.584.084.00Specific for Gen38.6977.37
GO:0007599Hemostasis43.0E-3350.0E-611.585.083.00Specific for E268.2645.51
GO:0007596blood coagulation43.0E-3350.0E-611.585.083.00Specific for E268.2645.51
GO:0050817Coagulation48.0E-3350.0E-611.584.843.00Specific for E268.2645.51
GO:0006979Response to oxidative stress3.1E-33.9E-38.017.255.00Both treatments53.8953.89
GO:0031589cell-substrate adhesion8.6E-310.0E-36.657.024.00Specific for E264.6043.07
GO:0007160cell-matrix adhesion30.0E-310.0E-36.656.003.00Specific for E279.2526.42
GO:0006730One-carbon metabolic process13.0E-313.0E-36.308.573.00Specific for Gen0.00100.00
GO:0051241negative regulation of multicellular organismal process12.0E-314.0E-36.234.326.00Specific for Gen51.8877.82
GO:0001945Lymph vessel development35.0E-335.0E-34.825.563.00Specific for E268.2645.51
GO:0000188Inactivation of MAPK activity260.0E-639.0E-34.71022.224.00Specific for Gen53.4371.24
GO:0043407negative regulation of MAP kinase activity420.0E-639.0E-34.71019.054.00Specific for Gen53.4371.24
GO:0043409negative regulation of MAPK cascade1.1E-339.0E-34.71014.294.00Specific for Gen53.4371.24
GO:0043405regulation of MAP kinase activity1.5E-339.0E-34.7107.326.00Both treatments64.8564.85
GO:0071901negative regulation of protein serine/threonine kinase activity1.5E-339.0E-34.71012.504.00Specific for Gen53.4371.24
GO:0033673negative regulation of kinase activity2.1E-339.0E-34.7106.676.00Both treatments64.8564.85
GO:0006469negative regulation of protein kinase activity2.1E-339.0E-34.7106.746.00Both treatments64.8564.85
GO:0051348negative regulation of transferase activity2.6E-339.0E-34.7106.256.00Both treatments64.8564.85
GO:0001933negative regulation of protein phosphorylation2.9E-339.0E-34.7106.126.00Both treatments64.8564.85
GO:0042326negative regulation of phosphorylation3.0E-339.0E-34.7106.006.00Both treatments64.8564.85
GO:1902532negative regulation of intracellular signal transduction7.0E-339.0E-34.7104.966.00Both treatments64.8564.85
GO:0071900regulation of protein serine/threonine kinase activity7.1E-339.0E-34.7104.926.00Both treatments64.8564.85
GO:0001706endoderm formation14.0E-339.0E-34.7108.113.00Specific for Gen45.5168.26
GO:0001704formation of primary germ layer17.0E-339.0E-34.7105.484.00Specific for Gen53.4371.24
GO:0007492endoderm development49.0E-339.0E-34.7104.763.00Specific for Gen45.5168.26
GO:0042541Hemoglobin biosynthetic process1.3E-362.0E-34.0623.083.00Specific for Gen26.4279.25
GO:0020027hemoglobin metabolic process1.5E-362.0E-34.0621.433.00Specific for Gen26.4279.25
GO:0055076transition metal ion homeostasis48.0E-362.0E-34.064.843.00Specific for Gen26.4279.25
Table 8

Enrichment of KEGG pathways by the genes differentially expressed in response to E2 or to Gen (Analyses “E2” vs “Gen”). Significantly enriched pathways (FDR < 0.05), identified by their KEGG identifier, are grouped according to 6 functionally related groups obtained by ClueGO analysis. Each group is named after its most significant term (lowest FDR) and is highlighted in bold. Functionally related groups are sorted by highest enrichment score, calculated as [- Log2 (group FDR)].

KEGG IdentifierKEGG ID FDRGroup FDREnrichment ScoreGroups% Associated GenesNr. GenesTreatment%Genes Specific for E2%Genes Specific for Gen
Steroid biosynthesis210.0E-9180.0E-922.4033.337.00Specific for Gen0.00100.00
ECM-receptor interaction1.3E-31.1E-39.848.647.00Specific for E266.8440.11
Alanine, aspartate and glutamate metabolism7.6E-32.6E-38.6510.004.00Specific for Gen21.5386.14
Arginine biosynthesis9.5E-32.6E-38.6512.003.00Specific for Gen0.00100.00
DNA replication8.4E-35.4E-37.5110.534.00Specific for Gen25.0075.00
p53 signaling pathway11.0E-39.5E-36.736.765.00Both treatments53.8953.89
Proteasome14.0E-314.0E-36.227.144.00Specific for E286.1421.53
Table 9

Enrichment of GO Biological Processes by the genes differentially expressed after 1 day or 5 days (Analyses “1d” vs “5d”). Significantly enriched biological processes (FDR < 0.05), identified by their ID and term description, are grouped according to 11 functionally related networks (GO group) obtained by ClueGO analysis. Each group is named after its most significant term (lowest FDR), highlighted in bold, which was chosen for GOTerm representation in Fig. 3 of the associated MS in JSBMB [1]. Functionally related groups are sorted by highest enrichment score, calculated as [- Log2 (group FDR)]. The classification of each term as specifically enriched in the responses after 1 day or 5 days (or both), obtained by ClueGO cluster analysis, is also shown. For groups including terms enriched in more than one list the classification of the leading term was adopted for the whole group.

GOIDGOTermTerm FDRGroup FDREnrichment ScoreGOGroups% Associated GenesNr. GenesTime%Genes Specific for 1d%Genes Specific for 5d
GO:0043207Response to external biotic stimulus490.0E-12100.0E-1233.278.5421.00Specific for 5 days36.4972.99
GO:0051707response to other organism490.0E-12100.0E-1233.278.5421.00Specific for 5 days36.4972.99
GO:0009615response to virus26.0E-6100.0E-1233.2712.967.00Specific for 5 days14.2985.71
GO:0009617response to bacterium1.0E-3100.0E-1233.275.209.00Specific for 5 days39.3078.60
GO:0043207Response to external biotic stimulus490.0E-128.9E-926.798.5421.00Specific for 5 days36.4972.99
GO:0051707response to other organism490.0E-128.9E-926.798.5421.00Specific for 5 days36.4972.99
GO:0051591response to cAMP33.0E-68.9E-926.7975.003.00Specific for 5 days45.5168.26
GO:0009617response to bacterium1.0E-38.9E-926.795.209.00Specific for 5 days39.3078.60
GO:0046683response to organophosphorus1.7E-38.9E-926.7920.003.00Specific for 5 days45.5168.26
GO:0014074response to purine-containing compound1.7E-38.9E-926.7920.003.00Specific for 5 days45.5168.26
GO:0032496response to lipopolysaccharide1.9E-38.9E-926.798.335.00Specific for 5 days33.0882.71
GO:0002237response to molecule of bacterial origin2.2E-38.9E-926.797.945.00Specific for 5 days33.0882.71
GO:0042493response to drug10.0E-38.9E-926.799.383.00Specific for 5 days45.5168.26
GO:0009612response to mechanical stimulus23.0E-38.9E-926.796.673.00Specific for 5 days45.5168.26
GO:0016126Sterol biosynthetic process2.0E-976.0E-923.61131.039.00Specific for 1 day84.8221.21
GO:0006694steroid biosynthetic process2.8E-976.0E-923.61119.3011.00Specific for 1 day78.9526.32
GO:0016125sterol metabolic process6.6E-976.0E-923.61120.8310.00Specific for 1 day86.4219.21
GO:0008202steroid metabolic process7.0E-976.0E-923.61114.6312.00Specific for 1 day80.7324.22
GO:1901617organic hydroxy compound biosynthetic process600.0E-976.0E-923.61115.259.00Specific for 1 day84.8221.21
GO:0008203cholesterol metabolic process1.8E-676.0E-923.61120.597.00Specific for 1 day80.2126.74
GO:1902652secondary alcohol metabolic process2.9E-676.0E-923.61118.927.00Specific for 1 day80.2126.74
GO:0006695cholesterol biosynthetic process7.9E-676.0E-923.61131.255.00Specific for 1 day89.8217.96
GO:1902653secondary alcohol biosynthetic process12.0E-676.0E-923.61127.785.00Specific for 1 day89.8217.96
GO:1901615organic hydroxy compound metabolic process30.0E-676.0E-923.6117.6910.00Specific for 1 day86.4219.21
GO:0044283small molecule biosynthetic process54.0E-676.0E-923.6115.8512.00Specific for 1 day68.8245.88
GO:0046165alcohol biosynthetic process92.0E-676.0E-923.61117.865.00Specific for 1 day89.8217.96
GO:0006066alcohol metabolic process330.0E-676.0E-923.6118.337.00Specific for 1 day80.2126.74
GO:0016053organic acid biosynthetic process990.0E-676.0E-923.6115.978.00Specific for 5 days54.2065.04
GO:0046394carboxylic acid biosynthetic process990.0E-676.0E-923.6115.978.00Specific for 5 days54.2065.04
GO:1901607alpha-amino acid biosynthetic process1.0E-376.0E-923.61110.425.00Specific for 5 days49.6366.17
GO:0008652cellular amino acid biosynthetic process1.1E-376.0E-923.6119.805.00Specific for 5 days49.6366.17
GO:0072330monocarboxylic acid biosynthetic process14.0E-376.0E-923.6115.974.00Both times58.0358.03
GO:0031331positive regulation of cellular catabolic process16.0E-376.0E-923.6115.714.00Specific for 5 days43.0764.60
GO:0009896positive regulation of catabolic process17.0E-376.0E-923.6115.484.00Specific for 5 days43.0764.60
GO:0097164ammonium ion metabolic process27.0E-376.0E-923.6116.253.00Specific for 1 day79.2526.42
GO:0001878Response to yeast3.1E-64.8E-617.7525.006.00Both times50.0050.00
GO:0009620response to fungus8.6E-64.8E-617.7520.696.00Both times50.0050.00
GO:0031099Regeneration2.1E-3150.0E-612.784.948.00Specific for 1 day75.8843.36
GO:0042060wound healing2.7E-3150.0E-612.784.688.00Specific for 1 day73.2252.30
GO:0042246tissue regeneration42.0E-3150.0E-612.784.084.00Specific for 1 day77.3738.69
GO:0007599hemostasis44.0E-3150.0E-612.785.083.00Specific for 5 days45.5168.26
GO:0007596blood coagulation44.0E-3150.0E-612.785.083.00Specific for 5 days45.5168.26
GO:0050817coagulation49.0E-3150.0E-612.784.843.00Specific for 5 days45.5168.26
GO:0006979Response to oxidative stress2.9E-33.9E-38.007.255.00Specific for 1 day60.0040.00
GO:0031589Cell-substrate adhesion8.6E-310.0E-36.637.024.00Specific for 5 days43.0764.60
GO:0009266Response to temperature stimulus15.0E-319.0E-35.727.893.00Specific for 1 day100.000.00
GO:0000188Inactivation of MAPK activity230.0E-629.0E-35.11022.224.00Specific for 5 days38.6977.37
GO:0043407negative regulation of MAP kinase activity400.0E-629.0E-35.11019.054.00Specific for 5 days38.6977.37
GO:0043409negative regulation of MAPK cascade1.0E-329.0E-35.11014.294.00Specific for 5 days38.6977.37
GO:0043405regulation of MAP kinase activity1.4E-329.0E-35.1107.326.00Specific for 5 days28.7286.17
GO:0071901negative regulation of protein serine/threonine kinase activity1.4E-329.0E-35.11012.504.00Specific for 5 days38.6977.37
GO:0033673negative regulation of kinase activity1.9E-329.0E-35.1106.676.00Specific for 5 days54.3667.96
GO:0006469negative regulation of protein kinase activity1.9E-329.0E-35.1106.746.00Specific for 5 days54.3667.96
GO:0051348negative regulation of transferase activity2.5E-329.0E-35.1106.256.00Specific for 5 days54.3667.96
GO:0001933negative regulation of protein phosphorylation2.7E-329.0E-35.1106.126.00Specific for 5 days54.3667.96
GO:0042326negative regulation of phosphorylation2.8E-329.0E-35.1106.006.00Specific for 5 days54.3667.96
GO:1902532negative regulation of intracellular signal transduction7.0E-329.0E-35.1104.966.00Specific for 5 days54.3667.96
GO:0071900regulation of protein serine/threonine kinase activity7.1E-329.0E-35.1104.926.00Specific for 5 days28.7286.17
GO:0001706endoderm formation15.0E-329.0E-35.1108.113.00Specific for 5 days26.4279.25
GO:0001704formation of primary germ layer17.0E-329.0E-35.1105.484.00Specific for 5 days21.5386.14
GO:0001945Lymph vessel development37.0E-339.0E-34.745.563.00Specific for 1 day68.2645.51
GO:0009142Nucleoside triphosphate biosynthetic process44.0E-344.0E-34.515.083.00Specific for 5 days0.00100.00
GO:0042541Hemoglobin biosynthetic process1.2E-362.0E-34.0623.083.00Specific for 5 days26.4279.25
GO:0020027hemoglobin metabolic process1.4E-362.0E-34.0621.433.00Specific for 5 days26.4279.25
GO:0055076transition metal ion homeostasis49.0E-362.0E-34.064.843.00Specific for 5 days26.4279.25
Table 10

Enrichment of KEGG pathways by the genes differentially expressed after 1 day or 5 days (analyses “1d” vs “5d”). Significantly enriched pathways (FDR < 0.05), identified by their KEGG identifier, are grouped according to 7 functionally related groups obtained by ClueGO analysis.

KEGG IdentifierKEGG ID FDRGroup FDREnrichment ScoreGroups% Associated GenesNr. GenesTime%Genes Specific for 1d%Genes Specific for 5d
p53 signaling pathway11.0E-311.0E-36.556.765.00Both times53.8953.89
Steroid biosynthesis210.0E-9210.0E-922.2033.337.00Specific for 1d100.000.00
ECM-receptor interaction1.3E-31.3E-39.668.647.00Specific for 1d66.8440.11
Alanine, aspartate and glutamate metabolism7.6E-37.6E-37.0110.004.00Specific for 1d64.6043.07
DNA replication8.4E-38.4E-36.9310.534.00Specific for 5 d25.0075.00
Arachidonic acid metabolism12.0E-312.0E-36.427.844.00Specific for 5 d25.0075.00
Proteasome14.0E-314.0E-36.247.144.00Specific for 5 d21.5386.14
Enrichment of GO Biological Processes (GO-BP) using all genes found to be differentially expressed in response to E2 and/or Gen (analysis “All”). Significantly enriched biological processes (FDR < 0.05) were identified by ID and term description and grouped into 23 functionally related networks (GO group) obtained by ClueGO analysis. Each group is named after its most significant term (lowest FDR) and highlighted in bold, which was chosen for GOTerm representation in Fig. 3 of the associated MS in JSBMB [1]. Functionally related groups are sorted by highest enrichment score, calculated as [ -Log2 (group FDR)]. Enrichment of KEGG pathways using all genes found to be differentially expressed in response to E2 and/or Gen (analysis “All”). Significantly enriched pathways (FDR < 0.05) were identified by their KEGG identifier and have been grouped according to 7 functionally related groups obtained by ClueGO analysis. Each group is named after its most significant term (lowest FDR), which is indicated in bold. Functionally related groups are sorted by highest enrichment score, calculated as [- Log2 (group FDR)]. Enrichment of GO Biological Processes (GO-BP) by the genes differentially expressed in response to E2 or to Gen (analyses “E2” vs “Gen”). Significantly enriched biological processes (FDR < 0.05), identified by their ID and term description, are grouped into 12 functionally related networks (GO group) obtained by ClueGO analysis. Each group is named after its most significant term (lowest FDR), highlighted in bold, which was chosen for group representation in Fig. 3 of the associated MS in JSBMB [1]. Functionally related groups are sorted by the highest enrichment score, calculated as [- Log2 (group FDR)]. The classification of each term as specifically enriched in response to the E2 or Gen treatments (or both), obtained by ClueGO cluster analysis, is also shown. For groups including terms enriched in more than one treatment, the classification of the leading term was adopted for the group. Enrichment of KEGG pathways by the genes differentially expressed in response to E2 or to Gen (Analyses “E2” vs “Gen”). Significantly enriched pathways (FDR < 0.05), identified by their KEGG identifier, are grouped according to 6 functionally related groups obtained by ClueGO analysis. Each group is named after its most significant term (lowest FDR) and is highlighted in bold. Functionally related groups are sorted by highest enrichment score, calculated as [- Log2 (group FDR)]. Enrichment of GO Biological Processes by the genes differentially expressed after 1 day or 5 days (Analyses “1d” vs “5d”). Significantly enriched biological processes (FDR < 0.05), identified by their ID and term description, are grouped according to 11 functionally related networks (GO group) obtained by ClueGO analysis. Each group is named after its most significant term (lowest FDR), highlighted in bold, which was chosen for GOTerm representation in Fig. 3 of the associated MS in JSBMB [1]. Functionally related groups are sorted by highest enrichment score, calculated as [- Log2 (group FDR)]. The classification of each term as specifically enriched in the responses after 1 day or 5 days (or both), obtained by ClueGO cluster analysis, is also shown. For groups including terms enriched in more than one list the classification of the leading term was adopted for the whole group. Enrichment of KEGG pathways by the genes differentially expressed after 1 day or 5 days (analyses “1d” vs “5d”). Significantly enriched pathways (FDR < 0.05), identified by their KEGG identifier, are grouped according to 7 functionally related groups obtained by ClueGO analysis.

Experimental design, materials, and methods

Experimental set-up and sampling

The experimental set-up generating the analysed RNAs has been previously described by Pinto et al. [1,2]. Immature sea bass (n = 10/experimental group) received intraperitoneal injections of 5 mg/kg E2 or 5 mg/kg Gen in coconut oil or injection of coconut oil alone (control groups). Individual scales were plucked with forceps from the same region of the skin (below the dorsal fin) in each fish, frozen in liquid nitrogen and stored at −80 °C until total RNA extraction.

Total RNA extraction

An automated Maxwell 16 Instrument and the SEV (standard elution volume) total RNA purification kit (Promega, Madison, Wisconsin, USA) were used for the extraction of total RNA from n = 15 scales/individual sea bass. Scales in lysis buffer were mechanically disrupted with an Ultra Turrax homogenizer (IKA, Staufen, Germany) and a dispersing element specialized for fibrous tissues. The extracted total RNA was concentrated by precipitation with 2 vol (V) of 100% ethanol and 1/10 V of 3 M sodium acetate pH 5.2 and resuspended in 20–30 μl of Milli-Q filtered water. RNA quantity and quality were measured in a NanoDrop 1000 spectrophotometer (Thermo Fisher Scientific, Waltham, Massachusetts, USA). Between 3 and 10 μg of RNA were digested with DNase, using a rigorous treatment regime by carrying out two sequential digestions of 30 min with 2U of TURBO DNAse, as recommended in the instructions of the DNA-free kit (Ambion, Thermo Fisher Scientific, Waltham, Massachusetts, USA).

RNA-seq library preparation

The prepared DNase treated RNAs were precipitated with 5 vol of 100% ethanol and shipped in refrigerated conditions to the Shanghai Ocean University Sequencing Service, Shanghai, China. Their quality was assessed using a Bioanalyser 2100 (Agilent Technologies, Santa Clara, California, USA), to confirm that all RNAs had an RNA integrity number (RIN) greater than 8. Library preparation was conducted using a TruSeq mRNA library prep kit (Illumina, San Diego, California, USA), following the suppliers' instructions. For each library, 0.5 μg of each of the 30 individual RNAs were used (n = 4–6 individual libraries per treatment, see Table 1). Sequencing of paired-end (100 bp) reads was carried out using an Illumina Hi-Seq 1500, at the Shanghai Ocean University Sequencing Service.

Sequencing and data analysis

Quality control and trimming of the produced reads was carried out with FastQC and Cutadapt [3,4], using a Phred quality score cut-off of 20. Good quality (filtered) reads (Table 1) were then mapped and assembled using the sea bass reference genome assembly from June 2012 (dicLab v1.0c) and the annotation from July 2013 [5,6], by running TopHat and Cufflinks packages with the data and using the default parameters [7,8]. Relative expression levels were obtained in fpkm (fragments per kilobase of transcript per million fragments mapped) for each sea bass gene and differential expression between experimental conditions was evaluated using Cuffdiff. Differential expression was evaluated using pairwise comparisons between a) the scales of E2-or Gen-treated fish compared to control fish at the same sampling time (E1d vs C1d, Gen1d vs C1d, E5d vs C5d or Gen5d vs C5d) or b) between the control groups over time (C1d vs C5d). Two different stringency conditions were used to identify differentially expressed genes: those passing the condition FDR <0.05 and those satisfying both conditions FDR <0.05 and a ≥ 2-fold change in expression between the two compared groups. The common or specific genes identified between different lists of E2 or Gen DE genes were represented by area proportional Venn diagrams generated with BioVenn [9,10], using their Cufflinks XLOC identifiers (see Supplementary Table 1). Similarities between global transcriptome changes identified for the six experimental conditions (C1d, E1d, Gen1d, C5d, E5d and Gen5d) were evaluated by hierarchical clustering with Cluster 3.0 [11,12], using median centered expression data after Log2 transformation of normalized gene expression (fpkm) levels. The uncentered correlation option and complete linkage options were used to cluster group arrays (experimental conditions).

Gene and functional annotations

The 749 genes for which differential expression was detected with FDR <0.05 were subjected to a multistep automatic gene annotation strategy. Stand-alone Blastx analyses of the corresponding Cufflinks transcripts (individual sequences extracted from the sea bass annotated genes coding sequences, between the mapping positions) were run against the Swiss-Prot curated protein database [13,14] and the unreviewed GenBank protein database [15,16], with expect values (E) set at a maximum of 10−10. These Blastx results were then combined and compared with the Cufflinks mappings to the annotated sea bass genome [5,6]. For genome mapping, the annotation of the closest gene was used as long as a maximum distance of 1000 bp existed between this gene and the mapping position of the Cufflinks transcript. A hierarchical preference order was defined to assign annotation matches to the differentially expressed genes, which was Swiss-Prot hits > sea bass genome hits > GenBank hits, as represented in Fig. 1. Genes were annotated using Swiss-Prot hits for all genes that gave significant hits with this curated database; when no Swiss-Prot hits were found genes were annotated using genome mapping and genes with no significant annotation using the two previous databases were annotated using the non-curated Genbank protein database. In addition, 54% of the 332 DE genes with ≥ 2-fold change were manually curated to verify the accuracy of the annotations. For this process, individual Cufflinks transcript sequences extracted from the mapping positions were re-annotated using individual BLAT versus the sea bass genome [5,6], compared with the sea bass annotations of these and close by genes. Their predicted coding sequences were blasted against the Swiss-Prot and GenBank protein databases, followed by a careful verification by multisequence alignments carried out with MultAlin [17,18]. Gene ontology (GO) and pathway (KEGG) enrichment analyses were carried out using Cytoscape v3.5.1 [19] and ClueGO plug-in v2.3.2 [20], with Cluepedia v1.3.2. The zebrafish (Danio rerio) orthologues for the 371 genes found to be DE with E2 or Gen treatments at FDR<0.05 were identified using stand-alone BlastX (with E value < 10−10) against the Ensembl zebrafish protein predictions (GRC Zebrafish Build 10, INSDC Assembly GCA_000002035.3 from Sep 2014, downloaded in Feb 2017 at https://www.ensembl.org/ [21]). D. rerio Ensembl protein IDs corresponding to each list of DE genes (“All” = all genes regulated by E2 and/or Gen; “E2” or “Gen” = genes regulated by each treatment irrespective of the sampling time and “1d” or “5d” = genes regulated after 1 or 5 days by either E2 or Gen) were then submitted to the Cytoscape/ClueGO plug-in. This was run using the following settings: enrichment analysis (right-sided hypergeometric test) using GO Biological Process (GO BP, levels 3–8) terms for D. rerio updated on 13/05/2017; Benjamini–Hochberg false discovery rate (FDR) correction with only terms with FDR ≤ 0.05 and a minimum of three genes/4% being considered significant; Initial group size (for grouping into functionally related networks of enriched terms) set as 1 and group merging at 50%, with a Kappa-statistics score threshold set at 0.4. The enrichment score for each functionally related network group was calculated as -Log2 (group FDR). The leading terms for each group were selected based on their highest enrichment score (lowest term FDR) and were used for group naming in tables and condensed bar plot representations and for evaluation of treatment specific enrichment. Parallel enrichment analyses were carried out for the Kyoto Encyclopaedia of Genes and Genomes (KEGG) pathways for each DE list, using the same parameters and strategy as described for GO BP.

Specifications Table

Subject areaBiology
More specific subject areaAquaculture, Ecotoxicology, Environment, Marine fisheries
Type of dataTables, figures
How data was acquiredIllumina Hi-Seq 1500
Data formatRaw, metadata
Experimental factorsMarine cultured sea bass were exposed to estradiol and genistein by intraperitoneal injection and their scales collected after 1 or 5 days
Experimental featuresRNA extraction, quality evaluation, RNA-seq library preparation, sequencing and bioinformatics analysis
Data source locationFaro, Algarve, Portugal (37° 1′ 0″ N;7° 56′ 0″W)
Data accessibilityData is available in this article and at the NCBI Sequence Read Archive (SRA), project SRP102504 (https://www.ncbi.nlm.nih.gov/sra/SRP102504). The links for each dataset of replicates for the six experimental groups are:https://www.ncbi.nlm.nih.gov/sra/SRX2673957[accn] for C1d,https://www.ncbi.nlm.nih.gov/sra/SRX2673962[accn] for E1d,https://www.ncbi.nlm.nih.gov/sra/SRX2674307[accn] for Gen1d,https://www.ncbi.nlm.nih.gov/sra/SRX2674405[accn] for C5d,https://www.ncbi.nlm.nih.gov/sra/SRX2674467[accn]for E5d andhttps://www.ncbi.nlm.nih.gov/sra/SRX2675383[accn]for Gen5d.
Related research article“Genistein and estradiol have common and specific impacts on the sea bass (Dicentrarchus labrax) skin-scale barrier” [1].
Value of the Data

This is the first comprehensive study of the transcriptome of fish scales, an estrogen-target tissue for which information is still limited and a promising practical model for environmental pollution screening and chemical risk assessment.

The dataset has relevance for aquaculture and for toxicology/ecotoxicology/environmental studies, since altered levels of hormones (including synthetic, anthropogenic hormones) could affect the development and homeostasis of multiple marine species.

The dataset has broad applicability as it is from the European sea bass, which is a representative of the species-rich order Perciformes and an important species for marine fisheries and aquaculture.

Global transcriptomes of scales from the European sea bass, exposed for 1 and 5 days to estradiol (a natural estrogen) or genistein (a phytoestrogen) is of use for aquaculture, ichthyologists, toxicologists and comparative endocrinologists. Since estrogenic compounds and genistein can be found in aquatic environments the dataset is of use for aquaculture (e.g increased ingestion of phytoestrogens from plant-based ingredients used in fish feeds) and the comparison of genes and pathways regulated or disrupted by estradiol and genistein is of great interest for studies related to physiology, ecotoxicology/environmental studies or aquaculture in fish or other marine species.

The identified sets of differentially expressed genes in sea bass scales after exposure to estradiol or genistein provide a source of potential biomarkers for assessing exposure to estrogens, phytoestrogens or other estrogenic pollutants, and this non-invasive approach complies with the 3R principal and is under validation for environmental pollution.

  12 in total

1.  The Ensembl automatic gene annotation system.

Authors:  Val Curwen; Eduardo Eyras; T Daniel Andrews; Laura Clarke; Emmanuel Mongin; Steven M J Searle; Michele Clamp
Journal:  Genome Res       Date:  2004-05       Impact factor: 9.043

2.  Open source clustering software.

Authors:  M J L de Hoon; S Imoto; J Nolan; S Miyano
Journal:  Bioinformatics       Date:  2004-02-10       Impact factor: 6.937

3.  Tissue responsiveness to estradiol and genistein in the sea bass liver and scale.

Authors:  Patrícia I S Pinto; M Dulce Estêvão; André Andrade; Soraia Santos; Deborah M Power
Journal:  J Steroid Biochem Mol Biol       Date:  2015-12-21       Impact factor: 4.292

4.  Genistein and estradiol have common and specific impacts on the sea bass (Dicentrarchus labrax) skin-scale barrier.

Authors:  Patricia I S Pinto; André R Andrade; Catarina Moreira; Cinta Zapater; Michael A S Thorne; Soraia Santos; M Dulce Estêvão; Ana Gomez; Adelino V M Canario; Deborah M Power
Journal:  J Steroid Biochem Mol Biol       Date:  2019-08-14       Impact factor: 4.292

5.  Multiple sequence alignment with hierarchical clustering.

Authors:  F Corpet
Journal:  Nucleic Acids Res       Date:  1988-11-25       Impact factor: 16.971

6.  Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks.

Authors:  Cole Trapnell; Adam Roberts; Loyal Goff; Geo Pertea; Daehwan Kim; David R Kelley; Harold Pimentel; Steven L Salzberg; John L Rinn; Lior Pachter
Journal:  Nat Protoc       Date:  2012-03-01       Impact factor: 13.491

7.  BioVenn - a web application for the comparison and visualization of biological lists using area-proportional Venn diagrams.

Authors:  Tim Hulsen; Jacob de Vlieg; Wynand Alkema
Journal:  BMC Genomics       Date:  2008-10-16       Impact factor: 3.969

8.  European sea bass genome and its variation provide insights into adaptation to euryhalinity and speciation.

Authors:  Mbaye Tine; Heiner Kuhl; Pierre-Alexandre Gagnaire; Bruno Louro; Erick Desmarais; Rute S T Martins; Jochen Hecht; Florian Knaust; Khalid Belkhir; Sven Klages; Roland Dieterich; Kurt Stueber; Francesc Piferrer; Bruno Guinand; Nicolas Bierne; Filip A M Volckaert; Luca Bargelloni; Deborah M Power; François Bonhomme; Adelino V M Canario; Richard Reinhardt
Journal:  Nat Commun       Date:  2014-12-23       Impact factor: 14.919

9.  UniProt: the universal protein knowledgebase.

Authors: 
Journal:  Nucleic Acids Res       Date:  2016-11-29       Impact factor: 16.971

10.  ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks.

Authors:  Gabriela Bindea; Bernhard Mlecnik; Hubert Hackl; Pornpimol Charoentong; Marie Tosolini; Amos Kirilovsky; Wolf-Herman Fridman; Franck Pagès; Zlatko Trajanoski; Jérôme Galon
Journal:  Bioinformatics       Date:  2009-02-23       Impact factor: 6.937

View more
  1 in total

1.  Disclosing proteins in the leaves of cork oak plants associated with the immune response to Phytophthora cinnamomi inoculation in the roots: A long-term proteomics approach.

Authors:  Ana Cristina Coelho; Rosa Pires; Gabriela Schütz; Cátia Santa; Bruno Manadas; Patrícia Pinto
Journal:  PLoS One       Date:  2021-01-22       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.