| Literature DB >> 25493933 |
Rong-Lin Wang1, David C Bencic2, Natàlia Garcia-Reyero3, Edward J Perkins4, Daniel L Villeneuve5, Gerald T Ankley5, Adam D Biales2.
Abstract
Fathead minnow and zebrafish are among the most intensively studied fish species in environmental toxicogenomics. To aid the assessment and interpretation of subtle transcriptomic effects from treatment conditions of interest, better characterization and understanding are needed for natural variation in gene expression among fish individuals from lab cultures. Leveraging the transcriptomics data from a number of our toxicogenomics studies conducted over the years, we conducted a meta-analysis of nearly 600 microarrays generated from the ovary tissue of untreated, reproductively mature fathead minnow and zebrafish samples. As expected, there was considerable batch-to-batch transcriptomic variation; this "batch-effect" appeared to differentially impact subsets of fish transcriptomes in a nonsystematic way. Temporally more closely spaced batches tended to share a greater transcriptomic similarity among one another. The overall level of within-batch variation was quite low in fish ovary tissue, making it a suitable system for studying chemical stressors with subtle biological effects. The observed differences in the within-batch variability of gene expression, at the levels of both individual genes and pathways, were probably both technical and biological. This suggests that biological interpretation and prioritization of genes and pathways targeted by experimental conditions should take into account both their intrinsic variability and the size of induced transcriptional changes. There was significant conservation of both the genomes and transcriptomes between fathead minnow and zebrafish. The high degree of conservation offers promising opportunities in not only studying fish molecular responses to environmental stressors by a comparative biology approach, but also effective sharing of a large amount of existing public transcriptomics data for developing toxicogenomics applications.Entities:
Mesh:
Year: 2014 PMID: 25493933 PMCID: PMC4262388 DOI: 10.1371/journal.pone.0114178
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Factors and batches (N) by which fathead minnow (PPR) and zebrafish (DRE) microarray samples were organized.
| Species | Experiment | Sampling Date | RNA Date | RNA Person | Scan Date |
| N = 9 | N = 9 | N = 4 | N = 2 | N = 5 | |
| DRE (80) | CTL_FAD24 (5) | CTL_2004_12 (10) | CTL_2005_1 (10) | CTL_A (65) | CTL_2007_12 (47) |
| CTL_FIP48_96 (10) | CTL_2005_1 (5) | CTL_2005_3 (5) | CTL_B (15) | CTL_2007_2 (4) | |
| CTL_FLU48_96 (10) | CTL_2006_10 (10) | CTL_2006_11 (30) | CTL_2007_3 (24) | ||
| CTL_KTC24_48_96 (15) | CTL_2006_4 (10) | CTL_2007_6 (35) | CTL_2007_4 (2) | ||
| CTL_MUSC96 (5) | CTL_2006_5 (10) | CTL_2008_4 (3) | |||
| CTL_PRO48_96 (10) | CTL_2006_9 (10) | ||||
| CTL_TRB24_48 (10) | CTL_2007_2 (5) | ||||
| CTL_TRI96 (5) | CTL_2007_4 (5) | ||||
| CTL_VIN48_96 (10) | CTL_2007_5 (15) | ||||
| N = 23 | N = 16 | N = 17 | N = 10 | N = 18 | |
| PPR (511) | BPA_NOTEL (6) | 2007_12 (4) | 11111 (12) | DDD (20) | 2008_10 (10) |
| FAD_I_Acute (12) | 2007_2 (12) | 2007_12 (58) | A (32) | 2008_11 (22) | |
| FAD_III_Acute (20) | 2007_3 (72) | 2007_4 (12) | B (124) | 2008_12 (62) | |
| FAD_Phase3 (60) | 2007_6 (54) | 2007_6 (60) | B_ROBOT (54) | 2008_4 (12) | |
| FLU_II_Acute (20) | 2007_7 (24) | 2008_11 (19) | C (24) | 2008_5 (10) | |
| FLU_Phase3 (38) | 2008_1 (54) | 2008_2 (12) | J (23) | 2008_6 (132) | |
| GEM (5) | 2008_2 (12) | 2008_6 (124) | L (126) | 2008_7 (38) | |
| KTC_I_Acute (12) | 2008_4 (51) | 2008_7 (44) | R (60) | 2008_8 (26) | |
| KTC_IV_Acute (28) | 2008_5 (78) | 2008_9 (32) | X (36) | 2009_1 (40) | |
| KTCv2_Phase3 (39) | 2008_8 (52) | 2009_1 (20) | YYY (12) | 2009_10 (6) | |
| PRO_I_Acute (12) | 2009_1 (5) | 2009_10 (44) | 2009_11 (39) | ||
| PRO_Phase3 (54) | 2009_10 (67) | 2009_11 (29) | 2009_2 (30) | ||
| RDX_Repro (4) | 2009_6 (4) | 2009_7 (9) | 2009_6 (23) | ||
| TNT_KTC_Acute (24) | 2009_7 (10) | 2009_9 (5) | 2009_7 (8) | ||
| TRB_BPA (6) | 2009_9 (6) | 2010_02 (6) | 2009_9 (14) | ||
| TRB_EE2 (4) | 2010_1 (6) | 2010_1 (5) | 2010_1 (28) | ||
| TRB_Phase3 (54) | 55555 (20) | 2010_2 (5) | |||
| TRB_TCC (5) | 2010_7 (6) | ||||
| TRI_II_Acute (20) | |||||
| TRI_Phase3 (32) | |||||
| VIN_II_Acute (19) | |||||
| VIN_Phase3 (32) | |||||
| WLSSD (5) |
Sample sizes were indicated in parenthesis.
BPA, bisphenol-A; EE2, 17a-ethynyl estradiol; FAD, fadrozole; FIP, fipronil; FLU, flutamide; GEM, Gemfibrozil; KTC, ketoconazole; PRO, prochloraz; RDX, hexahydro-1,3,5-trinitro-1,3,5-triazine; TRB, 17 -trenbolone; TNT, 2,4,6-trinitrotoluene; TRI, trilostane; VIN, vinclozolin; WLSSD, effluent from Western Lake Superior Sanitary District; RNA Date not determined: 11111, 55555; RNA Person not determined: DDD, YYY.
Between-batch variation as measured by average number of DEGs (standard deviations where N>3).
| Species/sample condition | Experiment | Sampling Date | Scan Date | RNA Date | RNA Person |
| PPR | 2360 (1033) | 1498 (1032) | 1123 (1102) | 1907 (756) | 434 (525) |
| DRE | 2754 (1684) | 2754 (1684) | 2845 (2320) | 4403 (2435) | 5593 |
The DEGs were identified in paired comparisons of N batches against a simulated reference made up of the batch means of their respective factors. DRE, zebrafish; PPR, fathead minnow.
Between-batch variation as measured by average number of DEGs (standard deviations where N>3).
| Species/sample condition | Experiment | Sampling Date | Scan Date | RNA Date | RNA Person |
| PPR | 4517 (889) | 4312 (1009) | 4640 (1616) | 4667 (941) | 5872 (2045) |
| DRE | 2523 (1651) | 2841 (2141) | 6321 | 5503 | 3669 |
The DEGs were identified in paired comparisons of N–1 batches against a batch designated as a common reference. N is the total number of batches in a factor. The comparisons were made in conjunction with the analysis of within-batch effects involving 250 permutations. There were little variations in between-batch effects among permutations so their calculations were made only from the first permutation. DRE, zebrafish; PPR, fathead minnow.
Within-batch variation as measured by the maximum (% of transcriptome) and minimum number of DEGs per permutation.
| DEGs perpermutation | Experiment | Experiment | SamplingDate | SamplingDate | ScanDate | ScanDate | RNADate | RNADate | RNAPerson | RNAPerson |
| PPR | DRE | PPR | DRE | PPR | DRE | PPR | DRE | PPR | DRE | |
| Maximum | 267 (1.8) | 52 (0.2) | 393 (2.6) | 10 (0.05) | 25 (0.2) | 1120 (5.2) | 156 (1.0) | 192 (0.9) | 883 (5.8) | 3613 (16.8) |
| Minimum | 0 | 0 | 0 | 0 | 0 | 216 | 0 | 0 | 0 | 1153 |
There were 250 permutations conducted under individual factors. Between-batch variation was controlled statistically in these analyses. PPR, fathead minnow; DRE, zebrafish.
Figure 1Estimation of within-batch variation.
Coefficients of variation (CV) were computed at various intensities of 15208 fathead minnow probes (A) and 21495 zebrafish probes (B).
Within-batch variation as measured by coefficient of variation (CV).
| CV range | DRE probes (% total) | PPR probes (% total) | DRE probes orthologous to PPR | PPR probes orthologous to DRE |
| <0.01 | 409 (1.9) | 3 (0.02) | 344 (3.7) | 1 (0) |
| 0.01–0.049 | 10721 (49.9) | 7625 (50.1) | 5956 (64.0) | 3825 (55) |
| 0.05–0.059 | 3651 (17.0) | 2738 (18) | 922 (9.9) | 1166 (16.8) |
| 0.06–0.069 | 2728 (12.7) | 1661 (10.9) | 738 (7.9) | 662 (9.5) |
| 0.07–0.079 | 1573 (7.3) | 1065 (7.0) | 440 (4.7) | 429 (6.2) |
| 0.08–0.089 | 856 (4.0) | 664 (4.4) | 276 (3.0) | 278 (4.0) |
| 0.09–0.099 | 491 (2.3) | 466 (3.1) | 168 (1.8) | 183 (2.6) |
| 0.1–0.199 | 896 (4.2) | 909 (6.0) | 406 (4.4) | 379 (5.5) |
| ≥0.2 | 170 (0.8) | 77 (0.5) | 61 (0.7) | 27 (0.4) |
| Total | 21495 | 15208 | 9311 | 6950 |
The CVs were calculated for each probe by individual batches under the Experiment factor and averaged over all batches. Further averaging these CVs across the entire transcriptome yielded an overall CV of 0.056 for fathead minnow (PPR) and 0.051 for zebrafish (DRE). The total number of orthologous genes identified between Agilent 015064 and 019597 was 6617, represented by 9311 and 6950 unique probes respectively. The PPR probes mapped to their EST sequences and ESTs to NCBI databases by BLAST all had a minimum E-value of E-06.
Figure 2Intraspecific correlation between the average CV and average intensity by KEGG pathways.
A total of 136 pathways (eight outliers excluded) were included for fathead minnow (A) and 144 pathways for zebrafish (B). The CCs were −0.68 and −0.70 respectively, with both p-values = 0. The p-values of normality test of error distribution for linear regressions were 0.094 (no significant departure from normality) and 0 (significant non-normality) respectively.
Figure 3Interspecific correlation by within-batch intensity and coefficient of variation of orthologs.
The within-batch intensities (A) and coefficients of variation (CV; B) were based on 6617 orthologous genes. The orthologs were represented by 9311 zebrafish (DRE) and 6950 fathead minnow (PPR) probes. The intensity and CV of an ortholog with duplicated probes were probe means. The correlation coefficients over the orthologs for the two metrics were 0.49 and 0.33 respectively, with the both p-values = 0.
Figure 4Interspecific correlation by average intensity and average coefficient of variation of individual pathways.
A total of 84 (three outliers excluded) KEGG pathways were calculated for their average intensities (A), and 53 (two outliers excluded) pathways for their average CVs (B), based on a combined total of 6617 orthologous genes. To be included, each pathway must have at least five orthologs and a p-value of ≤0.1 for the correlation of the intensities or CVs of its member genes as estimated within a batch. The CCs were 0.86 and 0.80 for the average intensity and average CV by pathway respectively, with the both p-values = 0. The p-values of normality test of error distribution for linear regressions were 0.045 and 0.585 respectively.