| Literature DB >> 17925033 |
William O Ward1, Carol D Swartz, Steffen Porwollik, Sarah H Warren, Nancy M Hanley, Geremy W Knapp, Michael McClelland, David M DeMarini.
Abstract
BACKGROUND: Deficiencies in microarray technology cause unwanted variation in the hybridization signal, obscuring the true measurements of intracellular transcript levels. Here we describe a general method that can improve microarray analysis of toxicant-exposed cells that uses the intrinsic power of transcriptional coupling and toxicant concentration-expression response data. To illustrate this approach, we characterized changes in global gene expression induced in Salmonella typhimurium TA100 by 3-chloro-4-(dichloromethyl)-5-hydroxy-2(5H)-furanone (MX), the primary mutagen in chlorinated drinking water. We used the co-expression of genes within an operon and the monotonic increases or decreases in gene expression relative to increasing toxicant concentration to augment our identification of differentially expressed genes beyond Bayesian-t analysis.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17925033 PMCID: PMC2225428 DOI: 10.1186/1471-2105-8-378
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Average mutagenicity of MX in Salmonella TA100 from four experiments. The average % survival of log-phase cells was 95, 97, and 73% at 1.15-, 2.3-, and 4.6-μM MX, respectively.
Figure 2Principal component analysis of differentially expressed genes in MX-treated Salmonella TA100. PCA#1 accounts for 31% of the variation in the data and PCA#2 accounts for 16%.
Distribution of genes among a range of p-values identified by various analyses
| Percent (%) of genes distributed across the range of | ||||
| Range of | All genes in the genome* | All genes in selected operons† | Non-significant genes in selected operons‡ | Genes identified by monotonic analysis§ |
| 0.0 – 0.2 | 15 | 72 | 37 | 94 |
| 0.2 – 0.4 | 17 | 11 | 24 | 6 |
| 0.4 – 0.6 | 20 | 6 | 14 | 0 |
| 0.6 – 0.8 | 23 | 5 | 12 | 0 |
| 0.8 – 1.0 | 25 | 6 | 13 | 0 |
*p-values were calculated by CyberT comparisons of gene expression between the control cells and those treated with the highest concentration of MX (4.6 μM) for all 4253 genes analyzed. Thus, 4253 was the denominator used to calculate the percentages in this column. CyberT analysis identified 169 genes having p-values < 0.05.
†Selected operons were those that had at least one gene with a consistent CyberT-calculated p-value < 0.05. Of the 169 genes identified by CyberT analysis, 86 resided in a total of 54 multi-gene operons, and the remaining 83 genes were in single-gene operons. Adding the additional genes on the 54 multi-gene operons (143) to those identified solely by CyberT (169) brought the total genes considered for analysis by the operon method to 312, which was the denominator used to calculate the percentages in this column. A Monte Carlo simulation, repeated 1500 times using a sample size of 312, randomly selected from 4253 p-values, and sorted into the p-value bins shown in column 1, produced a p-value distribution the same to the nearest percentage point as that shown in the second column, i.e., a distribution essentially identical to that found for all genes in the genome.
‡There were 161 genes residing in the selected operons that did not have p-values < 0.05. Eleven percent of these genes had missing data and, thus, did not appear in the table. Thus, 143 was the denominator used to calculate the percentages in this column. A Monte Carlo simulation repeated 1500 times using a sample size of 143 produced a p-value distribution the same to the nearest percentage point as that shown in the second column, i.e., a distribution essentially identical to that found for all genes in the genome.
§The monotonic analysis alone identified 309 genes as being differentially expressed. Of these, 153 were uniquely identified by the monotonic analysis; the other 156 had been identified by either the CyberT and/or operon analyses. Thus, the monotonic analysis added an additional 153 genes for analysis, and this number was the denominator used to calculate the percentages shown in this column.
Analysis of genes considered differentially expressed on the 54 operons identified by the operon analysis
| Operon number* | Number of genes in operon | Smallest | FC‡ | Largest | FC¶ | % In same direction¥ | % with | % Missing genes# |
| 1123 | 6 | 4.94E-02 | -1.9 | 0.21 | 1.5 | 17 | 17 | 17 |
| 865 | 4 | 2.83E-02 | -1.9 | 0.73 | 1.1 | 25 | 25 | 25 |
| 369 | 3 | 4.65E-02 | -1.6 | 33 | 33 | 67 | ||
| 1604 | 4 | 2.57E-02 | 2.0 | 0.18 | 1.7 | 50 | 25 | 50 |
| 617 | 2 | 3.59E-02 | 2.2 | 0.85 | -1.1 | 50 | 50 | 0 |
| 638 | 2 | 1.19E-02 | 2.4 | 50 | 50 | 50 | ||
| 747 | 2 | 4.13E-02 | 1.7 | 0.65 | -1.1 | 50 | 50 | 0 |
| 1603 | 4 | 1.83E-02 | 3.2 | 0.02 | 3.2 | 50 | 50 | 50 |
| 2136 | 2 | 4.94E-02 | -1.6 | 0.41 | 1.2 | 50 | 50 | 0 |
| 435 | 6 | 9.90E-03 | -2.1 | 0.80 | 1.1 | 67 | 17 | 17 |
| 562 | 3 | 2.20E-03 | 7.5 | 0.97 | 1.0 | 67 | 33 | 33 |
| 1586 | 3 | 5.47E-03 | 3.1 | 0.39 | -1.3 | 67 | 33 | 0 |
| 2535 | 3 | 2.38E-03 | 2.4 | 0.13 | 1.5 | 67 | 33 | 33 |
| 2213 | 7 | 1.65E-02 | -3.7 | 0.02 | -3.7 | 71 | 29 | 0 |
| 1301 | 11 | 4.01E-02 | 2.7 | 0.86 | -1.1 | 73 | 9 | 18 |
| 35 | 4 | 4.92E-02 | 1.9 | 0.80 | -1.1 | 75 | 25 | 0 |
| 1014 | 4 | 3.48E-02 | -1.7 | 0.48 | 1.3 | 75 | 25 | 0 |
| 619 | 4 | 2.28E-02 | 2.3 | 0.57 | -1.1 | 75 | 75 | 0 |
| 867 | 13 | 1.05E-02 | -2.2 | 0.20 | 1.4 | 77 | 15 | 8 |
| 1279 | 20 | 4.36E-02 | 1.6 | 0.59 | -1.2 | 80 | 5 | 5 |
| 374 | 5 | 4.26E-02 | 2.3 | 0.64 | -1.1 | 80 | 20 | 0 |
| 615 | 5 | 3.13E-03 | 5.1 | 0.68 | 1.1 | 80 | 60 | 20 |
| 119 | 6 | 1.77E-02 | 3.0 | 0.81 | -1.1 | 83 | 17 | 0 |
| 1754 | 6 | 4.62E-02 | 1.7 | 0.88 | 1.1 | 83 | 17 | 17 |
| 866 | 9 | 1.45E-02 | -1.9 | 0.84 | 1.1 | 89 | 22 | 0 |
| 1275 | 20 | 3.33E-02 | -1.6 | 0.54 | -1.1 | 100 | 10 | 0 |
| 2495 | 6 | 4.39E-02 | 2.9 | 1.00 | 1.0 | 100 | 17 | 0 |
| 858 | 4 | 3.87E-02 | -1.8 | 0.18 | -1.4 | 100 | 25 | 0 |
| 868 | 4 | 4.73E-02 | -1.7 | 0.65 | -1.2 | 100 | 25 | 0 |
| 336 | 3 | 2.46E-02 | -1.8 | 0.52 | -1.1 | 100 | 33 | 0 |
| 1090 | 3 | 4.84E-02 | -1.6 | 0.11 | -1.5 | 100 | 33 | 0 |
| 1508 | 6 | 7.64E-03 | -2.0 | 0.50 | -1.2 | 100 | 33 | 0 |
| 2178 | 3 | 3.87E-02 | -1.8 | 0.72 | -1.1 | 100 | 33 | 0 |
| 2384 | 3 | 3.69E-02 | 1.7 | 0.08 | 1.5 | 100 | 33 | 0 |
| 62 | 2 | 1.03E-03 | 2.8 | 0.31 | 1.3 | 100 | 50 | 0 |
| 441 | 2 | 6.63E-04 | 4.8 | 0.53 | 1.2 | 100 | 50 | 0 |
| 543 | 2 | 5.49E-03 | 3.5 | 0.50 | 1.2 | 100 | 50 | 0 |
| 1201 | 2 | 3.66E-03 | 2.2 | 0.11 | 1.4 | 100 | 50 | 0 |
| 1376 | 2 | 4.88E-02 | 46.2 | 0.24 | 1.6 | 100 | 50 | 0 |
| 2181 | 4 | 2.91E-02 | -1.7 | 0.08 | -1.5 | 100 | 50 | 0 |
| 2389 | 2 | 5.75E-03 | 2. | 0.35 | 1.4 | 100 | 50 | 0 |
| 722 | 9 | 1.65E-02 | -2.0 | 0.12 | -1.5 | 100 | 67 | 0 |
| 1284 | 3 | 3.83E-02 | -2.7 | 0.05 | -2.8 | 100 | 67 | 0 |
| 1499 | 4 | 2.70E-02 | 2.4 | 0.07 | 1.8 | 100 | 75 | 0 |
| 1761 | 4 | 1.33E-03 | 4.2 | 0.63 | 1.1 | 100 | 75 | 0 |
| 550 | 4 | 7.51E-04 | 4.1 | 0.05 | 2.1 | 100 | 100 | 0 |
| 789 | 2 | 4.21E-02 | -1.6 | 0.45 | -1.5 | 100 | 100 | 0 |
| 881 | 2 | 2.67E-04 | 3.1 | 0.01 | 2.0 | 100 | 100 | 0 |
| 911 | 2 | 4.46E-02 | -2.3 | 0.05 | -1.9 | 100 | 100 | 0 |
| 1260 | 2 | 8.08E-06 | 8.6 | 0.00 | 14.3 | 100 | 100 | 0 |
| 1652 | 3 | 2.79E-05 | 14.8 | 0.01 | 2.6 | 100 | 100 | 0 |
| 1705 | 2 | 3.15E-02 | -1.7 | 0.04 | -1.6 | 100 | 100 | 0 |
| 1766 | 2 | 2.73E-03 | 2.7 | 0.01 | 2.6 | 100 | 100 | 0 |
| 2411 | 2 | 4.47E-03 | 2.4 | 0.01 | 1.8 | 100 | 100 | 0 |
*Table 3 contains a list of the genes in each of these operons.
†The lowest p-value from the CyberT analysis in each operon comparing the high concentration of MX (4.6 μM) to the control.
‡The fold change for the transcript with the lowest p-value in the operon.
§ The p-value for the transcript with the highest p-value in the operon.
¶ The fold change for the transcript with the highest p-value in the operon.
¥The percent of transcripts in the operon with the same direction of change as the transcript with the lowest p-value in the operon.
#The percent of genes where the intensity measurements were equal to or less than background.
These operons had the majority of their genes expressing in the direction opposite that of a gene with a p-value < 0.05. Operon #2213 had two genes with p-values < 0.05 that changed in opposite directions.
Genes in 54 operons identified by operon analysis
| Operon number | Genes in operon |
| 35 | yaaY-ribF-ileS-lspA |
| 62 | polB-STM0098 |
| 119 | stfC-stfD-stfE-stfF-stfG-STM0201 |
| 336 | ylbA-allC-allD |
| 369 | fes-ybdZ-entF |
| 374 | entC-entE-entB-entA-ybdB |
| 435 | kdpE-kdpD-kdpC-kdpB-kdpA-STM0707 |
| 441 | STM0717-STM0718 |
| 543 | STM0893-STM0894 |
| 550 | STM0900-STM0901-STM0902-STM0903 |
| 562 | STM0925-STM0926-STM0927 |
| 615 | STM1005-STM1006-STM1007-STM1008-STM1009 |
| 617 | STM1011-STM1012 |
| 619 | STM1014-STM1015-STM1016-STM1017 |
| 638 | STM1049-STM1050 |
| 722 | flgB-flgC-flgD-flgE-flgF-flgG-flgH-flgI-flgJ |
| 747 | pepT-STM1228 |
| 789 | yeaK-yeaJ |
| 858 | orf48-orf32-orf245-orf408 |
| 865 | ssaB-ssaC-ssaD-ssaE |
| 866 | sseA-sseB-sscA-sseC-sseD-sseE-sscB-sseF-sseG |
| 867 | ssaG-ssaH-ssaI-ssaJ-STM1410-ssaK-ssaL-ssaM-ssaV-ssaN-ssaO-ssaP-ssaQ |
| 868 | ssaR-ssaS-ssaT-ssaU |
| 881 | nemA-ydhM |
| 911 | ydgF-ydgE |
| 1014 | STM1633-STM1634-STM1635-STM1636 |
| 1090 | STM1733-yciC-yciB |
| 1123 | STM1786-STM1787-STM1788-STM1789-STM1790-STM1791 |
| 1201 | ruvB-ruvA |
| 1260 | umuC-umuD |
| 1275 | cobT-cobS-cobU-cbiP-cbiO-cboQ-cbiN-cbiM-cbiL-cbiK-cbiJ-cbiH-cbiG-cbiF-cbiT-cbiE-cbiD-cbiC-cibB-cbiA |
| 1279 | pudB-pduC-pduD-pduE-pduG-pduH-pduJ-pduK-pduL-pduM-pduN-pduO-pduP-pduQ-pduS-pduT-pduU-pduV-pduW-pduX |
| 1284 | phsC-phsB-phsA |
| 1301 | manC-wcaI-wcaH-wcaG-gmd-wcaF-wcaE-wcaD-wcaC-wcaB-wcaA |
| 1376 | STM2236-STM2237 |
| 1499 | cysA-cysW-cysU-cysP |
| 1508 | eutM-eutD-eutT-eutQ-eutP-eutS |
| 1586 | STM2586-STM2587-STM2588 |
| 1603 | STM2628-STM2629-STM2630-STM2631 |
| 1604 | STM2632-STM2633-STM2634-STM2635 |
| 1652 | STM2726-STM2727-STM2728 |
| 1705 | proV-proW |
| 1754 | STM2913-STM2914-ygbM-ygbL-ygbK-ygbJ |
| 1761 | ygbE-cysC-cysN-cysD |
| 1766 | cysI-cysJ |
| 2136 | yhhK-STM3566 |
| 2178 | yhjS-yhjT-yhjU |
| 2181 | dppF-dppD-dppC-dppB |
| 2213 | yiaM-yiaN-yiaO-lyxK-sgbH-sgbU-sgbE |
| 2384 | ubiE-yigP-aarF |
| 2389 | STM3980-STM3981 |
| 2411 | STM4030-STM4031 |
| 2495 | thiH-thiG-STM4161-thiF-thiE-thiC |
| 2535 | dinF-STM4239-yjbJ |
Figure 3Monotonically increasing gene expression in MX-treated Salmonella TA100. The x-axis represents the concentration of MX at 4 concentrations, 0-, 1.15-, 2.3-, and 4.6-μM MX, respectively. The log-scaled y-axis represents expression values that are the mean of 12 background-corrected intensities (4 biological replicates and 3 technical replicates for each biological replicate) normalized to the DNA reference. Double asterisks represent those genes that are known members of the LexA regulon.
Figure 4Monotonically decreasing gene expression in MX-treated Salmonella TA100. The x-axis represents the concentration of MX at 4 concentrations, 0-, 1.15-, 2.3-, and 4.6-μM MX, respectively. The log-scaled y-axis represents expression values that are the mean of 12 background-corrected intensities (4 biological replicates and 3 technical replicates for each biological replicate) normalized to the DNA reference.
Fold change in expression of genes in LexA regulon due to MX treatment and normalized to control
| μM MX | ||||
| Gene | 0.25 | 0.5 | 1 | |
| 3.2 | 5.8 | 8.6 | ||
| 0.00003 | 5.5 | 11.1 | 14.8 | |
| 0.00003 | 3.7 | 4.8 | 4.8 | |
| 0.00004 | 4.4 | 7.6 | 12.6 | |
| 0.00004 | 5.2 | 8.3 | 14.3 | |
| 0.00005 | 4.8 | 8.4 | 8.9 | |
| 0.00016 | 8.7 | 14 | 16.9 | |
| 0.00019 | 5.5 | 8.8 | 13.5 | |
| 0.00035 | 2.9 | 3.5 | 4 | |
| 0.00102 | -1.3 | 1.9 | 2.5 | |
| 0.00103 | 2.3 | 3.1 | 2.8 | |
| 0.00118 | 2.2 | 2.7 | 3 | |
| 0.00129 | 1.7 | 2.1 | 2.5 | |
| 0.00219 | 4 | 6.3 | 7.8 | |
| 0.00241 | 1.4 | 1.8 | 2.2 | |
| 0.00366 | 1.7 | 2.3 | 2.2 | |
| 0.00612 | 4.1 | 7.2 | 7.4 | |
| 0.00629 | 1.6 | 1.9 | 2.2 | |
| 0.02699 | 8 | 11.4 | 16 | |
| 0.05804 | 1.1 | 0.8 | 9.6 | |
| 0.18374 | 1.3 | 1.2 | 1.4 | |
| 0.20943 | 0.9 | 1.3 | 1.5 | |
| 0.4182 | 1 | 0.7 | 1.2 | |
| 0.43548 | 3.7 | 0.7 | 1.3 | |
| 0.46889 | 1 | 1 | 0.9 | |
| 0.7057 | 0.9 | 1 | 0.9 | |
| 0.70953 | 1 | 1.1 | 1.1 | |
| 0.80643 | 0.1 | 0.9 | 0.9 | |
| 0.91485 | 1 | 0.9 | 1 | |
| 0.97301 | 1 | 0.9 | 1 | |
| No Data | ||||
| No Data | ||||
| 1.8 | 1.4 | 1.9 | ||
*The p-value was determined by a Cyber-T t-test of control to the high concentration of MX.
Augmentation of differentially expressed genes in KEGG pathways by addition of operon and monotonic analyses to CyberT analysis
| CyberT | CyberT + operon + monotonic | |||
| KEGG Pathway* | No. genes | No. genes | ||
| Porphyrin metabolism | 2 | 0.230 | 19 | 4.75 × 10-9 |
| Flagellar assembly | 7 | 0.027 | 15 | 3.27 × 10-5 |
| Type III secretion system | 4 | 0.132 | 12 | 8.35 × 10-5 |
| Sulfur metabolism | 6 | 0.0002 | 7 | 0.0002 |
| Thiamine metabolism | 1 | 0.382 | 6 | 0.003 |
| Nitrogen metabolism | 0 | NA | 1 | 0.004 |
| Biosynthesis of siderophore group peptides | 1 | 0.338 | 4 | 0.005 |
| Pyruvate metabolism | 0 | NA | 1 | 0.008 |
| Two-component system | 0 | NA | 1 | 0.010 |
| ABC transporters, prokaryotic | 11 | 0.120 | 27 | 0.011 |
| Selenoamino acid metabolism | 5 | 0.010 | 6 | 0.014 |
| Oxidative phosphorylation | 0 | NA | 1 | 0.018 |
| Other ion-coupled transporters | 2 | 0.013 | 6 | 0.035 |
| Ascorbate and aldarate metabolism | 1 | 0.338 | 3 | 0.035 |
| Arginine and proline metabolism | 0 | NA | 1 | 0.037 |
| HTH family transcriptional regulators | 1 | 0.072 | 2 | 0.043 |
*Functions are ordered according to increasing p-value associated with operon and monotonic analyses.
Augmentation of differentially expressed genes in TIGR functional groups by addition of operon and monotonic analyses to CyberT analysis
| CyberT | CyberT + operon + monotonic | |||
| TIGR function* | No. genes | No. genes | ||
| Protein and peptide secretion and trafficking | 6 | 0.013 | 21 | 1.86 × 10-9 |
| Heme, porphyrin, and cobalamin biosynthesis | 1 | 0.335 | 15 | 4.83 × 10-7 |
| Thiamine biosynthesis | 2 | 0.106 | 8 | 2.41 × 10-5 |
| Transposon functions | 14 | 0 | 16 | 0.0001 |
| Regulatory functions | 3 | 0.011 | 11 | 0.003 |
| Unknown | 41 | 0.003 | 115 | 0.010 |
| Cell division; Prophage functions | 2 | 0.002 | 2 | 0.011 |
| Chemotaxis and motility | 3 | 0.170 | 9 | 0.016 |
| Anions transport and binding | 4 | 0.013 | 6 | 0.018 |
| Central intermediary metabolism | 4 | 0.199 | 16 | 0.022 |
| Biosynthesis of murein sacculus and peptidoglycan | 1 | 0.266 | 1 | 0.033 |
| DNA replication, recombination, and repair | 13 | 0.0003 | 15 | 0.040 |
| Sulfur metabolism | 4 | 0.003 | 4 | 0.042 |
| Amino acids and amines metabolism | 0 | NA | 1 | 0.044 |
| Surface structures | 1 | 0.229 | 9 | 0.047 |
| Transport and binding proteins | 0 | NA | 1 | 0.048 |
*Functions are ordered according to increasing p-value associated with operon and monotonic analyses.
Figure 5Structures of furan, MX, and pyrrole.