| Literature DB >> 16738548 |
James A Brown1, Gavin Sherlock, Chad L Myers, Nicola M Burrows, Changchun Deng, H Irene Wu, Kelly E McCann, Olga G Troyanskaya, J Martin Brown.
Abstract
We present a method for the global analysis of the function of genes in budding yeast based on hierarchical clustering of the quantitative sensitivity profiles of the 4756 strains with individual homozygous deletion of nonessential genes to a broad range of cytotoxic or cytostatic agents. This method is superior to other global methods of identifying the function of genes involved in the various DNA repair and damage checkpoint pathways as well as other interrogated functions. Analysis of the phenotypic profiles of the 51 diverse treatments places a total of 860 genes of unknown function in clusters with genes of known function. We demonstrate that this can not only identify the function of unknown genes but can also suggest the mechanism of action of the agents used. This method will be useful when used alone and in conjunction with other global approaches to identify gene function in yeast.Entities:
Mesh:
Year: 2006 PMID: 16738548 PMCID: PMC1681475 DOI: 10.1038/msb4100043
Source DB: PubMed Journal: Mol Syst Biol ISSN: 1744-4292 Impact factor: 11.429
List of agents used in clustering analysis
| Name | Rep | Treatment | Control | Ref |
|---|---|---|---|---|
| AAPO | 2 | Antimycin A 1 μg/ml, 1 mM hydrogen peroxide chronic 16 h | Mock | a |
| ACTD | 3 | Actinomycin D 400 μM 4 h, YPD 17 h | Mock | a |
| ALK-15G | 2 | pH 8.0 15 generations | Historic | b |
| ALK-5G | 2 | pH 8.0 five generations | Historic | b |
| ANTA | 3 | Antimycin A 1 μg/ml in YPD chronic 16 h | Mock | a |
| ARAC | 2 | AraC 400 μM 4 h, YPD 16 h | Mock | a |
| ARN | 4 | Arsenite 1 mM (2), 2.5 mM (1), 5 mM (1) 1 h, YPD 16 h | Mock | f |
| ARS | 3 | Arsenic 20 μM (1) or 100 μM (2) 2 h, YPD 16 h | Mock | a |
| BEN | 3 | Benomyl 10 μM (2) or 15 μM (1) 2 h, YPD 16 h | Mock | a |
| BLEO | 4 | Bleomycin 0.01 U/ml 4 h, YPD 16 h | Mock | a |
| CAFF | 3 | Caffeine 6 mM chronic 16 h | Mock | a |
| CALC | 3 | Calcofluor white 3 μg/ml chronic 16 h | Mock | a |
| CIS1 | 6 | Cisplatin 1.0 mM 1 h, YPD 16 h | Mock | e |
| CIS4 | 6 | Cisplatin 0.23 mM 4 h, YPD 16 h | Mock | e |
| CPTA | 3 | Camptothecin 250 μM (2) or 300 μM (1) 2 h, YPD 16 h | Mock | a |
| CPTC | 3 | Camptothecin 5 μg/ml chronic 16 h | Mock | a |
| DOX | 6 | Doxorubicin 0.2 mM 4 h, YPD 16 h | Mock | a |
| GAL-15G | 2 | YPGalactose 15 generations | Historic | b |
| GAL-5G | 2 | YPGalactose five generations | Historic | b |
| GLYE | 3 | YEP 2% glycerol 2% ethanol chronic 16 h | Mock | a |
| H2O2 | 4 | Hydrogen peroxide 3 mM chronic 16 h | Mock | a |
| HU | 3 | Hydroxyurea 100 mM chronic 16 h | Mock | a |
| HYGB | 3 | Hygromycin B 7 μg/ml chronic 16 h | Mock | a |
| IDA | 3 | Idarubicin 50 μM (2) or 100 μM (1) 2 h, YPD 16 h | Mock | a |
| IR | 3 | IR 200Gy Cs137, YPD 18 h | Mock | d |
| LOVA | 3 | Lovastatin 100 μg/ml (0.75% EtOH) chronic 16 h | Mock | a |
| LYS | 2 | Lys minus five generations | Mock | b |
| MECH | 1 | Mechlorethamine 20 μM 3 h, YPD 16 h | Mock | a |
| MEL | 3 | Melphalan 800 μM 4 h, YPD 16 h | Mock | a |
| MIN-15G | 2 | Minimal+his/leu/ura 15 generations | Historic | b |
| MIN-5G | 2 | Minimal+his/leu/ura five generations | Historic | b |
| MMC | 5 | MitomycinC 0.5 mM 4 h, YPD 16 h | Mock | e |
| MMS | 3 | Methyl methanesulfonate 0.03% chronic 16 h | Mock | a |
| NACL-15G | 2 | NaCl 1 M 15 generations | Historic | b |
| NACL-5G | 2 | NaCl 1 M five generations | Historic | b |
| NYS-15G | 2 | Nystatin 10 μM 15 generations | Historic | b |
| NYS-5G | 2 | Nystatin 10 μM five generations | Historic | b |
| OXA | 3 | Oxaloplatin 4 h 10 mM, YPD 16 h | Mock | e |
| RAFA | 3 | Raffinose 6% with 1 μg/ml antimycin A chronic 16 h | Mock | a |
| SC | 2 | Minimal complete five generations | Historic | b |
| SORB-15G | 2 | Sorbitol 1.5 M 15 generations | Historic | b |
| SORB-5G | 2 | Sorbitol 1.5 M five generations | Historic | b |
| THR | 1 | Thr minus five generations | Historic | b |
| TPT | 1 | Topotecan 20 μM 3 h, YPD 16 h | Mock | a |
| TPZ | 4 | Tirapazamine 250 μM (3) or 300 μM (1) 2 h, YPD 16 h | Mock | a |
| TRP | 2 | Trp minus five generations | Historic | b |
| UVA | 4 | UVA 36 J/cm2 (1), 288 J/cm2 (3), 16 h YPD | Mock | a |
| UVB | 5 | UVB 3400 J/m2, YPD 16 h | Mock | c |
| UVC | 5 | UVC 200 J/m2, YPD 16 h | Mock | c |
| WORT | 3 | Wortmannin 1.5 μM (DMSO 1 μg/ml SC) chronic 16 h | Mock | a |
| YPD | 3 | Growth in YPD media 16 h | Time 0 | a |
Name given to each type of treatment followed by the number of repetitions that make up the geometric mean ratio of treated over the control. Treatment is a brief description of the treatment parameters: drug, concentration, and time. Chronic exposure is batch growth in continuous presence of the treatment. Acute exposures are for a defined time period followed by a recovery phase in YPD media. The type of control is indicated as a matched ‘mock' control, a ‘time 0' control used for change over time, and the ‘historic' controls taken from Giaever in which a highly replicated control condition was tested for the given number of generations. The references cited are (a) this work; (b) Giaever ; (c) Birrell ; (d) Game ; (e) Wu ; and (f) Haugen .
Figure 1Two-way unsupervised uncentered unnormalized hierarchical clustering using a Pearson's correlation of the phenotypic profiles of 4281 nonessential genes to 51 different treatments. The expanded region shows the DNA-damage cluster, which contains the components of the DNA-damage checkpoint function, nucleotide excision repair, and homologous recombination.
Interacting proteins, synthetic lethal interactions, and coordinated gene expression for the nonessential genes in the DNA-damage checkpoint and NER pathways
| All nonessential genes in pathway | Clustered by phenotypic profiling? | Interacting proteins by two-hybrid, co-IP and mass spec. analysis | Common synthetic lethality | Coordinated expression to stress | Coordinated expression to DNA damage | Cluster no. by integration analysis |
|---|---|---|---|---|---|---|
|
| ||||||
| | Yes (0) | Mec3, Rad17 (3) | 4 (17) | None (0) | None (0) | 8 |
| | Yes (0) | Ddc1, Rad17 (84) | 4 (1) | None (0) | None (0) | 14 |
| | Yes (0) | None (4) | 4 (11) | None (0) | None (0) | 14 |
| | Yes (0) | Ddc1, Mec3 (3) | 4 (1) | None (0) | None (0) | 14 |
| | Yes (0) | None (11) | 4 (6) | None (0) | None (0) | 14 |
|
| ||||||
| | Yes (0) | Rad10, Rad14 (33) | 0 (0) | None (0) | None (0) | NC |
| | Yes (0) | Rad1 (13) | 0 (1) | None (0) | None (2) | 9 |
| | Yes (0) | Rad1, Rad4, Rad16 (3) | 0 (0) | None (0) | None (8) | 9 |
| | Yes (0) | Rad14, Rad23 (1) | 0 (3) | Rad16 (>20) | None (0) | 9 |
| | Yes (0) | Rad4 (9) | 0 (4) | None (0) | None (0) | NC |
| | Yes (0) | None (1) | 0 (0) | None (>20) | None (1) | 9 |
| | Yes (0) | Rad16, Elc1 (1) | 0 (0) | None (0) | None (4) | 9 |
| | No (0) | Rad14, Rad7 (20) | 0 (2) | Rad4 (>20) | None (10) | 9 |
| | Yes (0) | Rad7 (0) | 0 (0) | None (0) | None (0) | NC |
aInteraction data show the gene names, intrapathway interactions as well as the number of additional nonpathway interactions in parenthesis obtained from Yeast Grid as well as the number of synthetically lethal interactions found at (http://biodata.mshri.on.ca/yeast_grid/servlet/SearchPage).
bThe number of genes that are coordinately regulated using either response to DNA damage (Gasch ) or to stress (Gasch ) with a Pearson correlation of >0.8 to the query gene from http://db.yeastgenome.org/cgi-bin/expression/expressionConnection.pl.
cCoclusters identified by probabilistic functional analysis by Lee . Cluster number is given or NC for genes which failed to cluster.
dGenes that are coclustered by Tong .
Figure 2Precision–recall evaluation of phenotype data on GO biological processes. The predictive power of phenotype profile correlations was evaluated against a gold standard based on six biological processes as defined by the GO: DNA repair, amino-acid biosynthesis, cell cycle checkpoint, response to osmotic stress, aerobic respiration, and galactose metabolism (A). The fraction of known functionally related gene pairs to total predictions (precision) at a range of thresholds is plotted versus the percentage of the number of known gene relationships recovered (recall) (). The characteristics of other high-throughput experimental data, affinity precipitation (▪), yeast two hybrid (), synthetic lethality (), transcription factor binding site data (), microarray correlation (), and functional data derived from Hughes () are shown for comparison. Two supervised feature selection methods were used to select the relevant features from the diverse collection of microarray data, one selecting single data set features independently and the other including or excluding entire data sets. The phenotype data is both more sensitive and precise than other high-throughput data on this set of processes. The phenotype profiles were also evaluated against a more general set of GO terms for comparison against existing data including (B) and excluding (C) the ribosome biogenesis GO term (GO:0007046), which tends to dominate gene pairs implicated by coexpression. The phenotype profiles implicate gene relationships over a broad range of biological processes.