| Literature DB >> 18194531 |
Daniel Schöner1, Markus Kalisch, Christian Leisner, Lukas Meier, Marc Sohrmann, Mahamadou Faty, Yves Barral, Matthias Peter, Wilhelm Gruissem, Peter Bühlmann.
Abstract
BACKGROUND: Large scale screening for synthetic lethality serves as a common tool in yeast genetics to systematically search for genes that play a role in specific biological processes. Often the amounts of data resulting from a single large scale screen far exceed the capacities of experimental characterization of every identified target. Thus, there is need for computational tools that select promising candidate genes in order to reduce the number of follow-up experiments to a manageable size.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18194531 PMCID: PMC2258006 DOI: 10.1186/1752-0509-2-3
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Genes in the data sets known to be related to spindle migration. For the statistical assessment (hypergeometric test) of the Gaussian Mixture Modeling results we used a reference set of all genes involved in spindle migration present in the data set analyzed. For some of the genes, their involvement in spindle migration has been published. The rest is known to be involved from the unpublished results of a previous Kar9-localization screen (Methods). Some key regulatory genes, such as kar9 and bim1, though being in the original synthetic lethality data set, are missing in the data set used for the analysis because of missing data for some observations (Additional File 5).
| Gene name | ORF | Evidence |
| ELP6 | YMR312W | unpublished data |
| BIM1 | YER016W | [40] |
| YPT7 | YML001W | unpublished data |
| PAT1 | YCR077C | unpublished data |
| CCZ1 | YBR131W | unpublished data |
| ASE1 | YOR058C | [41] |
| TVP38 | YKR088C | unpublished data |
Figure 1Workflow chart of mixture modeling approach. The figure depicts a general scheme of our mixture modeling method. First, one selects the query gene(s) from a biological process of interest. The respective synthetic lethal data set, retrieved either from a database, such as the GRID, or from own data then defines the list of target genes to consider [39]. Integration of different genomic information sources and generation of genomic features, that characterize the relationship of the query to its synthetic lethal targets, results in a multivariate data set of pairwise scores. Application of a Gaussian Mixture Model identifies a small group of target genes. Varying the posterior probability of the model refines the partitioning, such that the small group shows significant enrichment with known genes in the biological process of the query gene(s), if possible. As a last step, follow-up screens characterize the candidate genes contained in the small group in the biological context that is given (e.g. involvement in spindle migration).
Figure 2Best subset of data sources. Scatterplot for the best subset of features, which consists of hughes.corr and spellman.corr. The default separation is shown with a cutoff posterior probability of 0.5. The small group is shown in red triangles and the big group in black disks.
P-values for best model for spindle migration. Statistical assessment of the best subset of features {hughes.corr, spellman.corr}. The p-values based on the hypergeometric test are shown for two different group sizes. In addition the numbers of known genes and the total number of genes in the data set.
| Genes in small group | Known genes | Known genes in data set | Genes in data set | P-value |
| 6 | 2 | 7 | 129 | 0.0342 |
| 50 | 6 | 7 | 129 | 0.0136 |
Experimental Validation. Phenotypes of the 6 members of the small group.
| Gene name | ORF name | Experimental Evidence |
| ASE1 | YOR058C | compromised anaphase spindles |
| TVP38 | YKR088C | perturbed Kar9-asymmetry |
| SHE1 | YBL031W | broken spindle; perturbed Kar9-asymmetry |
| UBA4 | YHR111W | perturbed Kar9-asymmetry |
| YHR127W | YHR127W | weakly perturbed Kar9-asymmetry |
| GPD1 | YDL022W | none detected |
Figure 3Experimental validation of genes predicted to be involved in spindle migration. A) The she1Δ, uba4Δ and YHR127WΔ-strains show perturbed Kar9 localization. gpd1Δ does not show a phenotypic effect. Wild type and tvp38Δ are shown as positive and negative controls, respectively. The differences in proportions of defective cells are compared to the wild type based on the binomial distribution. This results in p-values that are significant for she1Δ and uba4Δ (p-values 0.0008% and 0.28% respectively), marginally significant for YHR127WΔ (p = 7.4%) and not significant for gpd1Δ (p = 70%). B) Some she1Δ cells have broken or bent anaphase spindles suggesting compromised spindle integrity.
P-values for best model for TOR2 signaling. Statistical assessment of the best subset of features (Additional File 17 and 18). The p-value based on the hypergeometric test is shown for group size 18. The table presents the numbers of known genes and the total number of genes in the data set.
| Genes in small group | Known genes in small group | Known genes in data set | Genes in data set | P-value |
| 18 | 3 | 4 | 70 | 0.0496 |