| Literature DB >> 23368702 |
Minho Lee1, Sangjo Han, Hyeshik Chang, Youn-Sig Kwak, David M Weller, Dongsup Kim.
Abstract
BACKGROUND: Yeast deletion-mutant collections have been successfully used to infer the mode-of-action of drugs especially by profiling chemical-genetic and genetic-genetic interactions on a genome-wide scale. Although tens of thousands of those profiles are publicly available, a lack of an accurate method for mining such data has been a major bottleneck for more widespread use of these useful resources.Entities:
Mesh:
Year: 2013 PMID: 23368702 PMCID: PMC3549813 DOI: 10.1186/1471-2164-14-S1-S6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Overall scheme of FitSearch. Although researchers have only one or two yeast fitness profiles to their drugs of interest that have unknown toxicity mechanisms, they can easily perform data-mining against tens of thousands of public fitness profiles in order to obtain insight into the mechanism through the FitSearch website (http://fitsearch.kaist.ac.kr). When any type of yeast fitness profile is submitted as a query in the website, a similarity search to other public resources is performed by rank-cutoff optimizer through the FitSearch engine, which is a newly developed method using rank-based overlapping statistics (see the details in the Methods). Since available public resources are deposited in FitRankDB as a general repository for the FitSearch engine (see the details in the Methods), the similarity search can be performed more efficiently, thoroughly, and rapidly in the FitSearch website. Finally, users scrutinize characteristics of a list of drugs similar to their drug of interest and obtain clues or plausible hypotheses, which could also help them to design further bioassays.
Different types of yeast fitness profiles deposited in FitRankDB.
| Type of treatment | Type of genome-wide deletion library | Type of fitness profile | Profile # |
|---|---|---|---|
| Chemical effect1 | Homozygous deletion strains3 | Chemical-genetic (Hom) | 918 |
| Chemical effect | Heterozygous deletion strains3 | Chemical-genetic (Het) | 1,530 |
| Genetic effect2 | Homozygous deletion strains | Genetic-genetic (Hom) | 12,419 |
See the details at http://pombe.kaist.ac.kr/fitsearch/statistics/
1 For example, drug, bioactive compounds or natural crude extracts
2 For example, knock-out, over-expression or mutation of a gene
3 Homozygous (or haploid) and heterozygous (or diploid) deletion collections of S. cerevisiae and S. pombe are commercially available at Open Biosystems (http://www.openbiosystems.com) and Bioneer (http://pombe.bioneer.co.kr), respectively.
Figure 2Toy example showing how the rank-cutoff optimizer works. (A) Ranks of each strain in virtual two query and target yeast fitness profiles to be compared are supposed to be deposited in Fit-RankDB. These profiles are also supposed to be generated using a virtual yeast deletion library comprising strain a to j. (B) Efficient calculation of a match number (or an overlapped strain number) accumulated under all possible rank-cutoffs of the query and the target by Dynamic programming (see the details in the Methods). For this calculation, first, rank matches of each strain should be expressed as the match matrix (M). In the M matrix, its row represents 'ranks in the query', its column 'ranks in the target', and its value 'the strain number with same rank in the query and the target'. Then, the current accumulated match number (in red-colored cell in the A matrix) is calculated by adding the current match number (in the orange-colored cell in the M matrix) to the previous accumulated match number (sky-colored cell plus purple-colored cell minus gray-colored cell in the A matrix). In this way, the accumulated match numbers regarding to all possible rank-cutoffs are efficiently calculated and stored in the A matrix. (C) The matrix of cumulative hyper-geometric p-values (P) is filled by calculating the equation (2) as the objective function (Hp) regarding to all possible rank-cutoffs, and used to find the rank-cutoffs with the minimized p-value as described in the equation (3), called optimal rank-cutoffs. The A matrix provides all of the parameters needed for equations (2) and (3) as follows: Its values represent the overlapped strain number in the equation (2); its row-names, the query strain number; its column-names, the target strain number in their respective rank-cutoffs; and its column or row length, the size of population. When the maximal rank-cutoff is set to 10 in the toy example, the query rank-cutoff 5 and the target rank-cutoff 5 shows the minimal p-value, 0.004. At those optimal rank-cutoffs, overlapping significance (hyper-geometric p-value) and overlapping score (Tanimoto coefficients) can be expressed as the similarity between the query and the target.
Available frontends in FitSearch web site.
| Option | Description |
|---|---|
| FitSearchp | Search pre-compiled fitness rank database (FitRankDB) with a fitness profile of user. |
| FitSearchd | Search FitRankDB with the profile specified in FitRankDB. |
There are more details in 'help' page in the web site.
Biological interpretation about similarity between two fitness profiles
| Query fitness profile | Target fitness profile | Biological interpretation of similar target treatment |
|---|---|---|
| Chemical effect (i.e. drug toxicity) with similar mode-of-action | ||
| Chemical effect with similar mode-of-action; Finding common direct drug target protein | ||
| Genetic effect (i.e. knock-out and mutations) on direct drug target protein gene | ||
| Biological functions related to chemical or genetic effect |
Figure 3Plot of an overlapping score and an overlapping significance as two-way cutoffs to show the most similar chemical or genetic effects to a query's effect. (A) Two-way cutoff plot of the most similar chemical effects to the 5-Fluorouracil's effect. (B) Two-way cutoff plot of similar chemical effects to clotrimazole's effect. (C) Two-way cutoff plot of the most similar genetic effects to clotrimazole's effect. (D) Two-way cutoff plot of the most similar chemical effects to DAPG's effect. Target sources mean public chemical-genetic or genetic-genetic yeast profiles.
FitSearch can detect genetic interactions that cannot be detected by SGA analysis.
| Rank | Gene1 | Gene2 | Tc | P-value | Note |
|---|---|---|---|---|---|
| 1 | YPL022W RAD1 | YML095C RAD10 | 1 | 3.44E-29 | Single-stranded DNA endonucleases (with each other) |
| 2 | YDL040C NAT1 | YHR013C ARD1 | 0.94 | 1.18E-317 | Subunit of the N-terminal acetyltransferase NatA (Nat1p, Ard1p, Nat5p) |
| 3 | YCR009C RVS161 | YDR388W RVS167 | 0.91 | 3.37E-114 | Manually curated by [ |
| 4 | YPL020C ULP1 | YKR082W NUP133 | 0.91 | 3.79E-28 | Overexpression of ULP1 rescues a nup133 rad27 or nup60 rad27 double mutant [ |
| 5 | YJL194W CDC6 | YHR118C ORC6 | 0.91 | 1.42E-72 | ORC6-rxl and chromosomal deletion of the Cdc6 leads to slow growth phenotype [ |
| 6 | YMR125W STO1 | YPL178W CBC2 | 0.88 | 2.62E-40 | Both are subunits of cap-binding protein complex |
| 7 | YMR224C MRE11 | YNL250W RAD50 | 0.88 | 1.34E-171 | MRE11 is a subunit of a complex with Rad50p and Xrs2p |
| 8 | YBR175W SWD3 | YAR003W SWD1 | 0.88 | 4.12E-54 | Both are subunits of the COMPASS (Set1C) complex |
| 9 | YDR166C SEC5 | YLR166C SEC10 | 0.86 | 2.30E-32 | Both are subunits of the exocyst complex |
| 10 | YNL041C COG6 | YNL051W COG5 | 0.8 | 1.56E-96 | Both are components of the conserved oligomeric Golgi complex |
The table shows the top 10 results that are not included in the genetic interaction list by SGA analysis [22]. Notes without references are retrieved from the Saccharomyces Genome Database (SGD) [26]. P-values, here, are corrected considering multiple tests.