| Literature DB >> 26356760 |
Qing Wen, Paul O'Reilly, Philip D Dunne, Mark Lawler, Sandra Van Schaeybroeck, Manuel Salto-Tellez, Peter Hamilton, Shu-Dong Zhang.
Abstract
BACKGROUND: While the discovery of new drugs is a complex, lengthy and costly process, identifying new uses for existing drugs is a cost-effective approach to therapeutic discovery. Connectivity mapping integrates gene expression profiling with advanced algorithms to connect genes, diseases and small molecule compounds and has been applied in a large number of studies to identify potential drugs, particularly to facilitate drug repurposing. Colorectal cancer (CRC) is a commonly diagnosed cancer with high mortality rates, presenting a worldwide health problem. With the advancement of high throughput omics technologies, a number of large scale gene expression profiling studies have been conducted on CRCs, providing multiple datasets in gene expression data repositories. In this work, we systematically apply gene expression connectivity mapping to multiple CRC datasets to identify candidate therapeutics to this disease.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26356760 PMCID: PMC4565135 DOI: 10.1186/1752-0509-9-S5-S4
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Information of datasets used.
| Dataset | Stage | Samples | Platform | TotalGenes | SignificantGenes |
|---|---|---|---|---|---|
| GSE21510 | 2-3 | 30(15pairs) | HG-U133Plus2 | 54675 | 4025 |
| GSE41258 | 2-3 | 38(19pairs) | HG-U133A | 22283 | 929 |
| GSE21510 | 4 | 8(4pairs) | HG-U133Plus2 | 54675 | 7 |
| GSE41258 | 4 | 36(18pairs) | HG-U133A | 22283 | 663 |
| GSE49355 | 4 | 30(15pairs) | HG-U133A | 22283 | 1323 |
| Combined Dataset | 2-4 | 142(71pairs) | 22277 | 4757 |
Top 1 gene of each dataset and its scores in all other datasets.
| ProbeID | GeneSymbol | GSE21510S2*3 | GSE41258S2*3 | GSE21510S4 | GSE41258S4 | GSE49355S4 | TotalScore | OverallRank |
|---|---|---|---|---|---|---|---|---|
| 201637 s at | FXR1 | 1 | 0 | 0 | 0 | 0 | 1 | 1000 |
| 203256 at | CDH3 | 0.6174 | 1 | 0 | 0.9400 | 0.9983 | 3.5557 | 25 |
| 37892 at | COL11A1 | 0.9990 | 0.8303 | 0 | 1 | 0.4707 | 3.3000 | 64 |
| 206976 s at | HSPH1 | 0.3651 | 0.9524 | 0 | 0 | 1 | 2.3175 | 262 |
| 216230 × at | SMPD1 | -0.6184 | -0.8424 | -1 | 0 | 0 | -2.4608 | 222 |
Top 10 genes of the combined signature and their scores in all datasets.
| ProbeID | GeneSymbol | GSE21510S2*3 | GSE41258S2*3 | GSE21510S4 | GSE41258S4 | GSE49355S4 | TotalScore | OverallRank |
|---|---|---|---|---|---|---|---|---|
| 203908 at | SLC4A4 | -0.9933 | -0.9847 | 0 | -0.9895 | -0.9883 | -3.9557 | 1 |
| 207502 at | GUCA2B | -0.9904 | -0.9973 | 0 | -0.9797 | -0.9826 | -3.9500 | 2 |
| 207003 at | GUCA2A | -0.9627 | -0.9989 | 0 | -0.9822 | -0.9513 | -3.8951 | 3 |
| 205480 s at | UGP2 | -0.9912 | -0.9732 | 0 | -0.9197 | -0.9996 | -3.8836 | 4 |
| 205950 s at | CA1 | -0.9974 | -0.8872 | 0 | -0.9959 | -0.9352 | -3.8159 | 5 |
| 212942 s at | KIAA1199 | 0.9472 | 0.9962 | 0 | 0.8775 | 0.9926 | 3.8135 | 6 |
| 203961 at | NEBL | 0.9788 | 0.9639 | 0 | 0.8508 | 0.9887 | 3.7821 | 7 |
| 219909 at | MMP28 | -0.9332 | -0.9995 | 0 | -0.8856 | -0.9613 | -3.7796 | 8 |
| 213766 × at | GNA11 | -0.9973 | -0.9037 | 0 | -0.9059 | -0.9609 | -3.7677 | 9 |
| 202370 s at | CBFB | 0.9886 | 0.8933 | 0 | 0.9335 | 0.9522 | 3.7676 | 10 |
Figure 1Results of sscMapping using the 148-gene signature determined from the gene signature progression procedure. Each data point represents a compound, with the raw connection score (cscore) or the normalized score (zscore) shown in (A) and (B) respectively. The blue line in the plot corresponds to the position of threshold p-value. Any data points above the blue line are drugs that have significant connections (solid red circles) to CRC gene signature.
The connections and perturbation stabilities of the 10 significant drugs obtained for the 148-gene signature.
| Compound | Replicate | cscore | pvalue | zscore | PerturbStability |
|---|---|---|---|---|---|
| trichostatin A | 182 | -0.144 | 5.0E-06 | -5.87 | 1.00 |
| vorinostat | 12 | -0.146 | 1.0E-05 | -4.63 | 1.00 |
| HC toxin | 1 | -0.217 | 5.0E-06 | -4.57 | 1.00 |
| sodium phenylbutyrate | 7 | -0.072 | 5.0E-06 | -4.56 | 1.00 |
| mycophenolic acid | 3 | -0.145 | 2.0E-05 | -4.41 | 1.00 |
| irinotecan | 3 | -0.146 | 1.0E-05 | -4.18 | 1.00 |
| etoposide | 4 | -0.087 | 3.9E-04 | -3.34 | 0.90 |
| valproic acid | 57 | -0.036 | 5.9E-04 | -3.19 | 0.78 |
| arachidonic acid | 3 | -0.094 | 5.8E-04 | -3.18 | 0.77 |
| rifabutin | 3 | -0.100 | 6.3E-04 | -3.23 | 0.74 |
Figure 2The numbers of drug hits returned using gene signatures from individual datasets and from the combined 148-gene signature. Note that only the combined gene signature drug list contains both irinotecan and etoposide, which are two existing CRC therapeutic drugs.
Drug results from combined and individual datasets.
| CombinedSig | GSE41258S2*3 | GSE21510S2*3 | GSE49355S4 | GSE41258S4 |
|---|---|---|---|---|
| trichostatin A | irinotecan | letrozole | methylergometrine | trichostatin A |
| vorinostat | sirolimus | nystatin | vorinostat | HC toxin |
| HC toxin | trichostatin A | quipazine | 0316684-0000 | vorinostat |
| sodium phenylbutyrate | scriptaid | josamycin | nocodazole | rifabutin |
| mycophenolic acid | vorinostat | benzocaine | scriptaid | sodium phenylbutyrate |
| irinotecan | camptothecin | isocorydine | simvastatin | scriptaid |
| etoposide | dexamethasone | metampicillin | morantel | monorden |
| valproic acid | LY-294002 | ethisterone | edrophonium chloride | alvespimycin |
| arachidonic acid | fludroxycortide | fluticasone | trichostatin A | dexamethasone |
| rifabutin | methylbenzethonium chloride | hycanthone | trazodone | tanespimycin |
| sodium phenylbutyrate | tenoxicam | butein | ||
| adenosine phosphate |