| Literature DB >> 26817825 |
Chien-Hung Huang1, Peter Mu-Hsin Chang2, Chia-Wei Hsu3, Chi-Ying F Huang4, Ka-Lok Ng5,6.
Abstract
BACKGROUND: Non-small cell lung cancer (NSCLC) is one of the leading causes of death globally, and research into NSCLC has been accumulating steadily over several years. Drug repositioning is the current trend in the pharmaceutical industry for identifying potential new uses for existing drugs and accelerating the development process of drugs, as well as reducing side effects.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26817825 PMCID: PMC4895785 DOI: 10.1186/s12859-015-0845-0
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Workflow of this study, which consists of (1) identification of DEGs, (2) machine learning approach, (3) topological parameter-based classification, (4) common pathway analysis, (5) common drug analysis and (6) effectiveness verification
Summary of microarray datasets
| GEO ID | Organization name | Number of samples (Early-stage) | Number of samples (Late-stage) |
|---|---|---|---|
| GSE7670 | Taipei Veterans General Hospital | 8 | 11 |
| GSE10072 | National Cancer Institute, NIH | 15 | 9 |
| GSE19804 | National Taiwan University | 35 | 13 |
| GSE27262 | National Yang Ming University | 25 | n/a |
The number of DEGs derived from the machine learning method for each microarray dataset
| Stage | GEO ID | Number of DEGs | Excluded | Net | Number of predicted key genes | Common genes |
|---|---|---|---|---|---|---|
| Early | GSE7670 | 801 | 350 | 451 | 259 | 136 |
| GSE10072 | 2835 | 890 | 1945 | 1173 | ||
| GSE19804 | 4614 | 1924 | 2690 | 1697 | ||
| GSE27262 | 8476 | 3161 | 5315 | 3310 | ||
| Late | GSE7670 | 1674 | 608 | 1066 | 511 | 182 |
| GSE10072 | 1656 | 574 | 1082 | 691 | ||
| GSE19804 | 3391 | 1545 | 1846 | 1181 |
The common pathways by using DAVID and CPDB for early-stage NSCLC (the p -value and p -value represent the corresponding p-value obtained by machine learning algorithms and topological parameter-based classification)
| DAVID | |||||
| KEGG | REACTOME | ||||
| pathname |
|
| pathname |
|
|
| NULL | NULL | ||||
| CPDB | |||||
| KEGG | REACTOME | ||||
| pathname |
|
| pathname |
|
|
| Endocytosis | 0.01340 | 0.00045 | Cell-Cell communication | 0.02810 | 0.00432 |
| Glycolysis/Gluconeogenesis | 0.02330 | 0.00249 | Glucose metabolism | 0.02420 | 0.02000 |
| Hematopoietic cell lineage | 0.04060 | 0.03380 | Regulation of PLK1 Activity at G2/M Transition | 0.03700 | 0.03070 |
| Gap junction | 0.04700 | 0.03930 | Metabolism of nucleotides | 0.03940 | 0.03280 |
| Cell junction organization | 0.00765 | 0.03590 | |||
| Platelet activation, signaling and aggregation | 0.04720 | 0.03630 | |||
The common paths using DAVID and CPDB for late-stage NSCLC (the p -value and p -value represent the corresponding p-value obtained by machine learning algorithms and topological parameter-based classification)
| DAVID | |||||
| KEGG | REACTOME | ||||
| pathname |
|
| pathname |
|
|
| Cell cycle | 0.03800 | 0.00140 | Cell Cycle Checkpoints | 0.00760 | 0.00872 |
| Cell Cycle, Mitotic | 0.00100 | 0.02186 | |||
| CPDB | |||||
| KEGG | REACTOME | ||||
| pathname |
|
| pathname |
|
|
| Cell cycle | 0.00632 | 0.00004 | Regulation of mitotic cell cycle | 0.01500 | 0.00000 |
| Inflammatory mediator regulation of TRP channels | 0.03960 | 0.00048 | APC/C:Cdc20 mediated degradation of mitotic proteins | 0.01500 | 0.00000 |
| Endocytosis | 0.00051 | 0.00153 | Activation of APC/C and APC/C: Cdc20 mediated degradation of mitotic proteins | 0.00663 | 0.00001 |
| Thyroid hormone synthesis | 0.01560 | 0.00662 | Cell Cycle Checkpoints | 0.00729 | 0.00001 |
| Salivary secretion | 0.03120 | 0.01370 | Cell Cycle | 0.00007 | 0.00001 |
| Long-term depression | 0.04420 | 0.02330 | Regulation of APC/C activators between G1/S and early anaphase | 0.00177 | 0.00001 |
| cGMP-PKG signaling pathway | 0.02210 | 0.02740 | G1/S Transition | 0.01030 | 0.00002 |
| Vascular smooth muscle contraction | 0.00497 | 0.03440 | Cdc20:Phospho-APC/C mediated degradation of Cyclin A | 0.00837 | 0.00004 |
| Cell Cycle, Mitotic | 0.00485 | 0.00014 | |||
| APC:Cdc20 mediated degradation of cell cycle proteins prior to satisfaction of the cell cycle checkpoint | 0.00182 | 0.00014 | |||
| Mitotic G1-G1/S phases | 0.00541 | 0.00016 | |||
| G2/M Checkpoints | 0.02590 | 0.00023 | |||
| M Phase | 0.00298 | 0.00119 | |||
| DNA Replication | 0.00813 | 0.00153 | |||
| Resolution of Sister Chromatid Cohesion | 0.00103 | 0.00297 | |||
| Mitotic Prometaphase | 0.00230 | 0.00399 | |||
| Apoptotic cleavage of cellular proteins | 0.00067 | 0.00572 | |||
| S Phase | 0.01500 | 0.00762 | |||
| Mitotic Anaphase | 0.02040 | 0.00875 | |||
| Mitotic Metaphase and Anaphase | 0.00035 | 0.01140 | |||
| Apoptotic execution phase | 0.00036 | 0.01170 | |||
| Synthesis of DNA | 0.03370 | 0.01760 | |||
| Separation of Sister Chromatids | 0.00540 | 0.01760 | |||
The number of potential drugs filtered by meta-analysis for early- and late- stage NSCLC using the enrichment score (ES) and cMap p-value (less than 0.1) for meta-analysis
| Early-stage | Late-stage | |||
|---|---|---|---|---|
| Potential drugs | IC50 verified | Potential drugs | IC50 verified | |
| ES < 0 & cMap | 9 | 4 | 31 | 5 |
| ES < 0 & cMap | 12 | 4 | 81 | 8 |
| ES < 0 & | 25 | 2 | 23 | 1 |
| ES < 0 & | 60 | 8 | 49 | 5 |
The number of common drugs and JI score for early- and late-stage using the enrichment score (ES) and cMap p-value (less than 0.1) for meta-analysis
| Effect size | ES < 0 & | ES < 0 & cMap | |||
|---|---|---|---|---|---|
| Effect size | Early-stage | Late-stage | Early-stage | Late-stage | |
| ES < 0 & | Early-stage | 0.557 | 0.078 | 0.152 | |
| Late-stage | 39 | 0.074 | 0.194 | ||
| ES < 0 & cMap | Early-stage | 5 | 4 | 0.143 | |
| Late-stage | 12 | 13 | 5 | ||
IC50 values of potential drugs for early-stage NSCLC
| Machine learning algorithms | ||
| cMap drug name | MTT (μM) | Clonogenic (μM) |
| mebendazole | <1 | |
| vorinostat | <1 | |
| pyrvinium | <0.1 | |
| niclosamide | >5 | |
| nortriptyline | <10 | |
| piperlongumine | >5 | |
| trichostatin A | ||
| trioxysalen | >5 | |
| Topological parameter-based classification | ||
| cMap drug name | MTT (μM) | Clonogenic (μM) |
| trichostatin A | <1 | |
| vorinostat | <1 | |
IC50 values of potential drugs for late-stage NSCLC
| Machine learning algorithms | ||
| cMap drug name | MTT (μM) | Clonogenic (μM) |
| trichostatin A | <1 | |
| Vorinostat | <1 | |
| withaferin A | <1 | |
| mebendazole | <1 | |
| piperlongumine | >5 | |
| Topological parameter classification | ||
| cMap drug name | MTT (μM) | Clonogenic (μM) |
| acepromazine | <10 | |
| nortriptyline | <10 | |
| propafenone | <10 | |
| trichostatin A | <1 | |
| vorinostat | <1 | |
The common drugs identified by both two methods
| Early-stage NSCLC | |||
| 4,5-dianilinophthalimide | mepacrine (quinacrine) | meptazinol | perhexiline |
| puromycin | trichostatin A | vorinostat | |
| Late-stage NSCLC | |||
| (-)-MK-801 | 4,5-dianilinophthalimide | MS-275 (Entinostat) | perhexiline |
| puromycin | quinostatin | rifabutin | scriptaid |
| trichostatin A | vorinostat | Y-27632 | |
The targeted genes identified by the common drugs derived from both two methods (the parentheses represent the number of associated cMap drugs)
| Early-stage NSCLC | |||
| ADRB2 (4) | CASP1 (3) | KAT2A (2) | SNCA (1) |
| ARRB1(1) | PSIP1 (1) | PAFAH1B3 (1) | GAPDH (1) |
| Late-stage NSCLC | |||
| ADRB2 (6) | ARRB1 (3) | NCOA1 (3) | PSIP1 (3) |
| SMARCA2 (3) | GAPDH (3) | CPT1A (1) | AURKB (1) |
| IRAK1 (1) | GRK5 (1) | SRPK1 (1) | AURKA (1) |
Fig. 2The top three genes (squares) which connect to the largest, the second largest and the third largest degree of targeted genes (circles) for a early-stage; b late-stage network