| Literature DB >> 28820893 |
Lei Chen1,2, Jing Lu3, Tao Huang4, Yu-Dong Cai1.
Abstract
Lung cancer causes a large number of deaths per year. Until now, a cure for this disease has not been found or developed. Finding an effective drug through traditional experimental methods invariably costs millions of dollars and takes several years. It is imperative that computational methods be developed to integrate several types of existing information to identify candidate drugs for further study, which could reduce the cost and time of development. In this study, we tried to advance this effort by proposing a computational method to identify candidate drugs for non-small cell lung cancer (NSCLC), a major type of lung cancer. The method used three steps: (1) preliminary screening, (2) screening compounds by an association test and a permutation test, (3) screening compounds using an EM clustering algorithm. In the first step, based on the chemical-chemical interaction information reported in STITCH, a well-known database that reports interactions between chemicals and proteins, and approved NSCLC drugs, compounds that can interact with at least one approved NSCLC drug were picked. In the second step, the association test selected compounds that can interact with at least one NSCLC-related chemical and at least one NSCLC-related gene, and subsequently, the permutation test was used to discard nonspecific compounds from the remaining compounds. In the final step, core compounds were selected using a powerful clustering algorithm, the EM algorithm. Six putative compounds, protoporphyrin IX, hematoporphyrin, canertinib, lapatinib, pelitinib, and dacomitinib, were identified by this method. Previously published data show that all of the selected compounds have been reported to possess anti-NSCLC activity, indicating high probabilities of these compounds being novel candidate drugs for NSCLC.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28820893 PMCID: PMC5562320 DOI: 10.1371/journal.pone.0183411
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Sixteen approved NSCLC drugs.
| PubChem ID | Name | Mechanism | NSCLC-related Genes |
|---|---|---|---|
| CID4033 | Mechlorethamine | Agents directly acting on DNA | |
| CID36314 | Paclitaxel | Antimitotic agent | |
| CID38904 | Carboplatin | Agents directly acting on DNA | |
| CID57166 | Porfimer Sodium | Photosensitizing agent | |
| CID60750 | Gemcitabine | Agents interfering with DNA synthesis | |
| CID60780 | Vinorelbine | Antimitotic agent | |
| CID60843 | Pemetrexed | Agents interfering with DNA synthesis | |
| CID123631 | Gefitinib | EGFR inhibitor | |
| CID126941 | Methotrexate | Agents interfering with DNA synthesis | |
| CID148124 | Docetaxel | Antimitotic agent | |
| CID176870 | Erlotinib | EGFR inhibitor | |
| CID441203 | Cisplatin | Agents directly acting on DNA | |
| CID5360373 | Bleomycin | Agents directly acting on DNA | |
| CID10184653 | Afatinib | EGFR/HER2/HER4 inhibitor | |
| CID11626560 | Crizotinib | ALK/HGFR inhibitor | |
| CID57379345 | Ceritinib | ALK antagonist |
The pseudo codes of the method for identification of novel candidate drugs for NSCLC.
| 1. Preliminary screening |
| 1.1 Using the approved NSCLC drugs and chemical-chemical interactions, compounds that can interact with at least one approved NSCLC drug were obtained and comprised the set |
| 2. Screening compounds by an association test and a permutation test |
| 2.1 For each compound |
| 2.2 For each compound |
| 2.3 If a compound |
| 3. Screening compounds by the EM clustering algorithm |
| 3.1 For each compound obtained in step 2, it was represented by fifteen features based on the interaction scores of protein/chemical-chemical interactions. |
| 3.2 For each approved drug, it was also represented by fifteen features used in step 3.1. |
| 3.3 The remaining compounds and approved drugs were fed into the EM clustering algorithm. |
| 3.4 According to the clustering results, compounds in the same category as approved drugs were picked and outputted. |
The procedures of the EM algorithm.
| EM algorithm |
|---|
| (1) Set an initial parameter |
| (2) E-step: for the current estimated value |
| (3) M-step: the new estimated value of parameter |
| (4) Check the condition ‖ |
The measurements of ten approved NSCLC drugs yielded by the computational method.
| PubChem ID | Name | Rating score on NSCLC-related chemicals | Rating score on NSCLC-related genes | P-value on NSCLC-related chemicals | P-value on NSCLC-related genes |
|---|---|---|---|---|---|
| CID4033 | Mechlorethamine | 296.299 | 358.250 | 0.248 | 0.115 |
| CID36314 | Paclitaxel | 306.485 | 462.889 | 0.346 | 0.221 |
| CID38904 | Carboplatin | 307.897 | 332.267 | 0.196 | 0.238 |
| CID57166 | Porfimer Sodium | 307.231 | 218.000 | 0.312 | 0.032 |
| CID60843 | Pemetrexed | 278.798 | 233.750 | 0.337 | 0.124 |
| CID123631 | Gefitinib | 303.047 | 493.275 | 0.289 | 0.292 |
| CID126941 | Methotrexate | 330.201 | 362.250 | 0.469 | 0.150 |
| CID176870 | Erlotinib | 320.090 | 579.929 | 0.273 | 0.175 |
| CID10184653 | Afatinib | 365.038 | 451.091 | 0.128 | 0.063 |
| CID11626560 | Crizotinib | 293.407 | 579.500 | 0.263 | 0.187 |
Detailed information of six putative compounds.
| PubChem ID | Name | Rating score on NSCLC-related chemicals | P-value on NSCLC-related chemicals | Rating score on NSCLC-related genes | P-value on NSCLC-related genes | NSCLC-related Genes |
|---|---|---|---|---|---|---|
| CID4971 | Protoporphyrin IX | 347.889 | 0.326 | 437.000 | 0.115 | |
| CID11103 | Hematoporphyrin | 309.920 | 0.225 | 224.333 | 0.029 | |
| CID156413 | Canertinib | 331.293 | 0.155 | 781.000 | 0.054 | |
| CID208908 | Lapatinib | 307.412 | 0.303 | 504.750 | 0.119 | |
| CID6445562 | Pelitinib | 322.081 | 0.153 | 605.833 | 0.112 | |
| CID11511120 | Dacomitinib | 302.176 | 0.131 | 576.000 | 0.025 |