Literature DB >> 14502500

Data mining the NCI cancer cell line compound GI(50) values: identifying quinone subtypes effective against melanoma and leukemia cell classes.

Kenneth A Marx1, Philip O'Neil, Patrick Hoffman, M L Ujwal.   

Abstract

Using data mining techniques, we have studied a subset (1400) of compounds from the large public National Cancer Institute (NCI) compounds data repository. We first carried out a functional class identity assignment for the 60 NCI cancer testing cell lines via hierarchical clustering of gene expression data. Comprised of nine clinical tissue types, the 60 cell lines were placed into six classes-melanoma, leukemia, renal, lung, and colorectal, and the sixth class was comprised of mixed tissue cell lines not found in any of the other five classes. We then carried out supervised machine learning, using the GI(50) values tested on a panel of 60 NCI cancer cell lines. For separate 3-class and 2-class problem clustering, we successfully carried out clear cell line class separation at high stringency, p < 0.01 (Bonferroni corrected t-statistic), using feature reduction clustering algorithms embedded in RadViz, an integrated high dimensional analytic and visualization tool. We started with the 1400 compound GI(50) values as input and selected only those compounds, or features, significant in carrying out the classification. With this approach, we identified two small sets of compounds that were most effective in carrying out complete class separation of the melanoma, non-melanoma classes and leukemia, non-leukemia classes. To validate these results, we showed that these two compound sets' GI(50) values were highly accurate classifiers using five standard analytical algorithms. One compound set was most effective against the melanoma class cell lines (14 compounds), and the other set was most effective against the leukemia class cell lines (30 compounds). The two compound classes were both significantly enriched in two different types of substituted p-quinones. The melanoma cell line class of 14 compounds was comprised of 11 compounds that were internal substituted p-quinones, and the leukemia cell line class of 30 compounds was comprised of 6 compounds that were external substituted p-quinones. Attempts to subclassify melanoma or leukemia cell lines based upon their clinical cancer subtype met with limited success. For example, using GI(50) values for the 30 compounds we identified as effective against all leukemia cell lines, we could subclassify acute lymphoblastic leukemia (ALL) origin cell lines from non-ALL leukemia origin cell lines without significant overlap from non-leukemia cell lines. Based upon clustering using GI(50) values for the 60 cancer cell lines laid out by the RadViz algorithm, these two compound subsets did not overlap with clusters containing any of the NCI's 92 compounds of known mechanism of action, a few of which are quinones. Given their structural patterns, the two p-quinone subtypes we identified would clearly be expected to possess different redox potentials/substrate specificities for enzymatic reduction in vivo. These two p-quinone subtypes represent valuable information that may be used in the elucidation of pharmacophores for the design of compounds to treat these two cancer tissue types in the clinic.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 14502500     DOI: 10.1021/ci034050+

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  8 in total

1.  Gene expression associations with the growth inhibitory effects of small molecules on live cells: specificity of effects and uniformity of mechanisms.

Authors:  Kerby Shedden; Yang Yang; Gus Rosania
Journal:  Stat Anal Data Min       Date:  2009-09-01       Impact factor: 1.051

2.  Discovery of novel types of inhibitors of S-adenosylmethionine synthesis by virtual screening.

Authors:  John C Taylor; Charles W Bock; Fusao Takusagawa; George D Markham
Journal:  J Med Chem       Date:  2009-10-08       Impact factor: 7.446

Review 3.  Intelligent Techniques Using Molecular Data Analysis in Leukaemia: An Opportunity for Personalized Medicine Support System.

Authors:  Haneen Banjar; David Adelson; Fred Brown; Naeem Chaudhri
Journal:  Biomed Res Int       Date:  2017-07-25       Impact factor: 3.411

4.  Pharmacogenomics of in vitro response of the NCI-60 cancer cell line panel to Indian natural products.

Authors:  Hari Sankaran; Simarjeet Negi; Lisa M McShane; Yingdong Zhao; Julia Krushkal
Journal:  BMC Cancer       Date:  2022-05-07       Impact factor: 4.638

5.  Drug toxicity assessment: cell proliferation versus cell death.

Authors:  Elena V Sazonova; Mikhail S Chesnokov; Boris Zhivotovsky; Gelina S Kopeina
Journal:  Cell Death Discov       Date:  2022-10-14

6.  Discovering associations in biomedical datasets by link-based associative classifier (LAC).

Authors:  Pulan Yu; David J Wild
Journal:  PLoS One       Date:  2012-12-05       Impact factor: 3.240

7.  Multi-tissue microarray analysis identifies a molecular signature of regeneration.

Authors:  Sarah E Mercer; Chia-Ho Cheng; Donald L Atkinson; Jennifer Krcmery; Claudia E Guzman; David T Kent; Katherine Zukor; Kenneth A Marx; Shannon J Odelberg; Hans-Georg Simon
Journal:  PLoS One       Date:  2012-12-26       Impact factor: 3.240

8.  In silico identification of anti-cancer compounds and plants from traditional Chinese medicine database.

Authors:  Shao-Xing Dai; Wen-Xing Li; Fei-Fei Han; Yi-Cheng Guo; Jun-Juan Zheng; Jia-Qian Liu; Qian Wang; Yue-Dong Gao; Gong-Hua Li; Jing-Fei Huang
Journal:  Sci Rep       Date:  2016-05-05       Impact factor: 4.379

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.