Literature DB >> 22848369

Concordance of gene expression and functional correlation patterns across the NCI-60 cell lines and the Cancer Genome Atlas glioblastoma samples.

Barry R Zeeberg1, Kurt W Kohn, Ari Kahn, Vladimir Larionov, John N Weinstein, William Reinhold, Yves Pommier.   

Abstract

BACKGROUND: The NCI-60 is a panel of 60 diverse human cancer cell lines used by the U.S. National Cancer Institute to screen compounds for anticancer activity. We recently clustered genes based on correlation of expression profiles across the NCI-60. Many of the resulting clusters were characterized by cancer-associated biological functions. The set of curated glioblastoma (GBM) gene expression data from the Cancer Genome Atlas (TCGA) initiative has recently become available. Thus, we are now able to determine which of the processes are robustly shared by both the immortalized cell lines and clinical cancers.
RESULTS: Our central observation is that some sets of highly correlated genes in the NCI-60 expression data are also highly correlated in the GBM expression data. Furthermore, a "double fishing" strategy identified many sets of genes that show Pearson correlation ≥0.60 in both the NCI-60 and the GBM data sets relative to a given "bait" gene. The number of such gene sets far exceeds the number expected by chance.
CONCLUSION: Many of the gene-gene correlations found in the NCI-60 do not reflect just the conditions of cell lines in culture; rather, they reflect processes and gene networks that also function in vivo. A number of gene network correlations co-occur in the NCI-60 and GBM data sets, but there are others that occur only in NCI-60 or only in GBM. In sum, this analysis provides an additional perspective on both the utility and the limitations of the NCI-60 in furthering our understanding of cancers in vivo.

Entities:  

Mesh:

Year:  2012        PMID: 22848369      PMCID: PMC3406063          DOI: 10.1371/journal.pone.0040062

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

The NCI-60 [1] is a panel of 60 human cancer cell lines used by the Developmental Therapeutics Program (DTP) of the U.S. National Cancer Institute to screen >100,000 compounds plus natural products since 1990. The NCI-60 panel includes cancers of colorectal (CO), renal (RE), ovarian (OV), prostate (PR), lung (LC), breast (BR), and central nervous system (CNS) origin, as well as leukemias (LE) and melanomas (ME). We and our many colleagues around the world have profiled the NCI-60 more comprehensively at the DNA, RNA, protein, mutation, functional, and pharmacological levels than any other panel of diverse cell types in existence. The NCI-60 data have been widely used in cancer research and bioinformatics, but the multiple datasets may be most informative for the recognition of complex ‘biosignatures’ (a ‘biosignature’ involves an ensemble of genes whose features are predictive). Analysis of such biosignatures has led to increased understanding of cell phenotypes and pathway relationships. When we recently clustered genes based on correlation of expression profiles across the NCI-60 [2], many of the clusters were associated with cancer-related biological functions. The number of such clusters far exceeded what would be expected by chance. One of the clusters, designated as “cluster 52 of the 160-cut,” was comprised of significant categories that generally reflected neuron development, immune response, and epithelial to mesenchymal transition (EMT) in addition to cell migration. In contrast, cluster 68 of the 160-cut was focused strongly on a single biological process, namely immune function.

Thumbnails of gene correlation clustering for Cluster 52 genes across (A) NCI-60 cell lines and (B) TCGA GBM samples.

The full size figures are available as Figures S1 and S2. The numbers appended after the gene name refer to the NCI-60 cluster in which that gene appeared. A previous study [3] compared the gene expression profiles between cell lines and breast tumor tissue samples. The authors noted that: “cell lines and tumors share many aspects of their gene expression patterns that can be related to the normal and pathological physiology that distinguishes breast cell types in vivo. These gene sets include 1) the basal epithelial cluster, 2) the luminal epithelial/ER+ cluster, 3) the ERBB2+ amplicon cluster, 4) the proliferation cluster, and 5) the interferon cluster.” The Fisher Exact p-value corresponding to this contingency table was 0.00039. Cancer cells in culture are subject to very different conditions than tumor cells in the host. They have been removed from their physiological milieu of other cell types, tissue architecture, hormonal influences, and autocrine/paracrine signals. So the question remained: “What does such a pattern of association in cell culture tell us about cancer cells in vivo?”. To explore that question, we analyzed the highly curated glioblastoma (GBM) transcript expression data set generated by the The Cancer Genome Atlas (TCGA) initiative [4]. TCGA was established to build a comprehensive catalogue of genomic and phenotypic abnormalities that drive carcinogenesis and potentially affect therapy in >20 different tumor types. In particular, TCGA has now provided a detailed view of genomic aberrations in a GBM cohort consisting of 206 patient samples. Verhaak and Hoadley et al. [5] recently described a gene expression-based molecular classification of GBM into Proneural, Neural, Classical and Mesenchymal subtypes and integrated multiple types of genomic data to establish patterns of somatic mutation, DNA copy number change, and gene expression. The Fisher’s Exact p-value for 100 randomizations corresponding to k = 3 was 0.464±0.279. The bold value indicates the lowest p-value model for the reals. This was the model used for the remainder of the cluster 52 analysis. In the present analysis, we tested whether sets of genes that we previously found to be (1) highly co-expressed across the NCI-60, and (2) functionally coherent were also highly co-expressed across the GBM samples. We then extended that basic analysis by a “double fishing” strategy. That is, we identified sets of genes that showed correlation ≥0.60 in both the NCI-60 and GBM data sets relative to a given “bait” gene. We found that the number of such gene sets far exceeded the number expected by chance. That analysis does not mean that cancer cells in culture share all, or even most, of their characteristics with cells in vivo, but it does indicate similarities. The Fisher Exact p-value corresponding to this contingency table was 0.00001. The Fisher’s Exact p-value for 100 randomizations corresponding to k = 3 was 0.529±0.283. The bold value indicates the lowest p-value model for the reals. This was the model used for the remainder of the cluster 68 analysis.

Methods

Datasets

For GBM expression data, the files unifiedScaled.txt (which contains a complete set of expression data, referred to as TCGA.GBM.complete) TCGA_unified_CORE_ClaNC840.txt (which includes the subtype tags of each sample) were downloaded from the TCGA web site http://tcga-data.nci.nih.gov/docs/publications/gbm_exp/. We used all 202 GBM samples that are available, representing roughly comparable numbers of samples of each subtype. Since the calculated correlation values will be more accurate if they come from a more diverse sampling population, we wanted to retain as much diversity as possible by looking at all subtypes together, so we did not report co-expression within or between subtypes.

Thumbnails of gene correlation clustering for Cluster 68 genes across (A) the NCI-60 cell lines and GBM samples (B).

The full size figures are available as Figures S3 and S4. The numbers appended after the gene name refer to the NCI-60 cluster in which that gene appeared.

Thumbnail of GO category versus gene list CIM for sets of genes with correlation ≥0.60 across both the NCI-60 and GBM samples.

The full size CIM is available as Figure S5. The gene name given as the column header is the representative of a list of genes. The full list of genes is available in the HTGM Download S1. NCI-60 expression data were obtained from CellMiner [6]. Determination of composite expression levels for each gene was performed as described previously [7]–[9]. A special request was made to the system administrator for the complete set of gene expression profiles (referred to as NCI-60.complete). That download would have been too large to perform through the standard web interface. Further details are provided in [2]). Briefly, NCI-60.complete was pre-processed by selecting only those genes that have both an HGNC symbol and annotation in the GO Biological Process ontology. Each gene profile vector was scaled to zero mean and unit variance. That reduced dataset is referred to here as NCI-60.BP. As mentioned above for the GBM samples, we are trying to achieve as high a degree of diversity as possible in the cell lines, so that the highly heterogeneous mixture of cell lines represented by the NCI-60 is ideal. For illustration, consider two genes. We are looking to see if the expression levels of those two genes go up and down together as we traverse the 60 cell lines. If all of the cells lines were essentially identical to one another, there would be no variation and we could not see how the two genes relate in different conditions. For most of the studies reported here, the expression data for GBM and for NCI-60 were restricted to those genes that were present in both TCGA.GBM.complete and NCI-60.BP. The designated gene (G) appearing in the gene column is the representative of a group of genes that correlate strongly with G. “NA” indicates that the gene set did not map to any statistically significant GO categories. The complete High-Throughput GoMiner (HTGM) download is provided in file Download S1. The files in the subdirectory work2026406846/inputFileDir are named according to each gene G. Each such file contains the complete list of genes correlating with G.

R Language

R language code [10] was developed to read and integrate the data in the two downloaded files, as well as to provide support for both basic and more complex queries [e.g., automatically find sets of genes meeting certain conditions with respect to both NCI-60 and GBM and then generate a relevant series of expression or correlation clustered image maps (CIMs)]. Historically, CIMs were first introduced in [11], [12].

Studies Predicated on Pre-existing Correlations Across the NCI-60

The key question we addressed here was whether genes that co-clustered with respect to their expression profiles across the NCI-60 cells also co-clustered with respect to their expression profiles across the GBM samples. To facilitate that analysis, we took advantage of the R language function cutree(). A key parameter in cutree() is “k” the number of clusters into which the cluster tree is to be divided. In the cluster 52 and cluster 68 studies (i.e., sets of genes reported in [2]), preliminary studies showed that k = 2 was optimal for NCI-60 expression clusters. Each such gene set had been derived from a clustering study using an absolute correlation metric, and therefore had two major partitionings (e.g., Figures 1A, S1). The two partitionings are designated as “cluster 1” and “cluster 2,” and are delineated by the number appended to each gene name on the right of the CIM. The genes within a single partitioning are mutually positively-correlated, and all genes in cluster 1 are negatively-correlated with all genes in cluster 2. We colloquially refer to the larger cluster (in the case of Figures 1A, S1, this would be cluster 2) as the “positively-correlated” genes and the smaller cluster as the “negatively-correlated” genes. In contrast to k = 2 for NCI-60, there was no a priori basis for selecting a particular value of k for the clustering across GBM, so we allowed k for GBM to range from 2 through 8.
Figure 1

Thumbnails of gene correlation clustering for Cluster 52 genes across (A) NCI-60 cell lines and (B) TCGA GBM samples.

The full size figures are available as Figures S1 and S2. The numbers appended after the gene name refer to the NCI-60 cluster in which that gene appeared.

To determine the optimal value of k, we constructed a 2×k contingency table (e.g., Table 1), each celli,j of which contains the number of genes that are both in the ith cluster of the NCI-60 clustering and the jth cluster of the GBM clustering. We computed a Fisher’s exact p-value for the null hypothesis that a distribution as extreme as the observed distribution could have occurred by chance. In addition, we randomized the gene names between performing the NCI-60 and GBM clusterings, to determine if the observed Fisher’s exact p-value could be achieved for a random gene set.
Table 1

Contingency table for cluster 52 re-clustered across NCI-60 and across GBM.

GBM CLUSTERMARGINALS
123
NCI-60 CLUSTER 1 91515
2 12341864
MARGINALS 21352379

The Fisher Exact p-value corresponding to this contingency table was 0.00039.

De novo Identification of Sets of Genes with Correlation ≥0.60 Across both NCI-60 and GBM

Without reference to any prior clustering analysis, the program constructed de novo a list of all pairs of genes having correlation ≥0.60 with respect to both NCI-60 and GBM expression profiles. The threshold of 0.60 was chosen for the calculations because it had been used in an earlier study of gene-gene correlations to minimize the number of false positives. Genes were ranked with respect to frequency of appearance in that list. Each gene “G” with frequency ≥5 was then used to “represent” the set of genes that showed correlation ≥0.60 with G. The top-ranking G gene was WAS (49 genes had correlation ≥0.60 with WAS). Many of the gene lists constructed by that method were highly redundant with respect to one another (i.e., pairs of lists may have many gene in common). To alleviate the redundancy problem, we computed the Jaccard similarity metric (the Jaccard coefficient measures similarity between sample sets, and is defined as the size of the intersection divided by the size of the union of the sample sets [13]) we eliminated highly redundant (Jaccard value ≥0.90; 0.90 was determined to be optimal in preliminary studies not shown here) gene sets from further analysis. Thus, we used a less-redundant set of 68 gene sets (from an initial selection of the top (possibly redundant) 100 gene sets) for the analysis. We wished to determine if the number of pairs of genes having correlation ≥0.60 with respect to both NCI-60 and GBM expression profiles exceeded the number expected by chance. We therefore performed a set of 10 studies in which we randomized the gene names in the GBM expression profiles. The number of such pairs obtained in the real study was 2708. In contrast, the number in the randomization studies was small in comparison (193±14).

Functional Categorization

Functional categorization of gene lists was performed using the High-Throughput GoMiner (HTGM) program [14]. The parameters used in running HTGM are tabulated in Table S1.

Clustered Image Maps

We used either the Genesis clustering program [15] or our own in-house R language code to construct CIMs presented here.

Results and Discussion

Studies Predicated on Existing Correlations Across the NCI-60

We recently clustered genes based on correlation of expression profiles across the NCI-60 [2]. Many of those clusters were characterized by cancer-associated biological functions. Using the expression profiles for the cluster 52 genes across the NCI-60 cell lines and also across the GBM samples, we were able to generate expression correlation CIMs across both of those sets of expression profiles (Figures 1A, S1, 1B, S2). The distinct patterns of red and green in the NCI-60 correlation CIM (Figures 1A, S1) results from the fact that cluster 52 had been derived by clustering the expression profiles in the NCI-60 cell lines using an absolute correlation metric. Thus, cluster 52 is composed of “negatively” and “positively” correlated subgroups. Not surprisingly, the patterns of red and green are less distinct in the GBM correlation CIM (Figures 1B, S2), since cluster 52 had been defined relative to NCI-60, not GBM, expression patterns. Although less distinct than for NCI-60, the GBM pattern is highly correlated with the pattern for NCI-60. That relationship is obvious by visual inspection. The quantitative analysis below confirms the visual impression. In the correlation CIMs, we appended a number (1 or 2) to the gene names, corresponding to membership in the two major clusters in the NCI-60 CIM. Those same numbers were retained in the gene names for the GBM CIM to allow identification of the cluster to which that gene belonged in the NCI-60 CIM. The pattern of clustering in the GBM correlation CIM (Figures 1B, S2) is markedly similar to that in the NCI-60 CIM. That observation shows that some gene co-expression patterns in the NCI-60 human tumor cell line panel are preserved in clinical glioblastoma, and supports our conjecture that NCI-60 gene expression correlations can indicate widely applicable gene-gene relationships. More precisely, Table 1 shows that there are 15 genes in cluster 1 and 64 genes in cluster 2, relative to the NCI-60 expression profile. Thirty-four of the 64 cluster 2 genes are the predominant members of GBM cluster 2. The remaining NCI-60 cluster 2 genes are distributed across GBM clusters 1 and 3. The concordance between the clustering patterns in NCI-60 and GBM is highly significant (Table 2). The Fisher’s exact p-value for k = 3 (0.00039) is strikingly lower than for the randomized controls (0.46±0.28). Furthermore, the large majority of the genes that were mutually correlated or anti-correlated in the NCI-60 preserved that relationship in the GBM tissue samples. The identities of the relevant genes are shown in Table 3.
Table 2

Fisher Exact p-values for concordance of re-clustering cluster 52 across NCI-60 and across GBM.

NUMBER OF CLUSTERS (parameter “k” of cutree())
2345678
0.00254 0.00039 0.001030.000930.001670.001570.00189

The Fisher’s Exact p-value for 100 randomizations corresponding to k = 3 was 0.464±0.279. The bold value indicates the lowest p-value model for the reals. This was the model used for the remainder of the cluster 52 analysis.

Table 3

Concordance of cluster 52 genes in NCI-60 and GBM clusters.

GBM CLUSTER
123
NCI-60 CLUSTER 1 ACCN2FOSGALK2
ALDOCMCAT
EPB41MRPS30
FOXO4SCO2
MUTYHSLC43A1
OXSM
PDE3B
PRMT7
SORD
2 AGERACTBACTN3
ALCAMADAM9CDH2
DBN1AXLCDH4
DDAH1CAV1CHST3
DYNLL1CAV2DKK3
GLRBCLIC4EGFR
MYBL1CNN2FADS3
NMT2DLC1FEZ2
ROBO3DSEFGF2
SYNE1DUSP1KCNMA1
TRPC1FGFR1MGLL
ZNF281FOSL2MT1X
GADD45AMYL6
GAS6SOCS5
IL6SORBS3
INHBATGFB2
ITGB1TUFT1
JUNVCAN
KLF7
LOXL2
MYH9
MYLK
PDGFC
PDLIM7
PTRF
PVR
QSOX1
RRAS
TGFB1I1
THBS1
TNFRSF12A
TRAM1
VCL
ZYX
A notable finding is that nearly half of the genes in GBM cluster 2 (Figures 1B, S2) are genes that were previously found to be involved in cell adhesion/migration and to form a mutually-high correlation subset of the cluster 52 genes [16]. Moreover, those genes were found to function coherently in a particular aspect of the cell migration process. With the exception of ALCAM and EGFR, the cell adhesion/migration tight cluster genes fall within GBM cluster 2. Sixteen of twenty-four genes of that tight cluster fall into GBM cluster 2. Thus, a set of genes previously found to be closely related in both gene expression and function in the NCI-60 cell lines [2], [16] are now found to be co-expressed also in clinical glioblastoma samples. To investigate other potential examples of coherence between gene expression clusters in NCI-60 cell lines and GBM samples, we repeated that analysis with the immune system-related cluster 68 genes [2] (Tables 4–6; Figures 2A, S3, 2B, S4). Again, the Fisher’s exact p-value (0.00001) (Table 5) validates the visual impression that there is a significant concordance between the NCI-60 and the GBM clustering.
Table 4

Contingency table for cluster 68 re-clustered across NCI-60 and across GBM.

GBM CLUSTERMARGINALS
1234
NCI-60 CLUSTER 1 8110524
2 1112381475
MARGINALS 1923381999

The Fisher Exact p-value corresponding to this contingency table was 0.00001.

Table 6

Concordance of cluster 68 genes in NCI-60 and GBM clusters.

GBM CLUSTER
1234
NCI-60 CLUSTER 1 ACVR2AADAM15APP
CSNK1G3AGPAT3DOCK1
CTBP2AHNAKMLF1
MAP3K13DUSP3NOL3
SMAD5EMP2OAT
YES1GRN
ZNF205LMNA
ZNF35MGAT4B
PLP2
SPR
ZFHX3
2 CHRNA3ADAAIF1C9
CYFIP2ALDH1A2ARHGDIBCD1A
ELOVL4CD1ECCR4GDF10
GNB1LCD79ACCR7GRAP
GRIK5GP5CD1DLY6H
MYBIGLL1CD27NKX2-5
NFKBIL1KRT1CD3DPRKCQ
SLIT1LAX1CD3ERAG2
SMPD3LTACD3GRASGRP2
TSPAN7PTGDRCD4RORB
USP20RAG1CD5SLC15A2
SEPT6CD52SLC18A2
CD7TAL1
CD84VPREB1
CD96
CECR1
CTSW
FLI1
FYB
GFI1
GMFG
GNA15
GRAP2
IL12RB1
IL2RG
ITGAL
ITGB2
ITK
LST1
MAP4K1
RHOH
SELL
SH2D1A
SIT1
SPN
TRAT1
TREML2
ZAP70
Figure 2

Thumbnails of gene correlation clustering for Cluster 68 genes across (A) the NCI-60 cell lines and GBM samples (B).

The full size figures are available as Figures S3 and S4. The numbers appended after the gene name refer to the NCI-60 cluster in which that gene appeared.

Table 5

Fisher Exact p-values for concordance of re-clustering cluster 68 across NCI-60 and across GBM.

NUMBER OF CLUSTERS (parameter “k” of cutree())
2345678
0.091620.10917 0.00001 0.000010.000010.000010.00001

The Fisher’s Exact p-value for 100 randomizations corresponding to k = 3 was 0.529±0.283. The bold value indicates the lowest p-value model for the reals. This was the model used for the remainder of the cluster 68 analysis.

De novo Identification of Sets of Genes with Correlation ≥0.60 Across both NCI-60 and TCGA GBM

There were 34,865 gene pairs with correlation ≥0.60 in the NCI-60 data set but not in GBM, 87,556 in GBM but not in the NCI-60, and 2,708 in both the NCI-60 and GBM. The highest-ranking gene of the 2,708 was WAS; 49 genes showed correlation ≥0.60 with WAS. Of the top 100 genes (i.e., genes with the highest number of correlations ≥0.60), 68 were non-redundant (i.e., the lists of correlating genes had Jaccard value ≤0.90). Functional categorization of those 68 gene lists by High-Throughput GoMiner (HTGM) revealed a complex set of significant categories (Figures 3, S5). The number of genes and the generalized functional correlations for the top 68 non-redundant gene sets are listed in Table 7. As is evident from Table 7, immune categories dominated, but Table 7 and Figure S5 reveal that there were also categories representing e.g. apoptosis, chemotaxis, DNA repair, chromatin assembly, angiogenesis, and adhesion.
Figure 3

Thumbnail of GO category versus gene list CIM for sets of genes with correlation ≥0.60 across both the NCI-60 and GBM samples.

The full size CIM is available as Figure S5. The gene name given as the column header is the representative of a list of genes. The full list of genes is available in the HTGM Download S1.

Table 7

Summary of high ranking genes with correlation ≥0.60 across both NCI-60 and GBM.

rankDesignated gene (G)number of genes with correlation ≥0.60generalized functional correlationnumber of genes incommon with cluster 52Number of genes in common with cluster 68
1WAS50immune021
2IL2RG37immune023
3CD437immune09
4CD4836immune013
5PTPRC35immune015
6PTPN735immune012
7HCLS135immune012
8CORO1A35immune011
9CD3734immune08
10PLCB233immune012
11RHOH32immune016
12LILRA231immune04
13LRMP29immune09
14RNASE628immune03
15NCF428immune00
16CD3D28immune020
17CSF3R27immune04
18CYBB26immune00
19SIT123immune019
20DOCK223immune09
21CD1D23immune014
22MYH921angiogenesis70
23CD221immune015
24SERPINE120angiogenesis40
25LOXL220angiogenesis40
26CYR6120angiogenesis60
27SH2D1A19immune014
28GNA1519immune010
29COL5A119extracellular matrix20
30BTK19immune02
31LY8618immune00
32LOX18angiogenesis20
33FLNA18cell-cell junciton60
34CD5218immune013
35S100A817immune02
36RNASE317immune02
37LGALS917immune08
38CSF2RB17immune00
39CD517immune013
40TNFRSF12A16chemotaxis50
41ST8SIA416tyrosine phosphorylation05
42PLK416mitosis00
43OAS216immune00
44MCM316DNA repair00
45LSP116immune02
46ITGB216immune010
47CD9616immune014
48CD716immune011
49ATP2A316immune012
50COL1A215extracellular matrix00
51CCR715immune013
52ADAM1215extracellular matrix10
53MED614NA00
54LGALS114NA00
55LCK14immune010
56FOSL214angiogenesis40
57DSE14immune40
58COL6A314extracellular matrix00
59COL1A114extracellular matrix10
60CLCF114tyrosine phosphorylation70
61ANXA214NA30
62PTPN2213NA04
63PLAUR13immune10
64LTB13immune011
65CTSW13immune013
66IRAK312immune00
67GRB712NA00
68CTCF12chromatin assembly00

The designated gene (G) appearing in the gene column is the representative of a group of genes that correlate strongly with G.

“NA” indicates that the gene set did not map to any statistically significant GO categories.

The complete High-Throughput GoMiner (HTGM) download is provided in file Download S1. The files in the subdirectory work2026406846/inputFileDir are named according to each gene G. Each such file contains the complete list of genes correlating with G.

The genes in cluster 52 or cluster 68 had been obtained by prior clustering of the gene expression profiles across NCI-60 cell lines, but not across TCGA GBM samples. We expect to find that some of the do novo gene lists derived from simultaneous consideration of both NCI-60 cell lines and TCGA GBM samples might overlap with genes in the cluster 52 or cluster 68 gene lists. In fact, Table 7 shows that the genes in certain of the de novo gene lists overlapped with the genes in NCI-60 clusters 52 (cell migration) and 68 (immune). Such overlap is particularly strong for cluster 68. This analysis shows ways in which strong gene-gene correlations and functional categorization (ie., GO assignments) obtained for the NCI-60 cell lines across tumor types can reflect in vivo relationships. It also shows the limitations of such similarity. The two types of sample sets represent major initiatives of the National Cancer Institute (NCI), in terms of both expense and research investment. Hence, a delineation of the similarities and differences remains a subject of considerable practical importance. Full version of Figure 1A. (PDF) Click here for additional data file. Full version of Figure 1B. (TIF) Click here for additional data file. Full version of Figure 2A. (PDF) Click here for additional data file. Full version of Figure 2B. (PDF) Click here for additional data file. HTGM GO categories versus gene set CIM for sets of genes with correlation ≥ 0.60 simultaneously in both NCI-60 and TCGA GBM. (PNG) Click here for additional data file. The parameters used in running HTGM. (JPG) Click here for additional data file. Zip archive of HTGM results. (ZIP) Click here for additional data file.
  14 in total

1.  Genesis: cluster analysis of microarray data.

Authors:  Alexander Sturn; John Quackenbush; Zlatko Trajanoski
Journal:  Bioinformatics       Date:  2002-01       Impact factor: 6.937

2.  Spotlight on molecular profiling: "Integromic" analysis of the NCI-60 cancer cell lines.

Authors:  John N Weinstein
Journal:  Mol Cancer Ther       Date:  2006-11-06       Impact factor: 6.261

3.  Genome-wide mRNA and microRNA profiling of the NCI 60 cell-line screen and comparison of FdUMP[10] with fluorouracil, floxuridine, and topoisomerase 1 poisons.

Authors:  William H Gmeiner; William C Reinhold; Yves Pommier
Journal:  Mol Cancer Ther       Date:  2010-12       Impact factor: 6.261

4.  An information-intensive approach to the molecular pharmacology of cancer.

Authors:  J N Weinstein; T G Myers; P M O'Connor; S H Friend; A J Fornace; K W Kohn; T Fojo; S E Bates; L V Rubinstein; N L Anderson; J K Buolamwini; W W van Osdol; A P Monks; D A Scudiero; E A Sausville; D W Zaharevitz; B Bunow; V N Viswanadhan; G S Johnson; R E Wittes; K D Paull
Journal:  Science       Date:  1997-01-17       Impact factor: 47.728

5.  Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1.

Authors:  Roel G W Verhaak; Katherine A Hoadley; Elizabeth Purdom; Victoria Wang; Yuan Qi; Matthew D Wilkerson; C Ryan Miller; Li Ding; Todd Golub; Jill P Mesirov; Gabriele Alexe; Michael Lawrence; Michael O'Kelly; Pablo Tamayo; Barbara A Weir; Stacey Gabriel; Wendy Winckler; Supriya Gupta; Lakshmi Jakkula; Heidi S Feiler; J Graeme Hodgson; C David James; Jann N Sarkaria; Cameron Brennan; Ari Kahn; Paul T Spellman; Richard K Wilson; Terence P Speed; Joe W Gray; Matthew Meyerson; Gad Getz; Charles M Perou; D Neil Hayes
Journal:  Cancer Cell       Date:  2010-01-19       Impact factor: 31.743

6.  Predictive statistics and artificial intelligence in the U.S. National Cancer Institute's Drug Discovery Program for Cancer and AIDS.

Authors:  J N Weinstein; T Myers; J Buolamwini; K Raghavan; W van Osdol; J Licht; V N Viswanadhan; K W Kohn; L V Rubinstein; A D Koutsoukos
Journal:  Stem Cells       Date:  1994-01       Impact factor: 6.277

7.  High-Throughput GoMiner, an 'industrial-strength' integrative gene ontology tool for interpretation of multiple-microarray experiments, with application to studies of Common Variable Immune Deficiency (CVID).

Authors:  Barry R Zeeberg; Haiying Qin; Sudarshan Narasimhan; Margot Sunshine; Hong Cao; David W Kane; Mark Reimers; Robert M Stephens; David Bryant; Stanley K Burt; Eldad Elnekave; Danielle M Hari; Thomas A Wynn; Charlotte Cunningham-Rundles; Donn M Stewart; David Nelson; John N Weinstein
Journal:  BMC Bioinformatics       Date:  2005-07-05       Impact factor: 3.169

8.  A comparison of gene expression signatures from breast tumors and breast tissue derived cell lines.

Authors:  D T Ross; C M Perou
Journal:  Dis Markers       Date:  2001       Impact factor: 3.434

9.  CellMiner: a relational database and query tool for the NCI-60 cancer cell lines.

Authors:  Uma T Shankavaram; Sudhir Varma; David Kane; Margot Sunshine; Krishna K Chary; William C Reinhold; Yves Pommier; John N Weinstein
Journal:  BMC Genomics       Date:  2009-06-23       Impact factor: 3.969

10.  Gene expression profiles of the NCI-60 human tumor cell lines define molecular interaction networks governing cell migration processes.

Authors:  Kurt W Kohn; Barry R Zeeberg; William C Reinhold; Margot Sunshine; Augustin Luna; Yves Pommier
Journal:  PLoS One       Date:  2012-05-03       Impact factor: 3.240

View more
  8 in total

Review 1.  An Interactive Resource to Probe Genetic Diversity and Estimated Ancestry in Cancer Cell Lines.

Authors:  Julie Dutil; Zhihua Chen; Alvaro N Monteiro; Jamie K Teer; Steven A Eschrich
Journal:  Cancer Res       Date:  2019-03-20       Impact factor: 12.701

2.  Coregulation of host-response genes in integument: switchover of gene expression correlation pattern and impaired immune responses induced by dipteran parasite infection in the silkworm, Bombyx mori.

Authors:  Anitha Jayaram; Appukuttan Nair R Pradeep; Arvind K Awasthi; Geetha N Murthy; Kangayam M Ponnuvel; Sirigineedi Sasibhushan; Guruprasad C Rao
Journal:  J Appl Genet       Date:  2013-12-06       Impact factor: 3.240

3.  Microarray Analysis in Glioblastomas.

Authors:  Kaumudi M Bhawe; Manish K Aghi
Journal:  Methods Mol Biol       Date:  2016

Review 4.  In vitro human cell line models to predict clinical response to anticancer drugs.

Authors:  Nifang Niu; Liewei Wang
Journal:  Pharmacogenomics       Date:  2015       Impact factor: 2.533

5.  High resolution copy number variation data in the NCI-60 cancer cell lines from whole genome microarrays accessible through CellMiner.

Authors:  Sudhir Varma; Yves Pommier; Margot Sunshine; John N Weinstein; William C Reinhold
Journal:  PLoS One       Date:  2014-03-26       Impact factor: 3.240

6.  A self-updating road map of The Cancer Genome Atlas.

Authors:  David E Robbins; Alexander Grüneberg; Helena F Deus; Murat M Tanik; Jonas S Almeida
Journal:  Bioinformatics       Date:  2013-04-17       Impact factor: 6.937

Review 7.  Cell line modeling for systems medicine in cancers (review).

Authors:  Nayoung Kim; Ningning He; Sukjoon Yoon
Journal:  Int J Oncol       Date:  2013-12-02       Impact factor: 5.650

8.  A membrane transporter determines the spectrum of activity of a potent platinum-acridine hybrid anticancer agent.

Authors:  Xiyuan Yao; Noah H Watkins; Heather Brown-Harding; Ulrich Bierbach
Journal:  Sci Rep       Date:  2020-09-16       Impact factor: 4.379

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.