| Literature DB >> 19549304 |
Uma T Shankavaram1, Sudhir Varma, David Kane, Margot Sunshine, Krishna K Chary, William C Reinhold, Yves Pommier, John N Weinstein.
Abstract
BACKGROUND: Advances in the high-throughput omic technologies have made it possible to profile cells in a large number of ways at the DNA, RNA, protein, chromosomal, functional, and pharmacological levels. A persistent problem is that some classes of molecular data are labeled with gene identifiers, others with transcript or protein identifiers, and still others with chromosomal locations. What has lagged behind is the ability to integrate the resulting data to uncover complex relationships and patterns. Those issues are reflected in full form by molecular profile data on the panel of 60 diverse human cancer cell lines (the NCI-60) used since 1990 by the U.S. National Cancer Institute to screen compounds for anticancer activity. To our knowledge, CellMiner is the first online database resource for integration of the diverse molecular types of NCI-60 and related meta data. DESCRIPTION: CellMiner enables scientists to perform advanced querying of molecular information on NCI-60 (and additional types) through a single web interface. CellMiner is a freely available tool that organizes and stores raw and normalized data that represent multiple types of molecular characterizations at the DNA, RNA, protein, and pharmacological levels. Annotations for each project, along with associated metadata on the samples and datasets, are stored in a MySQL database and linked to the molecular profile data. Data can be queried and downloaded along with comprehensive information on experimental and analytic methods for each data set. A Data Intersection tool allows selection of a list of genes (proteins) in common between two or more data sets and outputs the data for those genes (proteins) in the respective sets. In addition to its role as an integrative resource for the NCI-60, the CellMiner package also serves as a shell for incorporation of molecular profile data on other cell or tissue sample types.Entities:
Mesh:
Year: 2009 PMID: 19549304 PMCID: PMC2709662 DOI: 10.1186/1471-2164-10-277
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Schematic representation of CellMiner. CellMiner was constructed using four data resources (associated data). The user submits a job to CellMiner via a user-friendly web interface, and the job is then processed in background. Upon completion, results are returned to the user in a new HTML page or can be exported to various formats. CellMiner is publicly available at .
Description of the datasets included in the current version of CellMiner. More will be added on a continuing basis.
| S.no | Data set | Description | Reference |
| DNA | |||
| 1 | aCGH | DNA copy number changes from bacterial artificial chromosome array | Bussey et al., 2006[ |
| 2 | Mutation | DNA sequencing data on mutations on 24 human cancer genes | Ikediobi et al, 2006[ |
| 3 | Methylation of E-cadherin promoter | PCR amplification and sequencing of sodium bisulfite modified DNA | Reinhold et al, 2007[ |
| RNA | |||
| 4 | cDNA | cDNA clone microarray with 9,607 features | Scherf et al, 2000; Shankavaram et al., 2007[ |
| 5 | HU6800 | Affymetrix 6,800-feature microarray | Shankavaram et al., 2007[ |
| 6 | HGU95 | Affymetrix 64,000-feature microarray | Shankavaram et al., 2007[ |
| 7 | HGU133 | Affymetrix 44,000-feature microarray | Shankavaram et al., 2007[ |
| 8 | ABC transporter | RT-PCR data on 47 ABC transporters | Szakacs, et al., 2004[ |
| 9 | Ion transporter | 632-feature 70-mer oligo microarray | Huang et al., 2004[ |
| 10 | NCI-60 radiation | Microarray with 612 ESTs plus another set of 616 ESTs chosen on the basis of their known roles in cancer lymphoid biology | Amundson et al., 2008[ |
| 11 | microRNA | 627 human microRNA probes, including 321 mature microRNAs, as well as probes for most of their precursors. | Blower et al., 2007[ |
| Protein | |||
| 12 | RPLA | Reverse phase antibody lysate array with detection using 156 monoclonal antibodies | Nishizuka et al, 2003; Shankavaram et al., 2007[ |
| Drug | |||
| 13 | A118 | The "mechanism of action" set with 6 compound classes. The list of compounds was assembled for an earlier study as training set for neural network analysis of drug mechanism of action. | Weinstein et al., 1992 [ |
| 14 | A1429 | Combination of A118 and A1400 selected from > 70,000 tested, publicly available compounds by applying a series of filters (see text for description) | Scherf et al., 2000; Szakacs et al., 2004[ |
| 15 | A4463 | Selected compounds tested in the NCI DTP's sulforhodamine B assay two or more times and for which structure records are available. | Blower et al., 2002[ |
Summary of search functions and criteria available in the CellMiner resource.
| NCI-60/DU145-RC0.1/both | NCI-60/DU145/RC0.1 | NCI-60/DU145/RC0.1 | NCI-60/DU145/RC0.1/both | NCI-60/DU145/RC0.1 | NCI-60 | NCI-60 (59 cell lines) | |
| NA | DNA/RNA/Protein | DNA/RNA/Protein | DNA/RNA Protein | DNA/RNA/Protein | Drug sensitivity | DNA | |
| All/tissue type selection | All | All/tissue type selection | All | All | All/tissue type selection | NA | |
| NA | Raw | Normalized | gene or platform specific id, chromosome or genomic location | NA | NSC, Chemical name, Molecular formula | NA | |
| NA | NA | NA | File attachment, list, single value | NA | File attachment, list, single value | NA | |
| Information on Patient, cell line, Experimental details | Quantification of image files | Log base2 | HUGO, Entrez Gene id, Gene Symbol, Chromosome, Cytoband, mRNA-Refseq, Protein-Refseq, Transcription start and Transcription-end | HUGO, Entrez Gene id, Gene Symbol, Chromosome, Cytoband, mRNA-Refseq, Protein-Refseq, Transcription start and Transcription-end | Chemical name, SMILES, molecular formula, molecular weight, mechanism of action | HUGO, Zygosity, CDS mutation, AA mutation, mutation characterization | |
| HTML table of cell line information | zip | Text of log2 intensity values | HTML, MS-Excel, text file of log2 intensity values | Text file of log2 intensity values for each the matching datasets | HTML, MS-Excel, Text | HTML, Text | |
Figure 2Illustrative screen shot of the CellMiner graphical user interface. Shown is the result of a "Cell Line Metadata" query on a user-selected CNS tissue subset. Included in the output are literature citations for information on the lines selected. The default selection is the entire NCI-60. The results shown here resulted from background processing of the job and display via the graphical web interface.
Figure 3Screen shot of the "Query Datasets" input page. Shown is the result of a "Query Datasets" on a user-selected query options available to extract molecular profile data from CellMiner.
Figure 4Screen shot of the "Query Datasets" result page. An illustrative output page displaying results of a complex "Query Datasets" search. For this particular output, the query was constructed for TP53 (identified by HUGO name) to include two datasets containing Gene symbols and chromosome numbers.