| Literature DB >> 21930502 |
Junjun Zhang1, Joachim Baran, A Cros, Jonathan M Guberman, Syed Haider, Jack Hsu, Yong Liang, Elena Rivkin, Jianxin Wang, Brett Whitty, Marie Wong-Erasmus, Long Yao, Arek Kasprzyk.
Abstract
The International Cancer Genome Consortium (ICGC) is a collaborative effort to characterize genomic abnormalities in 50 different cancer types. To make this data available, the ICGC has created the ICGC Data Portal. Powered by the BioMart software, the Data Portal allows each ICGC member institution to manage and maintain its own databases locally, while seamlessly presenting all the data in a single access point for users. The Data Portal currently contains data from 24 cancer projects, including ICGC, The Cancer Genome Atlas (TCGA), Johns Hopkins University, and the Tumor Sequencing Project. It consists of 3478 genomes and 13 cancer types and subtypes. Available open access data types include simple somatic mutations, copy number alterations, structural rearrangements, gene expression, microRNAs, DNA methylation and exon junctions. Additionally, simple germline variations are available as controlled access data. The Data Portal uses a web-based graphical user interface (GUI) to offer researchers multiple ways to quickly and easily search and analyze the available data. The web interface can assist in constructing complicated queries across multiple data sets. Several application programming interfaces are also available for programmatic access. Here we describe the organization, functionality, and capabilities of the ICGC Data Portal.Entities:
Mesh:
Year: 2011 PMID: 21930502 PMCID: PMC3263593 DOI: 10.1093/database/bar026
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.International Cancer Genome Consortium (ICGC) projects (March 2011).
Figure 2.An overview of the ICGC Data Portal architecture.
The summary of data available on the ICGC Data Portal divided by cancer project
| Source | Cancer project | Data set | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Simple mutations | Copy number alterations | Structural rearrangements | Gene expression | miRNA expression | Exon junctions | DNA methylation | Germline variations | ||
| ICGC | Breast carcinoma (WTSI, UK) | ||||||||
| Liver cancer (NCC, JP) | |||||||||
| Liver cancer (RIKEN, JP) | |||||||||
| Malignant melanoma (WTSI, UK) | |||||||||
| Pancreatic cancer (OICR, CA) | |||||||||
| Pancreatic cancer (QCMG, AU) | |||||||||
| Small cell lung carcinoma (WTSI, UK) | |||||||||
| TCGA | Acute myeloid leukemia | ||||||||
| Breast invasive carcinoma | |||||||||
| Colon adenocarcinoma | |||||||||
| GlioblastomaMultiforme | |||||||||
| Kidney renal clear cell carcinoma | |||||||||
| Kidney renal papillary cell carcinoma | |||||||||
| Lung adenocarcinoma | |||||||||
| Lung squamous cell carcinoma | |||||||||
| Ovarian serous cystadenocarcinoma | |||||||||
| Rectum adenocarcinoma | |||||||||
| Stomach adenocarcinoma | |||||||||
| Uterine corpus endometrioid carcinoma | |||||||||
| Other | Breast cancer (JHU, US) | ||||||||
| Colorectal cancer (JHU, US) | |||||||||
| GlioblastomaMultiforme (JHU, US) | |||||||||
| Lung adenocarcinoma (TSP, US) | |||||||||
| Pancreatic cancer (JHU, US) | |||||||||
Public databases federated with the ICGC Data Portal
| Source | URL | Description of contents |
|---|---|---|
| Ensembl | Genome annotation | |
| Reactome | Pathway annotation | |
| KEGG | Pathway annotation | |
| COSMIC | Somatic mutations in cancer | |
| Pancreatic Expression Database | Pancreatic cancer expression data | |
| Breast Cancer Campaign Tissue Bank | Breast cancer expression data |
Figure 3.Screenshot of the ICGC Data Portal home page. Three main entry points are available: (A) Identifier search, (B) Analysis and (C) Database search.
Figure 4.Gene Report for KRAS includes: (A) Gene annotation data from Ensembl, Pathway annotation from (B) KEGG and (C) Reactome, (D) Summary of mutation frequencies in each cancer project, (E) Mutation data from COSMIC, Expression data from (F) Pancreatic Expression Database and (G) Breast Cancer Campaign Tissue Bank (BCCTB). Different sections of this page come from federated BioMart sources.
Figure 5.Results from affected pathways analysis for breast cancer (JHU, US). (A) Affected pathways are shown in a chart, with bars representing the number of affected genes in each pathway. (B) By clicking on the bar, users are able to view and download the genes that were mutated in each pathway.
Figure 6.A screenshot of the Quickquery interface.
Figure 7.A screenshot of the Flexiblequery interface.
Figure 8.A screenshot of the Advancedquery interface.
| Data sets | Cancer projects | Filters | Attributes |
|---|---|---|---|
| Genes | Pancreatic Cancer (QCMG, AU) | Rearrangement type: deletion | Ensembl Gene ID |
| Copy number alteration type: loss | Gene symbol | ||
| Gene description |
| Data sets | Cancer projects | Filters | Attributes |
|---|---|---|---|
| Genes | Colorectal cancer (JHU, US) | Pathway: Signaling by Wnt | Tumor sample ID |
| Consequence type: non_synonymous_ coding | Mutation ID | ||
| Mutation type | |||
| Donor ID | |||
| Diagnosis ID | |||
| Clinical staging (WHO) | |||
| Ensembl Gene ID | |||
| Gene Symbol |
| Data sets | Cancer projects | Filters | Attributes |
|---|---|---|---|
| Genes | Pancreatic Cancer (QCMG, AU) | Copy number alteration type: gain | Ensembl Gene ID |
| Platform: SOLiD sequencing | |||
| Normalized read count | |||
| Raw read count |