| Literature DB >> 21903633 |
Christian Perez-Llamas1, Gunes Gundem, Nuria Lopez-Bigas.
Abstract
Recently, we created IntOGen, a resource to integrate a large amount of cancer genomic data. IntOGen aims at facilitating the detection of the most recurrent alterations that drive tumorigenesis. It collates, annotates and analyzes high-throughput data about transcriptional, genomic and mutational changes taking place in tumors from different studies annotated with specific cancer types. Currently, it contains 118 studies for mRNA expression profiling and 188 studies for genomic alterations covering in total 64 different tumor topographies. In this article, we describe the Biomart portal for IntOGen. The portal provides easy access to different types of data and facilitates the bulk download of all the analysis results. Here, we describe the general features of IntOGen and give example queries to demonstrate its use. Database URL: www.intogen.org.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21903633 PMCID: PMC3169995 DOI: 10.1093/database/bar039
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Data annotation and classification. Each sample assay in a study is annotated for the platform and the ICD-O topography and morphology terms. An experiment in IntOGen consists of a set of assays coming from the same study that have been performed with the same platform. The analysis pipeline then generates overlapping groups of assays from the same experiment in ‘analysis units’ in two ways, 1) according to the topography and the morphology, and 2) according to the topography (with the morphology annotated as ‘Any morphology’).
Summary of the data content in IntOGen (v03)
| Alteration type | Main data sources | Number of independent studies | Number of experiments | Number of analysis units |
|---|---|---|---|---|
| Transcriptomic | GEO | 118 | 122 | 243 |
| ArrayExpress | ||||
| TCGA | ||||
| Genomic (copy number) | Progenetix | 188 | 188 | 343 |
| Sanger Cancer Genome Project | ||||
| TCGA | ||||
| Total | 306 | 310 | 586 |
Figure 2.Flowchart of the analyses in IntOGen. Each analysis unit (set of assays from the same study using the same platform and annotated with the same ICD-O terms) is analysed to detect the significantly altered genes. The gene-level experiment results are analyzed further to detect significantly altered modules. The experiment results with the same ICD-O terms are combined both at the gene level and at the module level. For methods details see (1).
Databases and data sets in the BioMart of IntOGen
| Databases | Data sets | Description |
|---|---|---|
| Experiments | Gene genomic alterations | Recurrence and significance of genomic alteration (gain and loss) for each gene at the level of experiments |
| Gene transcriptomic alterations | Recurrence and significance of transcriptomic alterations (upregulation and downregulation) for each gene at the level of experiments | |
| KEGG genomic alterations | Recurrence and significance of genomic alterations (gain and loss) for each KEGG pathway at the level of experiments | |
| KEGG transcriptomic alterations | Recurrence and significance of transcriptomic alterations (upregulation and downregulation) for each KEGG pathway at the level of experiments | |
| GO genomic alterations | Recurrence and significance of genomic alterations (gain and loss) for each GO term at the level of experiments | |
| GO transcriptomic alterations | Recurrence and significance of transcriptomic alterations (upregulation and downregulation) for each GO term at the level of experiments | |
| TFBS genomic alterations | Recurrence and significance of genomic alterations (gain and loss) for putative targets of each TF at the level of experiments | |
| TFBS transcriptomic alterations | Recurrence and significance of transcriptomic alterations (upregulation and downregulation) for putative targets of each TF at the level of experiments | |
| miRNA genomic alterations | Recurrence and significance of genomic alterations (gain and loss) for putative targets of each miRNA at the level of experiments | |
| miRNA transcriptomic alterations | Recurrence and significance of transcriptomic alterations (upregulation and downregulation) for putative targets of each miRNA at the level of experiments | |
| Combinations | Gene genomic alterations | Recurrence and significance of genomic alterations (gain and loss) for each gene at the level of combinations (tumor types and subtypes) |
| Gene transcriptomic alterations | Recurrence and significance of transcriptomic alterations (upregulation and downregulation) for each gene at the level of combinations (tumor types and subtypes) | |
| KEGG genomic alterations | Recurrence and significance of genomic alterations (gain and loss) for each KEGG pathway at the level of combinations (tumor types and subtypes) | |
| KEGG transcriptomic alterations | Recurrence and significance of transcriptomic alterations (upregulation and downregulation) for each KEGG pathway at the level of combinations (tumor types and subtypes) | |
| GO genomic alterations | Recurrence and significance of genomic alterations (gain and loss) for each GO term at the level of combinations (tumor types and subtypes) | |
| GO transcriptomic alterations | Recurrence and significance of transcriptomic alterations (upregulation and downregulation) for each GO term at the level of combinations (tumor types and subtypes) | |
| TFBS genomic alterations | Recurrence and significance of genomic alterations (gain and loss) for putative targets of each TF at the level of combinations (tumor types and subtypes) | |
| TFBS transcriptomic alterations | Recurrence and significance of transcriptomic alterations (upregulation and downregulation) for putative targets of each TF at the level of combinations (tumor types and subtypes) | |
| miRNA genomic alterations | Recurrence and significance of genomic alterations (gain and loss) for putative targets of each miRNA at the level of combinations (tumor types and subtypes) | |
| miRNA transcriptomic alterations | Recurrence and significance of transcriptomic alterations (upregulation and downregulation) for putative targets of each miRNA at the level of combinations (tumor types and subtypes) | |
| Oncomodules | Combinations | Sets of genes significantly altered in each cancer type and subtype |
| Experiments | Sets of genes significantly altered in each experiment |
Figure 3.Screenshot showing the attribute selection for the query 1. On the left, the selected dataset, filters and attributes are shown. On the right the detailed attributes selection view. To retrieve the results the user should click on the ‘Results’ button on the upper-left black bar, the ‘Count’ button gives the number of rows that match the query and the ‘New’ button allows to start a new query.
| Database | Data sets | Filters | Attributes |
|---|---|---|---|
| Database: IntOGen Combinations | Gene Genomic Alterations | Genes: ID List limit by a file with ids (Ensembl, Entrez etc.) | Genes>Ensembl> Gene Ensembl ID |
| ICD-O topography and morphology: breast; ANY morphology | Genes>Ensembl> Gene symbol | ||
| References>External references>Entrez Gene id | |||
| Results>Genomics> Gain | |||
| Results>Genomics>Loss |
| Database | Data sets | Filters | Attributes |
|---|---|---|---|
| IntOGen Oncomodules | Combinations: Oncomodules | Type of alteration: Gain | Genes>Ensembl> Gene Ensembl ID |
| ICD-O Topography and Morphology: lung; ANY morphology |
| Database | Data sets | Filters | Attributes |
|---|---|---|---|
| IntOGen Combinations | Gene transcriptomic alterations | Genes: ID List from the previous query | Genes>Ensembl> Gene Ensembl ID |
| ICD-O topography and morphology: lung; ANY morphology | Genes>Ensembl> Gene symbol | ||
| Results > Transcriptomic > Upregulation | |||
| Results > Transcriptomic > Downregulation |
| Database | Data sets | Filters | Attributes |
|---|---|---|---|
| IntOGen Combinations | Gene genomic alterations | ICD-O topography and morphology: brain; ANY morphology | Genes>Ensembl> Gene Ensembl ID |
| ICD-O topography and morphology: brain; astrocytoma, nos | Genes>Ensembl> Gene symbol | ||
| ICD-O topography and morphology: brain; epedymoma, nos | Results>Genomics> Gain | ||
| Results>Genomics> Loss |
| Database | Data sets | Filters | Attributes |
|---|---|---|---|
| IntOGen Experiments | Gene transcriptomic alterations | ICD-O topography and morphology: breast; ANY morphology | Genes>Ensembl> Gene Ensembl ID |
| Filters: ID List limit by GO id: GO:0007049 | Genes>Ensembl> Gene symbol | ||
| Results> Transcriptomic> Upregulation, | |||
| Results> Transcriptomic> Downregulation, | |||
| Results> Transcriptomic> Upregulation: total number of samples |
| Database | Data sets | Filters | Attributes |
|---|---|---|---|
| IntOGen Experiments | KEGG pathway transcriptomic alterations | ICD-O topography and morphology: prostate gland; ANY morphology | KEGG pathway id |
| KEGG name | |||
| Results> Transcriptomic> Upregulation | |||
| Results> Transcriptomic> Downregulation |
| Database | Data sets | Filters | Attributes |
|---|---|---|---|
| IntOGen Experiments | KEGG pathway transcriptomic alterations | None selected | ICD-O: Topography and morphology |
| EXPERIMENT: publication authors, publication year, PubMed id, publication title, experiment id | |||
| PLATFORM: platform title |
| Database | Data sets | Filters | Attributes |
|---|---|---|---|
| IntOGen Experiments | KEGG pathway genomic alterations | None selected | ICD-O: Topography and morphology |
| EXPERIMENT: publication authors, publication year, PubMed id, publication title, experiment id | |||
| PLATFORM: platform title |