| Literature DB >> 29527288 |
Lance D Miller1, Davide Bedognetti2, Wouter Hendrickx2, Jessica Roelands2, Julie Decock3, Sabri Boughorbel4, Darawan Rinchai2, Cristina Maccalli2, Michele Ceccarelli5, Michael Black6, Cris Print7, Jeff Chou1, Scott Presnell8, Charlie Quinn8, Puthen Jithesh9, Najeeb Syed10, Salha B J Al Bader11, Shahinaz Bedri12, Ena Wang13, Francesco M Marincola14, Damien Chaussabel4, Peter Kuppen15.
Abstract
The increased application of high-throughput approaches in translational research has expanded the number of publicly available data repositories. Gathering additional valuable information contained in the datasets represents a crucial opportunity in the biomedical field. To facilitate and stimulate utilization of these datasets, we have recently developed an interactive data browsing and visualization web application, the Gene Expression Browser (GXB). In this note, we describe a curated compendium of 13 public datasets on human breast cancer, representing a total of 2142 transcriptome profiles. We classified the samples according to different immune based classification systems and integrated this information into the datasets. Annotated and harmonized datasets were uploaded to GXB. Study samples were categorized in different groups based on their immunologic tumor response profiles, intrinsic molecular subtypes and multiple clinical parameters. Ranked gene lists were generated based on relevant group comparisons. In this data note, we demonstrate the utility of GXB to evaluate the expression of a gene of interest, find differential gene expression between groups and investigate potential associations between variables with a specific focus on immunologic classification in breast cancer. This interactive resource is publicly available online at: http://breastcancer.gxbsidra.org/dm3/geneBrowser/list.Entities:
Keywords: Breast Cancer; Cancer Immune Phenotype; Gene Expression Browser; Immune Subtypes; Immunologic Constant of Rejection
Year: 2017 PMID: 29527288 PMCID: PMC5820610 DOI: 10.12688/f1000research.10960.2
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. Basis of ICR and IDS/IBS classifications and prognostic value.
( A) Consensus clustering based on ICR genes segregates breast cancer patient in four different groups: ICR1, 2, 3 and 4. Patients with tumors categorized as ICR4 have the highest expression of the ICR gene signature and have a better prognosis compared with other ICR groups. ( B) Immune metagene model based on the relative expression of immune metagenes (B/P, T/NK and M/D) distinguishes PID, WID and FID tumors (horizontal axis: genes, vertical axis: individual cases). This classification has prognostic value in IBE tumors, and not in IBD tumors. Diagrams are based on Hendrickx ( A) and Miller ( B). This figure is for explanatory purposes only and does not serve as a demonstration of the GXB web application. ICR, Immunologic Constant of Rejection; IBE/D, Immune Benefit Enabled OR Disabled; F/P/WID, Favorable OR Poor OR Weak Immune Disposition.
Figure 2. Schematic representation of dataset selection and annotation.
Breast cancer cases included in 13 NCBI GEO datasets were uploaded in GXB (n=2142). 22 cases described data from breast cell lines and were excluded from our data collection. We annotated 1839 cases with survival data, IMS, IBS, IDS and ICR status. 281 cases were either neoadjuvant treated, did not represent a primary invasive tumor, were not sampled at the time of surgery or without available survival data and were therefore not annotated. The total collection includes 1839 cases from the original cohort described in Nagalla (n=1954). Of note, 115 cases of this cohort are not included in our collection as these were not shared via GEO. *251/1839 cases have been classified for IMS “Normal-like”. IDS is not applicable for normal-like breast cancer tissue; therefore, IDS is non-classified for these samples. DMFS, Distant Metastasis Free Survival; GXB, Gene Expression Browser; IMS, intrinsic molecular subtype; IBS, immune benefit status; IDS, immune disposition status; ICR, immunologic constant of rejection.
List of datasets uploaded to GXB.
| Dataset | Platforms | Diseases | Number of
| GEO ID | References |
|---|---|---|---|---|---|
|
| Affymetrix Human Genome
| Mixed Breast
| 139 |
|
|
|
| Affymetrix Human Genome
| Lymph Node
| 286 |
|
|
|
| Affymetrix Human Genome
| Lymph Node
| 58 |
|
|
|
| Affymetrix Human Genome
| ER + Breast
| 136 |
|
|
|
| Affymetrix Human Genome
| ER+ Breast
| 77 |
|
|
|
| Affymetrix Human Genome
| LN- Breast
| 200 |
|
|
|
| Affymetrix Human Genome
| Mixed Breast
| 159 |
|
|
|
| Affymetrix Human Genome
| Mixed Breast
| 99 |
|
|
|
| Affymetrix Human Genome
| Mixed Breast
| 1839 | multiple |
|
|
| Affymetrix Human Genome
| ER+ Breast
| 87 |
|
|
|
| Affymetrix Human Genome
| ER+ Breast
| 327 |
|
|
|
| Affymetrix Human Genome
| Lymph Node
| 198 |
|
|
|
| Affymetrix GeneChip
| ER+ Breast
| 54 |
|
|
|
| Affymetrix Human Genome
| Mixed Breast
| 289 |
|
|
Tumor classifications applied to breast cancer datasets.
| Classifications | Categories | Reference |
|---|---|---|
| Intrinsic Molecular
| Normal-like (Normal); HER2-enriched
| (
|
| Immune Benefit Status (IBS) | Immune Benefit-Enabled (IBE);
| (
|
| Immunogenic
| Poor Immunogenic Disposition (PID);
| (
|
| Immunologic Constant
| ICR1; ICR2; ICR3; ICR4 | (
|
Clinical data availability.
| Clinical variable | Available in
|
|---|---|
| IMS | 13 |
| IBS | 13 |
| IDS | 13 |
| ICR | 13 |
| DMFS 10Y EVENT | 13 |
| DMFS 10Y TIME | 13 |
| Disease free survival event | 11 |
| Disease free survival time | 11 |
| Distant metastasis free survival event | 6 |
| Distant metastasis free survival time | 6 |
| Age at initial pathologic diagnosis | 8 |
| Lymph node status | 8 |
| ER status | 8 |
| PR status | 5 |
| Histology differentiation grade | 7 |
| Tumor size | 8 |
| Pathology T Stage | 8 |
| Type treatment, bone metastasis event,
| <2 |
Figure 3. Illustrative example of abundance of HLA-G transcripts across ICR groups in multiple breast cancer datasets in GXB.
( A) Cross Project View in GXB showing HLA-G expression across ICR groups. ICR represents the immune gene signatures observed in association with tissue-specific destruction. In this view of GXB, expression of HLA-G can be visualized across projects listed on the left. ( B) Boxplots of HLA-G expression across ICR groups of three additional representative datasets selected from the dataset collection and the complete dataset including all annotated cases (right bottom plot). Plots indicate an increased HLA-G expression in breast tumors with a high expression of ICR genes. ICR, Immunologic Constant of Rejection.
Figure 4. GXB overview of expression of genes with known roles in tumor immunology across IBS/IDS subgroups in reconstituted Nagalla’s breast cancer dataset.
( A) Expression values of CD8 and CD19, indicators of immune cell infiltration, are similar in IBD-FID and IBE-FID groups, indicating equal immune cell infiltration in these subgroups. ( B) Expression values of CXCL10, GNZB, IFNG and STAT1, markers of immune functional orientation, are increased in the IBE-FID group compared with IBD-FID, indicating a differential functional orientation of the immune infiltrate between IBD-FID and IBE-FID tumors. IBE/D, Immune Benefit Enabled OR Disabled; F/P/WID, Favorable OR Poor OR Weak Immune Disposition.
Figure 5. Overlay of immunologic classifications in breast cancer as evaluated in GXB.
( A) Bar graph showing CXCL9 expression in individual samples from Erasmus Medical Center (EMC) dataset 2 split by ICR (single bar represents single case). Overlay of additional variables IBS, IDS and IMS is shown ( http://breastcancer.gxbsidra.org/dm3/miniURL/view/Lv). ( B) Frequency plot showing number of breast cancer cases across IBS/IDS subgroups split by ICR cluster. ICR, Immunologic Constant of Rejection; IBE/D, Immune Benefit Enabled OR Disabled; F/P/WID, Favorable OR Poor OR Weak Immune Disposition.