| Literature DB >> 24223926 |
Krishna R Kalari1, Brian M Necela, Xiaojia Tang, Kevin J Thompson, Melissa Lau, Jeanette E Eckel-Passow, Jennifer M Kachergus, S Keith Anderson, Zhifu Sun, Saurabh Baheti, Jennifer M Carr, Tiffany R Baker, Poulami Barman, Derek C Radisky, Richard W Joseph, Sarah A McLaughlin, High-seng Chai, Stephan Camille, David Rossell, Yan W Asmann, E Aubrey Thompson, Edith A Perez.
Abstract
Our goal in these analyses was to use genomic features from a test set of primary breast tumors to build an integrated transcriptome landscape model that makes relevant hypothetical predictions about the biological and/or clinical behavior of HER2-positive breast cancer. We interrogated RNA-Seq data from benign breast lesions, ER+, triple negative, and HER2-positive tumors to identify 685 differentially expressed genes, 102 alternatively spliced genes, and 303 genes that expressed single nucleotide sequence variants (eSNVs) that were associated with the HER2-positive tumors in our survey panel. These features were integrated into a transcriptome landscape model that identified 12 highly interconnected genomic modules, each of which represents a cellular processes pathway that appears to define the genomic architecture of the HER2-positive tumors in our test set. The generality of the model was confirmed by the observation that several key pathways were enriched in HER2-positive TCGA breast tumors. The ability of this model to make relevant predictions about the biology of breast cancer cells was established by the observation that integrin signaling was linked to lapatinib sensitivity in vitro and strongly associated with risk of relapse in the NCCTG N9831 adjuvant trastuzumab clinical trial dataset. Additional modules from the HER2 transcriptome model, including ubiquitin-mediated proteolysis, TGF-beta signaling, RHO-family GTPase signaling, and M-phase progression, were linked to response to lapatinib and paclitaxel in vitro and/or risk of relapse in the N9831 dataset. These data indicate that an integrated transcriptome landscape model derived from a test set of HER2-positive breast tumors has potential for predicting outcome and for identifying novel potential therapeutic strategies for this breast cancer subtype.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24223926 PMCID: PMC3815156 DOI: 10.1371/journal.pone.0079298
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Methods workflow for HER2 transcriptomic network.
High level analytical approach to build HER2-positive transcriptomic landscape from paired-end RNA-Seq data analysis of breast tumor subtypes.
Figure 2The eSNV-Detect workflow.
Flow chart of the eSNV-Detect – a method to identify expressed SNVs from paired-end RNA-Sequencing data.
Figure 3Schematic of analytical approach.
Computational approach to identify and characterize genomic features associated with HER2-positive tumors.
Figure 4Top 9 differentially expressed genes.
Box plot representation of nine differential expressed genes that are specific to the HER2-positive tumors compared to other tumors in our test set, shown in log2 scale.
Figure 5Clustering of genes uniquely expressed in HER2 tumors.
Hierarchical clustering of 685 significantly differentially expressed genes in HER2-positive tumors compared to other tumor subtypes in our test set of tumors.
Figure 6CASPER plots for 2 genes.
Visualization of output values from CASPER splicing analysis method for PPM1A and MPG genes. Data indicates that PPM1A and MPG transcripts are uniquely and alternately spliced in HER2-positive tumors compared to other groups.
Figure 7Visualization of single nucleotide variant validation.
Sanger sequence validation of highly expressed novel somatic SNVs for MRPL3 variant in the BCT40 HER2 tumor sample. RNA-Seq sequence reads shown above Sanger sequencing tracing, with mutation shown by an arrow.
Figure 8Representation of single nucleotide variant validation.
Sanger sequence validation of low expressed novel somatic SNVs for FOXA1 in the BCT40 HER2 tumor sample. RNA-Seq sequence reads shown over Sanger sequence trace with mutation indicated by an arrow.
Figure 9Overlap of genes from genomic features.
Venn diagram representation of genes obtained from three genomic features analyses.
Figure 10Visualization of single nucleotide variant validation.
Sanger sequencing validation of MPG eSNV in HER2 tumor. RNA-Seq reads shown over Sanger sequence tracing with mutation indicated by an arrow.
Figure 11Validation of splicing variants.
qPCR validation of the two isoforms in breast tumor samples for MPG splicing variants. Isoform abundance by qPCR is indicated by gray bars, whereas isoform abundance determined by CASPER is shown in black bars.
Figure 12HER2-positive tumor interactome.
HER2 tumor interactome network developed using cytoscape and functional reactome of 1055 genes obtained from integration analyses of genomic features. Different functional modules within the network are color coded.
Sub networks from the HER2-positive tumor interactome model.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| 1 | Aurora A signaling (N) | 44 | 6 | 0 | <1.00e-03 | CYLD, BCL3, CHUK, RELA, TP53, BIRC2 |
| 2 | Integrin signaling pathway (P) | 33 | 10 | 0 | <1.00e-03 | CAV1, ACTG1, LAMB2, ITGAX, ARPC2, ITGAV, COL6A2 ,FLNB, FYN, ABL1 |
| 3 | Phosphatidyl-inositol signaling system(K) | 33 | 7 | 0 | <1.00e-03 | PIP5K1C, PLCB4, PIK3CA, INPP5E, PIK3R1, DGKQ, INPP4A |
| 4 | Transport of Mature Transcript to Cytoplasm (R) | 27 | 10 | 0 | <1.00e-03 | NUP214, CDC40, NUP54, RANBP2, TPR, NUP133, NUPL2, UPF3B, NUP205, NUP107 |
| 5 | Signaling by NGF(R) | 24 | 8 | 0 | <1.00e-03 | PPP2R1B, RTN4, CREB1, FOXO1, ATF1, GSK3B, PRKACA, AKT2 |
| 6 | FOXA1 transcription factor network (N) | 23 | 2 | 0.0009 | 2.26E-01 | FOXA1, NR2F2 |
| 7 | Ubiquitin mediated proteolysis (K) | 17 | 5 | 0 | <1.00e-03 | UBE3A, ANAPC10, STUB1, UBE2D1, UBE2E1 |
| 8 | Signaling by Rho GTPases (R) | 9 | 6 | 0 | <1.00e-03 | TAGAP, PLEKHG2, RHOT2, ARHGEF17, AKAP13, ARAP2 |
| 9 | G2/M Transition (R) | 9 | 5 | 0 | <1.00e-03 | PLK4, CEP57, CKAP5, ALMS1, TUBA1A |
| 10 | M Phase (R) | 8 | 3 | 0 | <1.00e-03 | SMC3, STAG2, STAG1 |
| 11 | Insulin Synthesis and Secretion (R) | 7 | 3 | 0 | 1.00E-03 | SRP54, TRAM1, SSR1 |
| 12 | TGF-beta receptor signaling | 6 | 4 | 0 | <1.00e-03 | SNX6, TGFBR1, YAP1, ENG |
The interactome model identified 12 discrete modules or sub-networks. The total number of genes in each module as well as key nodal genes within each module are given below.
Pathway enrichment statistics in breast cancer subtypes from TCGA.
|
| ||||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| p01 | Aurora A signaling(N) | 44 | 27 | 2.99E-08 | 25 | 8.46E-07 | 15 | 9.83E-02 |
| p02 | Integrin signalling pathway(P) | 33 | 23 | 9.59E-09 | 20 | 2.93E-06 | 16 | 1.19E-03 |
| p03 | Phosphatidy-linositol signaling system(K) | 33 | 23 | 9.59E-09 | 19 | 1.53E-05 | 11 | 2.05E-01 |
| p04 | Transport of Mature Transcript to Cytoplasm(R) | 27 | 18 | 1.27E-06 | 17 | 8.32E-06 | 9 | 2.45E-01 |
| p05 | Signalling by NGF(R) | 24 | 20 | 4.36E-10 | 13 | 8.41E-04 | 8 | 2.27E-01 |
| p06 | FOXA1 transcription factor network(N) | 23 | 14 | 9.59E-05 | 10 | 4.10E-02 | 2 | 1.32E-01 |
| p07 | Ubiquitin mediated proteolysis(K) | 17 | 14 | 3.18E-07 | 11 | 2.67E-04 | 8 | 3.59E-02 |
| p08 | Signaling by Rho GTPases(R) | 9 | 8 | 5.39E-05 | 6 | 6.43E-03 | 1 | 6.93E-01 |
| p09 | G2/M Transition(R) | 9 | 6 | 6.43E-03 | 7 | 7.67E-04 | 4 | 2.24E-01 |
| p10 | M Phase(R) | 8 | 5 | 1.92E-02 | 6 | 2.65E-03 | 3 | 3.96E-01 |
| p11 | Insulin Synthesis and Secretion(R) | 7 | 5 | 8.85E-03 | 4 | 5.40E-02 | 2 | 6.65E-01 |
| p12 | TGF-beta receptor signaling | 6 | 4 | 2.82E-02 | 4 | 2.82E-02 | 4 | 2.82E-02 |
Fisher’s Exact test was carried out using a 2X2 contingency table to compare the number of genes from the Ctyoscape network that were differentially expressed in TCGA tumors compared to the remaining genes that were defined as “HER2-specific” in our analysis but were not incorporated into the Cytoscape network. The contingency table was constructed as follows: The constants were the total number of genes (G, 1055 genes from our initial analysis), the total number of genes from this set that were incorporated into the cytoscape network (N, 244 genes), and the total number of genes in the specific pathway to be analyzed (P). For each pathway, we compared the number of genes within that pathway that were differentially expressed at p<0.05 in the TCGA data (m) to the remainder of genes in the candidate list (N-m) versus the genes in the pathway that were not differentially expressed (P-m) against the number of genes that were in neither the candidate list nor the pathway (G-N-P+m).
Correlation of genes with lapatinib response in breast cancer cell lines.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| ILK | integrin-linked kinase | DE | -0.791538 | 9.10E-05 | Integrin signaling |
| ALMS1 | Alstrom syndrome 1 | SNV | 0.76677 | 2.00E-04 | cell transport, microtubules |
| CDC40 | cell division cycle 40 homolog ( | DE | 0.680083 | 1.90E-03 | RNA processing |
| STAG2 | stromal antigen 2 | DE | 0.673891 | 2.17E-03 | M-phase |
| MBD4 | methyl-CpG binding domain protein 4 | DE | 0.636739 | 4.49E-03 | Mismatch repair |
| ANAPC10 | anaphase promoting complex subunit 10 | DE | 0.622291 | 5.82E-03 | M-phase |
| ATF1 | activating transcription factor 1 | DE | 0.587203 | 1.04E-02 | cAMP signaling |
| ZNF385A | zinc finger protein 385A | AS | -0.581011 | 1.15E-02 | unknown (transcription?) |
| NUP54 | nucleoporin 54kDa | DE | 0.560372 | 1.56E-02 | Nuclear pore comples |
| APPL1 | adaptor protein, phosphotyrosine interaction, PH domain and leucine zipper containing 1 | SNV | -0.547988 | 1.86E-02 | RHO-GTPase signaling |
| NFATC3 | nuclear factor of activated T-cells, cytoplasmic, calcineurin-dependent 3 | DE | 0.54386 | 1.96E-02 | Calcium signaling |
| MCL1 | myeloid cell leukemia sequence 1 (BCL2-related) | AS | 0.54386 | 1.96E-02 | Apoptotic regulation |
| RAB10 | RAB10, member RAS oncogene family | DE | 0.539732 | 2.08E-02 | RHO-GTPase signaling |
| VBP1 | von Hippel-Lindau binding protein 1 | SNV | 0.527348 | 2.45E-02 | Protein folding |
| ARFGEF1 | ADP-ribosylation factor guanine nucleotide-exchange factor 1(brefeldin A-inhibited) | SNV | -0.517028 | 2.80E-02 | RHO-GTPase signaling |
| RFX1 | regulatory factor X, 1 (influences HLA class II expression) | DE | 0.517028 | 2.80E-02 | Transcriptional regulation |
| RAP1A | RAP1A, member of RAS oncogene family | DE | 0.508772 | 3.11E-02 | RHO-GTPase signaling |
| RRN3 | RRN3 RNA polymerase I transcription factor homolog ( | DE | 0.506708 | 3.19E-02 | Transcriptional regulation |
| UPF3B | UPF3 regulator of nonsense transcripts homolog B | SNV | 0.504644 | 3.27E-02 | RNA processing |
| UBE2E1 | ubiquitin-conjugating enzyme E2E 1 | DE | 0.498452 | 3.53E-02 | Protein processing |
| TPR | translocated promoter region | DE | -0.494324 | 3.70E-02 | M-phase |
| LAMB2 | laminin, beta 2 (laminin S) | DE | -0.49226 | 3.80E-02 | Extracellular matrix/adhesion |
| NIF3L1 | NIF3 NGG1 interacting factor 3-like 1 ( | SNV | 0.490196 | 3.89E-02 | unknown (transcription?) |
| PRKACA | protein kinase, cAMP-dependent, catalytic, alpha | DE | 0.479876 | 4.39E-02 | cAMP signaling |
| PPP1R2 | protein phosphatase 1, regulatory subunit 2 | DE | 0.477812 | 4.49E-02 | Protein phosphorylation |
| ZNF638 | zinc finger protein 638 | DE | 0.475748 | 4.60E-02 | Transcriptional regulation |
Correlation of gene expression with lapatinib response in breast cancer cell lines from a freely available data source (Cancer Cell Line Encyclopedia project) using the list of genes obtained from our HER2 interactome network.
Correlation of genes with paclitaxel response in breast cancer cell lines.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| CCT6A | chaperonin containing subunit 6A | 3 | SNV | -0.60 | 5.31E-03 | tubulin folding |
| ATN1 | atrophin 1 | 2 | DE | 0.59 | 6.07E-03 | transcriptional co-regulator |
| STRN4 | striatin, calmodulin binding protein 4 | 2 | DE | 0.54 | 1.34E-02 | caclium signaling |
| RAB11B | RAB11B, RAS oncogene family | 2 | DE | 0.52 | 1.79E-02 | RHO-GTPase signaling and endosomal trafficking |
| NR2F2 | nuclear receptor subfamily 2, group F, member 2 | 4 | AS | 0.51 | 2.12E-02 | nuclear receptor activated by retinoids |
| TOPORS | topoisomerase I binding, arginine/serine-rich | 1 | DE | 0.51 | 2.26E-02 | ubiquitin ligase regulates TP53 turnover |
| IQGAP1 | IQ motif containing GTPase activating protein 1 | 2 | SNV | 0.51 | 2.31E-02 | RHO-GAP associated with cytoskeletal reorganization |
| TGFBR1 | transforming growth factor,betareceptor1 | 10 | DE | -0.50 | 2.61E-02 | TGF-beta signal transduction |
| CASP3 | caspase 3, apoptosis-related cysteine peptidase | 10 | DE | -0.50 | 2.61E-02 | apoptotic signaling |
| KIFAP3 | kinesin-associated protein 3 | 2 | DE,AS | 0.49 | 2.88E-02 | cytokinesis |
| SF3A2 | splicing factor 3a, subunit 2, 66kDa | 9 | DE | 0.49 | 2.99E-02 | RNA splicing |
| MAP3K10 | mitogen-activated protein kinase kinase kinase 10 | 2 | DE | 0.47 | 3.49E-02 | activates JNK signaling |
| TAGAP | T-cell activation RhoGTPase activating protein | 1 | SNV | 0.46 | 4.20E-02 | RHO-GTPase signaling |
| SON | SON DNA binding protein | 1 | SNV | -0.45 | 4.43E-02 | RNA splicing, tubulin |
| KDELR1 | KDEL endoplasmic reticulum protein retention receptor 1 | 2 | DE | 0.45 | 4.43E-02 | vesicular transport |
| SMARCE1 | SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily e, member 1 | 6 | DE | -0.45 | 4.51E-02 | chromatin remodeling, ER signaling |
| PLEKHG2 | pleckstrin homology domain containing, family G member 2 | 1 | DE | 0.45 | 4.84E-02 | RHO-GTPase signaling |
Correlation of gene expression with paclitaxel response in breast cancer cell lines from Cancer Cell line Encyclopedia project using HER2 interactome genes.
Gene set analysis of 12 sub-network genes with N9831 clinical trial data.
|
| |||||
|---|---|---|---|---|---|
| Gene_set_name | Pathway | Score | p-value | FDR | pos.neg.vector |
| HER2.network.01 | Aurora A signaling(N) | -0.1 | 0.1756 | 0.571 | negative |
| HER2.network.02 | Integrin signalling pathway(P) | -0.555 | 0.0032 | 0.042 | negative |
| HER2.network.03 | Phosphatidy-linositol signaling system(K) | 0.0859 | 0.2798 | 0.548 | positive |
| HER2.network.04 | Transport of Mature Transcript to Cytoplasm(R) | 0.0316 | 0.4296 | 0.621 | positive |
| HER2.network.05 | Signalling by NGF(R) | 0.2084 | 0.1126 | 0.366 | positive |
| HER2.network.06 | FOXA1 transcription factor network(N) | 0.0936 | 0.2948 | 0.548 | positive |
| HER2.network.07 | Ubiquitin mediated proteolysis(K) | 0.3351 | 0.0344 | 0.366 | positive |
| HER2.network.08 | Signaling by Rho GTPases(R) | -0.486 | 0.0739 | 0.48 | negative |
| HER2.network.09 | G2/M Transition(R) | 0.1197 | 0.3491 | 0.567 | positive |
| HER2.network.10 | M Phase(R) | -0.266 | 0.1583 | 0.571 | negative |
| HER2.network.11 | Insulin Synthesis and Secretion(R) | 0.2276 | 0.2188 | 0.548 | positive |
| HER2.network.12 | TGF-beta receptor signaling | 0.512 | 0.0611 | 0.366 | positive |
| HER2.network.all | 0.0936 | 0.366 | positive |
The direction of the GSA score indicates the relationship to outcome, as does the column entitled “pos.neg.vector”. Negative indicates an association with increased risk of relapse, whereas positive indicates decreased risk of relapse. GSA was carried out using gene expression data and outcomes (relapse free survival) as continuous variables.