| Literature DB >> 33361113 |
Kiley Graim1,2, Dmitriy Gorenshteyn2,3, David G Robinson2,3, Nicholas J Carriero1, James A Cahill4, Rumela Chakrabarti5, Michael H Goldschmidt6, Amy C Durham6, Julien Funk1, John D Storey2,7, Vessela N Kristensen8, Chandra L Theesfeld2, Karin U Sorenmo5, Olga G Troyanskaya1,2,9.
Abstract
Understanding the changes in diverse molecular pathways underlying the development of breast tumors is critical for improving diagnosis, treatment, and drug development. Here, we used RNA-profiling of canine mammary tumors (CMTs) coupled with a robust analysis framework to model molecular changes in human breast cancer. Our study leveraged a key advantage of the canine model, the frequent presence of multiple naturally occurring tumors at diagnosis, thus providing samples spanning normal tissue and benign and malignant tumors from each patient. We showed human breast cancer signals, at both expression and mutation level, are evident in CMTs. Profiling multiple tumors per patient enabled by the CMT model allowed us to resolve statistically robust transcription patterns and biological pathways specific to malignant tumors versus those arising in benign tumors or shared with normal tissues. We showed that multiple histological samples per patient is necessary to effectively capture these progression-related signatures, and that carcinoma-specific signatures are predictive of survival for human breast cancer patients. To catalyze and support similar analyses and use of the CMT model by other biomedical researchers, we provide FREYA, a robust data processing pipeline and statistical analyses framework.Entities:
Year: 2020 PMID: 33361113 PMCID: PMC7849403 DOI: 10.1101/gr.256388.119
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Multiple CMTs per patient model enables discovery of carcinoma-specific processes that inform human BRCA. (Left) CMT model. Tissue samples were collected and annotated for each of the 89 samples from 16 canine patients. For study inclusion, each patient was required to provide a minimum of one sample (represented by colored blocks) from each histological group: normal (green), benign (yellow), and malignant (red). Many of the dogs have multiple samples of different tumor histologies. (Right) FREYA framework. We developed the FREYA framework to study tumor development. Using FREYA, we analyzed multiple primary tumors per patient with RNA and mutation profiling and developed a statistical framework to determine differences in gene expression between normal, benign, and malignant samples, and we compared CMT molecular signals to human breast cancer.
Figure 2.Cancer hallmark processes found in CMT transcriptional programs. (A) Biological processes showing differential gene expression between normal and carcinoma samples (Supplemental Table S2) were identified by network-based enrichment method at https://humanbase.flatironinstitute.org. Differentially expressed genes were clustered using a shared nearest neighbor–based community-finding algorithm to identify distinct modules of tightly connected genes (Krishnan et al. 2016) within the mammary epithelium functional network (Greene et al. 2015). Gene Ontology (GO) enrichment was performed on each module, and representative significant processes are displayed (for the entire list, see Supplemental Table S3). Circles are genes and the size of the circle indicates the sum of connections in the graph. Gene expression values (SAM scores) are overlaid. Red indicates increased expression in carcinoma, and blue indicates decreased expression in carcinoma. COSMIC cancer census genes are indicated in each module (M1–M4). (B) Human PAM50 intrinsic subtype signals are found in CMTs. Each bar represents the number of samples predicted for each PAM50 subtype, human or canine. Predictions for CMT samples were based on gene expression programs using a classifier trained on human BRCA samples and PAM50 subtype gene expression signature data. In human samples, 98% were correctly predicted, reflecting the accuracy of the predictor. (C) Density plot showing the genome-wide number of mutations per tumor sample in human (gray) and canine (maroon). (D) OncoPrint showing histology, predicted PAM50 subtype, number of mutations, and histologic subtype (simple/complex) for each sample in the cohort.
Figure 3.Identification of progression expression patterns. Progression expression patterns (PEPs) are identified using differential expression analysis between histological groups: (top) Tumor PEP, 1023 genes; (bottom) Carcinoma-specific PEP, 136 genes. The diagrams illustrate how each PEP pattern is defined. For example, Tumor PEP includes genes up-regulated in tumors (significantly differentially expressed both between normal and benign samples and between normal and malignant samples). The heatmap shows the expression patterns for these genes, with patterns divided into up- and down-regulated (e.g., Tumor PEP includes 567 genes significantly up-regulated in tumors and 456 genes significantly down-regulated in tumors).
Figure 4.Resolution of cancer hallmark processes by PEPs to discern malignancy-specific processes. Genes differentially expressed between Normal and Carcinoma samples (as in traditional gene expression analysis) show Tumor or Carcinoma-specific signatures. This experimental design stratifies tumor processes into those specific to malignant tumors (carcinoma-specific pattern) and those that are perturbed in both benign and malignant tumors (tumor-specific pattern). Five representative example GO terms from each pattern are shown (for a complete list, see Supplemental Table S3).
Figure 5.Dog PEP signature is predictive of survival in human breast cancer. (A,B) Kaplan-Meier plots showing patients with breast cancers bearing strongest Carcinoma PEP signal have worse outcomes in two independent human breast cancer cohorts: (A) TCGA BRCA; (B) METABRIC. (C–E) Dogs, like humans, have strong hormone receptor expression signaling differences between PAM50 subtypes. Estrogen receptor 1 (ESR1), progesterone receptor (PGR), and erb-b2 receptor tyrosine kinase 2 (ERBB2) expression within each PAM50 subtype shown: (C) TCGA BRCA; (D) this CMT data; and (E) METABRIC. Horizontal lines across each graph indicate median receptor expression across the entire cohort.