| Literature DB >> 32457901 |
Tasha Thong1, Yutong Wang2, Michael D Brooks3, Christopher T Lee4, Clayton Scott2,5, Laura Balzano2, Max S Wicha3,5, Justin A Colacino1,5,6.
Abstract
Similarities between stem cells and cancer cells have implicated mammary stem cells in breast carcinogenesis. Recent evidence suggests that normal breast stem cells exist in multiple phenotypic states: epithelial, mesenchymal, and hybrid epithelial/mesenchymal (E/M). Hybrid E/M cells in particular have been implicated in breast cancer metastasis and poor prognosis. Mounting evidence also suggests that stem cell phenotypes change throughout the life course, for example, through embryonic development and pregnancy. The goal of this study was to use single cell RNA-sequencing to quantify cell state distributions of the normal mammary (NM) gland throughout developmental stages and when perturbed into a stem-like state in vitro using conditional reprogramming (CR). Using machine learning based dataset alignment, we integrate multiple mammary gland single cell RNA-seq datasets from human and mouse, along with bulk RNA-seq data from breast tumors in the Cancer Genome Atlas (TCGA), to interrogate hybrid stem cell states in the normal mammary gland and cancer. CR of human mammary cells induces an expanded stem cell state, characterized by increased expression of embryonic stem cell associated genes. Alignment to a mouse single-cell transcriptome atlas spanning mammary gland development from in utero to adulthood revealed that NM cells align to adult mouse cells and CR cells align across the pseudotime trajectory with a stem-like population aligning to the embryonic mouse cells. Three hybrid populations emerge after CR that are rare in NM: KRT18+/KRT14+ (hybrid luminal/basal), EPCAM+/VIM+ (hybrid E/M), and a quadruple positive population, expressing all four markers. Pseudotime analysis and alignment to the mouse developmental trajectory revealed that E/M hybrids are the most developmentally immature. Analyses of single cell mouse mammary RNA-seq throughout pregnancy show that during gestation, there is an enrichment of hybrid E/M cells, suggesting that these cells play an important role in mammary morphogenesis during lactation. Finally, pseudotime analysis and alignment of TCGA breast cancer expression data revealed that breast cancer subtypes express distinct developmental signatures, with basal tumors representing the most "developmentally immature" phenotype. These results highlight phenotypic plasticity of normal mammary stem cells and provide insight into the relationship between hybrid cell populations, stemness, and cancer.Entities:
Keywords: breast cancer; epithelial; hybrid; mesenchymal; pregnancy; single-cell RNA sequencing; stem cells
Year: 2020 PMID: 32457901 PMCID: PMC7227401 DOI: 10.3389/fcell.2020.00288
Source DB: PubMed Journal: Front Cell Dev Biol ISSN: 2296-634X
FIGURE 1Unbiased clustering and cell type identification of NM cells. (A) tSNE dimension reduction of NM samples colored by individual. (B) Unbiased clustering of NM samples colored by cell cluster. (C) Expression of known cell type marker genes by cluster across all NM samples.
FIGURE 2Unbiased clustering and differential gene expression between NM and CR. (A) tSNE dimension reduction of NM and CR samples by individual. (B) FeaturePlots of myoepithelial marker gene (KRT14) and (C) luminal marker gene (KRT18) expression. (D) Differentially expressed genes between NM and CR epithelial cells. Significantly upregulated genes in CR (FDR < 0.05) are colored in orange. Significantly upregulated genes in NM are colored in purple. (E) Distribution of cells from NM and CR samples scored by embryonic stem cell gene expression. (F) Comparison of overlap between NM and CR differentially expressed genes and the mammary stem cell (MaSC) gene expression signature reported in Lim et al. (2009). Yellow genes indicate MaSC genes more highly expressed in CR vs. NM.
FIGURE 3Alignment of NM and CR cells to mouse mammary developmental trajectory and characterization of hybrid cells. (A) Principal component analysis plot of single cell RNA-seq data of mouse mammary gland at embryonic day 16 (E.16), embryonic day 18 (E. 18), post-natal day 4 (P.4), and adult basal (A. basal) and adult luminal (A.luminal) cells as reported in Giraddi et al. (2018). (B) Pseudotime estimates of mouse mammary developmental stages. (C) NM and CR cells mapped to the developmental trajectory with CoRGI. (D) CR samples mapped to the mouse mammary developmental trajectory labeled by individual. (E) Hybrid cell identification of mouse mammary cells along the developmental trajectory. Luminal/basal hybrids were identified by concurrent high KRT14/KRT18 expression. Epithelial/mesenchymal hybrids were identified by concurrent high EPCAM/VIM expression. Quadruple positive hybrid cells were identified by high expression of all four marker genes KRT14/KRT18,/EPCAM/VIM. (F) CR cells mapped to mouse developmental trajectory and labeled by hybrid status. (G) Pseudotime estimates of NM and CR cells relative to the mouse mammary developmental trajectory cells. (H) Pseudotime estimates of mouse hybrid cells. (I) Pseudotime estimates of CR hybrid cells.
FIGURE 4Alignment of Bach mouse mammary developmental dataset to the Giraddi mammary trajectory. (A) Pseudotime estimates of Bach mammary developmental stages: nulliparous (NP), mid-gestation (G), lactation (L), and post-involution (PI). (B) Bach mammary cells mapped to the Giraddi trajectory with CoRGI. (C) Bach luminal/basal hybrid cells mapped to Giraddi trajectory. (D) Bach epithelial/mesenchymal hybrid cells mapped to Giraddi trajectory. (E) Bach quadruple positive hybrid cells mapped to Giraddi trajectory (F) Proportions of Bach hybrid cells by developmental stage.
FIGURE 5Alignment of TCGA tumors to Giraddi mammary trajectory. (A) Principal component analysis of TCGA bulk breast tumor RNA-seq labeled by subtype. (B) Alignment of TCGA tumors to Giraddi developmental trajectory. (C) Pseudotime estimates of TCGA tumor subtypes. (D) Mortality hazard ratio estimates relative to expression of the top 10 genes most negatively correlated with mouse pseudotime. The more negative the pseudotime estimate, the more highly expressed the gene is in the earliest timepoints during development.