Holly Holliday1,2, Daniel Roden1,2, Simon Junankar1,2, Sunny Z Wu1,2, Laura A Baker1,2, Christoph Krisp3,4, Chia-Ling Chan1, Andrea McFarland1, Joanna N Skhinas1, Thomas R Cox1,2, Bhupinder Pal5,6, Nicholas D Huntington7, Christopher J Ormandy1,2, Jason S Carroll8, Jane Visvader5,9, Mark P Molloy3, Alexander Swarbrick1,2. 1. The Kinghorn Cancer Centre, Garvan Institute of Medical Research, Sydney, NSW 2010, Australia. 2. St Vincent's Clinical School, Faculty of Medicine, UNSW Sydney, Sydney, NSW 2010, Australia. 3. Australian Proteome Analysis Facility, Macquarie University, Sydney, NSW 2109, Australia. 4. Institute of Clinical Chemistry and Laboratory Medicine, Mass Spectrometric Proteomics, University Medical Center Hamburg-Eppendorf, Hamburg 20251, Germany. 5. ACRF Cancer Biology and Stem Cells Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC 3052, Australia. 6. Olivia Newton-John Cancer Research Institute and School of Cancer Medicine, La Trobe University, Heidelberg, VIC 3084, Australia. 7. Biomedicine Discovery Institute, Department of Biochemistry and Molecular Biology, Monash University, Clayton, VIC 3168, Australia. 8. Cancer Research UK Cambridge Institute, University of Cambridge, Robinson Way, Cambridge CB2 0RE, UK. 9. Department of Medical Biology, The University of Melbourne, Parkville, VIC 3010, Australia.
Abstract
Inhibitor of differentiation (ID) proteins dimerize with basic HLH (bHLH) transcription factors, repressing transcription of lineage-specification genes across diverse cellular lineages. ID4 is a key regulator of mammary stem cells; however, the mechanism by which it achieves this is unclear. Here, we show that ID4 has a cell autonomous role in preventing myoepithelial differentiation of basal cells in mammary organoids and in vivo. ID4 positively regulates proliferative genes and negatively regulates genes involved in myoepithelial function. Mass spectrometry reveals that ID4 interacts with the bHLH protein HEB, which binds to E-box motifs in regulatory elements of basal developmental genes involved in extracellular matrix and the contractile cytoskeleton. We conclude that high ID4 expression in mammary basal stem cells antagonizes HEB transcriptional activity, preventing myoepithelial differentiation and allowing for appropriate tissue morphogenesis. Downregulation of ID4 during pregnancy modulates gene regulated by HEB, promoting specialization of basal cells into myoepithelial cells.
Inhibitor of differentiation (ID) proteins dimerize with basic HLH (bHLH) transcription factors, repressing transcription of lineage-specification genes across diverse cellular lineages. ID4 is a key regulator of mammary stem cells; however, the mechanism by which it achieves this is unclear. Here, we show that ID4 has a cell autonomous role in preventing myoepithelial differentiation of basal cells in mammary organoids and in vivo. ID4 positively regulates proliferative genes and negatively regulates genes involved in myoepithelial function. Mass spectrometry reveals that ID4 interacts with the bHLH protein HEB, which binds to E-box motifs in regulatory elements of basal developmental genes involved in extracellular matrix and the contractile cytoskeleton. We conclude that high ID4 expression in mammary basal stem cells antagonizes HEB transcriptional activity, preventing myoepithelial differentiation and allowing for appropriate tissue morphogenesis. Downregulation of ID4 during pregnancy modulates gene regulated by HEB, promoting specialization of basal cells into myoepithelial cells.
The mammary gland undergoes tissue remodeling throughout life. During murine pubertal development, terminal end buds (TEBs) located at the tips of the ducts invade into the surrounding stromal fat pad (Williams and Daniel, 1983; Macias and Hinck, 2012). This process is driven by collective migration and rapid proliferation of outer cap cells, which surround multiple layers of inner body cells (Williams and Daniel, 1983). As the ducts elongate, the cap cells differentiate into the basal cell layer, and the body cells adjacent to the basal cells give rise to the luminal cell layer (Williams and Daniel, 1983), whereas the innermost body cells undergo apoptosis to sculpt the bilayered ductal tree present in the adult gland (Paine and Lewis, 2017). During pregnancy the gland undergoes alveolar morphogenesis in preparation for production and secretion of milk at lactation. The luminal cells differentiate into milk-producing alveolar cells, and the milk is ejected from the gland by the contractile action of specialized myoepithelial cells, smooth muscle-like epithelial cells that differentiate from basal cells (Macias and Hinck, 2012). Here, “basal” refers to the basal lineage, encompassing cap cells, duct basal cells, and myoepithelial cells.The basal compartment contains bipotent mammary stem cells (MaSCs), giving rise to basal and luminal lineages upon transplantation (Shackleton et al., 2006; Stingl et al., 2006). Lineage tracing studies have identified both bipotent (Rios et al., 2014; Wang et al., 2015) and unipotent myoepithelial-restricted (Van Keymeulen et al., 2011; van Amerongen et al., 2012; Prater et al., 2014; Wuidart et al., 2016, 2018; Davis et al., 2016; Scheele et al., 2017; Lilja et al., 2018; Lloyd-Lewis et al., 2018) stem cells in the basal compartment under physiological conditions.Lineage-specifying transcription factors are responsible for directing luminal and myoepithelial differentiation and also for maintaining the self-renewal capacity of uncommitted stem cells upstream in the mammary epithelial hierarchy. Transcriptomic profiling of sorted epithelial subpopulations has identified lineage specifying transcription factors that regulate each step of luminal-alveolar differentiation (Carr et al., 2012; Kouros-Mehr et al., 2006; Asselin-Labat et al., 2007; Buchwalter et al., 2013; Bouras et al., 2008; Liu et al., 2008; Oakes et al., 2008; Chakrabarti et al., 2012; Yamaji et al., 2009). However, due to lack of specific cell markers that can resolve stem and myoepithelial populations, it has been challenging to dissect molecular regulators of basal differentiation. Although a number of basal-specific transcription factors have been identified, such as P63, SLUG, SOX9, SRF, and MRTFA (Mills et al., 1999; Yang et al., 1999; Guo et al., 2012; Li et al., 2006; Sun et al., 2006), their role in the basal compartment and myoepithelial specialization is poorly understood. ID proteins (ID1-4) are helix-loop-helix (HLH) transcriptional regulators that lack a DNA-binding domain. They function by dimerizing with basic HLH (bHLH) transcription factors and preventing them from binding to E-box DNA motifs and regulating transcription (Benezra et al., 1990). E-box motifs are found in regulatory regions of genes involved in lineage specification and as such ID proteins and bHLH transcription factors are critical regulators of stemness and differentiation across diverse cellular lineages (Massari and Murre, 2000). The expression of ID4 in mouse and human mammary epithelium is exclusive to the basal population (Lim et al., 2010). We and others have demonstrated that ID4 is a key regulator of mammary stem cells, required for ductal elongation during puberty (Best et al., 2014; Dong et al., 2011; Junankar et al., 2015). ID4 was also shown to have a role in blocking luminal differentiation (Best et al., 2014; Junankar et al., 2015). The precise molecular mechanisms by which ID4 functions in the mammary gland, including its full repertoire of transcriptional targets and interacting partners, have yet to be determined. Here we show that ID4 marks basal stem cells and demonstrate that ID4 also inhibits myoepithelial differentiation. Moreover, using unbiased interaction proteomics, we identify the bHLH transcription factor HEB as a factor in the mammary differentiation hierarchy. By mapping the genome-wide binding sites of HEB we show that it directly binds to regulatory elements of ID4 target genes involved in myoepithelial functions such as contraction and extracellular matrix (ECM) synthesis.
Results
Loss of ID4 causes upregulation of myoepithelial genes in basal cells
In order to determine the genes regulated by ID4 in mammary epithelial cells, we FACS-enriched basal, luminal progenitor, and mature luminal cells from wild-type (WT) and ID4 knockout (KO) mice (Yun et al., 2004) and performed RNA-sequencing (RNA-seq) (Figure 1A). No significant differences were observed in the proportion of mammary epithelial subpopulations between WT and KO mice (Figure S1A). One hundred four genes were significantly (FDR<0.05) upregulated and 145 genes downregulated in ID4 KO basal cells. In contrast, only 1 gene was differentially expressed in each of the two luminal subpopulations (Figures 1B and 1C and Table S1), suggesting that ID4 predominantly regulates gene expression within basal cells in vivo.
Figure 1
Loss of ID4 results in upregulation of myoepithelial genes in sorted mammary basal cells
(A) Live lineage negative CD24+CD29hiCD61+ basal, CD24+CD29loCD61+ luminal progenitor, and CD24+CD29loCD61− mature luminal cells were isolated by FACS from adult (10–12 weeks) ID4 wild-type (WT) and knockout (KO) mice at estrus for RNA-seq. Representative FACS plots shown from four experiments.
(B) Number of significantly (FDR<0.05) differentially expressed genes (DEGs) upregulated (red) or downregulated (blue) in ID4 KO epithelial subpopulations compared with ID4 WT. LP = luminal progenitor, ML = mature luminal.
(C) Heatmap displaying the significant differentially expressed genes between ID4 WT and ID4 KO basal cells.
(D) Genes were ranked based on the limma t-statistic comparing ID4 WT and KO basal cells, and GSEA was carried out using the C2all, C3TF, C5, and Hallmark gene sets. GSEA results were visualized using Cytoscape EnrichmentMap. Nodes represent gene sets, and edges represent overlap. Gene sets with an FDR<0.25 are shown.
(E) Representative GSEA enrichment plots displaying the profile of the running Enrichment Score (green) and positions of gene set members on the rank ordered list for pathways related to myoepithelial function. Normalized enrichment scores (NES) and FDR are indicated.
See also Figure S1 and Table S1.
Loss of ID4 results in upregulation of myoepithelial genes in sorted mammary basal cells(A) Live lineage negative CD24+CD29hiCD61+ basal, CD24+CD29loCD61+ luminal progenitor, and CD24+CD29loCD61− mature luminal cells were isolated by FACS from adult (10–12 weeks) ID4 wild-type (WT) and knockout (KO) mice at estrus for RNA-seq. Representative FACS plots shown from four experiments.(B) Number of significantly (FDR<0.05) differentially expressed genes (DEGs) upregulated (red) or downregulated (blue) in ID4 KO epithelial subpopulations compared with ID4 WT. LP = luminal progenitor, ML = mature luminal.(C) Heatmap displaying the significant differentially expressed genes between ID4 WT and ID4 KO basal cells.(D) Genes were ranked based on the limma t-statistic comparing ID4 WT and KO basal cells, and GSEA was carried out using the C2all, C3TF, C5, and Hallmark gene sets. GSEA results were visualized using Cytoscape EnrichmentMap. Nodes represent gene sets, and edges represent overlap. Gene sets with an FDR<0.25 are shown.(E) Representative GSEA enrichment plots displaying the profile of the running Enrichment Score (green) and positions of gene set members on the rank ordered list for pathways related to myoepithelial function. Normalized enrichment scores (NES) and FDR are indicated.See also Figure S1 and Table S1.In line with the known role of ID4 in promoting proliferation of mammary epithelial cells (Junankar et al., 2015; Dong et al., 2011), the genes downregulated in ID4 KO basal cells were enriched for pathways involved in cell growth such as translation and metabolism (Figures 1D, S1B, and S1C). Conversely, the genes upregulated in KO basal cells were enriched for pathways related to basal cells and smooth muscle function such as contraction, epithelial-mesenchymal transition (EMT), and serum response factor (SRF) targets (Figures 1E and S1B). These pathways shared common genes as indicated by the connecting edges in the enrichment map network (Figure 1D). SRF target genes are of relevance, as SRF is a master regulator of cytoskeletal contraction and is one of the only transcription factors implicated in myoepithelial differentiation (Miano et al., 2007; Li et al., 2006; Sun et al., 2006). Taken together, loss of ID4 causes basal cells to adopt a more differentiated myoepithelial and mesenchymal gene expression program, implicating ID4 in the repression of basal cell specialization.
ID4 expression decreases upon terminal myoepithelial differentiation of basal cells
ID4 is known to be heterogeneously expressed in basal cells, and ID4-positive cells have enhanced mammary reconstitution activity (Junankar et al., 2015). To further characterize the phenotype of ID4-positive basal cells, we used an ID4-GFP reporter mouse in which the ID4 promoter drives GFP expression (Best et al., 2014). Basal cells with high expression of the stem cell marker CD49f/ITGA6 and the epithelial marker EPCAM have been shown to be enriched for MaSC activity (Stingl et al., 2006; Prater et al., 2014). ID4-GFP expression was maximal in this EPCAMhi CD49fhi subset (Figure 2A). ID4-GFP expression within the basal gate was binned into three groups—bright (top 10%), intermediate (middle 80%), and dim (bottom 10%)—and the median fluorescent intensity (MFI) of EPCAM and CD49f was analyzed within these groups (Figures 2B and 2C). ID4-bright cells had significantly higher EPCAM and CD49f MFI than ID4-dim and -intermediate cells, suggesting ID4 marks basal stem cells.
Figure 2
ID4 expression decreases in terminally differentiated myoepithelial cells
(A–C) (A) FACS analysis of EPCAM and CD49f in live lineage negative mammary cells from adult (10–14 weeks) Id4floxGFP reporter mice. ID4-GFP expression is indicated in the heatmap scale. Representative plots from five experiments shown. Basal cells were binned into three groups based on ID4-GFP expression, ID4-bright (red), ID4-intermediate (orange), and ID4-dim (green), and the median fluorescence intensity (MFI) of EPCAM (B) and CD49f (C) were compared between the three gates. MFI expressed as a fold change relative to the ID4-low basal cells. Ordinary one-way ANOVA test was used to test significance. n = 5. Error bars represent SEM. ∗∗p < 0.01, ∗∗∗p < 0.001, ∗∗∗∗p < 0.0001.
(D–H) (D) Representative co-immunofluorescent staining of ID4 and α-SMA in TEB and duct from a pubertal (6 weeks) mammary gland. Scale bar: 20 μm. Comparison of ID4 (E) and α-SMA (F) mean fluorescence between individual cap cells (dark blue) and basal duct cells (light blue). Unpaired two-tailed students t test. Error bars represent SEM. ∗∗∗∗p < 0.0001. Correlation between α-SMA and ID4 (G) and Hoescht (H) mean fluorescence in individual cap cells (dark blue) and basal duct cells (light blue). R2 and p values are displayed. Data are pooled from nine mice. Approximately 20 TEB cap cells and 20 ductal basal cells were analyzed per mouse.
(I–K) (I) tSNE plot of 9663 Krt5+/Krt14 + basal cells from (Bach et al., 2017). Two mice were analyzed per developmental stage. NP = Nulliparous (8 weeks), G = Gestation (Day 14.5), L = Lactation (Day 6), I = Involution (Day 11). Feature plots (J) and Violin plots (K) displaying expression of Id4, Epcam, and Acta2 in single cells in the different developmental stages.
(L–N) (L) Heatmap displaying top and bottom 30 differentially expressed genes between the top and bottom 200 Id4 high and Id4 low basal cells across all stages. Top 10 GO terms enriched in the top 50 genes upregulated in Id4 high (M) and low (N) basal cells.
See also Figure S2 and Table S2.
ID4 expression decreases in terminally differentiated myoepithelial cells(A–C) (A) FACS analysis of EPCAM and CD49f in live lineage negative mammary cells from adult (10–14 weeks) Id4floxGFP reporter mice. ID4-GFP expression is indicated in the heatmap scale. Representative plots from five experiments shown. Basal cells were binned into three groups based on ID4-GFP expression, ID4-bright (red), ID4-intermediate (orange), and ID4-dim (green), and the median fluorescence intensity (MFI) of EPCAM (B) and CD49f (C) were compared between the three gates. MFI expressed as a fold change relative to the ID4-low basal cells. Ordinary one-way ANOVA test was used to test significance. n = 5. Error bars represent SEM. ∗∗p < 0.01, ∗∗∗p < 0.001, ∗∗∗∗p < 0.0001.(D–H) (D) Representative co-immunofluorescent staining of ID4 and α-SMA in TEB and duct from a pubertal (6 weeks) mammary gland. Scale bar: 20 μm. Comparison of ID4 (E) and α-SMA (F) mean fluorescence between individual cap cells (dark blue) and basal duct cells (light blue). Unpaired two-tailed students t test. Error bars represent SEM. ∗∗∗∗p < 0.0001. Correlation between α-SMA and ID4 (G) and Hoescht (H) mean fluorescence in individual cap cells (dark blue) and basal duct cells (light blue). R2 and p values are displayed. Data are pooled from nine mice. Approximately 20 TEB cap cells and 20 ductal basal cells were analyzed per mouse.(I–K) (I) tSNE plot of 9663 Krt5+/Krt14 + basal cells from (Bach et al., 2017). Two mice were analyzed per developmental stage. NP = Nulliparous (8 weeks), G = Gestation (Day 14.5), L = Lactation (Day 6), I = Involution (Day 11). Feature plots (J) and Violin plots (K) displaying expression of Id4, Epcam, and Acta2 in single cells in the different developmental stages.(L–N) (L) Heatmap displaying top and bottom 30 differentially expressed genes between the top and bottom 200 Id4 high and Id4 low basal cells across all stages. Top 10 GO terms enriched in the top 50 genes upregulated in Id4 high (M) and low (N) basal cells.See also Figure S2 and Table S2.During ductal elongation at puberty, the cap cells differentiate at the neck of the TEBs and mature into myoepithelial cells that form the outer basal layer of the ducts (Paine and Lewis, 2017). To locate ID4-high and ID4-low populations in a tissue context and to further investigate the association between ID4 and markers of myoepithelial differentiation, mammary gland sections from pubertal mice, in which TEBs and ducts are both present, were stained for ID4 and the myoepithelial marker alpha-smooth muscle actin (α-SMA). ID4 expression was highest in the nuclei of cap cells at the extremity of the TEBs and expressed at lower levels in basal cells of ducts (Figures 2D and S2A). Conversely, α-SMA expression was higher in ductal basal cells and lower in cap cells. ID4-high cap cells had a compact cuboidal epithelial appearance compared with the more separated elongated morphology of the ID4-low duct cells (Figures 2D and S2A). Quantification of fluorescence demonstrated that ID4 was more highly expressed in cap cells of TEBs than in basal cells of ducts, whereas the opposite was true for α-SMA expression (Figures 2E and 2F). A negative correlation between α-SMA and ID4 was observed, with clear separation between cap (dark blue Figure 2G) and duct basal cells (light blue Figure 2G). As a negative control, α-SMA fluorescence was compared with nuclear stain Hoechst and no correlation or separation based on region was observed (Figure 2H). Thus, based on marker expression, morphology, and spatial localization, ID4 expression is high in epithelial-like cap cells and is lower in more differentiated myoepithelial cells.Terminal differentiation of basal cells into contractile myoepithelial cells occurs during lactation. We interrogated a published single cell RNA-seq (scRNA-seq) dataset (Bach et al., 2017) to examine Id4 expression dynamics over postnatal murine mammary gland development. In this study, individual EPCAM+ mammary epithelial cells from four developmental stages—nulliparous (8 weeks), gestation (day 14.5), lactation (day 6), and involution (day 11)—were captured and profiled. We limited our analysis to basal cell clusters (9,663 cells), defined by expression of both Krt5 and Krt14. Basal cells broadly clustered by developmental time point (Figure 2I). Increased differentiation of myoepithelial cells appears to proceed from nulliparous to gestation to lactation, associated with decreased expression of epithelial marker Epcam and increased expression of Acta2 (encoding α-SMA) (Figures 2J and 2K), consistent with gradual acquisition of a smooth muscle phenotype and loss of adherent epithelial features (Deugnier et al., 1995). Like Epcam, Id4 expression was highest in basal cells from nulliparous mice and decreased in basal cells of pregnant and lactating mice (Figures 2J and 2K). This result was validated on the protein level by immunohistochemical staining for ID4 on mammary gland sections at different developmental time points (Figure 1, Figure 2, Figure 3, Figure 4, Figure 5, Figure 62B). Using the Monocle 2 package (Trapnell et al., 2014), we performed pseudo-temporal ordering of all basal cells to form a myoepithelial differentiation trajectory (Figure S2C). The nulliparous and involution cells clustered together in pseudo-time space in the least differentiated part of the trajectory. Basal cells from gestating mice were dispersed between the nulliparous and lactation stages, whereas basal cells at lactation were the most differentiated. Id4 expression decreased, whereas several myoepithelial markers (Acta2, Cnn1, Mylk, Myh11, and Oxtr) increased over pseuodotime (Figure S2D).
Figure 3
ID4 inhibits myoepithelial differentiation of organoids
(A) Schematic diagram of 3D Matrigel organoid assay. ID4-GFP + basal cells were FACS purified from adult (10–11 weeks) ID4-GFP reporter mice. Exon 1 and 2 of Id4 are floxed and a GFP reporter cassette introduced. Basal cells are reprogrammed in culture using ROCK inhibitor Y-27632 and irradiated NIH-3T3 feeder cells. Adenoviral-Cre is used to knock out ID4 as shown by western blotting. Single cells are then seeded on top of a Matrigel plug and grown for 6 days followed by immunofluorescent staining and quantification. (B, C, and E–F) Organoids grown from conditionally reprogrammed basal cells were treated with control GFP adenovirus (ID4 WT) or with Cre adenovirus (ID4 KO). Organoids were stained for ID4 (B) and α-SMA (E). Scale bar: 10 μm. Fluorescence was quantified for ID4 (C) and α-SMA (F) in approximately 10 organoids per experiment. n = 4.
(D) The average organoid size was determined per chamber. Unpaired two-tailed students t test. Error bars represent SEM. ∗p < 0.05, ∗∗∗p < 0.001, ∗∗∗∗p < 0.0001.
See also Figure S3.
Figure 4
ID4 negatively regulates EMT and ECM production in mammary epithelial cells
(A) (i) Western blot analysis of ID4 expression in Comma-Dβ cells treated with non-targeting (NT) or ID4-targeting siRNA. Representative results from three western blots shown. (ii) Densitometry quantification of ID4 bands. Band intensity was normalized to β-Actin and expressed as fold change relative to NT control. N = 3. Unpaired two-tailed students t test.
(B) Genes were ranked based on the limma t-statistic comparing NT and siID4 cells, and GSEA was carried out using Gene Ontology (GO) gene sets. The top 10 positively (red) and negatively (blue) enriched pathways are displayed.
(C) GO GSEA results were visualized using Cytoscape EnrichmentMap. Nodes represent gene sets and edges represent overlap. Gene sets with an FDR<0.1 are shown.
(D) Representative GSEA enrichment plots displaying the profile of the running Enrichment Score (green) and positions of gene set members on the rank ordered list. NES and FDR are indicated.
(E) (i) Western blot analysis of ID4, α-SMA, CNN2, ZEB1, SLUG, and SNAIL in Comma-Dβ cells overexpressing ID4 (ID4 OE). (ii) Densitometry quantification of bands normalized to β-Actin as a fold-change relative to control cells. N = 3. Unpaired two-tailed students t test. Error bars represent SEM. (iii) Morphology of Comma-Dβ cells overexpressing ID4. Scale bar: 100 μm.
(F) (i) Collagen fibers were visualized by picrosirius red staining of TEBs from 6-week-old ID4 WT and KO mice. Scale bar: 50 μm. Total collagen staining (ii) and birefringence signal (iii) were quantified from approximately three TEBs from each mammary gland section. N = 9 for WT and N = 6 for KO mice. Unpaired two-tailed students t test. ∗p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.001, ∗∗∗∗p < 0.0001. Error bars represent SEM.
See also Figure S4 and Table S3.
Figure 5
ID4 interacts with E-proteins E2A and HEB
(A) Unsupervised hierarchical clustering heatmap of SWATH RIME data from Comma-Dβ cells. Proteins with significantly higher abundance (p value<0.05) in the ID4 IPs compared with IgG IPs are displayed. Log2 protein area was used to generate the heatmap. Data from three independent experiments shown, each with 2–3 technical replicates.
(B) Co-immunoprecipitation (co-IP) and western blotting of ID4, E2A, and HEB and IgG negative controls from uncrosslinked Comma-Dβ lysates. Irrelevant lanes were digitally removed as indicated by the space.
(C) Proximity ligation assay (PLA) in Comma-Dβ cells for ID4 and HEB or corresponding negative control IgG. The cytoskeleton is stained with phalloidin and nucleus with DAPI. Scale bar: 20 μm. High power insets are shown below.
(D) Quantification of PLA foci from six random fields of view for each condition, each with approximately 50 nuclei per image. Ordinary one-way ANOVA test was used to test significance. ∗∗∗∗p < 0.001. N = 3.
(E) Co-immunofluorescent staining of V5 and HEB in TEB (upper) and duct (lower) from ID4-FlagV5 mouse mammary glands. High-power insets feature cells positive for both proteins. Scale bar: 20 μm.
(F) co-IP and western blotting from ID4-FlagV5 mammary gland protein extracts. ID4 was immunoprecipitated using antibodies raised against ID4 and Flag and two independent V5 antibodies. ID4, V5, and HEB were detected by western blotting.
See also Figure S5 and Table S4.
Figure 6
HEB directly binds to a subset of ID4 target genes
(A) Top four enriched transcription factor binding motifs determined using MEME-ChIP for consensus HEB ChIP-seq peaks in Comma-Dβ cells. E-values are displayed.
(B) GREAT pathway analysis of consensus HEB peaks. Top 16 Gene Ontologies (Biological process, cellular component, molecular function) are displayed.
(C) Venn diagram showing overlap between genes associated with an HEB peak and ID4 RNA-seq differentially expressed genes.
(D) Profile plots of average HEB, H3K4Me3, H3K27Ac, and H3K27Me3 signal intensity at regions associated with siID4 downregulated (left) and upregulated (right) RNA-seq differentially expressed genes.
(E) Examples of HEB and histone mark peaks occurring upstream of siID4 upregulated genes Sparc, Col1a1, and Cnn2 from the Integrative Genomics Viewer (IGV). Bars beneath peaks represent consensus MACS call (FDR<0.05) in at least two of four biological replicates. Input was used as a negative control. Purple boxes highlight HEB-binding regions. Refseq genes shown in blue. Data scales for each track are indicated.
(F) Volcano plot of differential binding analysis. Analysis using edgeR of HEB binding in siID4 verses NT of three biological replicates. Regions with an FDR<0.05 are indicated in pink.
(G) Profile plots of average HEB signal intensity in NT control (pink) and siID4 (purple) conditions at regions associated with RNA-seq siID4 downregulated (left), upregulated (middle), and unchanged (right) genes. (H) Model of ID4 and HEB action in mammary epithelial cells. Left: when ID4 is expressed it interacts with HEB, antagonizing its transcriptional activity. Right: when ID4 is depleted, HEB dimerizes and binds to E-box motifs in the promoters and enhancers of developmental genes involved in contraction and ECM. Below are hypothetical ChIP signals for H3K27Ac (blue), H3K4Me3 (green), and HEB (pink).
See also Figure S6 and Table S5.
We sought to identify transcriptional signatures associated with high and low Id4-expressing cells in the mammary basal epithelium across all developmental time points (Figure 2L and Table S2). Basal cells with high Id4 expression were enriched for genes involved in RNA binding, metabolic processes, translation, and ribosome biogenesis (Figure 2M). Conversely, genes enriched in the Id4-low basal cells were involved in muscle contraction and response to cytokine and hormone stimuli and circulatory system development (Figure 2N). Myoepithelial genes such as Oxtr, encoding the oxytocin receptor, and contractile genes such as Mylk, Cnn1, Cav1, and Myh11 were among the top differentially expressed genes in the cells with low Id4 expression (Figure 2L and Table S2). Thus, ID4 downregulation during pregnancy and lactation is associated with terminal differentiation into functionally mature contractile myoepithelial cells consistent with its role as a basal stem cell marker.
Loss of ID4 results in myoepithelial differentiation of mammary organoids
To functionally validate the role of ID4 in suppressing the myoepithelial differentiation of basal cells we generated a primary basal cell organoid model (Figure 3A). The organoid model system complements and expands upon the findings from the KO mouse, as the acute consequence of ID4 loss on basal cell phenotype can be determined. Furthermore, organoids are less complex cellular systems than tissue, thus cell-autonomous effects can be isolated more precisely. ID4-positive basal cells from mice in which exons 1 and 2 of Id4 are floxed (Id4fl/fl) (Best et al., 2014) were isolated using cell sorting. To overcome culture-induced senescence, basal cells were conditionally reprogrammed into a proliferative stem/progenitor state using an established protocol utilizing irradiated 3T3 fibroblast feeders and Rho Kinase (ROCK) inhibition (Liu et al., 2012b; Prater et al., 2014) (Figure 3A). Conditionally reprogrammed basal cells adopted an epithelial cobblestone morphology and expressed high levels of ID4 (Figures S3A and S3B). Cells cultured in the absence of ROCK inhibitor or feeders adopted a flattened differentiated/senescent cell morphology and had reduced ID4 expression (Figures S3A and S3B). Reprogrammed cells maintained basal marker expression of P63 and KRT14 (Figure S3C) and could be grown as 3D organoids with basal marker KRT14 on the outer cell layer and luminal marker KRT8 on the inner cells of the organoids (Figure S3D).ID4 inhibits myoepithelial differentiation of organoids(A) Schematic diagram of 3D Matrigel organoid assay. ID4-GFP + basal cells were FACS purified from adult (10–11 weeks) ID4-GFP reporter mice. Exon 1 and 2 of Id4 are floxed and a GFP reporter cassette introduced. Basal cells are reprogrammed in culture using ROCK inhibitor Y-27632 and irradiated NIH-3T3 feeder cells. Adenoviral-Cre is used to knock out ID4 as shown by western blotting. Single cells are then seeded on top of a Matrigel plug and grown for 6 days followed by immunofluorescent staining and quantification. (B, C, and E–F) Organoids grown from conditionally reprogrammed basal cells were treated with control GFP adenovirus (ID4 WT) or with Cre adenovirus (ID4 KO). Organoids were stained for ID4 (B) and α-SMA (E). Scale bar: 10 μm. Fluorescence was quantified for ID4 (C) and α-SMA (F) in approximately 10 organoids per experiment. n = 4.(D) The average organoid size was determined per chamber. Unpaired two-tailed students t test. Error bars represent SEM. ∗p < 0.05, ∗∗∗p < 0.001, ∗∗∗∗p < 0.0001.See also Figure S3.ID4 expression was limited to the outer basal cells (Figure 3B), recapitulating expression of ID4 in cap cells of TEBs (Figure 2D). To test whether ID4 regulates differentiation of basal cells we deleted ID4 with adenoviral-Cre and compared these to cells treated with control adenoviral-GFP. ID4 protein was markedly downregulated in organoids infected with Cre adenovirus confirming successful gene deletion (Figures 3B and 3C). Loss of ID4 resulted in slightly smaller organoids with increased α-SMA fluorescent intensity (Figures 3D–3F). This finding demonstrates that as well as marking undifferentiated basal cells, ID4 has a cell autonomous role in impeding maturation of basal cells into myoepithelial cells.
ID4 inhibits expression of contractile and ECM genes
Given the negative association between ID4 and the differentiated myoepithelial phenotype, we sought to determine direct ID4 target genes by performing RNA-seq following siRNA-mediated ID4 depletion in the spontaneously immortalized mouse mammary epithelial cell line Comma-Dβ (Danielson et al., 1984). This normal-like cell line expresses basal markers and is commonly used as a model of mammary stem/progenitor cells, as they retain the capacity of multi-lineage differentiation when transplanted into mammary fat pads (Deugnier et al., 2006; Idoux-Gillet et al., 2018; Danielson et al., 1984; Junankar et al., 2015; Best et al., 2014). Western blotting confirmed 70%–80% knockdown (KD) of ID4 protein 48 h after siRNA transfection (Figure 4A). RNA-seq analysis of ID4 KD cells resulted in 471 and 421 (FDR<0.05) down- and upregulated genes, respectively, compared with the non-targeting siRNA control (Table S3).ID4 negatively regulates EMT and ECM production in mammary epithelial cells(A) (i) Western blot analysis of ID4 expression in Comma-Dβ cells treated with non-targeting (NT) or ID4-targeting siRNA. Representative results from three western blots shown. (ii) Densitometry quantification of ID4 bands. Band intensity was normalized to β-Actin and expressed as fold change relative to NT control. N = 3. Unpaired two-tailed students t test.(B) Genes were ranked based on the limma t-statistic comparing NT and siID4 cells, and GSEA was carried out using Gene Ontology (GO) gene sets. The top 10 positively (red) and negatively (blue) enriched pathways are displayed.(C) GO GSEA results were visualized using Cytoscape EnrichmentMap. Nodes represent gene sets and edges represent overlap. Gene sets with an FDR<0.1 are shown.(D) Representative GSEA enrichment plots displaying the profile of the running Enrichment Score (green) and positions of gene set members on the rank ordered list. NES and FDR are indicated.(E) (i) Western blot analysis of ID4, α-SMA, CNN2, ZEB1, SLUG, and SNAIL in Comma-Dβ cells overexpressing ID4 (ID4 OE). (ii) Densitometry quantification of bands normalized to β-Actin as a fold-change relative to control cells. N = 3. Unpaired two-tailed students t test. Error bars represent SEM. (iii) Morphology of Comma-Dβ cells overexpressing ID4. Scale bar: 100 μm.(F) (i) Collagen fibers were visualized by picrosirius red staining of TEBs from 6-week-old ID4 WT and KO mice. Scale bar: 50 μm. Total collagen staining (ii) and birefringence signal (iii) were quantified from approximately three TEBs from each mammary gland section. N = 9 for WT and N = 6 for KO mice. Unpaired two-tailed students t test. ∗p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.001, ∗∗∗∗p < 0.0001. Error bars represent SEM.See also Figure S4 and Table S3.Downregulated genes were predominantly involved in cell proliferation and growth pathways (Figures 4B and 4C; blue), including hallmark gene sets such as E2F targets, MYC targets, and G2M checkpoint (Figures S4A and S4B). Driving the enrichment of these gene sets were several key cell cycle genes such as Mki67, Cdk2, Cdk6, and Cdk17 (Table S3). This result complements the loss of cell growth gene expression programs in ID4 KO basal cells in vivo (Figure 1D).Genes acutely upregulated upon ID4 depletion were involved in development, morphogenesis, ECM remodeling, and immune signaling (Figures 4B and 4C; red). Consistent with ID4 repressing myoepithelial specialization, loss of ID4 resulted in upregulation of SRF targets, actomyosin cytoskeleton, EMT, and myogenesis gene signatures (Figures 4D, S4A, and S4B). Several contractile genes were increased in ID4 depleted cells including Cnn1, Cnn2, Tagln, Lmod1, and Acta2 (FDR = 0.06) (Table S3), many of which were inversely correlated with Id4 expression in the scRNA-seq analysis (Figure 2L and Table S2).To independently validate the transcriptomic results implicating ID4 in repression of contractile EMT genes, we overexpressed ID4 in Comma-Dβ cells and performed western blotting analysis for several EMT proteins. Overexpression of ID4 resulted in downregulation of α-SMA (Figure 4E i–ii), consistent with the upregulation of this marker in the primary organoid culture upon loss of ID4 (Figures 3E and 3F). CNN2, another smooth muscle contractile protein, as well as classical EMT markers ZEB1 and SLUG, were also suppressed by ID4 (Figure 4E i-ii). Finally, morphological inspection of ID4 overexpressing cells revealed a cobblestone epithelial appearance compared with the more mesenchymal control cells. Together, these results confirm that ID4 blocks expression of genes involved in myoepithelial contraction and EMT.Several ECM genes encoding collagens (e.g. Col1a1, Col1a2, and Col5a1), basement membrane laminins (e.g. Lamc1 and Lama4), and matricellular proteins (e.g. Sparc) were also upregulated upon ID4 depletion (Figures 4C and 4D and Table S3). The ECM provides physical support to the mammary gland and is a source of biochemical signals that coordinate morphogenesis (Muschler and Streuli, 2010). Changes in ECM gene expression are associated with EMT and cellular contractility (Kiemer et al., 2001; Liu et al., 2012a). To functionally validate the role of ID4 in regulating ECM proteins we examined whether ID4 represses ECM deposition in vivo using picrosirius red staining of mammary glands from ID4 WT and KO mice to visualize fibrillar collagen (Figure 4F). ID4 KO TEBs in pubertal mammary glands were surrounded by a thickened collagen-dense ECM when compared with ID4 WT TEBs (Figure 4F i–ii), indicating that ID4 normally restrains collagen expression by cap cells during puberty. In addition to total abundance, the thickness of bundled collagen fibers can be further distinguished using polarized light. Analysis of birefringence signal revealed an increase in thick fibers and a decrease in thin fibers in the ECM surrounding ID4 KO TEBs (Figure 4F iii), signifying a redistribution of collagen composition, as well as an overall increase in collagen abundance, in the absence of ID4.Hence, ID4 positively regulates proliferative genes and negatively regulates genes involved in myoepithelial functions such as contraction and ECM synthesis in mammary epithelial cells. These ID4-regulated functions are likely to be critical for morphogenesis of the ductal tree during pubertal development.
ID4 interacts with E-proteins in mammary epithelial cells
As ID4 lacks a DNA-binding domain, it influences transcription through its interaction with other DNA-binding proteins. We used Rapid Immunoprecipitation Mass spectrometry of Endogenous proteins (RIME) to discover the binding partners of ID4 in Comma-Dβ cells (Mohammed et al., 2013). We identified 48 proteins that were significantly (p < 0.05) more abundant in the ID4 IPs compared with the IgG-negative control IPs in three independent RIME experiments (Figure 5A and Table S4). ID4 was consistently identified in all replicates and was the top hit (Table S4), verifying the validity of the technique. Among the putative ID4-binding partners were many DNA- and RNA-interacting proteins (Figure 5A).ID4 interacts with E-proteins E2A and HEB(A) Unsupervised hierarchical clustering heatmap of SWATH RIME data from Comma-Dβ cells. Proteins with significantly higher abundance (p value<0.05) in the ID4 IPs compared with IgG IPs are displayed. Log2 protein area was used to generate the heatmap. Data from three independent experiments shown, each with 2–3 technical replicates.(B) Co-immunoprecipitation (co-IP) and western blotting of ID4, E2A, and HEB and IgG negative controls from uncrosslinked Comma-Dβ lysates. Irrelevant lanes were digitally removed as indicated by the space.(C) Proximity ligation assay (PLA) in Comma-Dβ cells for ID4 and HEB or corresponding negative control IgG. The cytoskeleton is stained with phalloidin and nucleus with DAPI. Scale bar: 20 μm. High power insets are shown below.(D) Quantification of PLA foci from six random fields of view for each condition, each with approximately 50 nuclei per image. Ordinary one-way ANOVA test was used to test significance. ∗∗∗∗p < 0.001. N = 3.(E) Co-immunofluorescent staining of V5 and HEB in TEB (upper) and duct (lower) from ID4-FlagV5 mouse mammary glands. High-power insets feature cells positive for both proteins. Scale bar: 20 μm.(F) co-IP and western blotting from ID4-FlagV5 mammary gland protein extracts. ID4 was immunoprecipitated using antibodies raised against ID4 and Flag and two independent V5 antibodies. ID4, V5, and HEB were detected by western blotting.See also Figure S5 and Table S4.The E-proteins E2A and HEB were identified as ID4 interactors in our RIME analysis. There are three members of the E-protein family (E2A, HEB, and ITF-2), that dimerize with other E-proteins or tissue-specific bHLH transcription factors (e.g. MyoD and NeuroD) to regulate expression of lineage commitment genes (Wang and Baker, 2015). E2A and HEB expression was confirmed in the mammary gland epithelium by IHC (Figure S5A). E2A has been implicated in branching morphogenesis of mammary organoids (Lee et al., 2011); however, there are no previous studies implicating HEB in mammary gland development. Given this, and that E-proteins are the canonical binding partners of ID proteins in other lineages, we chose to pursue the ID4-HEB interaction further.Reciprocal co-immunoprecipitation followed by western blotting (co-IP WB) experiments were performed to validate the ID4-E-protein interactions in the same Comma-Dβ cell line, as well as in normal human mammary epithelial cell lines PMC42 and MCF10A. IP of ID4 resulted in co-IP of E2A and HEB, and correspondingly, IP of E2A and HEB resulted in co-IP of ID4 (Figures 5B and S5B). E2A and HEB did not form heterodimers with each other in Comma-Dβ cells (Figure 5B), implying that E-proteins function either as homodimers or as heterodimers with other bHLH proteins in this context. To identify mammary-specific bHLH factors binding to HEB, we performed another RIME experiment by immunoprecipitating HEB protein. HEB (HTF4_MOUSE in Table S4) was identified as the top hit, and ID4 was also identified (Figure S5C). However, we did not identify any other bHLH transcription factors in this experiment (Figure S5C and Table S4). This suggests that HEB either binds DNA as a homodimer or binds another factor that was not detected by this assay. To independently validate the interaction between ID4 and HEB, the proximity ligation assay (PLA) was used to visualize protein-protein interactions in situ. Multiple PLA foci were detected in the nuclei of Comma-Dβ cells co-stained with ID4 and HEB antibodies (Figures 5C and 5D).To test if ID4 and HEB interacted in vivo, we engineered a tagged ID4mouse model in which a FlagV5 tag, a very efficient and specific target for immunoprecipitation, was integrated into the Id4 locus downstream of the open reading frame. The result was FlagV5-tagged ID4 protein under the control of endogenous regulatory elements. We first ensured that ID4 and HEB were co-expressed by performing co-immunofluorescent staining for V5 and HEB. ID4-FlagV5 expression was tightly restricted to the cap and ductal basal cells, whereas HEB was more ubiquitous in its expression, detected in basal and luminal epithelial cells and surrounding stromal cells (Figure 5E). ID4 and HEB co-localized in the nuclei of cap and duct basal cells (insets; Figure 5E). Reciprocal co-IP was carried out from digested mammary glands of ID4-FlagV5 mice using antibodies raised against ID4, Flag, V5 (two different antibodies), and HEB (Figure 5F). Each antibody tested was able to precipitate both ID4 and HEB, confirming interaction between the two transcription factors in vivo.
HEB binds to regulatory elements of a subset of ID4 differentially expressed genes
To establish if ID4 regulates gene expression through HEB, we sought to determine if HEB directly binds to genes regulated by ID4 by performing ChIP-seq for HEB in Comma-Dβ cells. Across four biological replicates there were a total of 2,752 HEB peaks identified (FDR<0.05). This was narrowed down to 956 consensus peaks, which were present in at least two replicates. Transcription factor motif enrichment was carried out using MEME-ChIP (Machanick and Bailey, 2011), and the top enriched motifs were canonical E-box motifs (CANNTG), which are the binding sites for E-proteins (Figure 6A). The majority of peaks were mapped to intergenic and intronic regions, and approximately 5% of the peaks occurred at gene promoters (Figure S6A). We used the Genomics Regions Enrichment of Annotations Tool (GREAT) to analyze the functional significance of the regions bound by HEB (McLean et al., 2010). In this unbiased analysis, top enriched pathways were related to actin cytoskeleton and ECM organization (Figure 6B), resembling the pathways negatively regulated by ID4 in the gene expression profiling (Figure 4B). This suggests that ID4 mediates repression through its physical interaction with HEB.HEB directly binds to a subset of ID4 target genes(A) Top four enriched transcription factor binding motifs determined using MEME-ChIP for consensus HEB ChIP-seq peaks in Comma-Dβ cells. E-values are displayed.(B) GREAT pathway analysis of consensus HEB peaks. Top 16 Gene Ontologies (Biological process, cellular component, molecular function) are displayed.(C) Venn diagram showing overlap between genes associated with an HEB peak and ID4 RNA-seq differentially expressed genes.(D) Profile plots of average HEB, H3K4Me3, H3K27Ac, and H3K27Me3 signal intensity at regions associated with siID4 downregulated (left) and upregulated (right) RNA-seq differentially expressed genes.(E) Examples of HEB and histone mark peaks occurring upstream of siID4 upregulated genes Sparc, Col1a1, and Cnn2 from the Integrative Genomics Viewer (IGV). Bars beneath peaks represent consensus MACS call (FDR<0.05) in at least two of four biological replicates. Input was used as a negative control. Purple boxes highlight HEB-binding regions. Refseq genes shown in blue. Data scales for each track are indicated.(F) Volcano plot of differential binding analysis. Analysis using edgeR of HEB binding in siID4 verses NT of three biological replicates. Regions with an FDR<0.05 are indicated in pink.(G) Profile plots of average HEB signal intensity in NT control (pink) and siID4 (purple) conditions at regions associated with RNA-seq siID4 downregulated (left), upregulated (middle), and unchanged (right) genes. (H) Model of ID4 and HEB action in mammary epithelial cells. Left: when ID4 is expressed it interacts with HEB, antagonizing its transcriptional activity. Right: when ID4 is depleted, HEB dimerizes and binds to E-box motifs in the promoters and enhancers of developmental genes involved in contraction and ECM. Below are hypothetical ChIP signals for H3K27Ac (blue), H3K4Me3 (green), and HEB (pink).See also Figure S6 and Table S5.The consensus peaks were annotated to 1,320 genes, using the default GREAT basal plus extension gene annotation rule (McLean et al., 2010). We overlapped these genes with those regulated following ID4 KD to determine the genes directly regulated by HEB (Figure 6C). Approximately 10% of these genes had an associated HEB peak, which is more than expected by chance (p < 5.65 × 10−13; hypergeometric test). The remainder of the genes regulated by ID4 KD are likely due to secondary effects of ID4 KD- or HEB-independent mechanisms. The fact that there was a similar number of genes overlapping in both upregulated (46) and downregulated (43) genes suggests that HEB can bind to sites both negatively and positively regulated by ID4. In line with this, E-proteins have previously been demonstrated to act as both activators and repressors of gene transcription by recruiting different co-factors (Bayly et al., 2004; Zhang et al., 2004).In parallel, we performed ChIP-seq for three histone modifications—H3K4Me3 (active promoter mark), H3K27Ac (active enhancer mark), and H3K27Me3 (repressive chromatin mark)—to elucidate the chromatin context of HEB-bound regions. A number of HEB peaks associated with ID4-regulated genes demonstrated a bimodal distribution of H3K27Ac signal (Figures 6D and S6B), suggesting that HEB binds to enhancers. Some of the peaks were also localized to active promoters (Figures 6D and S6B). HEB peaks were observed at enhancer-marked chromatin upstream of ID4-repressed ECM genes expressed in myoepithelial cells such as Sparc, Col1a1, Col1a2, Col3a1, and Col5a1 (Figures 6E and S6C) (Barsky and Karlin, 2005). HEB binding was also observed near the promoter of contractile gene Cnn2 (Figure 6E), whose RNA and protein product CNN2 was suppressed by ID4 (Table S3 and Figure 4E). Of relevance, Cnn2, encoding Calponin 2, was recently discovered to be regulated by a super enhancer specifically accessible in myoepithelial cells (Pervolarakis et al., 2019). This further indicates that ID4 inhibits HEB's ability to activate transcription of genes that define the myoepithelial fate.To determine whether HEB DNA binding is augmented when released from inhibition by ID4 we performed HEB ChIP-seq on Comma-Dβ cells in which ID4 had been depleted by siRNA. Western blotting revealed that ID4 protein was reduced to approximately 20% of control levels, whereas HEB expression was unchanged (Figure S6D). E-box motifs were again enriched in both conditions (Figure S6E). Differential binding analysis revealed a total of 290 regions changing upon ID4 depletion (p < 0.05) (Table S5). More peaks were increased than decreased (263 compared with 27), suggesting that depletion of ID4 increased HEB's DNA-binding activity (Figures 6F and S6F). GREAT analysis revealed that the peaks that increased were involved in processes such as gland morphogenesis, skeletal development, and branching morphogenesis (Figure S6G). No pathways were enriched in the regions that were decreased when ID4 was knocked down. Finally, we observed an increase in HEB binding in cells depleted of ID4, specifically at genes that were differentially expressed by ID4 KD (Figure 6G). Together, our ChIP-seq analysis suggests that HEB binds to E-box motifs in regulatory elements of basal developmental genes involved in ECM and the contractile cytoskeleton, and this is antagonized by its interaction with ID4 (Figure 6H).
Discussion
We show that ID4 represses genes associated with myoepithelial differentiation in mammary basal stem cells, in part through the inhibition of the E-protein HEB. ID4 has previously been demonstrated to block luminal commitment of basal cells via inhibition of key luminal driver genes including Elf5, Notch, Brca1, Esr1, PR, and FoxA1 (Junankar et al., 2015; Best et al., 2014). The dual inhibition of both luminal and myoepithelial differentiation by ID4 likely protects the stem-cell phenotype of uncommitted basal cells during development. Subsequent downregulation of ID4, through a currently unknown mechanism, may then allow basal cells to adopt a luminal or myoepithelial fate depending on the cellular context.HEB has not been associated with lineage commitment of epithelial tissues. It is, however, known to be involved in the specification of lymphocyte (Braunstein and Anderson, 2012), hematopoietic (Li et al., 2017), mesodermal (Yoon et al., 2015), neuronal (Mesman and Smidt, 2017), and skeletal muscle (Conway et al., 2004) lineages. Our proteogenomic analyses support the model outlined in Figure 6H. When ID4 is highly expressed, such as in cap cells, it sequesters HEB off chromatin, preventing expression of differentiation genes, thus determining a stem-like state (Figure 6H; left). When ID4 expression is low, such as in differentiating cells, HEB is able to bind to E-box DNA motifs at promoters/enhancers to activate transcription of developmental genes that specify functional myoepithelial cells (Figure 6H; right). Further functional studies are needed to demonstrate HEB's requirement in promoting myoepithelial differentiation.Compared with the luminal lineage, the molecular regulators controlling the basal lineage remain poorly understood. One of the few transcription factors known to promote myoepithelial differentiation is MADS-box protein SRF and its associated co-activator MRTFA (Sun et al., 2006; Li et al., 2006). HEB and SRF cooperatively activate transcription of Acta2 in cultured fibroblasts (Kumar et al., 2003). This occurred in an E-box-dependent manner and was inhibited by ID1 and ID2 overexpression (Kumar et al., 2003). Although we show that ID4 suppresses Acta2, we did not observe HEB binding to the Acta2 promoter. However, it is possible that in the absence of ID4, HEB and SRF cooperate to drive expression of other myoepithelial genes. This is supported by the positive enrichment of SRF targets upon ID4 depletion. Subsequent studies should test whether HEB and SRF cooperate in mammary epithelial cells.The roles of ID4 in regulating myoepithelial commitment and ECM deposition expand upon why ID4 is required for pubertal mammary gland morphogenesis (Junankar et al., 2015; Dong et al., 2011; Best et al., 2014). TEBs undergo collective migration, enabling the coordinated movement of adherent cells into the stromal fat pad (Ewald et al., 2008). We hypothesize that the high levels of ID4 in cap cells prevent epithelial cells from acquiring a mesenchymal/myoepithelial phenotype. Similar mechanisms have been observed for the transcriptional repressor OVOL2, which inhibits EMT to allow for collective migration (Watanabe et al., 2014), and for C/EBPα, which maintains epithelial homeostasis of human mammary epithelial cells (Lourenço et al., 2020). During ductal elongation, collagenous stromal ECM is absent directly in front of the invading TEBs (Silberstein et al., 1990; Sternlicht, 2006). We show that ID4 suppresses collagen synthesis and deposition around TEBs, which may otherwise act as a physical barrier to impede invasion. In support of this idea, ectopic deposition of collagen by mammary epithelial cells induced by exogeneous TGF-β, or forced expression of recombinant type I collagen that is resistant to collagenase attack, causes ensheathment of TEBs by collagen and retardation of ductal elongation (Silberstein et al., 1990; Feinberg et al., 2018).Developmental transcription factors are often dysregulated in cancer. ID4 is highly expressed in ~50% of basal-like breast cancer (BLBC) cases and associates with poor prognosis (Junankar et al., 2015; Baker et al., 2016). Interestingly, ID4 has also been implicated in prostate development (Sharma et al., 2013) and acts as a tumor suppressor in this context (Carey et al., 2009). It is likely that the different repertoire of binding partners in different cell types gives rise to the organ-specific functions of ID4 in the breast and prostate. Given the cell-intrinsic role of ID4 in promoting growth/proliferation and inhibiting differentiation in the mammary gland, it is easy to envision how overexpression of ID4 could lead to an aggressive breast cancer phenotype. In addition, the suppression of ECM synthesis may allow tumor cells to easily invade into the surrounding stroma, akin to the invasion of cap cells during ductal elongation. Future work should test whether the mechanisms discovered here are conserved in breast cancer initiation and progression.To conclude, these insights into ID4 and HEB function help unravel regulation within the basal differentiation hierarchy, with broader implications for the regulation of epithelial stem cells in general, and also in tumor progression.
Limitations of the study
A caveat of this study is that dissection of ID4's molecular mechanism by RIME and ChIP-seq was performed in one cell line. However, the Comma-D cell line is a well-accepted model for mammary development often used for the purpose of genomic and biochemical studies (Ibarra et al., 2007; Wellberg et al., 2010; Best et al., 2014), and results were validated in human cell lines and transgenic mouse models (Figures S5, 5E, and 5F).
Resource availability
Lead contact
Further information and requests for resources and reagents should be directed to the Lead Contact, Alexander Swarbrick (a.swarbrick@garvan.org.au).
Materials availability
The ID4-FlagV5 mouse model is available upon request with a Material Transfer Agreement.
Data and code availability
The accession number for the RNA-seq and ChIP-seq data reported in this paper isGEO: GSE149969. The accession number for the mass spectrometry proteomics data is PRIDE :: PXD017517.
Methods
All methods can be found in the accompanying Transparent methods supplemental file.
Authors: Jie Dong; Shixia Huang; Marian Caikovski; Shaoquan Ji; Amanda McGrath; Myra G Custorio; Chad J Creighton; Paul Maliakkal; Ekaterina Bogoslovskaia; Zhijun Du; Xiaomei Zhang; Michael T Lewis; Fred Sablitzky; Cathrin Brisken; Yi Li Journal: Development Date: 2011-12 Impact factor: 6.868
Authors: Marie-Liesse Asselin-Labat; Kate D Sutherland; Holly Barker; Richard Thomas; Mark Shackleton; Natasha C Forrest; Lynne Hartley; Lorraine Robb; Frank G Grosveld; Jacqueline van der Wees; Geoffrey J Lindeman; Jane E Visvader Journal: Nat Cell Biol Date: 2006-12-24 Impact factor: 28.824
Authors: Sounak Sahu; Mary E Albaugh; Betty K Martin; Nimit L Patel; Lisa Riffle; Susan Mackem; Joseph D Kalen; Shyam K Sharan Journal: Sci Rep Date: 2022-05-03 Impact factor: 4.996