| Literature DB >> 35377806 |
Sarah K Nyquist1,2,3,4, Patricia Gao3, Tessa K J Haining3, Michael R Retchin3, Yarden Golan5, Riley S Drake1,3,6, Kellie Kolb1,3, Benjamin E Mead1,3, Nadav Ahituv5, Micaela E Martinez7, Alex K Shalek1,2,3,6,8,9,10, Bonnie Berger1,4, Brittany A Goods11.
Abstract
Human breast milk (hBM) is a dynamic fluid that contains millions of cells, but their identities and phenotypic properties are poorly understood. We generated and analyzed single-cell RNA-sequencing (scRNA-seq) data to characterize the transcriptomes of cells from hBM across lactational time from 3 to 632 d postpartum in 15 donors. We found that the majority of cells in hBM are lactocytes, a specialized epithelial subset, and that cell-type frequencies shift over the course of lactation, yielding greater epithelial diversity at later points. Analysis of lactocytes reveals a continuum of cell states characterized by transcriptional changes in hormone-, growth factor-, and milk production-related pathways. Generalized additive models suggest that one subcluster, LC1 epithelial cells, increases as a function of time postpartum, daycare attendance, and the use of hormonal birth control. We identify several subclusters of macrophages in hBM that are enriched for tolerogenic functions, possibly playing a role in protecting the mammary gland during lactation. Our description of the cellular components of breast milk, their association with maternal–infant dyad metadata, and our quantification of alterations at the gene and pathway levels provide a detailed longitudinal picture of hBM cells across lactational time. This work paves the way for future investigations of how a potential division of cellular labor and differential hormone regulation might be leveraged therapeutically to support healthy lactation and potentially aid in milk production.Entities:
Keywords: breast milk; macrophage; mammary epithelial cell; maternal health; single-cell RNA-sequencing
Mesh:
Year: 2022 PMID: 35377806 PMCID: PMC9169737 DOI: 10.1073/pnas.2121720119
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 12.779
Fig. 1.Atlas of cell types present in hBM across lactation. (A) Sampling timeline showing collection of samples for each donor as a function of time postpartum (days). (B) Projection of dimensionality reduced (UMAP) scRNA-seq data (n = 48,478 cells across 15 donors) colored by cell type, lactation stage (early, transitional, mature, and several late stages), and donor. (C) Marker genes (x axis and grouping labels on top) for each major cell type cluster (y axis). Circle size describes percent of cells in cluster expressing the gene. Color represents the mean log-normalized gene expression in that cluster standardized across clusters within each gene.
Fig. 2.Frequency of cell types over the course of lactation. (A) Frequency of cell types identified for each sample (Upper) and associated maternal and infant health information metadata (Lower) collected in user-reported questionnaires. Colored circles to the left of metadata names indicate associations of metadata with cell-type (specified by color) abundances and the direction of the association via plus (+) or minus (−). Different donors show associations in different directions with cell-type’s proportions () (B) Normalized cell frequencies as function of time postpartum for samples at timepoints <400 d postpartum are shown for all identified cell types. Spearman correlation coefficients (R) and P values from generalized additive models are shown below each plot, and confidence intervals around Spearman correlations are displayed in gray.
Fig. 3.Macrophage subclusters across lactation stage. (A) UMAP projection of hBM-macrophages, colored by lactation stage (Left), donor (Center), and macrophage subcluster (Right). (B) Heatmap of top marker genes for each identified macrophage subcluster (Dataset S3). (C) Reactome enrichment results for each subcluster. Full results are shown in Dataset S4. (D) Module scoring results for M1 or M2 gene sets for each subcluster (**P < 2.2e-16, *P < 2.3e-6).
Fig. 4.Subclustering analysis of epithelial cells reveals an increase in epithelial diversity over the course of lactation. (A) UMAP visualization of epithelial cells colored by epithelial subcluster (Left) or donor (Right). (B) Mean expression in cell subset standardized within genes (color) and percent of cells expression (dot size) of canonical mammary epithelial marker genes in each epithelial subgroup. (C) Mean expression in cell subset standardized within genes (color) and percent of cells expression (dot size) of marker genes for each epithelial subgroup identified by pseudobulk marker gene identification. (D) Reduced top Enrichr results from the GO biological processes 2021 database on the marker genes for each subgroup, colored by the mean gene-set score for all genes in that pathway on cells in that subgroup, scaled by a z-score across subgroups. GO annotation categories are labeled by manual curation of related GO classifications. (E) Proportions of each subgroup per sample, split by milk stage. Error bars show SD.
Fig. 5.Transcriptional programs of luminal epithelial cells change over the course of lactation. (A) Genes of interest changing over all epithelial clusters over the course of lactation, standardized expression over time; full results in Dataset S8. (B) Reduced top Enrichr GO biological process results on genes changing over time shared across both LC1 epithelial cells and secretory epithelial cells. Heatmaps represent sample means of gene-set scores of each pathway (rows) z-scored across samples (columns) and samples ordered by increasing time postpartum. Pathways colored by curated related GO term classifications; full results are in Dataset S9. (C) Hormone receptors, growth factor pathway components, AP-1 subunits, STAT5 and downstream targets, and milk component genes change with different dynamics in the LC1 epithelial and secretory lactocyte subclusters. Plots colored by mean expression of cells in each milk stage and time point z-scored across all time points and both subgroups (an asterisk indicates adjusted P < 0.05 via DESeq2 analysis over time) (Materials and Methods). Full results are in Dataset S8.