| Literature DB >> 33207190 |
Marie Saitou1, Eliza A Gaylord2, Erica Xu3, Alison J May2, Lubov Neznanova4, Sara Nathan2, Anissa Grawe2, Jolie Chang5, William Ryan5, Stefan Ruhl6, Sarah M Knox7, Omer Gokcumen8.
Abstract
Salivary proteins are essential for maintaining health in the oral cavity and proximal digestive tract, and they serve as potential diagnostic markers for monitoring human health and disease. However, their precise organ origins remain unclear. Through transcriptomic analysis of major adult and fetal salivary glands and integration with the saliva proteome, the blood plasma proteome, and transcriptomes of 28+ organs, we link human saliva proteins to their source, identify salivary-gland-specific genes, and uncover fetal- and adult-specific gene repertoires. Our results also provide insights into the degree of gene retention during gland maturation and suggest that functional diversity among adult gland types is driven by specific dosage combinations of hundreds of transcriptional regulators rather than by a few gland-specific factors. Finally, we demonstrate the heterogeneity of the human acinar cell lineage. Our results pave the way for future investigations into glandular biology and pathology, as well as saliva's use as a diagnostic fluid.Entities:
Keywords: cell-line heterogeneity; developmental regulation; gene regulation; glycosylation; lincRNAs; regulatory architecture; salivary biomarkers; salivary mucins; salivary proteins; transcriptome
Year: 2020 PMID: 33207190 PMCID: PMC7703872 DOI: 10.1016/j.celrep.2020.108402
Source DB: PubMed Journal: Cell Rep Impact factor: 9.423
Figure 1.Overview of the Transcriptome Analysis
(A) Anatomical location of the three major glands in humans.
(B) Principal-component analysis of gene expression levels in adult and fetal salivary glands. Blue symbols, adult samples; yellow symbols, fetal samples. Triangle, square, and circle shapes represent the parotid (PAR), submandibular (SM), and sublingual (SL) glands, respectively (fetal gland types in lowercase letters).
(C) Hierarchical clustering analysis transcriptome data from the different adult and fetal gland types without a priori clustering information.
(D) Volcano plots showing the expression differences among gland types in a pairwise fashion for adult (top) and fetus (bottom). The x axis indicates gene expression log2 fold changes (log2). The y axis indicates a −log10 value of the adjusted p value. Genes with significantly different expressions among glands are indicated in red (adjusted p < 0.0001).
(E) Heatmaps of the log10 normalized expression values of gland-specific genes showing the 30 most highly expressed genes in the different mature and fetal gland types. The top gene, RN7SL1, was excluded because of its role as a housekeeping gene.
(F) Immunofluorescent localization of MUC5B (left panel) and MUC7 (right panel) in fetal glandular tissues. Left-side images of each panel show mucin (red) and E cadherin (blue) immunostaining without nuclei, and right-side images show a lower magnification of the same glandular region and include nuclei. ECAD, E cadherin. Scale bars, 25 μm.
Genes Expressed in Abundance in Salivary Glands
| Top Transcribed Genes[ |
|---|
| Top 20 Transcribed Genes in the SL[ |
| Top 20 Transcribed Genes in the PAR[ |
| Top 20 Transcribed Genes in the SM[ |
| Additional Highly Transcribed Genes of Reported Functional Relevance in Gland Development, Physiology, or Pathology[ |
| Additional Highly Transcribed Genes of Reported Functional Relevance[ |
| Top 10 Proteins Abundantly Found in Whole-Mouth Saliva that Likely Originate from Extrinsic Sources Such as Blood Plasma or Epithelial Linings of the Oral Cavity |
| Top 10 Transcribed TF Genes[ |
| Additional Highly Transcribed TF Genes of Reported Functional Relevance[ |
Listed are the top 10 genes expressed in each major gland type. Because some of these genes overlap, the total of genes listed here is lower than 30. For a more systematic look into their expression in salivary glands, see Tables S2 and S4.
Underlined gene names designate those that are predominantly expressed in the respective gland category (log2 fold change > 2).
A more detailed description of these genes, including references, is provided in the main text.
Figure 2.Categorization of Genes Based on Their Expression Trends in Fetal and Mature Salivary Glands
(A) Pie chart on the left indicates the proportions and numbers of genes (i.e., expressed >100 DESeq2 normalized counts [NCs]) that showed no significant differences (adjusted p > 0.0001, dark gray), were downregulated (adjusted p < 0.0001, green), or were either retained or upregulated in mature salivary glands compared with their fetal counterparts.
(B) Parallel set graph to summarize the breakdown of genes that show variable gene expression in adult glands, indicating how the differential transcriptome repertoires of mature glands are a product of gland-specific retention and upregulation of gene expression. Smaller pie charts at the right side of the parallel set graph indicate the proportion of retained and upregulated genes for each mature gland type.
(C) Heatmap showing genes highly expressed in fetal glands (>1,000 NCs) shown in relative abundance (Z score) that are retained in only one mature gland type from its fetal counterpart, but not in the other two mature gland types.
Figure 3.The Diverse Transcription Factor (TF) Repertoire of Mature Salivary Glands May Shape Hotspots of Salivary-Gland-Specific Expression across the Genome
(A) Heatmap of expression levels of TF genes (as listed in TF2DNA database; Pujato et al., 2014) across fetal and mature salivary gland tissues. Four categories of TFs are shown in the heatmap: TFs that are (1) differentially expressed (p < 0.0001) among mature glands, (2) abundant (>2,000 NCs) in adult or fetal glands, (3) previously associated with organogenesis, and (4) salivary gland specific (that is, >100 NCs in the salivary glands but negligible expression in all 53 GTEx tissue [<10 transcripts per million (TPM)]). LTF, a secreted protein in saliva, is listed here because one of its isoforms, delta lactoferrin, displays TF activity (He and Furmanski, 1995; Mariller et al., 2007).
(B) Immunofluorescent analysis of TF BHLHA15/MIST1 in adult glandular tissues. The SM and PAR cells are highly enriched for MIST1 compared with the SL cells. NKCC1/SLC12A2, Na-K-Cl cotransporter 1. Scale bar, 25 μm.
(C) Heatmap of expression levels of genes in mature salivary glands (>100 NCs) that show negligible expression in other tissues and organs (expression in all 53 GTEx tissues < 10 TPM). The specific tissues in the GTEx database used for this analysis are listed in Table S5. Epithelial or secretory tissues and organs important for comparison to salivary glands are indicated on top of the heatmap. Deviation from the mean expression for each column is shown as a Z score with a scale similar to that used in (A).
(D) Circos plot showing the locations of genes with salivary-gland-specific expression. These genes show considerable expression in the salivary glands (>100 NCs) but negligible expression in all 53 GTeX tissues. Clusters of genes located within 1 Mb of one another are pointed out with gene names inside the Circos plot. Genes that previously had not been reported within the context of salivary glands are indicated in red.
Figure 4.The Shaping of the Salivary Proteome
(A) Each graph represents a comparison of transcript abundances of a specific gland type, with protein abundances in that gland’s corresponding ductal saliva. x axis, log10 DESeq2 NCs; y axis, log10 normalized protein abundances. Blue dots indicate genes coding for secreted proteins. Genes showing the highest abundance (top 10%) at both the transcript and the protein level are highlighted in the top-right quadrant by a gray background and enlarged in the right panels, with their protein names indicated.
(B) Comparison of the most abundant proteins in human saliva with the protein abundances in 29 human organs from the Human Protein Atlas database (Wang et al., 2019). Genes were chosen based on their protein expression levels, according to the HSP-Wiki database, and their transcript levels, according to our salivary gland RNA-seq analysis. Heatmap colors indicate Z scores normalized for each row of data. Genes are ordered from top (highest) to bottom (lowest) based on their enrichment in salivary glands.
(C) Schematic showing the glandular origins of the most abundant saliva proteins in whole-mouth saliva. The central group of circles represents the most abundant proteins detected in whole-mouth saliva (data source: HSP-Wiki). The groups of circles on the outside represent the transcript levels in the PAR (orange), SL (blue), and SM (green) coding for the most abundant salivary proteins in the corresponding glandular secretions (data source: HSP-Wiki). The sizes (areas) of the circles symbolize relative RNA abundances normalized for each gland type. Colors in the central group of circles indicate the putative salivary gland origin of the proteins or their origin from blood plasma (gray). Blood plasma values are based on protein abundances. Data source: Human Plasma Proteome Project Data Central at PeptideAtlas http://www.peptideatlas.org/hupo/hppp/ (Schwenk et al., 2017). For blood plasma, only those proteins that were abundantly detected in whole-mouth saliva are shown. Proteins, indicated by an asterisk, are detected as secreted proteins at the glandular level but were not among the most abundant proteins detected in whole-mouth saliva.
(D) Heatmap of transcript levels for genes involved according to GO categorization in protein N-linked or O-linked glycosylation. Heatmap colors indicate Z scores normalized for each row of data. Table S2 provides the list of glycosylation-related genes and their gene expression in salivary glands.
Figure 5.Gland- and Cell-Specific Expression of Salivary Proteins
(A) Immunohistochemistry of glandular tissues. The SM and PAR acinar cells are highly enriched for amylase (AMY1), statherin (STATH), and lactoperoxidase (LPO) compared with the SL, consistent with these proteins being markers of serous cells. MUC7 and CRISP3 are expressed by a subset of acinar cells of the SM and SL, with little to no expression in the PAR. MUC5B is highly expressed by the SL mucous acinar cells, but not by the acinar cells of the PAR or SM, indicating it to be a marker of SL function. Scale bar, 25 μm.
(B) MUC7 and amylase are expressed by distinct subtypes of serous acinar cells. The mature SM tissue section was immunolabeled for MUC7 (red), AMY (green), and ECAD (gray).
(C) Gel electrophoretic separation of whole mouth saliva (WS) and glandular ductal secretions (PAR and SM/SL), followed by staining of proteins with Coomassie blue and of glycoproteins with periodic acid Schiff stain (pink bands in left panel) and probing of transfers with antibodies against MUC7, AMY, BPIFA2, CRISP3, and STATH (right panel).
KEY RESOURCES TABLE
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Antibodies | ||
| Rat polyclonal anti-E cadherin | Sigma | Cat# U3254; RRID: AB_477600 |
| Rabbit polyclonal Anti-MUC7 | Sigma | Cat# HPA006411; RRID: AB_1854204 |
| Mouse monoclonal anti-MUC7 | Abcam | Cat# 4D2-1D7; RRID: AB_10866568 |
| Mouse monoclonal anti-MUC5B | Abcam | Cat# ab105460; RRID: AB_10862195 |
| Rabbit polyclonal anti-CRISP3 | Sigma | Cat# HPA054392; RRID: AB_2682472 |
| Rabbit polyclonal anti-AMY1A | Sigma | Cat# HPA045394; RRID: AB_2679311 |
| Rabbit polyclonal anti-LPO1 | Sigma | Cat# HPA028688; RRID: AB_10601909 |
| Rabbit polyclonal anti-STATH | Dundee Cell Products Ltd., Dundee, UK | Custom |
| Rabbit polyclonal anti-BPIFA2 | Eurogentec | |
| Biological Samples | ||
| Human Salivary Gland Tissues | Laboratory of Sarah Knox | UCSF Biospecimen Resources Program (BIOS) 17-22669 |
| Human Saliva - Whole Saliva | Laboratory of Stefan Ruhl | Custom |
| Human Saliva - Parotid Saliva | Laboratory of Stefan Ruhl | Custom |
| Human Saliva - Submandibular/Sublingual Saliva | Laboratory of Stefan Ruhl | Custom |
| Chemicals, Peptides, and Recombinant Proteins | ||
| Optimal cutting temperature compound (OCT) | Tissue-Tek | Cat# 4583 |
| RNA lysis buffer, filter cartilages and collection tubes | Invitrogen | Cat# AM1912 |
| Pierce bicinchoninic acid (BCA) protein assay kit | Thermo Scientific | Cat# 23227 |
| Deposited Data | ||
| RNaseq data - Salivary glands | This study | |
| Salivary and serum proteomics data | Human Salivary Proteome Wiki | |
| RNaseq data from multiple tissues | GTEx | |
| Proteome data from multiple tissues | Human Protein Atlas | |
| Software and Algorithms | ||
| Custom codes for downstream analysis | Gokcumen Laboratory | |
| FastQC | Babraham Bioinformatics | v0.11.9 |
| Trimmomatic | v0.39 | |
| Kallisto | v0.46.1 | |
| DESeq2 | Bioconductor | v1.28.1 |
| GOrilla | vMar.8.2013 | |
| ggplot2 | tidyverse | V3.3.2 |