| Literature DB >> 33112891 |
Harlan Barker1, Seppo Parkkila1.
Abstract
The World Health Organization declared the COVID-19 epidemic a public health emergency of international concern on March 11th, 2020, and the pandemic is rapidly spreading worldwide. COVID-19 is caused by a novel coronavirus SARS-CoV-2, which enters human target cells via angiotensin converting enzyme 2 (ACE2). We used a number of bioinformatics tools to computationally characterize ACE2 by determining its cell-specific expression in trachea, lung, and small intestine, derive its putative functions, and predict transcriptional regulation. The small intestine expressed higher levels of ACE2 mRNA than any other organ. By immunohistochemistry, duodenum, kidney and testis showed strong signals, whereas the signal was weak in the respiratory tract. Single cell RNA-Seq data from trachea indicated positive signals along the respiratory tract in key protective cell types including club, goblet, proliferating, and ciliary epithelial cells; while in lung the ratio of ACE2-expressing cells was low in all cell types (<2.6%), but was highest in vascular endothelial and goblet cells. Gene ontology analysis suggested that, besides its classical role in the renin-angiotensin system, ACE2 may be functionally associated with angiogenesis/blood vessel morphogenesis. Using a novel tool for the prediction of transcription factor binding sites we identified several putative binding sites within two tissue-specific promoters of the ACE2 gene as well as a new putative short form of ACE2. These include several interferon-stimulated response elements sites for STAT1, IRF8, and IRF9. Our results also confirmed that age and gender play no significant role in the regulation of ACE2 mRNA expression in the lung.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33112891 PMCID: PMC7592753 DOI: 10.1371/journal.pone.0240647
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 2Immunohistochemical localization of ACE2 protein in selected human tissues.
In the duodenum (A), the protein is most strongly localized to the apical plasma membrane of absorptive enterocytes (arrows). The goblet cells (arrowheads) show weaker apical staining. Intracellular staining is confined to the absorptive enterocytes. In the kidney (B), ACE2 shows strong apical staining in the epithelial cells of the proximal convoluted tubules (arrows) and Bowman´s capsule epithelium (arrowheads). The distal convoluted tubules are negative (asterisk). The testis specimen (C) shows strong immunostaining in the seminiferous tubules (arrows) and interstitial cells (arrowheads). The lung sample (D) is negative. In the nasopharyngeal mucosa (E), ACE2 signal is very weak and only occasional epithelial cells show weak signals (arrows). Immunostained specimens were taken from the Protein Expression Atlas (https://www.proteinatlas.org/).
Fig 1Expression of ACE2 mRNA in selected human tissues.
Expression values as TPM have been extracted from the FANTOM5 dataset.
Fig 3Single cell RNA-Seq analysis of different cell types from the lung (human), derived from data from GEO dataset GSE136831 [50].
ACE2 mRNA expression as normalized, batch-corrected counts is shown for comparison in upper panel. The force directed layout plot was computed and visualized in ScanPy [54]. For each cell type the ratio of cells expressing ACE2 is presented in addition to a stacked barplot of the relative cell type frequencies in the whole dataset. Alveolar type I (ATI), alveolar type II (ATII), pulmonary neuroendocrine cells (PNEC), smooth muscle cells (SMC), vascular endothelia (VE).
Fig 4Single cell RNA-Seq analysis of different cell types from the respiratory tract (mouse tracheal epithelium), derived from data from GEO dataset GSE103354 [51].
ACE2 mRNA expression as normalized, batch-corrected counts is shown for comparison in upper panel. The force directed layout plot was computed and visualized in ScanPy [54]. For each cell type the ratio of cells expressing ACE2 is presented in addition to a stacked barplot of the relative cell type frequencies in the whole dataset.
Fig 5Single cell RNA-Seq analysis of mouse intestinal epithelial cells.
Data is from GEO dataset GSE92332 [52]. ACE2 mRNA expression as normalized, batch-corrected counts is shown for comparison in upper panel. The force directed layout plot was computed and visualized in ScanPy [54]. For each cell type the ratio of cells expressing ACE2 is presented in addition to a stacked barplot of the relative cell type frequencies in the whole dataset.
Fig 6Effect of age on ACE2 mRNA expression levels.
Data is extracted from the GTEx dataset as TPM. In these organs, ANOVA revealed significant differences between age categories in tibial nerve (p = 8.58 x 10−6), minor salivary gland (p = 0.002), and whole blood (p = 0.005). In other tissues, the differences did not reach statistical significance. The highest TPM values are seen in the small intestine, testis, and kidney.
Fig 7Effect of gender on ACE2 mRNA expression levels.
Data is extracted from the GTEx dataset as TPM. The expression levels in males and females overlap in all tissue categories. Statistically significant differences studied by ANOVA analysis were determined in esophagus-gastroesophageal junction (p = 0.024) and whole blood (p = 0.0002). The ACE2 mRNA expression levels in testis specimens are shown here for comparison. 1 = male, 2 = female.
Fig 8Prediction of transcription factor binding sites in the human ACE2 gene promoter regions of Ensembl transcripts ENST00000252519 and ENST00000427411 using TFBSfootprinter.
A) Promoter region of the intestine specific ACE2 transcript. The results show putative binding sites for several transcription factors which have a strong correlation of expression with ACE2 in colon, kidney, and ileum. The predicted binding sites overlap regions of conservation in mammal species (Ensembl GERP), and cluster within 400 base pairs (bp) of the transcription start site. B) Promoter region of the lung specific ACE2 transcript. The predicted binding sites cluster within 200 bp of the TSS and overlap regions of conservation in mammal species, ATAC-Seq peaks (ENCODE), and TFBS metaclusters (GTRD).
Fig 9Prediction of interferon-stimulation mediating transcription factor binding sites in the promoter region of the putative short-form of human ACE2 gene using TFBSfootprinter.
The long terminal repeat 16A1 (LTR16A1) is cataloged in the FANTOM dataset [36] and has been identified as potentially relevant for interferon-mediated transcription of this new ACE2 transcript [56]; it is displayed in the CAGE peaks track along with an arrow indicating the TSS. The region analyzed represents –1,000 bp to +200 bp relative to the putative TSS, while nucleotide positions at bottom are given relative to the ACE2 full length transcript.
Genes associated with ACE2 mRNA expression in selected human tissues.
Derived from GTEx bulk RNA-Seq data [39].
| Tissue | Correlated_gene | Correlation | p-value | HGNC | UniProt | Description | Panther protein class |
|---|---|---|---|---|---|---|---|
| 0.9653 | 8.29E-110 | 3355 | Q07075 | Glutamyl aminopeptidase | metalloprotease (PC00153) | ||
| 0.9653 | 8.35E-110 | 7020 | Q16820 | Meprin A subunit beta | metalloprotease (PC00153) | ||
| 0.9621 | 2.19E-106 | 10920 | P46059 | Solute carrier family 15 member 1 | transporter (PC00227) | ||
| 0.9619 | 4.4E-106 | 7015 | Q16819 | Meprin A subunit alpha | metalloprotease (PC00153) | ||
| 0.9608 | 5.63E-105 | 11025 | Q07837 | Neutral and basic amino acid transport protein rBAT | amylase (PC00048) | ||
| 0.9587 | 6.45E-103 | 603 | P04114 | Apolipoprotein B-100 | |||
| 0.9581 | 2.33E-102 | 27960 | Q695T7 | Sodium-dependent neutral amino acid transporter B(0)AT1 | primary active transporter (PC00068) | ||
| 0.9562 | 1.16E-100 | 500 | P15144 | Aminopeptidase N | metalloprotease (PC00153) | ||
| 0.9559 | 2.45E-100 | 7467 | P55157 | Microsomal triglyceride transfer protein large subunit | transporter (PC00227) | ||
| 0.9554 | 6.24E-100 | 23534 | A6NHL2 | Tubulin alpha chain-like 3 | tubulin (PC00228) | ||
| 0.9092 | 1.07E-138 | 9086 | P60201 | Myelin proteolipid protein | myelin protein (PC00161) | ||
| 0.9056 | 1.52E-152 | 7015 | Q16819 | Meprin A subunit alpha | metalloprotease (PC00153) | ||
| 0.9012 | 7.07E-32 | 14599 | Q9UJW2 | Tubulointerstitial nephritis antigen | cysteine protease (PC00081) | ||
| 0.8964 | 8.19E-145 | 4212 | Q9Y2T3 | Guanine deaminase | deaminase (PC00088) | ||
| 0.8963 | 9E-145 | 28101 | Q2M2H8 | Probable maltase-glucoamylase 2 | glucosidase (PC00108) | ||
| 0.894 | 6.66E-143 | 23473 | Q5W0B7 | Transmembrane protein 236 | |||
| 0.8935 | 1.58E-142 | 25974 | Q9HCC8 | Glycerophosphoinositol inositolphosphodiesterase GDPD2 | |||
| 0.8932 | 2.81E-142 | 9090 | Q14651 | Plastin-1 | non-motor actin binding protein (PC00165) | ||
| 0.893 | 3.59E-142 | 4905 | Q9UM44 | HERV-H LTR-associating protein 2 | immunoglobulin receptor superfamily (PC00124) | ||
| 0.8928 | 5.27E-142 | 11025 | Q07837 | Neutral and basic amino acid transport protein rBAT | amylase (PC00048) | ||
| 0.8914 | 6.12E-141 | 2194 | Q9UMD9 | Collagen alpha-1(XVII) chain | extracellular matrix structural protein (PC00103) | ||
| 0.8911 | 2.91E-125 | 17592 | Q9BT67 | NEDD4 family-interacting protein 1 | |||
| 0.8908 | 4.67E-125 | 11219 | P09486 | SPARC | extracellular matrix glycoprotein (PC00100) | ||
| 0.8905 | 2.94E-140 | 14599 | Q9UJW2 | Tubulointerstitial nephritis antigen | cysteine protease (PC00081) | ||
| 0.8886 | 1.34E-123 | 10606 | P22307 | Non-specific lipid-transfer protein | transfer/carrier protein (PC00219) | ||
| 0.8852 | 2.44E-121 | 11097 | P28370 | Probable global transcription activator SNF2L1 | DNA helicase (PC00011) | ||
| 0.8829 | 6.45E-120 | 814 | P20020 | Plasma membrane calcium-transporting ATPase 1 | primary active transporter (PC00068) | ||
| 0.8815 | 4.6E-119 | 28675 | Q86XK7 | V-set and immunoglobulin domain-containing protein 1 | |||
| 0.8809 | 1.09E-118 | 5276 | O43719 | HIV Tat-specific factor 1 | RNA splicing factor (PC00148) | ||
| 0.8808 | 1.26E-118 | 117 | P21399 | Cytoplasmic aconitate hydratase | RNA binding protein (PC00031) | ||
| 0.8802 | 3.01E-118 | 13717 | Q9UJA9 | Ectonucleotide pyrophosphatase/phosphodiesterase family member 5 | nucleotide phosphatase (PC00173) | ||
| 0.8705 | 2.78E-27 | 28276 | Q6UX53 | Methyltransferase-like protein 7B | methyltransferase (PC00155) | ||
| 0.8683 | 5.3E-27 | 500 | P15144 | Aminopeptidase N | metalloprotease (PC00153) | ||
| 0.8671 | 7.59E-27 | 6694 | P98164 | Low-density lipoprotein receptor-related protein 2 | |||
| 0.8667 | 8.5E-27 | 10916 | Q9BZW2 | Solute carrier family 13 member 1 | secondary carrier transporter (PC00258) | ||
| 0.8608 | 4.52E-26 | 26625 | Q6NUS8 | UDP-glucuronosyltransferase 3A1 | |||
| 0.8535 | 3.26E-25 | 10996 | O14975 | Very long-chain acyl-CoA synthetase | secondary carrier transporter (PC00258) | ||
| 0.8526 | 4.1E-25 | 29437 | Q9HBJ8 | Collectrin | |||
| 0.8443 | 3.36E-24 | 20795 | Q8NCR9 | Clarin-3 | |||
| 0.8367 | 2.05E-23 | 124 | P13686 | Tartrate-resistant acid phosphatase type 5 | |||
| 0.6275 | 1.32E-64 | 24536 | Q6UWI2 | Prostate androgen-regulated mucin-like protein 1 | |||
| 0.6055 | 4.21E-59 | 2159 | P21554 | Cannabinoid receptor 1 | G-protein coupled receptor (PC00021) | ||
| 0.5918 | 6.52E-56 | 26442 | Q96DE0 | U8 snoRNA-decapping enzyme | |||
| 0.5817 | 1.23E-53 | 3062 | Q9NRD9 | Dual oxidase 1 | oxidase (PC00175) | ||
| 0.5739 | 6.2E-52 | 26507 | Q1HG43 | Dual oxidase maturation factor 1 | |||
| 0.571 | 2.53E-51 | 3768 | Q68DA7 | Formin-1 | |||
| 0.5692 | 6.05E-51 | 26217 | Q5U3C3 | Transmembrane protein 164 | |||
| 0.5673 | 1.54E-50 | 5213 | P51659 | Peroxisomal multifunctional enzyme type 2 | |||
| 0.5631 | 1.16E-49 | 7061 | P10620 | Microsomal glutathione S-transferase 1 | |||
| 0.5594 | 6.77E-49 | 10850 | P34896 | Serine hydroxymethyltransferase, cytosolic | methyltransferase (PC00155) |
Fig 10Expression of genes most highly correlated with ACE2 in single cell datasets of trachea and intestinal epithelia.
Trachea expression data is taken from GSE103354 [51] and intestinal epithelia data is derived from GSE92332 [52]. Visualized in ScanPy [54].
Gene ontology annotation results for the processes associated with genes strongly coexpressed (≥0.5) with ACE2 across all tissues in GTEx dataset.
Calculated with GProfiler Python library [41].
| Source | Native | Process | p-value |
|---|---|---|---|
| GO:BP | GO:0001525 | angiogenesis | 1.797E-08 |
| GO:BP | GO:0048514 | blood vessel morphogenesis | 4.611E-08 |
| GO:BP | GO:0001944 | vasculature development | 1.893E-07 |
| GO:BP | GO:0072358 | cardiovascular system development | 2.365E-07 |
| GO:BP | GO:0001568 | blood vessel development | 4.746E-07 |
| GO:BP | GO:0035239 | tube morphogenesis | 5.907E-07 |
| GO:BP | GO:0072359 | circulatory system development | 6.641E-07 |
| GO:BP | GO:0048646 | anatomical structure formation involved in morphogenesis | 1.885E-05 |
| GO:BP | GO:0035295 | tube development | 2.900E-05 |
| GO:BP | GO:1901342 | regulation of vasculature development | 0.025 |
| GO:BP | GO:0008217 | regulation of blood pressure | 0.035 |
| GO:BP | GO:0045765 | regulation of angiogenesis | 0.009 |
| GO:CC | GO:0071944 | cell periphery | 0.001 |
| GO:CC | GO:0009986 | cell surface | 0.002 |
| GO:CC | GO:0005886 | plasma membrane | 0.013 |
| GO:CC | GO:0046930 | pore complex | 0.003 |
| HP | HP:0005381 | recurrent meningococcal disease | 0.004 |
| HP | HP:0005430 | recurrent Neisserial infections | 0.007 |
| HP | HP:0100601 | eclampsia | 0.021 |
| KEGG | KEGG:04610 | complement and coagulation cascades | 0.002 |
| KEGG | KEGG:04923 | regulation of lipolysis in adipocytes | 0.037 |
| WP | WP:WP558 | complement and coagulation cascades | 0.011 |