| Literature DB >> 31547151 |
Stefaan Verhulst1, Tania Roskams2, Pau Sancho-Bru3, Leo A van Grunsven4.
Abstract
BACKGROUND: Chronic liver diseases are frequently accompanied with activation of biliary epithelial cells (BECs) that can differentiate into hepatocytes and cholangiocytes, providing an endogenous back-up system. Functional studies on BECs often rely on isolations of an BEC cell population from healthy and/or injured livers. However, a consensus on the characterization of these cells has not yet been reached. The aim of this study was to compare the publicly available transcriptome profiles of human and mouse BECs and to establish gene signatures that can identify quiescent and activated human and mouse BECs.Entities:
Keywords: BEC; gene signature; scRNAseq; transcriptome
Mesh:
Year: 2019 PMID: 31547151 PMCID: PMC6829476 DOI: 10.3390/cells8101117
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 6.600
Biliary epithelial cells (BEC) gene expression data.
| LPC Marker | Reference | Healthy | Injury | Hepatocytes | Niche | Injury Model | Species | Platform | GSEA | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LPC | Liver | Negative Fraction | LPC | Liver | Negative Fraction | ||||||||
| LGR5 | [ | x | x | x | x | CCl4 | Mouse | Agilent 014868 G4122F | GSE32210 | ||||
| MIC1 1C3 | [ | x | x | x | x | DDC | Mouse | Agilent 014868 G4122F | GSE29121 | ||||
| HNF1b | [ | x | x | x | DDC / CDE | Mouse | Mogene 2.0st | GSE51389 | |||||
| Foxl1 | [ | x | x | x | DDC | Mouse | Agilent 014868 G4122F | GSE28892 | |||||
| LPC cell lines | [ | x | x | None | Mouse | Mouse 430_2 | GSE85114/GSE12908/GSE18269 | ||||||
| Side population | [ | x | x | x | ASH | Human | Truseq LT | GSE102683 | |||||
| EpCAM | [ | x | x | x | ASH | Human | Truseq LT | GSE102683 | |||||
| TROP2 | [ | x | x | x | ASH | Human | Truseq LT | GSE102683 | |||||
| EpCAM | [ | x | None | Mouse | Mouse 430_2 | GSE63793 | |||||||
Microarray GEO sample (GSM) number associated with cell type or tissue.
| GSM Number | Cell Type or Tissue | Specie |
|---|---|---|
| GSM1061907, GSM1061908, GSM686644, GSM686645 | Bcell | Mouse |
| GSM1071644, GSM361592, GSM389821 | platelet_megakaryocyte | Mouse |
| GSM1081398 - GSM1081400 | CD8_T_cell | Mouse |
| GSM1129665, GSM686658, GSM686659 | macrophage | Mouse |
| GSM1214478-GSM1214480, GSM686650, GSM686651 | NK | Mouse |
| GSM1232678 - GSM1232680, GSM686652, GSM686653 | NKT | Mouse |
| GSM1281320 - GSM1281322, GSM525382 - GSM525384 | hepatocyte | Mouse |
| GSM1301661, GSM1301662 | neutrophil | Mouse |
| GSM190795 - GSM190797, GSM598999 - GSM599002, GSM658893 - GSM658898, GSM765922 - GSM765924 | epithelial | Mouse |
| GSM216494 - GSM216497, GSM686654 | erythroblast | Mouse |
| GSM298115 - GSM298117, GSM378250 - GSM378254 | fetal_liver | Mouse |
| GSM344315 - GSM344318 | neutrophil | Mouse |
| GSM547762 - GSM547766, GSM690763 - GSM690765 | endothelial | Mouse |
| GSM555381 - GSM555384 | CD4_T_cell | Mouse |
| GSM571897 | macrophage | Mouse |
| GSM591473, GSM591475, GSM591477, GSM591480, GSM602665 - GSM602667 | Healthy Liver | Mouse |
| GSM686646, GSM686647 | CD4_T_cell | Mouse |
| GSM686648, GSM686649 | CD8_T_cell | Mouse |
| GSM852330 - GSM852334 | Quiescent HSC | Mouse |
| GSM852341 - GSM852343 | HSC from BDL | Mouse |
| GSM852344 - GSM852346 | CCl4_HSC | Mouse |
| GSM1557526 - GSM1557528 | LPC EPCAM | Mouse |
| GSM2257924 - GSM2257940 | PIL | Mouse |
| GSM2257941 - GSM2257944 | BMOL | Mouse |
| GSM2257945 - GSM2257947 | T_LPC | Mouse |
| GSM323977 - GSM323981 | BMEL | Mouse |
| GSM715841 - GSM715844 | LPC_Foxl1_D0 | Mouse |
| GSM715856 - GSM715859 | LPC_Foxl1_POS_D3 | Mouse |
| GSM715860 - GSM715863 | LPC_Foxl1_POS_D7 | Mouse |
| GSM715864 - GSM715866 | LPC_Foxl1_POS_D14 | Mouse |
| GSM715867 - GSM715870 | LPC_Foxl1_undiff | Mouse |
| GSM721145 - GSM721149 | LPC_MIC_POS_CTL | Mouse |
| GSM721153 - GSM721156 | LPC_MIC_POS_DDC | Mouse |
| GSM1047599, GSM1047603 | LPC_LGR5 | Mouse |
| GSM1244523, GSM1244525 | LPC_HNF1b_POS_CDE | Mouse |
| GSM1244526 - GSM1244528 | LPC_HNF1b_POS_DDC | Mouse |
| GSM1244529, GSM1244530 | LPC_HNF1b_POS_CTL | Mouse |
| GSM709348 - GSM709354 | Healthy Livers for ASH | Human |
| GSM709355 - GSM709369 | ASH | Human |
| GSM1974233, GSM1974234 | Primary hepatocytes | Human |
| GSM1974235, GSM1974236 | Fetal hepatocytes | Human |
| GSM1627740 - GSM1627773 | Healthy livers for NASH | Human |
| GSM1627805, GSM1627806 | Definite NASH | Human |
| GSM155919, GSM155926 - GSM155928, GSM155947, GSM155948, GSM155961, GSM155964, GSM155988, GSM155989 | Healthy livers as control for Cirrhotic livers | Human |
| GSM155920 - GSM155923, GSM155931, GSM155951, GSM155952, GSM155965 - GSM155969, GSM155984 | Cirrhotic livers | Human |
| GSM2787428, GSM2787427 | Human iPSC | Human |
| GSM2787426, GSM2787425 | Cultured iPSC-LPC | Human |
| GSM2787422, GSM2787421 | Fresh iPSC-LPC | Human |
| GSM456340 - GSM456342 | HepaRG_diff | Human |
| GSM456343 - GSM456345 | HepaRG_undiff | Human |
| GSM456349 - GSM456351 | Primary hepatocytes Control for HepaRG | Human |
Figure 1Workflow and clustering of BEC transcriptome data sets. (A) Schematic overview of the workflow to merge microarray data from different platforms. Colored bars represent boxplots of the expression of all genes from one sample. Three colors are an example of three different datasets with multiple samples. (B,C) T-distributed stochastic neighbor embedding (t-SNE) and Pearson correlation analysis of mouse transcriptomic data from BEC cell lines, primary BECs and healthy livers. (D) Relative gene expression analysis (fold change, log2) of common BEC markers in primary human and mouse BECs compared to livers.
Figure 2Gene expression of popularly used BEC markers for isolation in primary human and mouse BECs. Gene expression of mouse and human livers (grey), quiescent (orange) and activated (blue) BECs. Left axis represents normalized intensity values (log2) for all mouse gene expression and right axis normalized transcript per million (log2) for human data.
Figure 3Pathway analysis of BEC populations. Gene set enrichment analysis of Hallmark pathways on transcriptomic profiles of BECs compared to healthy (mouse) or injured (human) livers. Gradient of the blue color represents positively enriched pathway (False discovery rate, FDR).
Figure 4Selection of genes enriched in BECs. Heatmap of genes enriched in BECs compared to healthy livers (A and C) or different cell types (B and D). For A only genes enriched in BECs isolated from healthy mouse livers were used while for C only genes enriched in BECs isolated from either choline-deficient, ethionine-supplemented (CDE)- or 3,5-diethoxycarbonyl-1,4-dihydrocollidine (DDC)-treated mice.
Figure 5Generation of BEC gene signatures. (A) Venn diagrams of the genes enriched in BECs compared to healthy livers or different cell types and both merged gene sets compared to genes enriched in human BECs. Gene ontology (GO) analysis of quiescent (B) and activated (C) BEC gene signature. The size of the circle is correlated to the number of genes involved in that GO. The color of the circle represents significance (adjusted p-value) and the x-axis stands for gene ratio (genes within signature versus total number of genes in GO). (D) Venn diagram of genes enriched in quiescent and activated BECs (microarray/RNAseq) and in BECs from scRNAseq of human livers.
List of human and mouse genes in quiescent and activated BECs.
| HUMAN | MOUSE | HUMAN/MOUSE | ||||
|---|---|---|---|---|---|---|
| Gene Name | Gene Description | Entrez ID | Ensemble ID | Gene Name | Ensemble ID | LPC expression |
| AKAP7 | A-kinase anchoring protein 7 | 9465 | ENSG00000118507 | Akap7 | ENSMUSG00000039166 | Activated |
| ALDH1A2 | aldehyde dehydrogenase 1 family member A2 | 8854 | ENSG00000128918 | Aldh1a2 | ENSMUSG00000013584 | Quiescent |
| ANKRD1 | ankyrin repeat domain 1 | 27063 | ENSG00000148677 | Ankrd1 | ENSMUSG00000024803 | Activated |
| ANKRD42 | ankyrin repeat domain 42 | 338699 | ENSG00000137494 | Ankrd42 | ENSMUSG00000041343 | Activated |
| ARL14 | ADP ribosylation factor like GTPase 14 | 80117 | ENSG00000179674 | Arl14 | ENSMUSG00000098207 | Quiescent and Activated |
| ATF3 | activating transcription factor 3 | 467 | ENSG00000162772 | Atf3 | ENSMUSG00000026628 | Quiescent |
| B4GALT5 | beta-1,4-galactosyltransferase 5 | 9334 | ENSG00000158470 | B4galt5 | ENSMUSG00000017929 | Activated |
| BICC1 | BicC family RNA binding protein 1 | 80114 | ENSG00000122870 | Bicc1 | ENSMUSG00000014329 | Quiescent and Activated |
| C1orf116 | chromosome 1 open reading frame 116 | 79098 | ENSG00000182795 | AA986860 | ENSMUSG00000042510 | Quiescent and Activated |
| CDH1 | cadherin 1 | 999 | ENSG00000039068 | Cdh1 | ENSMUSG00000000303 | Activated |
| CDH6 | cadherin 6 | 1004 | ENSG00000113361 | Cdh6 | ENSMUSG00000039385 | Activated |
| CREB5 | cAMP responsive element binding protein 5 | 9586 | ENSG00000146592 | Creb5 | ENSMUSG00000053007 | Activated |
| CRYAB | crystallin alpha B | 1410 | ENSG00000109846 | Cryab | ENSMUSG00000032060 | Quiescent and Activated |
| CTGF | connective tissue growth factor | 1490 | ENSG00000118523 | Ctgf | ENSMUSG00000019997 | Quiescent and Activated |
| CYR61 | cysteine rich angiogenic inducer 61 | 3491 | ENSG00000142871 | Cyr61 | ENSMUSG00000028195 | Quiescent and Activated |
| DCDC2 | doublecortin domain containing 2 | 51473 | ENSG00000146038 | Dcdc2a | ENSMUSG00000035910 | Quiescent and Activated |
| DDR1 | discoidin domain receptor tyrosine kinase 1 | 780 | ENSG00000223680 | Ddr1 | ENSMUSG00000003534 | Activated |
| DSP | desmoplakin | 1832 | ENSG00000096696 | Dsp | ENSMUSG00000054889 | Activated |
| EGR2 | early growth response 2 | 1959 | ENSG00000122877 | Egr2 | ENSMUSG00000037868 | Quiescent |
| EHF | ETS homologous factor | 26298 | ENSG00000135373 | Ehf | ENSMUSG00000012350 | Quiescent and Activated |
| ELOVL7 | ELOVL fatty acid elongase 7 | 79993 | ENSG00000164181 | Elovl7 | ENSMUSG00000021696 | Activated |
| ENC1 | ectodermal-neural cortex 1 | 8507 | ENSG00000171617 | Enc1 | ENSMUSG00000041773 | Activated |
| ENTPD2 | ectonucleoside triphosphate diphosphohydrolase 2 | 954 | ENSG00000054179 | Entpd2 | ENSMUSG00000015085 | Quiescent |
| EPCAM | epithelial cell adhesion molecule | 4072 | ENSG00000119888 | Epcam | ENSMUSG00000045394 | Quiescent and Activated |
| FBRS | fibrosin | 64319 | ENSG00000156860 | Fbrs | ENSMUSG00000042423 | Activated |
| FLRT3 | fibronectin leucine rich transmembrane protein 3 | 23767 | ENSG00000125848 | Flrt3 | ENSMUSG00000051379 | Quiescent and Activated |
| FOSB | FosB proto-oncogene, AP-1 transcription factor subunit | 2354 | ENSG00000125740 | Fosb | ENSMUSG00000003545 | Quiescent and Activated |
| FOXJ1 | forkhead box J1 | 2302 | ENSG00000129654 | Foxj1 | ENSMUSG00000034227 | Activated |
| FRAS1 | Fraser extracellular matrix complex subunit 1 | 80144 | ENSG00000138759 | Fras1 | ENSMUSG00000034687 | Quiescent |
| GADD45B | growth arrest and DNA damage inducible beta | 4616 | ENSG00000099860 | Gadd45b | ENSMUSG00000015312 | Activated |
| GLIS2 | GLIS family zinc finger 2 | 84662 | ENSG00000274636 | Glis2 | ENSMUSG00000014303 | Quiescent and Activated |
| GLIS3 | GLIS family zinc finger 3 | 169792 | ENSG00000107249 | Glis3 | ENSMUSG00000052942 | Quiescent and Activated |
| GOLGB1 | golgin B1 | 2804 | ENSG00000173230 | Golgb1 | ENSMUSG00000034243 | Quiescent |
| HBEGF | heparin binding EGF like growth factor | 1839 | ENSG00000113070 | Hbegf | ENSMUSG00000024486 | Quiescent and Activated |
| HSPA1A | heat shock protein family A (Hsp70) member 1A | 3303 | ENSG00000237724 | Hspa1a | ENSMUSG00000091971 | Quiescent |
| HSPA1B | heat shock protein family A (Hsp70) member 1B | 3304 | ENSG00000224501 | Hspa1b | ENSMUSG00000090877 | Quiescent |
| ITGB4 | integrin subunit beta 4 | 3691 | ENSG00000132470 | Itgb4 | ENSMUSG00000020758 | Activated |
| ITGB8 | integrin subunit beta 8 | 3696 | ENSG00000105855 | Itgb8 | ENSMUSG00000025321 | Quiescent and Activated |
| JUNB | JunB proto-oncogene, AP-1 transcription factor subunit | 3726 | ENSG00000171223 | Junb | ENSMUSG00000052837 | Activated |
| JUND | JunD proto-oncogene, AP-1 transcription factor subunit | 3727 | ENSG00000130522 | Jund | ENSMUSG00000071076 | Quiescent and Activated |
| KIAA1324 | KIAA1324 | 57535 | ENSG00000116299 | 5330417C22Rik | ENSMUSG00000040412 | Quiescent and Activated |
| KLF5 | Kruppel like factor 5 | 688 | ENSG00000102554 | Klf5 | ENSMUSG00000005148 | Quiescent and Activated |
| KRT17 | keratin 17 | 3872 | ENSG00000128422 | Krt17 | ENSMUSG00000035557 | Activated |
| KRT19 | keratin 19 | 3880 | ENSG00000171345 | Krt19 | ENSMUSG00000020911 | Quiescent and Activated |
| KRT7 | keratin 7 | 3855 | ENSG00000135480 | Krt7 | ENSMUSG00000023039 | Quiescent and Activated |
| LAMB2 | laminin subunit beta 2 | 3913 | ENSG00000172037 | Lamb2 | ENSMUSG00000052911 | Quiescent |
| LAMC2 | laminin subunit gamma 2 | 3918 | ENSG00000058085 | Lamc2 | ENSMUSG00000026479 | Quiescent and Activated |
| LRRC49 | leucine rich repeat containing 49 | 54839 | ENSG00000137821 | Lrrc49 | ENSMUSG00000047766 | Activated |
| MACC1 | MET transcriptional regulator MACC1 | 346389 | ENSG00000183742 | Macc1 | ENSMUSG00000041886 | Quiescent and Activated |
| MAGI1 | membrane associated guanylate kinase, WW and PDZ domain containing 1 | 9223 | ENSG00000151276 | Magi1 | ENSMUSG00000045095 | Quiescent and Activated |
| MYO5C | myosin VC | 55930 | ENSG00000128833 | Myo5c | ENSMUSG00000033590 | Activated |
| MYO6 | myosin VI | 4646 | ENSG00000196586 | Myo6 | ENSMUSG00000033577 | Activated |
| NFAT5 | nuclear factor of activated T cells 5 | 10725 | ENSG00000102908 | Nfat5 | ENSMUSG00000003847 | Activated |
| NFE2L3 | nuclear factor, erythroid 2 like 3 | 9603 | ENSG00000050344 | Nfe2l3 | ENSMUSG00000029832 | Quiescent |
| NFKBIE | NFKB inhibitor epsilon | 4794 | ENSG00000146232 | Nfkbie | ENSMUSG00000023947 | Activated |
| NOTCH2 | notch 2 | 4853 | ENSG00000134250 | Notch2 | ENSMUSG00000027878 | Quiescent and Activated |
| NSD1 | nuclear receptor binding SET domain protein 1 | 64324 | ENSG00000165671 | Nsd1 | ENSMUSG00000021488 | Quiescent and Activated |
| PEG10 | paternally expressed 10 | 23089 | ENSG00000242265 | Peg10 | ENSMUSG00000092035 | Activated |
| POGZ | pogo transposable element derived with ZNF domain | 23126 | ENSG00000143442 | Pogz | ENSMUSG00000038902 | Activated |
| PPP1R9A | protein phosphatase 1 regulatory subunit 9A | 55607 | ENSG00000158528 | Ppp1r9a | ENSMUSG00000032827 | Activated |
| RAI2 | retinoic acid induced 2 | 10742 | ENSG00000131831 | Rai2 | ENSMUSG00000043518 | Quiescent and Activated |
| RASSF9 | Ras association domain family member 9 | 9182 | ENSG00000198774 | Rassf9 | ENSMUSG00000044921 | Quiescent and Activated |
| RBM25 | RNA binding motif protein 25 | 58517 | ENSG00000119707 | Rbm25 | ENSMUSG00000010608 | Activated |
| RIPK4 | receptor interacting serine/threonine kinase 4 | 54101 | ENSG00000183421 | Ripk4 | ENSMUSG00000005251 | Activated |
| S100A6 | S100 calcium binding protein A6 | 6277 | ENSG00000197956 | S100a6 | ENSMUSG00000001025 | Quiescent and Activated |
| SERPINH1 | serpin family H member 1 | 871 | ENSG00000149257 | Serpinh1 | ENSMUSG00000070436 | Quiescent |
| SF1 | splicing factor 1 | 7536 | ENSG00000168066 | Sf1 | ENSMUSG00000024949 | Activated |
| SHROOM3 | shroom family member 3 | 57619 | ENSG00000138771 | Shroom3 | ENSMUSG00000029381 | Quiescent and Activated |
| SLC5A1 | solute carrier family 5 member 1 | 6523 | ENSG00000100170 | Slc5a1 | ENSMUSG00000011034 | Quiescent and Activated |
| SLC6A6 | solute carrier family 6 member 6 | 6533 | ENSG00000131389 | Slc6a6 | ENSMUSG00000030096 | Activated |
| SLC7A1 | solute carrier family 7 member 1 | 6541 | ENSG00000139514 | Slc7a1 | ENSMUSG00000041313 | Activated |
| SLCO3A1 | solute carrier organic anion transporter family member 3A1 | 28232 | ENSG00000176463 | Slco3a1 | ENSMUSG00000025790 | Activated |
| SNRNP200 | small nuclear ribonucleoprotein U5 subunit 200 | 23020 | ENSG00000144028 | Snrnp200 | ENSMUSG00000003660 | Activated |
| SNRNP70 | small nuclear ribonucleoprotein U1 subunit 70 | 6625 | ENSG00000104852 | Snrnp70 | ENSMUSG00000063511 | Quiescent and Activated |
| SOX9 | SRY-box 9 | 6662 | ENSG00000125398 | Sox9 | ENSMUSG00000000567 | Quiescent and Activated |
| SPHK1 | sphingosine kinase 1 | 8877 | ENSG00000176170 | Sphk1 | ENSMUSG00000061878 | Activated |
| SPINT1 | serine peptidase inhibitor, Kunitz type 1 | 6692 | ENSG00000166145 | Spint1 | ENSMUSG00000027315 | Quiescent and Activated |
| SREBF2 | sterol regulatory element binding transcription factor 2 | 6721 | ENSG00000198911 | Srebf2 | ENSMUSG00000022463 | Activated |
| STK35 | serine/threonine kinase 35 | 140901 | ENSG00000125834 | Stk35 | ENSMUSG00000037885 | Activated |
| SYNJ2 | synaptojanin 2 | 8871 | ENSG00000078269 | Synj2 | ENSMUSG00000023805 | Quiescent and Activated |
| TACSTD2 | tumor associated calcium signal transducer 2 | 4070 | ENSG00000184292 | Tacstd2 | ENSMUSG00000051397 | Activated |
| TCF20 | transcription factor 20 | 6942 | ENSG00000100207 | Tcf20 | ENSMUSG00000041852 | Activated |
| TGFB2 | transforming growth factor beta 2 | 7042 | ENSG00000092969 | Tgfb2 | ENSMUSG00000039239 | Quiescent and Activated |
| THSD4 | thrombospondin type 1 domain containing 4 | 79875 | ENSG00000187720 | Thsd4 | ENSMUSG00000032289 | Activated |
| TNFRSF19 | TNF receptor superfamily member 19 | 55504 | ENSG00000127863 | Tnfrsf19 | ENSMUSG00000060548 | Activated |
| TUBB2B | tubulin beta 2B class IIb | 347733 | ENSG00000137285 | Tubb2b | ENSMUSG00000045136 | Activated |
| UBAP2L | ubiquitin associated protein 2 like | 9898 | ENSG00000143569 | Ubap2l | ENSMUSG00000042520 | Quiescent and Activated |
| UGT8 | UDP glycosyltransferase 8 | 7368 | ENSG00000174607 | Ugt8a | ENSMUSG00000032854 | Quiescent and Activated |
| UNC119B | unc-119 lipid binding chaperone B | 84747 | ENSG00000175970 | Unc119b | ENSMUSG00000046562 | Activated |
| VTCN1 | V-set domain containing T cell activation inhibitor 1 | 79679 | ENSG00000134258 | Vtcn1 | ENSMUSG00000051076 | Activated |
| WFDC2 | WAP four-disulfide core domain 2 | 10406 | ENSG00000101443 | Wfdc2 | ENSMUSG00000017723 | Activated |
| WWC1 | WW and C2 domain containing 1 | 23286 | ENSG00000113645 | Wwc1 | ENSMUSG00000018849 | Quiescent and Activated |
| ZFP36 | ZFP36 ring finger protein | 7538 | ENSG00000128016 | Zfp36 | ENSMUSG00000044786 | Activated |
| ZFP36L1 | ZFP36 ring finger protein like 1 | 677 | ENSG00000185650 | Zfp36l1 | ENSMUSG00000021127 | Activated |
Figure 6BEC signatures to identify BECs. (A and B) Visual representation (chord diagram) of gene set enrichment analysis with our quiescent and activated BEC signatures, Sancho-Bru signature (2012) [44] and BEC cell line signature of Passman et al. 2016 [30] on BEC-like cells and liver diseases (AC: alcoholic cirrhosis, AH: alcoholic hepatitis, NASH: non-alcoholic steatohepatitis, PBC: peribiliary cirrhosis, PSC: primary sclerosis cholangitis). The size of the arrow presents the positive enrichment (significance, -log p-value).
Figure 7Validation of scBEC signature. TSNE plots of human (A) and mouse (B) liver cells from respectively Aizarani et al. 2019 [28] and Pepe-Mooney et al. 2019 [27], with expression of every single gene of the scBEC signature. Blue scale bars represent normalized counts.