| Literature DB >> 30357379 |
Shawez Khan1,2,3, Federico Taverna1,3, Katerina Rohlenova1,3, Lucas Treps1,3, Vincent Geldhof1,3, Laura de Rooij1,3, Liliana Sokol1,3, Andreas Pircher1,3, Lena-Christin Conradi1,3, Joanna Kalucka1,3, Luc Schoonjans1,2,3, Guy Eelen1,3, Mieke Dewerchin1,3, Tobias Karakach1,3, Xuri Li2, Jermaine Goveia1,3, Peter Carmeliet1,2,3.
Abstract
Endothelial cells (ECs) line blood vessels, regulate homeostatic processes (blood flow, immune cell trafficking), but are also involved in many prevalent diseases. The increasing use of high-throughput technologies such as gene expression microarrays and (single cell) RNA sequencing generated a wealth of data on the molecular basis of EC (dys-)function. Extracting biological insight from these datasets is challenging for scientists who are not proficient in bioinformatics. To facilitate the re-use of publicly available EC transcriptomics data, we developed the endothelial database EndoDB, a web-accessible collection of expert curated, quality assured and pre-analyzed data collected from 360 datasets comprising a total of 4741 bulk and 5847 single cell endothelial transcriptomes from six different organisms. Unlike other added-value databases, EndoDB allows to easily retrieve and explore data of specific studies, determine under which conditions genes and pathways of interest are deregulated and assess reprogramming of metabolism via principal component analysis, differential gene expression analysis, gene set enrichment analysis, heatmaps and metabolic and transcription factor analysis, while single cell data are visualized as gene expression color-coded t-SNE plots. Plots and tables in EndoDB are customizable, downloadable and interactive. EndoDB is freely available at https://vibcancer.be/software-tools/endodb, and will be updated to include new studies.Entities:
Mesh:
Year: 2019 PMID: 30357379 PMCID: PMC6324065 DOI: 10.1093/nar/gky997
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Schematic overview of EndoDB construction and functionalities.
List of species in the EndoDB
| Species | Colloquial species name | Number of datasets in EndoDB |
|---|---|---|
|
| Human | 270 |
|
| Mouse | 76 |
|
| Rat | 5 |
|
| Cow | 3 |
|
| Zebrafish | 3 |
|
| Pig | 3 |
Complete list of species included in the EndoDB, the number of datasets per species is indicated.
Top 10 most common EC types in the EndoDB
| Cell type | Number of datasets in EndoDB |
|---|---|
| Umbilical vein ECs | 142 |
| Aorta ECs | 29 |
| Dermal ECs | 23 |
| Lymphatic ECs | 21 |
| Coronary artery ECs | 17 |
| Pulmonary ECs | 16 |
| Brain ECs | 11 |
| Retinal ECs | 11 |
| Dermal lymphatic ECs | 10 |
| Blood outgrowth ECs | 9 |
Top 10 most common EC types in the EndoDB, the number of datasets per cell type is indicated.
Figure 2.Most common experimental conditions in the EndoDB. Relative representation of the 15 most common sample types in the EndoDB grouped by experimental conditions. Abbreviations—EC: endothelial cell; VEGF: vascular endothelial growth factor.
Figure 3.Study-centered data exploration. (A) PCA of normal hindbrain, Shh-medulloblastoma and Wnt-medulloblastoma ECs (study E-GEOD-73753). (B) Gene set enrichment analysis of normal hindbrain ECs versus Shh-medulloblastoma ECs. The upregulated gene sets are shown in red, the downregulated gene sets are shown in blue. (C) Differential analysis of normal hindbrain ECs versus Shh-medulloblastoma ECs shown in volcano plot; some highly deregulated proliferation-associated genes are indicated. (D) Gene expression heatmap of normal hindbrain ECs versus Shh-medulloblastoma and Wnt-medulloblastoma. The high-gene expression levels are shown in red, the low-expression levels in blue. (E) Expression of the indicated genes in normal hindbrain ECs and Shh-medulloblastoma ECs. (F) Metabolic gene set enrichment analysis of normal hindbrain ECs versus Shh-medulloblastoma ECs. The upregulated gene sets are shown in red, the downregulated gene sets are shown in blue. (G and H) Differential analysis of normal hindbrain ECs versus Shh-medulloblastoma ECs for the subset of metabolic genes (G) and transcription factors (H) shown as a volcano plot. Abbreviations—Ccna2: cyclin A2; Mki67: marker of proliferation ki67; NES: normalized enrichment score; Rrm2: ribonucleotide reductase regulatory subunit M2; Shh: Sonic Hedgehog; TEC: tumor endothelial cell; Top2a: topoisomerase 2 alpha; Uck2: uridine-cytidine kinase 2.
Figure 4.Gene and pathway-centered data exploration, example of Mki67 gene search. (A) Dot plot showing log fold change of Mki67 gene expression in all sample comparisons in EndoDB datasets. The dot indicated by the green arrow head in the enlarged circle represents the pair-wise comparison selected for further analysis (data from study E-GEOD-14375). (B) Expression of the Mki67 gene in normal hindbrain ECs and endothelioma ECs. (C) Differential analysis of normal brain ECs versus endothelioma ECs for all genes shown in volcano plot. Mki67 and Top2a genes are indicated. (D and E) Gene set enrichment analysis of normal brain ECs versus endothelioma ECs for all genes (D) and for the subset of detected metabolic genes (E). The upregulated gene sets are shown in red, the downregulated gene sets are shown in blue. Abbreviations—Mki67: marker of proliferation ki67; NES: normalized enrichment score; Top2a: topoisomerase 2 alpha.
Figure 5.Single cell data exploration. t-SNE plots of COLO205 TECs color-coded for the expression of the indicated proliferation-related genes (GSE110501). Abbreviations—Mki67: marker of proliferation ki67; Rrm2: ribonucleotide reductase regulatory subunit M2; Top2a: topoisomerase 2 alpha.
Top 10 most common tissue types in the EndoDB
| Organ | Number of datasets in EndoDB |
|---|---|
| Umbilical cord | 149 |
| Skin | 42 |
| Aorta | 28 |
| Lung | 28 |
| Heart | 25 |
| Brain | 22 |
| Liver | 16 |
| Eye | 15 |
| Blood | 13 |
| Lymphatic system | 13 |
Top 10 most common tissue types in the EndoDB, the number of datasets per tissue type is indicated.