| Literature DB >> 31164042 |
Bosu Hu1, Lei Zheng1, Chunshen Long1, Mingmin Song1, Tao Li2, Lei Yang3, Yongchun Zuo1.
Abstract
Understanding early development offers a striking opportunity to investigate genetic disease, stem cell and assisted reproductive technology. Recent advances in high-throughput sequencing technology have led to the rising influx of omics data, which have rapidly boosted our understanding of mammalian developmental mechanisms. Here, we review the database EmExplorer (a database for exploring time activation of gene expression in mammalian embryos), which systematically organizes the genes from development-related pathways, and which we have already established and continue to update it. The current version of EmExplorer incorporates over 26 000 genes obtained from 306 functional pathways in five species. The function annotations of development-related genes were also integrated into EmExplorer. To facilitate data extraction, the database also contains the following information. (i) The dynamic expression values for each development stage are matched to the corresponding genes. (ii) A two-layer search tool which supports multi-option searching, such as by official symbol, pathway name and function annotation. The returned entries can directly link to the analysis results for the corresponding gene or pathway in the analysis module. (iii) The analysis module provides different gene comparisons at the multi-species level and functional pathway level, which shows the species specificity and stage specificity at the gene or pathway level. (iv) The analysis based on the hypergeometric distribution test reveals the enrichment of gene functions at a particular stage of one organism's pathway. (v) The browser is designed for users with ambiguous searching goals and greatly helps new users to get a general idea of the contents of the database. (vi) The experimentally validated pathways are manually curated and shown on the home page. EmExplorer will be helpful for elucidating early developmental mechanisms and exploring time activation genes. EmExplorer is freely available at http://bioinfor.imu.edu.cn/emexplorer .Entities:
Keywords: database; dynamic expression; pathway and function; preimplantation embryonic development; time activation
Mesh:
Year: 2019 PMID: 31164042 PMCID: PMC6597754 DOI: 10.1098/rsob.190054
Source DB: PubMed Journal: Open Biol ISSN: 2046-2441 Impact factor: 6.411
Figure 1.Overview of the establishment process and workflow of EmExplorer. EmExplorer integrates development-associated genes from public resources and sorts them into functional pathways. Users can input key words to the query engine, and the relevant information will be extracted from the database. The analysis tools enable users to make comparisons between genes on different levels. All search and analysis results will be helpful for further analysis.
EmExplorer data content and detailed statistics .
| species | pathway | gene | function annotation | gene with dynamic expression value | gene with function annotation |
|---|---|---|---|---|---|
| 306 | 5098 | 131 | 3663 | 399 | |
| 292 | 4290 | 14 | 3123 | 24 | |
| 302 | 5874 | 132 | 3333 | 356 | |
| 302 | 5565 | 106 | 3180 | 182 | |
| 302 | 6073 | 99 | 5253 | 250 | |
| total | 306 | 13 040 | 195 | 484 |
Figure 2.The analysis of the pluripotent stem cell (PSC) regulation pathway at the pathway level. (a) The experimentally validated information is provided on the home page. Each developmental stage contains several stage-specific functional pathways. We selected one functional pathway, which was confirmed by previous experiments and was the same as the results we obtained after data processing, at each stage. The current mainstream browsers can completely support the special effects of the home page. (b–f) The HEGs are defined as the group of genes whose expression values in a stage are higher than the median of the overall genes in a certain pathway. The median expression value of HEGs represents the overall level of HEGs. As the figure shows, the analysis result of the PSC regulation pathway for multiple species indicates that the common feature of these five organisms is that genes in the PSC pathway are rapidly raised at the eight-cell stage and decreased when it comes to the morula stage. The eight-cell phase may a key node for cellular pluripotency activation. Except for human and mouse, the temporal expression in the other three organisms reaches its lowest level at the oocyte stage. Such a difference between human and mouse and the other three species relates to the differences in biological functions performed in this stage.
The experimentally validated content in each developmental stage.
| stage | ||||||
|---|---|---|---|---|---|---|
| oocyte | pathway | 2 | 0 | 2 | 2 | 1 |
| gene | 18 | 0 | 22 | 11 | 11 | |
| function | 21 | 0 | 22 | 14 | 16 | |
| zygote | pathway | 10 | 4 | 9 | 9 | 8 |
| gene | 77 | 3 | 81 | 55 | 42 | |
| function | 48 | 6 | 52 | 46 | 42 | |
| 2-cell | pathway | 3 | 2 | 3 | 3 | 2 |
| gene | 51 | 6 | 34 | 20 | 18 | |
| function | 53 | 7 | 43 | 38 | 38 | |
| 4-cell | pathway | 15 | 6 | 15 | 15 | 15 |
| gene | 107 | 4 | 107 | 76 | 59 | |
| function | 6 | 3 | 56 | 40 | 54 | |
| 8-cell | pathway | 4 | 1 | 5 | 4 | 4 |
| gene | 38 | 1 | 43 | 30 | 23 | |
| function | 31 | 2 | 34 | 27 | 26 | |
| morula | pathway | 7 | 0 | 7 | 4 | 5 |
| gene | 11 | 0 | 11 | 5 | 6 | |
| function | 11 | 0 | 11 | 6 | 7 | |
| blastocyst | pathway | 7 | 3 | 7 | 7 | 7 |
| gene | 74 | 2 | 64 | 55 | 31 | |
| function | 47 | 4 | 46 | 39 | 35 | |
Figure 3.The basic operations in EmExplorer are shown above. Taking the significant gene POU5F1 as a case in point, the search browser results are presented in a table and clicking the corresponding linkage leads users to the detailed information website. Analysis tools enable users to make comparisons at the gene, organism, and pathway levels, and the results are intuitively visualized. Functional enrichment analysis shows the significance of biological functions at a certain stage. All these data can be downloaded for further analysis. The Statistics box shows the current content in EmExplorer.