| Literature DB >> 22369103 |
Elisa Donnard1, Adriano Barbosa-Silva, Rafael L M Guedes, Gabriel R Fernandes, Henrique Velloso, Matthew J Kohn, Miguel A Andrade-Navarro, J Miguel Ortega.
Abstract
BACKGROUND: The integration of sequencing and gene interaction data and subsequent generation of pathways and networks contained in databases such as KEGG Pathway is essential for the comprehension of complex biological processes. We noticed the absence of a chart or pathway describing the well-studied preimplantation development stages; furthermore, not all genes involved in the process have entries in KEGG Orthology, important information for knowledge application with relation to other organisms.Entities:
Mesh:
Year: 2011 PMID: 22369103 PMCID: PMC3287586 DOI: 10.1186/1471-2164-12-S4-S3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Biointeraction extraction from PESCADOR. Top: Sample abstract tagged by PESCADOR. Gene or protein names (terms) recognized are highlighted in violet and the biointeraction words in yellow. The platform allows users to search for their interactions of interest by terms, abstracts or concepts of interest added initially by the user. Bottom: Manual curation of the information presented in the abstract and its graphical representation in the form of a regulatory pathway.
Figure 2Preimplantation development pathway. The figure shows a pathway representation of the genes involved in the regulation of the preimplantation development and interactions between them. Some functions are also detailed in the grey rectangles. In the upper part of the figure are located genes involved in the early stages of development (until blastocyst formation) below these, the left part corresponds to the regulations that occur in the inner cell mass, the portion of cells that remains undifferentiated during a longer period of time, part of these cells will give rise to the primitive endoderm and the genes that regulate this process are shown in the bottom left. In the right are the genes involved in the development of the outer cells of the blastocyst, which differentiate to form the trophectoderm. The interactions are described in the text. KEGG Markup Language was used for pathway representation. The developmental stages figures were adapted from Yamanaka et al. 2006 [14]. The pathway genes are represented according to their ancestry based on the determination of their Last Common Ancestor. Genes considered recent are shown in green while genes of more ancient origin are shown in lilac. Genes that present an ortholog in D. melanogaster are marked (*). This will be further adressed in the text section “Pathway Ancestry”. DPPA1 and hRSCP are shown in grey due to the fact that the lack of corresponding SwissProt annotated gene product to be used as seed prevented their use in this analysis.
Figure 3Gene origin in human evolution. Distribution of the genes in the preimplantation pathway according to their origin in clades of the human lineage, based on the determination of the Last Common Ancestor for the ortholog clusters generated by SeedServer. The y-axis represents the number of genes and the x-axis represents the taxonomical groups in which the genes originated.
Figure 4Pathway construction flowchart. The initial step consists of a PubMed search with the subject of interest (e.g. preimplantation development). The list of PubMed identifiers (PMIDs) obtained in the search is then used in the web tool Medline Ranker as the background set along with a list of PMIDs of manually selected abstracts considered informative which form the test set. The tool generates a list of abstracts classified by order of relevance. Best 1000 abstracts are recovered and their corresponding PMID is then introduced in the PESCADOR platform. Abstracts are tagged by PESCADOR and provide a source of biointeractions for manual curation and pathway construction. UniProt IDs for products of the genes present in the final pathway are obtained and used as seed in SeedServer. The software recruits homologues for each gene and creates the final clusters. Taxonomy IDs from each cluster can be used for Last Common Ancestor (LCA) determination.