| Literature DB >> 26699919 |
Min Zhao1, Yining Liu1, Tracy A O'Mara2.
Abstract
Endometrial cancer (EC) ranks as the sixth common cancer for women worldwide. To better distinguish cancer subtypes and identify effective early diagnostic biomarkers, we need improved understanding of the biological mechanisms associated with EC dysregulated genes. Although there is a wealth of clinical and molecular information relevant to EC in the literature, there has been no systematic summary of EC-implicated genes. In this study, we developed a literature-based database ECGene (Endometrial Cancer Gene database) with comprehensive annotations. ECGene features manual curation of 414 genes from thousands of publications, results from eight EC gene expression datasets, precomputation of coexpressed long noncoding RNAs, and an EC-implicated gene interactome. In the current release, we generated and comprehensively annotated a list of 458 EC-implicated genes. We found the top-ranked EC-implicated genes are frequently mutated in The Cancer Genome Atlas (TCGA) tumor samples. Furthermore, systematic analysis of coexpressed lncRNAs provided insight into the important roles of lncRNA in EC development. ECGene has a user-friendly Web interface and is freely available at http://ecgene.bioinfo-minzhao.org/. As the first literature-based online resource for EC, ECGene serves as a useful gateway for researchers to explore EC genetics.Entities:
Keywords: cancer genomics; database; endometrial cancer; long noncoding RNA; network analysis; systems biology
Mesh:
Substances:
Year: 2016 PMID: 26699919 PMCID: PMC5066700 DOI: 10.1002/humu.22950
Source DB: PubMed Journal: Hum Mutat ISSN: 1059-7794 Impact factor: 4.878
Figure 1Web interface of ECGene database. A: Example of the general information provided for each endometrial cancer‐implicated gene. B: Example of gene‐supporting literature with keywords highlighted. C: Box plots of gene expression profiles in normal tissues. D: Gene expression data from endometrial cancer tissue expression studies. E: Coexpressed lncRNAs in TCGA endometrial cancer samples. F: Query interface for text search. G: The quick search interface for ECGene database. H: BLAST search interface for comparing query against all sequences in ECGene.
Figure 2The enriched gene ontology terms of the top 100 ranked EC‐implicated genes. The X‐axis is the logarithm of corrected P values. The Y‐axis represents the similarity between the enriched gene ontology terms.
Figure 3The mutational landscape for the top 99 ranked endometrial cancer genes in multiple cancers.
Figure 4Reconstructed endometrial cancer map using protein–protein interaction data. A: The 246 genes in orange are genes from the core dataset in ECGene. The remaining 44 genes in blue are linker genes that bridge the 246 genes. B: The degree distribution. C: The short path length frequency.