| Literature DB >> 30143675 |
Asna Tungekar1,2, Sumana Mandarthi1,3, Pooja Rajendra Mandaviya1,2, Veerendra P Gadekar1,2,4, Ananthajith Tantry1,5, Sowmya Kotian1,2, Jyotshna Reddy1,2, Divya Prabha1, Sushma Bhat1,2, Sweta Sahay1, Roshan Mascarenhas1,2,6, Raghavendra Rao Badkillaya1,7, Manoj Kumar Nagasampige1,8, Mohan Yelnadu1,5,9,10, Harsh Pawar10, Prashantha Hebbar11,12, Manoj Kumar Kashyap13,14,15,16.
Abstract
Esophageal cancer (EC) is the eighth most aggressive malignancy and its treatment remains a challenge due to the lack of biomarkers that can facilitate early detection. EC is identified in two major histological forms namely - Adenocarcinoma (EAC) and Squamous cell carcinoma (ESCC), each showing differences in the incidence among populations that are geographically separated. Hence the detection of potential drug target and biomarkers demands a population-centric understanding of the molecular and cellular mechanisms of EC. To provide an adequate impetus to the biomarker discovery for ESCC, which is the most prevalent esophageal cancer worldwide, here we have developed ESCC ATLAS, a manually curated database that integrates genetic, epigenetic, transcriptomic, and proteomic ESCC-related genes from the published literature. It consists of 3475 genes associated to molecular signatures such as, altered transcription (2600), altered translation (560), contain copy number variation/structural variations (233), SNPs (102), altered DNA methylation (82), Histone modifications (16) and miRNA based regulation (261). We provide a user-friendly web interface ( http://www.esccatlas.org , freely accessible for academic, non-profit users) that facilitates the exploration and the analysis of genes among different populations. We anticipate it to be a valuable resource for the population specific investigation and biomarker discovery for ESCC.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30143675 PMCID: PMC6109081 DOI: 10.1038/s41598-018-30579-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Schema for annotation of different types of molecular alterations in esophageal squamous cell carcinoma. The research articles published on esophageal squamous cell carcinoma (ESCC) are screened to filter the differentially expressed molecules at DNA, mRNA, miRNA and protein levels in ESCC tissues/cell lines as compared with their normal cell line or adjacent normal epithelia. The screened articles fulfilling the criteria described in the schema are manually curated to catalog the molecular alternations at DNA, mRNA, miRNA, and protein level. The information pertaining to ESCC and gene regulation status, the experimental approach used, analysis design, region of sample collection and along with the PubMed citation is provided for each molecule. The molecules are provided with external link to other database like OMIM, HPRD, HGNC and Ensembl to additional information about the molecule.
Figure 2Population specific ESCC incidence and ESCC ATLAS entries. The observance of ESCC incidence was higher among Iranian and South African population, but the number of investigations focusing on these populations are very low. Hence, there are very few known ESCC associated molecular signatures. Here, x-axis indicates the total number of published articles focusing on specific population (that were surveyed and recorded in ESCC ATLAS) and the y-axis indicates the age-standardized rate (ASR) of ESCC incidence available from GLOBOCAN 2012. The size of scatter plots represents the total number of recorded molecular signatures to be associated with ESCC from the surveyed article.
Figure 3Distribution of molecular signatures vs. number of curated articles or number of genes across different populations. It shows statistics of (A) distribution of molecular signature (transcriptomics, proteomics, structural variation, SNV, miRNA, methylation, and histone modification) vs. number of curated research articles in different populations (coded with different colors) and (B) distribution of molecular signature (trasncriptomics, proteomics, structural variation, SNV, miRNA, methylation, and histone modification) vs. number of genes observed or curated from different populations (coded with different colors).
Figure 4Distribution and overlapping of Transcriptomic and Proteomic data. Venn diagram representing distribution of (A) transcriptomic, and (B) proteomic evidences between different populations and indicating unique and common genes involved in etiology of ESCC between the different populations. We had only CLDN4 gene with Transcriptomic evidences for Korean population, the same gene also found in Indian population. However, for better representation, we did not plot Korean population in (A).
Figure 5Data analysis for population specific overrepresented GO terms. Data analysis for population specific overrepresented GO terms corresponding to (A) biological process, (B) molecular function and (C) cellular component. In the scatterplot the semantically similar GO terms remain close together in cluster and are labeled with the representative GO term with the highest enrichment score. The bubbles corresponding to common GO terms between different populations are adjusted in two-dimensional space by adding or subtracting 0.15 semantic space units. The bubble size indicates the frequency of GO term in the underlying GOA database.
Figure 6Protein-protein interactions. Hexagonal red nodes correspond to proteins that were added by PEPPER algorithm (expansions), the red shades of the hexagonal nodes correspond to the scores of relevance computed in the post-processing step. The purple square represents a protein of interest (bait, which is STAT3 in this case). The rounded green nodes represent the list of input proteins (preys). The green edges represent the initial interactions between the seeds (bait and preys). The red, dark and light blue edges are used to represent the interaction between the seeds and the expansions.
Figure 7(A) screenshot of the primary information page for OPN/Osteopontin (gene/protein in ESCCDb. The query, browse and results tabs/pages for Osteopontin protein are shown (B) The molecule page for OPN with the DNA, mRNA, miRNA and protein level alterations, level of regulation, experimental approach used, PubMed citation and external links to publicly available resources. The figure was created using software Adobe Illustrator CS5 Version 15.0.0.