| Literature DB >> 27114494 |
Rahul Agarwal1, Binayak Kumar1, Msk Jayadev1, Dhwani Raghav2, Ashutosh Singh3.
Abstract
Cancer of large intestine is commonly referred as colorectal cancer, which is also the third most frequently prevailing neoplasm across the globe. Though, much of work is being carried out to understand the mechanism of carcinogenesis and advancement of this disease but, fewer studies has been performed to collate the scattered information of alterations in tumorigenic cells like genes, mutations, expression changes, epigenetic alteration or post translation modification, genetic heterogeneity. Earlier findings were mostly focused on understanding etiology of colorectal carcinogenesis but less emphasis were given for the comprehensive review of the existing findings of individual studies which can provide better diagnostics based on the suggested markers in discrete studies.Colon Rectal Cancer Gene Database (CoReCG), contains 2056 colon-rectal cancer genes information involved in distinct colorectal cancer stages sourced from published literature with an effective knowledge based information retrieval system. Additionally, interactive web interface enriched with various browsing sections, augmented with advance search facility for querying the database is provided for user friendly browsing, online tools for sequence similarity searches and knowledge based schema ensures a researcher friendly information retrieval mechanism.Colorectal cancer gene database (CoReCG) is expected to be a single point source for identification of colorectal cancer-related genes, thereby helping with the improvement of classification, diagnosis and treatment of human cancers. DATABASE URL: lms.snu.edu.in/corecg.Entities:
Mesh:
Year: 2016 PMID: 27114494 PMCID: PMC4843536 DOI: 10.1093/database/baw059
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.The schema of CoReCG database. Figure explains the data collection steps used in CoReCG resourced from literature and various databases. It also showing methods for retrieval of information and databases linked to CoReCG.
Figure 2.The flowchart of CoReCG data collection. The figure shows the steps involved in collecting data for CoReCG which includes, preprocessing of data using related keywords against pubmed to fetch relevant articles, followed by the extraction of information from literature and annotation and submitting information in CoReCG, the whole process were manually curated and verified from cross references and updating in CoReCG.
Literature statistics of CoReCG
| Publication year | Number of papers |
|---|---|
| 1980–2000 | |
| 2001–2005 | |
| 2006–2010 | |
| 2011– |
Figure 3.Database structure of CoReCG. The detailed MySQL database structure which consists of 14 tables.
Figure 4.Web interface of CoReCG. (A) Search CoReCG using a keyword (Gene Symbol). (B) Query result obtained after keyword search. (C) Detailed information obtained after selecting a gene id. (D) Advance Search page of CoReCG. (E) Search CoReCG using a sequence.
Figure 5.(A) Major pathways shared by the genes. (B) Major domains shared by the genes. (C) Number of domains present in various major pathways.
CoReCG compared with other available cancer related databases
| Database name (Pubmed ID) | No. of genes | Common genes in database and CoReCG | Unique genes in database | Unique genes in CoReCG | |
|---|---|---|---|---|---|