| Literature DB >> 36035123 |
Xinchang Zheng1,2, Wenting Zong1,2,3, Zhaohua Li1,2,3, Yingke Ma1,2, Yanling Sun1,2, Zhuang Xiong1,2,3, Song Wu1,2,3, Fei Yang1,2,3, Wei Zhao1,2,3, Congfan Bu1,2, Zhenglin Du1,2, Jingfa Xiao1,2,3, Yiming Bao1,2,3.
Abstract
Due to the explosion of cancer genome data and the urgent needs for cancer treatment, it is becoming increasingly important and necessary to easily and timely analyze and annotate cancer genomes. However, tumor heterogeneity is recognized as a serious barrier to annotate cancer genomes at the individual patient level. In addition, the interpretation and analysis of cancer multi-omics data rely heavily on existing database resources that are often located in different data centers or research institutions, which poses a huge challenge for data parsing. Here we present CCAS (Cancer genome Consensus Annotation System, https://ngdc.cncb.ac.cn/ccas/#/home), a one-stop and comprehensive annotation system for the individual patient at multi-omics level. CCAS integrates 20 widely recognized resources in the field to support data annotation of 10 categories of cancers covering 395 subtypes. Data from each resource are manually curated and standardized by using ontology frameworks. CCAS accepts data on single nucleotide variant/insertion or deletion, expression, copy number variation, and methylation level as input files to build a consensus annotation. Outputs are arranged in the forms of tables or figures and can be searched, sorted, and downloaded. Expanded panels with additional information are used for conciseness, and most figures are interactive to show additional information. Moreover, CCAS offers multidimensional annotation information, including mutation signature pattern, gene set enrichment analysis, pathways and clinical trial related information. These are helpful for intuitively understanding the molecular mechanisms of tumors and discovering key functional genes.Entities:
Keywords: comprehensive annotation; databases integration; individual cancer patient; multi-omics; web server
Year: 2022 PMID: 36035123 PMCID: PMC9403316 DOI: 10.3389/fgene.2022.956781
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1The workflow of CCAS can be divided into three modules: Submission, Pre-processing and Annotation, and Interpretation. After the user submits data to CCAS, CCAS first converts the format of the files. SNV/Indels data will be converted to VCF format and other data types will be converted to “Gene ID \t Value” format. CCAS will then annotate the patient data using the integrated data sources at multiple levels. Mutation Signature and ssGSEA calculations are also performed. The annotation results are stored in sqlite3 database (a single file database) and json file. CCAS has built user-friendly interface to help users navigate and interpret the annotation results, enabling efficient identification of key functional genes at the individual patient level.
FIGURE 2The workflow of submitting data and checking job progress in CCAS. (A) Submitting portal at home page. CCAS receives data at multi-omics level including SNV/Indels (required), Expression, Copy Number Variation (CNV), and Methylation along with, job title, notification email, the Disease Ontology ID, and reference version. (B) Check results page in CCAS. On this page the user can check the progress of the job. (C) Notification emails sent to users at the start of a job and at the end of a job.
FIGURE 3Overview of the annotation results. The annotation results of CCAS consists of patient level annotation and gene level annotation. Gene detailed pages can be viewed by clicking “View” button at the end of each record in the gene annotation table.