| Literature DB >> 25691825 |
Katarzyna Tomczak1, Patrycja Czerwińska1, Maciej Wiznerowicz2.
Abstract
The Cancer Genome Atlas (TCGA) is a public funded project that aims to catalogue and discover major cancer-causing genomic alterations to create a comprehensive "atlas" of cancer genomic profiles. So far, TCGA researchers have analysed large cohorts of over 30 human tumours through large-scale genome sequencing and integrated multi-dimensional analyses. Studies of individual cancer types, as well as comprehensive pan-cancer analyses have extended current knowledge of tumorigenesis. A major goal of the project was to provide publicly available datasets to help improve diagnostic methods, treatment standards, and finally to prevent cancer. This review discusses the current status of TCGA Research Network structure, purpose, and achievements.Entities:
Keywords: The Cancer Genome Atlas (TCGA); big data analysis; cancer genomics
Year: 2015 PMID: 25691825 PMCID: PMC4322527 DOI: 10.5114/wo.2014.47136
Source DB: PubMed Journal: Contemp Oncol (Pozn) ISSN: 1428-2526
The Cancer Genome Atlas (TCGA) organisation centres. Based on [7]
| Centre Name | Centre Description | Localisation |
|---|---|---|
| Tissue Source Sites (TSSs) | Collection of the samples (blood and tissue from tumour and normal controls) and clinical metadata from patients (donors) | |
| Biospecimen Core Resource (BCR) | Coordination of sample delivery and data collection, cataloguing, processing, and verifying the quality and quantity | Research Institute at Nationwide Children's Hospital in Columbus, Ohio |
| Genome Sequencing Centers (GSCS) | High-throughput sequencing (data are available in TCGA Data Portal or at NIH's database of Genotype and Phenotype) | Broad Institute Sequencing Platform in Cambridge |
| Cancer Genome Characterisation Centers (GCCs) | Utilisation of novel technologies and multiple platforms | Copy Number Alteration (Brigham and Women's Hospital and Harvard Medical School in Boston, The Broad Institute in Cambridge) |
| Proteome Characterization Centres (PCCs) | Identification of cancer-specific proteins | Cancer Proteomic Center |
| Data Coordinating Center (DCC) | Management of all generated data and transfer them to public databases (TCGA Data Portal and Cancer Genomics Hub) | |
| Cancer Genomics Hub (CGHub) | Storage, catalogue, and access to lower levels of cancer genome sequences and alignments | University of California Santa Cruz |
| Genome Data Analysis Centers (GDACs) | Development of novel informatics tools to facilitate with processing and integrating data analyses across the entire genome | Broad Institute, Cambridge, Massachusetts |
Fig. 1The Cancer Genome Atlas (TCGA) Research Network Centres flowchart. Based on [6]
The TCGA structure involves several cooperating centres for processing the samples and managing all the obtained datasets. First, different Tissue Source Sites (TSSs) collect clinical metadata and biospecimens from eligible cancer patients. After preliminary pathology review, TSSs deliver biospecimens and metadata to the Biospecimen Core Resource (BCR), where they are approved. Next, the BCR catalogues and submits metadata to the Data Coordinating Centre (DCC), as well as processing and verifying the quality and quantity of isolated molecular analytes, which are further provided to Genome Characterisation Centres (GCCs) and Genome Sequencing Centres (GSCs) for further genomic characterisation and high-throughput sequencing. Then, sequence-related data are deposited in DCC. The GSCs also submit trace files, sequences, and alignment mappings to NCI's Cancer Genomics Hub (CGHub) secure repository. Generated genomic data submitted to the DCC and CGHub are made available to the research community and Genome Data Analysis Centres (GDACs). The GDACs provide new information-processing, analysis, and visualisation tools to the entire research community. Furthermore, the information generated by the TCGA Research Network is centrally managed at the DCC and entered into public free-access databases (TCGA Portal, NCBI's Trace Archive, CGHub), allowing scientists to continually access the cancer datasets