| Literature DB >> 28794857 |
Erin K Wagner1, Satyajeet Raje2, Liz Amos2, Jessica Kurata3, Abhijit S Badve4, Yingquan Li5, Ben Busby6.
Abstract
Data sharing is critical to advance genomic research by reducing the demand to collect new data by reusing and combining existing data and by promoting reproducible research. The Cancer Genome Atlas (TCGA) is a popular resource for individual-level genotype-phenotype cancer related data. The Database of Genotypes and Phenotypes (dbGaP) contains many datasets similar to those in TCGA. We have created a software pipeline that will allow researchers to discover relevant genomic data from dbGaP, based on matching TCGA metadata. The resulting research provides an easy to use tool to connect these two data sources.Entities:
Keywords: GDC; SRA; TCGA; The Cancer Genome Atlas; cancer; database; dbGaP; genome
Year: 2017 PMID: 28794857 PMCID: PMC5538035 DOI: 10.12688/f1000research.9837.1
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. Module organization and a typical end-to-end workflow.
List of allowable values for Study Type, Primary Site and Disease in the Genomic Data Commons (The Cancer Genome Atlas) data.
The mapping between the Disease and Primary Site can be found in our GitHub repository.
| Study Type | Primary Site | Disease | ||
|---|---|---|---|---|
| Genotyping Array | Adrenal Gland | Pheochromocytoma and
| ||
| miRNA-Seq | Bile Duct | Adrenocortical Carcinoma | ||
| RNA-Seq | Bladder | Cholangiocarcinoma | ||
| WXS (Whole Exome Sequencing) | Blood | Bladder Urothelial Carcinoma | ||
| Bone | Acute Myeloid Leukemia | |||
| Brain | Osteosarcoma | |||
| Breast | Glioblastoma Multiforme | |||
| Cervix | Brain Lower Grade Glioma | |||
| Colorectal | Breast Invasive Carcinoma | |||
| Esophagus | Cervical Squamous Cell Carcinoma and
| |||
| Eye | Colon Adenocarcinoma | |||
| Head and Neck | Rectum Adenocarcinoma | |||
| Kidney | Esophageal Carcinoma | |||
| Liver | Uveal Melanoma | |||
| Lung | Head and Neck Squamous Cell
| |||
| Lymph Nodes | High-Risk Wilms Tumor | |||
| Nervous System | Kidney Renal Clear Cell Carcinoma | |||
| Ovary | Kidney Renal Papillary Cell Carcinoma | |||
| Pancreas | Kidney Chromophobe | |||
| Pleura | Rhabdoid Tumor | |||
| Prostate | Clear Cell Sarcoma of the Kidney | |||
| Skin | Liver Hepatocellular Carcinoma | |||
| Soft Tissue | Lung Adenocarcinoma | |||
| Stomach | Lung Squamous Cell Carcinoma | |||
| Testis | Lymphoid Neoplasm Diffuse Large
| |||
| Thymus | Neuroblastoma | |||
| Thyroid | Ovarian Serous Cystadenocarcinoma | |||
| Uterus | Pancreatic Adenocarcinoma | |||
| Mesothelioma | ||||
| Prostate Adenocarcinoma | ||||
| Skin Cutaneous Melanoma | ||||
| Sarcoma | ||||
| Stomach Adenocarcinoma | ||||
| Testicular Germ Cell Tumors | ||||
| Thymoma | ||||
| Thyroid Carcinoma | ||||
| Uterine Corpus Endometrial Carcinoma | ||||
| Uterine Carcinosarcoma |