| Literature DB >> 29092930 |
Shane Wilson1, Michael Fitzsimons2, Martin Ferguson3, Allison Heath2, Mark Jensen3, Josh Miller2, Mark W Murphy2, James Porter2, Himanso Sahni3, Louis Staudt4, Yajing Tang2, Zhining Wang4, Christine Yu1, Junjun Zhang1, Vincent Ferretti1, Robert L Grossman5.
Abstract
The NCI Genomic Data Commons (GDC) was launched in 2016 and makes available over 4 petabytes (PB) of cancer genomic and associated clinical data to the research community. This dataset continues to grow and currently includes over 14,500 patients. The GDC is an example of a biomedical data commons, which collocates biomedical data with storage and computing infrastructure and commonly used web services, software applications, and tools to create a secure, interoperable, and extensible resource for researchers. The GDC is (i) a data repository for downloading data that have been submitted to it, and also a system that (ii) applies a common set of bioinformatics pipelines to submitted data; (iii) reanalyzes existing data when new pipelines are developed; and (iv) allows users to build their own applications and systems that interoperate with the GDC using the GDC Application Programming Interface (API). We describe the GDC API and how it has been used both by the GDC itself and by third parties. Cancer Res; 77(21); e15-18. ©2017 AACR. ©2017 American Association for Cancer Research.Entities:
Mesh:
Year: 2017 PMID: 29092930 PMCID: PMC5683428 DOI: 10.1158/0008-5472.CAN-17-0598
Source DB: PubMed Journal: Cancer Res ISSN: 0008-5472 Impact factor: 12.701