Lin Wei1, Zhilin Jin2, Shengjie Yang1, Yanxun Xu2, Yitan Zhu1, Yuan Ji1,3. 1. Program of Computational Genomics & Medicine, NorthShore University HealthSystem, Evanston, IL 60201, USA. 2. Department of Applied Mathematics & Statistics, Johns Hopkins University, Baltimore, MD 21218, USA. 3. Department of Public Health Sciences, University of Chicago, Chicago, IL 60637, USA.
Abstract
Motivation: The Cancer Genome Atlas (TCGA) program has produced huge amounts of cancer genomics data providing unprecedented opportunities for research. In 2014, we developed TCGA-Assembler, a software pipeline for retrieval and processing of public TCGA data. In 2016, TCGA data were transferred from the TCGA data portal to the Genomic Data Commons (GDCs), which is supported by a different set of data storage and retrieval mechanisms. In addition, new proteomics data of TCGA samples have been generated by the Clinical Proteomic Tumor Analysis Consortium (CPTAC) program, which were not available for downloading through TCGA-Assembler. It is desirable to acquire and integrate data from both GDC and CPTAC. Results: We develop TCGA-assembler 2 (TA2) to automatically download and integrate data from GDC and CPTAC. We make substantial improvement on the functionality of TA2 to enhance user experience and software performance. TA2 together with its previous version have helped more than 2000 researchers from 64 countries to access and utilize TCGA and CPTAC data in their research. Availability of TA2 will continue to allow existing and new users to conduct reproducible research based on TCGA and CPTAC data. Availability and implementation: http://www.compgenome.org/TCGA-Assembler/ or https://github.com/compgenome365/TCGA-Assembler-2. Contact: zhuyitan@gmail.com or koaeraser@gmail.com. Supplementary information: Supplementary data are available at Bioinformatics online.
Motivation: The Cancer Genome Atlas (TCGA) program has produced huge amounts of cancer genomics data providing unprecedented opportunities for research. In 2014, we developed TCGA-Assembler, a software pipeline for retrieval and processing of public TCGA data. In 2016, TCGA data were transferred from the TCGA data portal to the Genomic Data Commons (GDCs), which is supported by a different set of data storage and retrieval mechanisms. In addition, new proteomics data of TCGA samples have been generated by the Clinical Proteomic Tumor Analysis Consortium (CPTAC) program, which were not available for downloading through TCGA-Assembler. It is desirable to acquire and integrate data from both GDC and CPTAC. Results: We develop TCGA-assembler 2 (TA2) to automatically download and integrate data from GDC and CPTAC. We make substantial improvement on the functionality of TA2 to enhance user experience and software performance. TA2 together with its previous version have helped more than 2000 researchers from 64 countries to access and utilize TCGA and CPTAC data in their research. Availability of TA2 will continue to allow existing and new users to conduct reproducible research based on TCGA and CPTAC data. Availability and implementation: http://www.compgenome.org/TCGA-Assembler/ or https://github.com/compgenome365/TCGA-Assembler-2. Contact: zhuyitan@gmail.com or koaeraser@gmail.com. Supplementary information: Supplementary data are available at Bioinformatics online.
Authors: Yitan Zhu; Yanxun Xu; Donald L Helseth; Kamalakar Gulukota; Shengjie Yang; Lorenzo L Pesce; Riten Mitra; Peter Müller; Subhajit Sengupta; Wentian Guo; Jonathan C Silverstein; Ian Foster; Nigel Parsad; Kevin P White; Yuan Ji Journal: J Natl Cancer Inst Date: 2015-05-08 Impact factor: 13.506
Authors: Philip L Ross; Yulin N Huang; Jason N Marchese; Brian Williamson; Kenneth Parker; Stephen Hattan; Nikita Khainovski; Sasi Pillai; Subhakar Dey; Scott Daniels; Subhasish Purkayastha; Peter Juhasz; Stephen Martin; Michael Bartlet-Jones; Feng He; Allan Jacobson; Darryl J Pappin Journal: Mol Cell Proteomics Date: 2004-09-22 Impact factor: 5.911
Authors: Hui Zhang; Tao Liu; Zhen Zhang; Samuel H Payne; Bai Zhang; Jason E McDermott; Jian-Ying Zhou; Vladislav A Petyuk; Li Chen; Debjit Ray; Shisheng Sun; Feng Yang; Lijun Chen; Jing Wang; Punit Shah; Seong Won Cha; Paul Aiyetan; Sunghee Woo; Yuan Tian; Marina A Gritsenko; Therese R Clauss; Caitlin Choi; Matthew E Monroe; Stefani Thomas; Song Nie; Chaochao Wu; Ronald J Moore; Kun-Hsing Yu; David L Tabb; David Fenyö; Vineet Bafna; Yue Wang; Henry Rodriguez; Emily S Boja; Tara Hiltke; Robert C Rivers; Lori Sokoll; Heng Zhu; Ie-Ming Shih; Leslie Cope; Akhilesh Pandey; Bing Zhang; Michael P Snyder; Douglas A Levine; Richard D Smith; Daniel W Chan; Karin D Rodland Journal: Cell Date: 2016-06-29 Impact factor: 41.582
Authors: Joseph R Iacona; Nicholas J Monteleone; Alexander D Lemenze; Ashley L Cornett; Carol S Lutz Journal: RNA Biol Date: 2019-08-23 Impact factor: 4.652
Authors: Xingxin Pan; Brandon Burgman; Erxi Wu; Jason H Huang; Nidhi Sahni; S Stephen Yi Journal: Comput Struct Biotechnol J Date: 2022-06-30 Impact factor: 6.155
Authors: Timothy A Dinh; Ramja Sritharan; F Donelson Smith; Adam B Francisco; Rosanna K Ma; Rodica P Bunaciu; Matt Kanke; Charles G Danko; Andrew P Massa; John D Scott; Praveen Sethupathy Journal: Cell Rep Date: 2020-04-14 Impact factor: 9.423