Literature DB >> 26209429

Alternative preprocessing of RNA-Sequencing data in The Cancer Genome Atlas leads to improved analysis results.

Mumtahena Rahman1, Laurie K Jackson2, W Evan Johnson3, Dean Y Li4, Andrea H Bild5, Stephen R Piccolo6.   

Abstract

MOTIVATION: The Cancer Genome Atlas (TCGA) RNA-Sequencing data are used widely for research. TCGA provides 'Level 3' data, which have been processed using a pipeline specific to that resource. However, we have found using experimentally derived data that this pipeline produces gene-expression values that vary considerably across biological replicates. In addition, some RNA-Sequencing analysis tools require integer-based read counts, which are not provided with the Level 3 data. As an alternative, we have reprocessed the data for 9264 tumor and 741 normal samples across 24 cancer types using the Rsubread package. We have also collated corresponding clinical data for these samples. We provide these data as a community resource.
RESULTS: We compared TCGA samples processed using either pipeline and found that the Rsubread pipeline produced fewer zero-expression genes and more consistent expression levels across replicate samples than the TCGA pipeline. Additionally, we used a genomic-signature approach to estimate HER2 (ERBB2) activation status for 662 breast-tumor samples and found that the Rsubread data resulted in stronger predictions of HER2 pathway activity. Finally, we used data from both pipelines to classify 575 lung cancer samples based on histological type. This analysis identified various non-coding RNA that may influence lung-cancer histology.
AVAILABILITY AND IMPLEMENTATION: The RNA-Sequencing and clinical data can be downloaded from Gene Expression Omnibus (accession number GSE62944). Scripts and code that were used to process and analyze the data are available from https://github.com/srp33/TCGA_RNASeq_Clinical. CONTACT: stephen_piccolo@byu.edu or andreab@genetics.utah.edu SUPPLEMENTARY INFORMATION: Supplementary material is available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2015        PMID: 26209429      PMCID: PMC4804769          DOI: 10.1093/bioinformatics/btv377

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  26 in total

1.  Multiplatform single-sample estimates of transcriptional activation.

Authors:  Stephen R Piccolo; Michelle R Withers; Owen E Francis; Andrea H Bild; W Evan Johnson
Journal:  Proc Natl Acad Sci U S A       Date:  2013-10-15       Impact factor: 11.205

2.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

3.  Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples.

Authors:  Günter P Wagner; Koryu Kin; Vincent J Lynch
Journal:  Theory Biosci       Date:  2012-08-08       Impact factor: 1.919

4.  Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal.

Authors:  Jianjiong Gao; Bülent Arman Aksoy; Ugur Dogrusoz; Gideon Dresdner; Benjamin Gross; S Onur Sumer; Yichao Sun; Anders Jacobsen; Rileen Sinha; Erik Larsson; Ethan Cerami; Chris Sander; Nikolaus Schultz
Journal:  Sci Signal       Date:  2013-04-02       Impact factor: 8.192

5.  edgeR for differential RNA-seq and ChIP-seq analysis: an application to stem cell biology.

Authors:  Olga Nikolayeva; Mark D Robinson
Journal:  Methods Mol Biol       Date:  2014

6.  Bioconductor: open software development for computational biology and bioinformatics.

Authors:  Robert C Gentleman; Vincent J Carey; Douglas M Bates; Ben Bolstad; Marcel Dettling; Sandrine Dudoit; Byron Ellis; Laurent Gautier; Yongchao Ge; Jeff Gentry; Kurt Hornik; Torsten Hothorn; Wolfgang Huber; Stefano Iacus; Rafael Irizarry; Friedrich Leisch; Cheng Li; Martin Maechler; Anthony J Rossini; Gunther Sawitzki; Colin Smith; Gordon Smyth; Luke Tierney; Jean Y H Yang; Jianhua Zhang
Journal:  Genome Biol       Date:  2004-09-15       Impact factor: 13.583

7.  The UCSC Cancer Genomics Browser.

Authors:  Jingchun Zhu; J Zachary Sanborn; Stephen Benz; Christopher Szeto; Fan Hsu; Robert M Kuhn; Donna Karolchik; John Archie; Marc E Lenburg; Laura J Esserman; W James Kent; David Haussler; Ting Wang
Journal:  Nat Methods       Date:  2009-04       Impact factor: 28.547

8.  A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium.

Authors: 
Journal:  Nat Biotechnol       Date:  2014-08-24       Impact factor: 54.908

9.  The Subread aligner: fast, accurate and scalable read mapping by seed-and-vote.

Authors:  Yang Liao; Gordon K Smyth; Wei Shi
Journal:  Nucleic Acids Res       Date:  2013-04-04       Impact factor: 16.971

10.  Exploring TCGA Pan-Cancer data at the UCSC Cancer Genomics Browser.

Authors:  Melissa S Cline; Brian Craft; Teresa Swatloski; Mary Goldman; Singer Ma; David Haussler; Jingchun Zhu
Journal:  Sci Rep       Date:  2013-10-02       Impact factor: 4.379

View more
  87 in total

1.  Falling from grace: HPRT is not suitable as an endogenous control for cancer-related studies.

Authors:  Michelle H Townsend; Abigail M Felsted; Zac E Ence; Stephen R Piccolo; Richard A Robison; Kim L O'Neill
Journal:  Mol Cell Oncol       Date:  2019-02-26

2.  Cancer Cells Employ Nuclear Caspase-8 to Overcome the p53-Dependent G2/M Checkpoint through Cleavage of USP28.

Authors:  Ines Müller; Elwira Strozyk; Sebastian Schindler; Stefan Beissert; Htoo Zarni Oo; Thomas Sauter; Philippe Lucarelli; Sebastian Raeth; Angelika Hausser; Nader Al Nakouzi; Ladan Fazli; Martin E Gleave; He Liu; Hans-Uwe Simon; Henning Walczak; Douglas R Green; Jiri Bartek; Mads Daugaard; Dagmar Kulms
Journal:  Mol Cell       Date:  2020-01-22       Impact factor: 17.970

3.  Transcription Factor Activities Enhance Markers of Drug Sensitivity in Cancer.

Authors:  Luz Garcia-Alonso; Francesco Iorio; Angela Matchan; Nuno Fonseca; Patricia Jaaks; Gareth Peat; Miguel Pignatelli; Fiammetta Falcone; Cyril H Benes; Ian Dunham; Graham Bignell; Simon S McDade; Mathew J Garnett; Julio Saez-Rodriguez
Journal:  Cancer Res       Date:  2017-12-11       Impact factor: 12.701

4.  G protein αq exerts expression level-dependent distinct signaling paradigms.

Authors:  Dinesh Kankanamge; Mithila Tennakoon; Amila Weerasinghe; Luis Cedeno-Rosario; Deborah N Chadee; Ajith Karunarathne
Journal:  Cell Signal       Date:  2019-03-05       Impact factor: 4.315

5.  An LTR retrotransposon-derived lncRNA interacts with RNF169 to promote homologous recombination.

Authors:  Bing Deng; Wenli Xu; Zelin Wang; Chang Liu; Penghui Lin; Bin Li; Qiaojuan Huang; Jianhua Yang; Hui Zhou; Lianghu Qu
Journal:  EMBO Rep       Date:  2019-09-05       Impact factor: 8.807

6.  ΔNp63γ/SRC/Slug Signaling Axis Promotes Epithelial-to-Mesenchymal Transition in Squamous Cancers.

Authors:  Simon S McDade; Dennis J McCance; Kirtiman Srivastava; Adam Pickard; Stephanie G Craig; Gerard P Quinn; Shauna M Lambe; Jacqueline A James
Journal:  Clin Cancer Res       Date:  2018-05-08       Impact factor: 12.531

7.  Reproducible RNA-seq analysis using recount2.

Authors:  Leonardo Collado-Torres; Abhinav Nellore; Kai Kammers; Shannon E Ellis; Margaret A Taub; Kasper D Hansen; Andrew E Jaffe; Ben Langmead; Jeffrey T Leek
Journal:  Nat Biotechnol       Date:  2017-04-11       Impact factor: 54.908

Review 8.  Cloud computing for genomic data analysis and collaboration.

Authors:  Ben Langmead; Abhinav Nellore
Journal:  Nat Rev Genet       Date:  2018-01-30       Impact factor: 53.242

Review 9.  The value of genomics in dissecting the RAS-network and in guiding therapeutics for RAS-driven cancers.

Authors:  Gajendra Shrestha; Shelley M MacNeil; Jasmine A McQuerry; David F Jenkins; Sunil Sharma; Andrea H Bild
Journal:  Semin Cell Dev Biol       Date:  2016-06-20       Impact factor: 7.727

10.  Dual inhibition of BCL-XL and MCL-1 is required to induce tumour regression in lung squamous cell carcinomas sensitive to FGFR inhibition.

Authors:  Clare E Weeden; Casey Ah-Cann; Aliaksei Z Holik; Julie Pasquet; Jean-Marc Garnier; Delphine Merino; Guillaume Lessene; Marie-Liesse Asselin-Labat
Journal:  Oncogene       Date:  2018-05-10       Impact factor: 9.867

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.