Andrea Rau1,2, Michael Flister3,4,5, Hallgeir Rui6, Paul L Auer2. 1. GABI, INRA, AgroParisTech, Université Paris-Saclay, Jouy-en-Josas, France. 2. Joseph J. Zilber School of Public Health, University of Wisconsin-Milwaukee, Milwaukee, WI, USA. 3. Department of Physiology, Medical College of Wisconsin, Milwaukee, WI, USA. 4. Cancer Center, Medical College of Wisconsin, Milwaukee, WI, USA. 5. Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, WI, USA. 6. Department of Pathology, Medical College of Wisconsin, Milwaukee, WI, USA.
Abstract
Motivation: The Cancer Genome Atlas (TCGA) has greatly advanced cancer research by generating, curating and publicly releasing deeply measured molecular data from thousands of tumor samples. In particular, gene expression measures, both within and across cancer types, have been used to determine the genes and proteins that are active in tumor cells. Results: To more thoroughly investigate the behavior of gene expression in TCGA tumor samples, we introduce a statistical framework for partitioning the variation in gene expression due to a variety of molecular variables including somatic mutations, transcription factors (TFs), microRNAs, copy number alternations, methylation and germ-line genetic variation. As proof-of-principle, we identify and validate specific TFs that influence the expression of PTPN14 in breast cancer cells. Availability and implementation: We provide a freely available, user-friendly, browseable interactive web-based application for exploring the results of our transcriptome-wide analyses across 17 different cancers in TCGA at http://ls-shiny-prod.uwm.edu/edge_in_tcga. All TCGA Open Access tier data are available at the Broad Institute GDAC Firehose and were downloaded using the TCGA2STAT R package. TCGA Controlled Access tier data are available via controlled access through the Genomic Data Commons (GDC). R scripts used to download, format and analyze the data and produce the interactive R/Shiny web app have been made available on GitHub at https://github.com/andreamrau/EDGE-in-TCGA.
Motivation: The Cancer Genome Atlas (TCGA) has greatly advanced cancer research by generating, curating and publicly releasing deeply measured molecular data from thousands of tumor samples. In particular, gene expression measures, both within and across cancer types, have been used to determine the genes and proteins that are active in tumor cells. Results: To more thoroughly investigate the behavior of gene expression in TCGA tumor samples, we introduce a statistical framework for partitioning the variation in gene expression due to a variety of molecular variables including somatic mutations, transcription factors (TFs), microRNAs, copy number alternations, methylation and germ-line genetic variation. As proof-of-principle, we identify and validate specific TFs that influence the expression of PTPN14 in breast cancer cells. Availability and implementation: We provide a freely available, user-friendly, browseable interactive web-based application for exploring the results of our transcriptome-wide analyses across 17 different cancers in TCGA at http://ls-shiny-prod.uwm.edu/edge_in_tcga. All TCGA Open Access tier data are available at the Broad Institute GDAC Firehose and were downloaded using the TCGA2STAT R package. TCGA Controlled Access tier data are available via controlled access through the Genomic Data Commons (GDC). R scripts used to download, format and analyze the data and produce the interactive R/Shiny web app have been made available on GitHub at https://github.com/andreamrau/EDGE-in-TCGA.
Authors: Jianmin Zhang; Song Yao; Qiang Hu; Qianqian Zhu; Song Liu; Kathryn L Lunetta; Stephen A Haddad; Nuo Yang; He Shen; Chi-Chen Hong; Lara Sucheston-Campbell; Edward A Ruiz-Narvaez; Jeannette T Bensen; Melissa A Troester; Elisa V Bandera; Lynn Rosenberg; Christopher A Haiman; Andrew F Olshan; Julie R Palmer; Christine B Ambrosone Journal: Carcinogenesis Date: 2016-08-01 Impact factor: 4.944
Authors: Alexander Gusev; Arthur Ko; Huwenbo Shi; Gaurav Bhatia; Wonil Chung; Brenda W J H Penninx; Rick Jansen; Eco J C de Geus; Dorret I Boomsma; Fred A Wright; Patrick F Sullivan; Elina Nikkola; Marcus Alvarez; Mete Civelek; Aldons J Lusis; Terho Lehtimäki; Emma Raitoharju; Mika Kähönen; Ilkka Seppälä; Olli T Raitakari; Johanna Kuusisto; Markku Laakso; Alkes L Price; Päivi Pajukanta; Bogdan Pasaniuc Journal: Nat Genet Date: 2016-02-08 Impact factor: 38.330
Authors: Yang I Li; Bryce van de Geijn; Anil Raj; David A Knowles; Allegra A Petti; David Golan; Yoav Gilad; Jonathan K Pritchard Journal: Science Date: 2016-04-28 Impact factor: 47.728
Authors: Andrea Rau; Regina Manansala; Michael J Flister; Hallgeir Rui; Florence Jaffrézic; Denis Laloë; Paul L Auer Journal: Biostatistics Date: 2022-04-13 Impact factor: 5.279
Authors: Brandi L Carofino; Kayla M Dinshaw; Pui Yan Ho; Christophe Cataisson; Aleksandra M Michalowski; Andrew Ryscavage; Addie Alkhas; Nathan W Wong; Vishal Koparde; Stuart H Yuspa Journal: Oncotarget Date: 2019-12-31