Literature DB >> 27530686

Exploring cancer genomic data from the cancer genome atlas project.

Abstract

The Cancer Genome Atlas (TCGA) has compiled genomic, epigenomic, and proteomic data from more than 10,000 samples derived from 33 types of cancer, aiming to improve our understanding of the molecular basis of cancer development. Availability of these genome-wide information provides an unprecedented opportunity for uncovering new key regulators of signaling pathways or new roles of pre-existing members in pathways. To take advantage of the advancement, it will be necessary to learn systematic approaches that can help to uncover novel genes reflecting genetic alterations, prognosis, or response to treatments. This minireview describes the updated status of TCGA project and explains how to use TCGA data. [BMB Reports 2016; 49(11): 607-611].

Entities: Chemical Disease Gene Species

Mesh：

Substances：
MicroRNAs
RNA, Messenger

Year: 2016 PMID： 27530686 PMCID： PMC5346320 DOI： 10.5483/bmbrep.2016.49.11.145

Source DB: PubMed Journal: BMB Rep ISSN： 1976-6696 Impact factor: 4.778

INTRODUCTION

Human Genome Project (HGP) was successfully completed in 2003 and began a new era of genome-based medicine (1–3). Success of HGP motivated development of new technologies for genome-wide copy number alteration analysis, gene expression profiling, and better sequencing methods that can comprehensively characterize entire genomes at low cost. The comprehensive genome map has improved our understanding of complex genetic networks involved in development of human disease and allowed us to uncover functions of associated genetic elements in molecular level. There are two major branches in genomics. Structural genomics generally collect and catalogues all genetic and epigenetic elements by massive sequencing. On the contrary, functional genomics aims to uncover the functional roles of genetic or epigenetic elements in context of different biological systems. Microarrays is another most important technology for genomics that is designed to capture the expression patterns of coding/non-coding genes, alterations in copy number and methylation status in entire genome simultaneously. Because functional activity of genes is well reflected in gene expression patterns, microarray technology has been extensively used to generate expression profile data from diseased tissues for identifying disease-associated genes and from cell lines for characterizing newly discovered genes. In recent years, by using publicly available genomic and epigenomic data, many genome-wide analyses uncovered novel genes associated with human diseases and genes with unexpected roles in different cellular context (4–6). Thus, in-depth analysis of multiple genomic data will undoubtedly reveal novel insights into the regulation of many signaling pathways or novel key regulators of the pathways. In this review, I will provide a short description of major progress on cancer genomics, particularly in The Cancer Genome Atlas (TCGA) project. Furthermore, I will provide description on the data generated by different platforms and analytical tools that have been developed through the progression of TCGA projects.

THE CANCER GENOME ATLAS (TCGA)

The Cancer Genome Atlas (TCGA) Project is a multi-institutional innovative research program supported by the National Institutes of Health. TCGA was launched to facilitate the comprehensive understanding of the cancer genetics using state-of-art genomic technologies and analysis tools to catalogue all of the potential cancer drivers, identify robust prognostic and predictive biomarkers and novel druggable therapeutic targets, and uncover molecular subtypes of tumors that are different in prognosis and response to treatments. With use of several different technical platforms, TCGA currently collects and maintains many different genome-wide data including expression of coding and non-coding RNA, somatic mutations, copy-number alteration, and epigenomic data like promoter methylation. In addition to genomic and epigenomic data, it collects proteomic data by using state-of-art technology reverse phase protein arrays (RPPA). The project plans to collect multi-platform data from hundreds of tissues per each cancer type and share the data with any investigators who are interested in genome-based medicine or those who are interested in studying function of genes without any restriction in use of the data. In 2005, a pilot study (phase I) started aiming to test the feasibility of ideas and develop the research infrastructure by characterizing few selected cancer types that are understudied: lung squamous cell cancer, glioblastoma, and ovarian cancers (7–10). Phase 2 study was started in 2009 and expanded to additional cancer types (33 cancer types). New oncogenes and tumor suppressor genes were identified through analysis of TCGA data. Some of findings are unexpected and showed significant association with clinical outcomes. For instance, genomic level analysis showed that non-hypermutated adenocarcinomas from the colon and rectum are almost indistinguishable at molecular level (11). Alterations in the FGFR kinase genes are very common in lung squamous cell cancer, while KRAS and EGFR mutations are most commonly activated oncogenes in lung adenocarcinoma (10). Thus, TCGA data provides an unprecedented opportunity for exploring human genome to uncover previously unrecognized genetic disruptions in smaller scale and gene-focused studies. TCGA has established a pipeline for collecting and processing tissues from numerous source sites (tissue banks at hospitals), generation of high quality genomic and proteomic data, and distribution and analysis of the data. Most importantly, major bodies for data generation and analysis are consisted of the Genome Characterization Centers (GCCs), Genome Sequencing Centers (GSCs) and Genome Data Analysis Centers (GDACs). The GCCs aim to identify all genomic alterations in the tumors in each cancer type. Each GCC uses most advanced platform technologies to generate mRNA and miRNA expression data, DNA methylation data, and copy number alteration data. The genetic changes identified by the GCCs are further characterized by the GSCs that perform large-scale genomic sequencing using the latest sequencing technologies to identify small genomic changes that could play a role in cancer. All of the data generated by the GCCs and GSCs on the multiple genomic platform technologies from thousands of tissue samples are transferred to GDAC through Data Coordinating Center (DCC). The GDACs are responsible for analysis of the data and development of new bioinformatics tools that can facilitate use of TCGA data by the entire research community.

TYPES OF DATA GENERATED FROM TCGA PROJECT

Six different platform data are currently generated from GCC and GSC and available to general public. These include somatic mutation data, mRNA and miRNA expression data, DNA methylation data, copy number alteration data, and proteomic data.

Whole exome sequencing data

Majority of mutation data were generated by whole exome sequencing using second-generation DNA sequencing instruments (mostly Illumina and ABI SOLiD). Whole exome sequencing analysis is carried out by sequencing the DNA coding for protein products, but not DNA sequences that do not directly code for proteins. However, about 10% of samples in TCGA project underwent whole genome sequencing, which sequences every base-pair of DNA and that can reveal any alteration in regulatory regions of genome.

mRNA expression data

mRNA expression profile data were first generated by using microarray technologies from Affymetrix or Agilent, but RNA sequencing (RNA-seq) technology from Illumina was used in later stage of TCGA project. RNA-seq technology has several advantages over microarray platform as it can quantify rare and common transcripts, alternative splicing, previously unrecognized transcripts, gene fusions, as well as non-coding RNAs. It can also quantify distribution of somatic mutations and edited RNAs (12).

microRNA expression data

microRNA (miRNA) is a small non-coding RNA (~22 nucleotides in size) that regulates other genes through post-transcriptional manner (13). miRNA expression profile data were generated by directly sequencing small molecule RNAs using RNA-seq technology from Illumina. These data were separately processed and maintained from data from mRNA-seq data as their biological and molecular characteristics are different from coding RNAs.

DNA methylation data

DNA methylation is an epigenetic mark which is frequently associated with transcriptional activity of genes. TCGA DNA methylation data were initially generated by using Illumina 27K DNA methylation array (HumanMethylation27 containing 27,578 probes in 14,495 genes). Later, it was replaced by 450K methylation arrays (HumanMethylation450 containing 485,512 probes covering 99% RefSeq genes).

DNA copy number alteration data

Copy number alteration is probably most frequent genetic events during the course of tumor development. Copy number data were generated by using Affymetrix SNP 6.0 arrays containing 1.8 million genetic markers, including more than 906,600 single nucleotide polymorphisms (SNPs) and more than 946,000 probes for the detection of copy number variation.

Reverse-phase protein array (RPPA) data

RPPA is an antibody-based quantitative methods assessing hundreds of protein markers in thousands samples in a cost-effective, sensitive and high-throughput manner (14). This technology has been extensively validated for both cell line and patient samples, and its applications range from building reproducible prognostic models to assessing underlying biology associated with prognosis. Current RPPA data from TCGA project include expression and modification of ~200 proteins. In addition to genomic and proteomic data, TCGA data also include slide images for histopathology and details on patients information such as tumor stages, races, potential etiology, treatments and survival.

WHERE TO GET TCGA DATA

All of genomic, proteomic, and clinical data from TCGA project were available from TCGA data portal site. However, as of July 15th, 2016, the TCGA Data Portal is no longer operational and all TCGA data now resides at the Genomic Data Commons (GDC, https://gdc-portal.nci.nih.gov/). While a vast majority of TCGA data in the GDC are publically available without restriction, meaning that no authentication or authorization is necessary to access it, some of the data are controlled access, meaning that special authorization process is necessary to access the data. Access to controlled data is typically granted by program-specific Data Access Committees (https://gdc.nci.nih.gov/access-data/obtaining-access-controlleddata). Public availability of the data is ruled by the NIH Genomic Data Sharing Policy (https://gds.nih.gov/). Open access data typically includes the data that cannot identify individuals such as high level genomic and proteomic data as well as most clinical and all biospecimen data elements. Controlled data includes individually identifiable data such as low level genomic sequencing data, germline variants, SNP6 genotype data, and certain clinical data elements. Processed high level data are also available from UCSC Cancer Genomics Browser (https://genome-cancer.ucsc.edu/). It offers more user-friendly processed data and limited visualization tools are also available. Histology information is also available from The Cancer Digital Slide Archive, CDSA (http://cancer.digitalslidearchive.net/), which provides the interactive tools for viewing and annotating diagnostic and tissue slide images from TCGA project (15). In addition to genomic, proteomic, and clinical data, TCGA also offers radiological imaging data from TCGA patients through The Cancer Imaging Archive, TCIA (http://www.cancerimagingarchive.net) in order to stimulate imaging phenotype-genotype study (16).

HOW TO ANALYZE TCGA DATA

Comprehensive genomic data from large number of patients would undoubtedly improve our knowledge in understanding of cancer-related genes and their clinical relevance. However, analysis of such “big data” would require substantial skills in computational tools, statistics, and programming languages. Thus, it would be necessary to develop easy-to-use and intuitive genomic tools that can help researchers or clinicians in analysis and interpretation of all the data types in a meaningful way. TCGA provides intuitive web-based tools. The cBioPortal for Cancer Genomics (http://cbioportal.org) offers probably best web-based tool for beginners who have limited experience in analysis of genomic data and only wish to analyze limited number of genes (17). The cBioPortal is an open-access resource developed by investigators at the Memorial Sloan-Kettering Cancer Centre (MSKCC). It allows users to search gene(s) of interest in certain cancers or all cancers in TCGA data and provides a flexible interface to multiple data sets and easy-to-use visualization options. The cBioportal offers unique analysis and visualization tools such as MEMo (Mutual Exclusivity Modules) analysis, correlation plots for expression and copy number alteration or methylation of genes, assessing clinical relevance of genes by Kaplan-Meier plots, co-expression analysis, network analysis. In addition, it also offers highly useful OncoPrint diagrams that are an intuitive diagram of genomic alterations such as somatic mutations and copy number alterations across a set of samples. Mutationmapper provides summary diagram of all mutations on a linear protein map and links to protein 3D structure database to examine potential effects of mutations. More importantly, all analyzed data can be downloaded in table format for further analysis. The Broad GDAC Firehose (http://gdac.broadinstitute.org/) is a web portal site that has been developed by the Broad Institute, aiming to deliver automated analyses of the TCGA data to general users. It provided preprocessed annotated data and association analysis across all types of data including clinical data. For example, it can provide list of genes whose copy number alteration, methylation, mRNA expression, and mutations are significantly correlated with tumor stages, survival of patients, sex, ages, or ethnic groups. Expression of genes of interest across all cancer types can be also easily assessed in firebrowse (http://firebrowse.org/). PROGgeneV2 (http://watson.compbio.iupui.edu/chirayu/proggene/database/) provide survival analysis of patients from multiple cohorts in database (18). Users can choose either single gene or set of genes to estimate their association with prognosis of patients. Because typical molecular biologists would not have good background on statistics that is necessary to run survival analysis, it would be useful tool for them. Mexpress (http://mexpress.be/) provides easy-to-use data visualization tool of the TCGA data including mRNA expression, DNA methylation, and clinical data (19). In addition, it also provide the correlation among data sets. The Cancer Proteome Atlas (TCPA) (http://app1.bioinformatics.mdanderson.org/tcpa/_design/basic/index.html) is data portal for proteomic data from TCGA project (20). It provides correlation analysis between proteins and association of proteins with prognosis of patients. In addition to TCGA data, it also provide data from established cancer cell lines.

EXPLORATION OF GENOMIC DATA

Analysis tools from TCGA project developed to make that basic scientists without training in informatics, statistics, and clinical knowledge can analyze the data and interpret the results. The potential involvement of genes of interest in cancer development can be easily assessed. For example, genetic alterations of peroxiredoxin family in all cancer types can be assessed through cBioPortal (Fig. 1A) and alterations of individual genes in certain cancer type (i.e., ovarian cancer) are visualized in oncoprint format (Fig. 1B). Furthermore, the clinical relevance of alteration is estimated and displayed in Kaplan-Meier plots (Fig. 1C). Clinical association of genes of interest can be further validated by using tools in PROGgeneV2. Correlation between different genomic data is also readily visualized through cBioProtal and Firehose (Fig. 2).

Fig. 1

Visualization of analyzed data. (A) The spectrum of genetic alteration in PRDX genes in different cancer types. (B) Genetic alterations of PRDX genes in ovarian cancer. (C) Kaplan-Meier plot of patients with ovarian cancer stratified according to genetic alteration of PRDX genes.

Fig. 2

Scatter plots between mRNA expression and copy number alteration of PRDX1 and PRDX2 in ovarian cancer.

CLOSING REMARK

TCGA is an unprecedented powerful public resource of cancer genomic data providing researchers with a great opportunity to increase present knowledge on cancer. Multi-layer analyses performed on different platforms reflecting distinct biological characteristics provide a better understanding of cancer biology, leading to improvement in patient stratification, identification of novel prognostic or predictive markers, and finding novel potentially druggable therapeutic targets. The translation of genomic knowledge into biological insights will move these new findings to the next level and guide to a new era in data-driven molecular biology.

20 in total

1. The Genomic Landscape and Clinical Relevance of A-to-I RNA Editing in Human Cancers.

Authors: Leng Han; Lixia Diao; Shuangxing Yu; Xiaoyan Xu; Jie Li; Rui Zhang; Yang Yang; Henrica M J Werner; A Karina Eterovic; Yuan Yuan; Jun Li; Nikitha Nair; Rosalba Minelli; Yiu Huen Tsang; Lydia W T Cheung; Kang Jin Jeong; Jason Roszik; Zhenlin Ju; Scott E Woodman; Yiling Lu; Kenneth L Scott; Jin Billy Li; Gordon B Mills; Han Liang
Journal: Cancer Cell Date: 2015-10-01 Impact factor: 31.743

2. The Cancer Imaging Archive (TCIA): maintaining and operating a public information repository.

Authors: Kenneth Clark; Bruce Vendt; Kirk Smith; John Freymann; Justin Kirby; Paul Koppel; Stephen Moore; Stanley Phillips; David Maffitt; Michael Pringle; Lawrence Tarbox; Fred Prior
Journal: J Digit Imaging Date: 2013-12 Impact factor: 4.056

3. Finishing the euchromatic sequence of the human genome.

Authors:
Journal: Nature Date: 2004-10-21 Impact factor: 49.962

4. International network of cancer genome projects.

Authors: Thomas J Hudson; Warwick Anderson; Axel Artez; Anna D Barker; Cindy Bell; Rosa R Bernabé; M K Bhan; Fabien Calvo; Iiro Eerola; Daniela S Gerhard; Alan Guttmacher; Mark Guyer; Fiona M Hemsley; Jennifer L Jennings; David Kerr; Peter Klatt; Patrik Kolar; Jun Kusada; David P Lane; Frank Laplace; Lu Youyong; Gerd Nettekoven; Brad Ozenberger; Jane Peterson; T S Rao; Jacques Remacle; Alan J Schafer; Tatsuhiro Shibata; Michael R Stratton; Joseph G Vockley; Koichi Watanabe; Huanming Yang; Matthew M F Yuen; Bartha M Knoppers; Martin Bobrow; Anne Cambon-Thomsen; Lynn G Dressler; Stephanie O M Dyke; Yann Joly; Kazuto Kato; Karen L Kennedy; Pilar Nicolás; Michael J Parker; Emmanuelle Rial-Sebbag; Carlos M Romeo-Casabona; Kenna M Shaw; Susan Wallace; Georgia L Wiesner; Nikolajs Zeps; Peter Lichter; Andrew V Biankin; Christian Chabannon; Lynda Chin; Bruno Clément; Enrique de Alava; Françoise Degos; Martin L Ferguson; Peter Geary; D Neil Hayes; Thomas J Hudson; Amber L Johns; Arek Kasprzyk; Hidewaki Nakagawa; Robert Penny; Miguel A Piris; Rajiv Sarin; Aldo Scarpa; Tatsuhiro Shibata; Marc van de Vijver; P Andrew Futreal; Hiroyuki Aburatani; Mónica Bayés; David D L Botwell; Peter J Campbell; Xavier Estivill; Daniela S Gerhard; Sean M Grimmond; Ivo Gut; Martin Hirst; Carlos López-Otín; Partha Majumder; Marco Marra; John D McPherson; Hidewaki Nakagawa; Zemin Ning; Xose S Puente; Yijun Ruan; Tatsuhiro Shibata; Michael R Stratton; Hendrik G Stunnenberg; Harold Swerdlow; Victor E Velculescu; Richard K Wilson; Hong H Xue; Liu Yang; Paul T Spellman; Gary D Bader; Paul C Boutros; Peter J Campbell; Paul Flicek; Gad Getz; Roderic Guigó; Guangwu Guo; David Haussler; Simon Heath; Tim J Hubbard; Tao Jiang; Steven M Jones; Qibin Li; Nuria López-Bigas; Ruibang Luo; Lakshmi Muthuswamy; B F Francis Ouellette; John V Pearson; Xose S Puente; Victor Quesada; Benjamin J Raphael; Chris Sander; Tatsuhiro Shibata; Terence P Speed; Lincoln D Stein; Joshua M Stuart; Jon W Teague; Yasushi Totoki; Tatsuhiko Tsunoda; Alfonso Valencia; David A Wheeler; Honglong Wu; Shancen Zhao; Guangyu Zhou; Lincoln D Stein; Roderic Guigó; Tim J Hubbard; Yann Joly; Steven M Jones; Arek Kasprzyk; Mark Lathrop; Nuria López-Bigas; B F Francis Ouellette; Paul T Spellman; Jon W Teague; Gilles Thomas; Alfonso Valencia; Teruhiko Yoshida; Karen L Kennedy; Myles Axton; Stephanie O M Dyke; P Andrew Futreal; Daniela S Gerhard; Chris Gunter; Mark Guyer; Thomas J Hudson; John D McPherson; Linda J Miller; Brad Ozenberger; Kenna M Shaw; Arek Kasprzyk; Lincoln D Stein; Junjun Zhang; Syed A Haider; Jianxin Wang; Christina K Yung; Anthony Cros; Anthony Cross; Yong Liang; Saravanamuttu Gnaneshan; Jonathan Guberman; Jack Hsu; Martin Bobrow; Don R C Chalmers; Karl W Hasel; Yann Joly; Terry S H Kaan; Karen L Kennedy; Bartha M Knoppers; William W Lowrance; Tohru Masui; Pilar Nicolás; Emmanuelle Rial-Sebbag; Laura Lyman Rodriguez; Catherine Vergely; Teruhiko Yoshida; Sean M Grimmond; Andrew V Biankin; David D L Bowtell; Nicole Cloonan; Anna deFazio; James R Eshleman; Dariush Etemadmoghadam; Brooke B Gardiner; Brooke A Gardiner; James G Kench; Aldo Scarpa; Robert L Sutherland; Margaret A Tempero; Nicola J Waddell; Peter J Wilson; John D McPherson; Steve Gallinger; Ming-Sound Tsao; Patricia A Shaw; Gloria M Petersen; Debabrata Mukhopadhyay; Lynda Chin; Ronald A DePinho; Sarah Thayer; Lakshmi Muthuswamy; Kamran Shazand; Timothy Beck; Michelle Sam; Lee Timms; Vanessa Ballin; Youyong Lu; Jiafu Ji; Xiuqing Zhang; Feng Chen; Xueda Hu; Guangyu Zhou; Qi Yang; Geng Tian; Lianhai Zhang; Xiaofang Xing; Xianghong Li; Zhenggang Zhu; Yingyan Yu; Jun Yu; Huanming Yang; Mark Lathrop; Jörg Tost; Paul Brennan; Ivana Holcatova; David Zaridze; Alvis Brazma; Lars Egevard; Egor Prokhortchouk; Rosamonde Elizabeth Banks; Mathias Uhlén; Anne Cambon-Thomsen; Juris Viksna; Fredrik Ponten; Konstantin Skryabin; Michael R Stratton; P Andrew Futreal; Ewan Birney; Ake Borg; Anne-Lise Børresen-Dale; Carlos Caldas; John A Foekens; Sancha Martin; Jorge S Reis-Filho; Andrea L Richardson; Christos Sotiriou; Hendrik G Stunnenberg; Giles Thoms; Marc van de Vijver; Laura van't Veer; Fabien Calvo; Daniel Birnbaum; Hélène Blanche; Pascal Boucher; Sandrine Boyault; Christian Chabannon; Ivo Gut; Jocelyne D Masson-Jacquemier; Mark Lathrop; Iris Pauporté; Xavier Pivot; Anne Vincent-Salomon; Eric Tabone; Charles Theillet; Gilles Thomas; Jörg Tost; Isabelle Treilleux; Fabien Calvo; Paulette Bioulac-Sage; Bruno Clément; Thomas Decaens; Françoise Degos; Dominique Franco; Ivo Gut; Marta Gut; Simon Heath; Mark Lathrop; Didier Samuel; Gilles Thomas; Jessica Zucman-Rossi; Peter Lichter; Roland Eils; Benedikt Brors; Jan O Korbel; Andrey Korshunov; Pablo Landgraf; Hans Lehrach; Stefan Pfister; Bernhard Radlwimmer; Guido Reifenberger; Michael D Taylor; Christof von Kalle; Partha P Majumder; Rajiv Sarin; T S Rao; M K Bhan; Aldo Scarpa; Paolo Pederzoli; Rita A Lawlor; Massimo Delledonne; Alberto Bardelli; Andrew V Biankin; Sean M Grimmond; Thomas Gress; David Klimstra; Giuseppe Zamboni; Tatsuhiro Shibata; Yusuke Nakamura; Hidewaki Nakagawa; Jun Kusada; Tatsuhiko Tsunoda; Satoru Miyano; Hiroyuki Aburatani; Kazuto Kato; Akihiro Fujimoto; Teruhiko Yoshida; Elias Campo; Carlos López-Otín; Xavier Estivill; Roderic Guigó; Silvia de Sanjosé; Miguel A Piris; Emili Montserrat; Marcos González-Díaz; Xose S Puente; Pedro Jares; Alfonso Valencia; Heinz Himmelbauer; Heinz Himmelbaue; Victor Quesada; Silvia Bea; Michael R Stratton; P Andrew Futreal; Peter J Campbell; Anne Vincent-Salomon; Andrea L Richardson; Jorge S Reis-Filho; Marc van de Vijver; Gilles Thomas; Jocelyne D Masson-Jacquemier; Samuel Aparicio; Ake Borg; Anne-Lise Børresen-Dale; Carlos Caldas; John A Foekens; Hendrik G Stunnenberg; Laura van't Veer; Douglas F Easton; Paul T Spellman; Sancha Martin; Anna D Barker; Lynda Chin; Francis S Collins; Carolyn C Compton; Martin L Ferguson; Daniela S Gerhard; Gad Getz; Chris Gunter; Alan Guttmacher; Mark Guyer; D Neil Hayes; Eric S Lander; Brad Ozenberger; Robert Penny; Jane Peterson; Chris Sander; Kenna M Shaw; Terence P Speed; Paul T Spellman; Joseph G Vockley; David A Wheeler; Richard K Wilson; Thomas J Hudson; Lynda Chin; Bartha M Knoppers; Eric S Lander; Peter Lichter; Lincoln D Stein; Michael R Stratton; Warwick Anderson; Anna D Barker; Cindy Bell; Martin Bobrow; Wylie Burke; Francis S Collins; Carolyn C Compton; Ronald A DePinho; Douglas F Easton; P Andrew Futreal; Daniela S Gerhard; Anthony R Green; Mark Guyer; Stanley R Hamilton; Tim J Hubbard; Olli P Kallioniemi; Karen L Kennedy; Timothy J Ley; Edison T Liu; Youyong Lu; Partha Majumder; Marco Marra; Brad Ozenberger; Jane Peterson; Alan J Schafer; Paul T Spellman; Hendrik G Stunnenberg; Brandon J Wainwright; Richard K Wilson; Huanming Yang
Journal: Nature Date: 2010-04-15 Impact factor: 49.962

5. Cancer Digital Slide Archive: an informatics resource to support integrated in silico analysis of TCGA pathology data.

Authors: David A Gutman; Jake Cobb; Dhananjaya Somanna; Yuna Park; Fusheng Wang; Tahsin Kurc; Joel H Saltz; Daniel J Brat; Lee A D Cooper
Journal: J Am Med Inform Assoc Date: 2013-07-25 Impact factor: 4.497

6. Comprehensive molecular characterization of human colon and rectal cancer.

Authors:
Journal: Nature Date: 2012-07-18 Impact factor: 49.962

7. Reconstruction of nuclear receptor network reveals that NR2E3 is a novel upstream regulator of ESR1 in breast cancer.

Authors: Yun-Yong Park; Kyounghyun Kim; Sang-Bae Kim; Bryan T Hennessy; Soo Mi Kim; Eun Sung Park; Jae Yun Lim; Jane Li; Yiling Lu; Ana Maria Gonzalez-Angulo; Woojin Jeong; Gordon B Mills; Stephen Safe; Ju-Seog Lee
Journal: EMBO Mol Med Date: 2011-12-15 Impact factor: 12.137

Review 8. The long and short of microRNA.

Authors: Luke A Yates; Chris J Norbury; Robert J C Gilbert
Journal: Cell Date: 2013-04-25 Impact factor: 41.582

9. MEXPRESS: visualizing expression, DNA methylation and clinical TCGA data.

Authors: Alexander Koch; Tim De Meyer; Jana Jeschke; Wim Van Criekinge
Journal: BMC Genomics Date: 2015-08-26 Impact factor: 3.969

10. TCPA: a resource for cancer functional proteomics data.

Authors: Jun Li; Yiling Lu; Rehan Akbani; Zhenlin Ju; Paul L Roebuck; Wenbin Liu; Ji-Yeon Yang; Bradley M Broom; Roeland G W Verhaak; David W Kane; Chris Wakefield; John N Weinstein; Gordon B Mills; Han Liang
Journal: Nat Methods Date: 2013-09-15 Impact factor: 28.547

23 in total

Review 1. What is the potential of nanolock- and nanocross-nanopore technology in cancer diagnosis?

Authors: Li-Qun Gu; Kent S Gates; Michael X Wang; Guangfu Li
Journal: Expert Rev Mol Diagn Date: 2017-12-01 Impact factor: 5.225

2. Alterations in cancer stem-cell marker CD44 expression predict oncologic outcome in soft-tissue sarcomas.

Authors: Timothy Henderson; Mingyi Chen; Morgan A Darrow; Chin-Shang Li; Chi-Lu Chiu; Arta M Monjazeb; William J Murphy; Robert J Canter
Journal: J Surg Res Date: 2017-12-22 Impact factor: 2.192

3. Five Novel Oncogenic Signatures Could Be Utilized as AFP-Related Diagnostic Biomarkers for Hepatocellular Carcinoma Based on Next-Generation Sequencing.

Authors: Zheng Yu; Rongchang Wang; Fan Chen; Jianru Wang; Xiaohui Huang
Journal: Dig Dis Sci Date: 2018-02-13 Impact factor: 3.199

4. Expression profiles analysis identifies a novel three-mRNA signature to predict overall survival in oral squamous cell carcinoma.

Authors: Xinyuan Zhao; Shuyu Sun; Xiongqun Zeng; Li Cui
Journal: Am J Cancer Res Date: 2018-03-01 Impact factor: 6.166

5. Comprehensive and integrative analysis identifies COX7A1 as a critical methylation-driven gene in breast invasive carcinoma.

Authors: Zhixian He; Feiran Wang; Wei Zhang; Jinhua Ding; Sujie Ni
Journal: Ann Transl Med Date: 2019-11

6. LncRNA PVT1 as an effective biomarker for cancer diagnosis and detection based on transcriptome data and meta-analysis.

Authors: Yunhong Zeng; Tieqiang Wang; Yi Liu; Zhan Su; Pingtao Lu; Xiaoliang Chen; Dongsheng Hu
Journal: Oncotarget Date: 2017-09-04

7. Bcl6/p53 expression, macrophages/mast cells infiltration and microvascular density in invasive breast carcinoma.

Authors: Roberto Tamma; Simona Ruggieri; Tiziana Annese; Giovanni Simone; Anita Mangia; Serena Rega; Francesco A Zito; Beatrice Nico; Domenico Ribatti
Journal: Oncotarget Date: 2018-04-27

8. Clinical and genomic landscape of gastric cancer with a mesenchymal phenotype.

Authors: Sang Cheul Oh; Bo Hwa Sohn; Jae-Ho Cheong; Sang-Bae Kim; Jae Eun Lee; Ki Cheong Park; Sang Ho Lee; Jong-Lyul Park; Yun-Yong Park; Hyun-Sung Lee; Hee-Jin Jang; Eun Sung Park; Sang-Cheol Kim; Jeonghoon Heo; In-Sun Chu; You-Jin Jang; Young-Jae Mok; WonKyung Jung; Baek-Hui Kim; Aeree Kim; Jae Yong Cho; Jae Yun Lim; Yuki Hayashi; Shumei Song; Elena Elimova; Jeannelyn S Estralla; Jeffrey H Lee; Manoop S Bhutani; Yiling Lu; Wenbin Liu; Jeeyun Lee; Won Ki Kang; Sung Kim; Sung Hoon Noh; Gordon B Mills; Seon-Young Kim; Jaffer A Ajani; Ju-Seog Lee
Journal: Nat Commun Date: 2018-05-03 Impact factor: 14.919

Review 9. lncRNA PVT1 identified as an independent biomarker for prognosis surveillance of solid tumors based on transcriptome data and meta-analysis.

Authors: Xiaoliang Chen; Yueying Yang; Yong Cao; Changjun Wu; Shuxian Wu; Zhan Su; Hongwei Jin; Dongli Wang; Gengxin Zhang; Wei Fan; Jinbo Lin; Yunhong Zeng; Dongsheng Hu
Journal: Cancer Manag Res Date: 2018-08-16 Impact factor: 3.989

10. A pan-cancer study of the transcriptional regulation of uricogenesis in human tumours: pathological and pharmacological correlates.

Authors: Zuzana Saidak; Christophe Louandre; Samy Dahmani; Chloé Sauzay; Sara Guedda; Bruno Chauffert; Denis Chatelain; Irene Ceballos-Picot; Antoine Galmiche
Journal: Biosci Rep Date: 2018-09-19 Impact factor: 3.840