Literature DB >> 31344359

Before and After: Comparison of Legacy and Harmonized TCGA Genomic Data Commons' Data.

Galen F Gao1, Joel S Parker2, Sheila M Reynolds3, Tiago C Silva4, Liang-Bo Wang5, Wanding Zhou6, Rehan Akbani7, Matthew Bailey5, Saianand Balu8, Benjamin P Berman9, Denise Brooks10, Hu Chen11, Andrew D Cherniack12, John A Demchok13, Li Ding5, Ina Felau13, Sharon Gaheen14, Daniela S Gerhard13, David I Heiman15, Kyle M Hernandez16, Katherine A Hoadley2, Reyka Jayasinghe17, Anab Kemal13, Theo A Knijnenburg3, Peter W Laird6, Michael K A Mensah13, Andrew J Mungall10, A Gordon Robertson10, Hui Shen6, Roy Tarnuzzer13, Zhining Wang13, Matthew Wyczalkowski5, Liming Yang13, Jean C Zenklusen13, Zhenyu Zhang18, Han Liang19, Michael S Noble20.   

Abstract

We present a systematic analysis of the effects of synchronizing a large-scale, deeply characterized, multi-omic dataset to the current human reference genome, using updated software, pipelines, and annotations. For each of 5 molecular data platforms in The Cancer Genome Atlas (TCGA)-mRNA and miRNA expression, single nucleotide variants, DNA methylation and copy number alterations-comprehensive sample, gene, and probe-level studies were performed, towards quantifying the degree of similarity between the 'legacy' GRCh37 (hg19) TCGA data and its GRCh38 (hg38) version as 'harmonized' by the Genomic Data Commons. We offer gene lists to elucidate differences that remained after controlling for confounders, and strategies to mitigate their impact on biological interpretation. Our results demonstrate that the hg19 and hg38 TCGA datasets are very highly concordant, promote informed use of either legacy or harmonized omics data, and provide a rubric that encourages similar comparisons as new data emerge and reference data evolve.
Copyright © 2019 The Authors. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  DNA methylation; The Cancer Genome Atlas; human reference genome; mRNA expression; microRNA expression; quality control; somatic copy number alteration; somatic mutation

Mesh:

Substances:

Year:  2019        PMID: 31344359      PMCID: PMC6707074          DOI: 10.1016/j.cels.2019.06.006

Source DB:  PubMed          Journal:  Cell Syst        ISSN: 2405-4712            Impact factor:   11.091


  47 in total

1.  dbSNP: the NCBI database of genetic variation.

Authors:  S T Sherry; M H Ward; M Kholodov; J Baker; L Phan; E M Smigielski; K Sirotkin
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  Novel genes in the PAGE and GAGE family of tumor antigens found by homology walking in the dbEST database.

Authors:  U Brinkmann; G Vasmatzis; B Lee; I Pastan
Journal:  Cancer Res       Date:  1999-04-01       Impact factor: 12.701

3.  Circular binary segmentation for the analysis of array-based DNA copy number data.

Authors:  Adam B Olshen; E S Venkatraman; Robert Lucito; Michael Wigler
Journal:  Biostatistics       Date:  2004-10       Impact factor: 5.899

4.  Emerging roles for PAX8 in ovarian cancer and endosalpingeal development.

Authors:  Nathan J Bowen; Sanjay Logani; Erin B Dickerson; Laura B Kapa; Mariam Akhtar; Benedict B Benigno; John F McDonald
Journal:  Gynecol Oncol       Date:  2006-10-24       Impact factor: 5.482

5.  A cluster of 21 keratin-associated protein genes within introns of another gene on human chromosome 21q22.3.

Authors:  Kazunori Shibuya; Izumi Obayashi; Shuichi Asakawa; Shinsei Minoshima; Jun Kudoh; Nobuyoshi Shimizu
Journal:  Genomics       Date:  2004-04       Impact factor: 5.736

6.  Expression of pax8 as a useful marker in distinguishing ovarian carcinomas from mammary carcinomas.

Authors:  Daisuke Nonaka; Luis Chiriboga; Robert A Soslow
Journal:  Am J Surg Pathol       Date:  2008-10       Impact factor: 6.394

7.  miRBase: microRNA sequences, targets and gene nomenclature.

Authors:  Sam Griffiths-Jones; Russell J Grocock; Stijn van Dongen; Alex Bateman; Anton J Enright
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

8.  NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins.

Authors:  Kim D Pruitt; Tatiana Tatusova; Donna R Maglott
Journal:  Nucleic Acids Res       Date:  2006-11-27       Impact factor: 16.971

9.  The human olfactory receptor repertoire.

Authors:  S Zozulya; F Echeverri; T Nguyen
Journal:  Genome Biol       Date:  2001-06-01       Impact factor: 13.583

10.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

View more
  46 in total

Review 1.  Online informatics resources to facilitate cancer target and chemical probe discovery.

Authors:  Xuan Yang; Haian Fu; Andrey A Ivanov
Journal:  RSC Med Chem       Date:  2020-04-09

Review 2.  Utilising extracellular vesicles for early cancer diagnostics: benefits, challenges and recommendations for the future.

Authors:  Ryan Charles Pink; Ellie-May Beaman; Priya Samuel; Susan Ann Brooks; David Raul Francisco Carter
Journal:  Br J Cancer       Date:  2022-01-10       Impact factor: 7.640

3.  Deciphering specific miRNAs in brain tumors: a 5-miRNA signature in glioblastoma.

Authors:  João Basso; Marco G Paggi; Ana Fortuna; Carla Vitorino; Rui Vitorino
Journal:  Mol Genet Genomics       Date:  2022-02-17       Impact factor: 3.291

Review 4.  Computational analysis of cancer genome sequencing data.

Authors:  Isidro Cortés-Ciriano; Doga C Gulhan; Jake June-Koo Lee; Giorgio E M Melloni; Peter J Park
Journal:  Nat Rev Genet       Date:  2021-12-08       Impact factor: 53.242

5.  BoxCar and shotgun proteomic analyses reveal molecular networks regulated by UBR5 in prostate cancer.

Authors:  Yiwu Yan; Bo Zhou; Yeon-Joo Lee; Sungyong You; Michael R Freeman; Wei Yang
Journal:  Proteomics       Date:  2021-12-23       Impact factor: 3.984

6.  Identifying tumorigenic non-coding mutations through altered cis-regulation.

Authors:  Zhongshan Cheng; Michael Vermeulen; Micheal Rollins-Green; Tomas Babak; Brian DeVeale
Journal:  STAR Protoc       Date:  2021-11-08

7.  re-Searcher: GUI-based bioinformatics tool for simplified genomics data mining of VCF files.

Authors:  Daniyar Karabayev; Ulykbek Kairov; Askhat Molkenov; Kaiyrgali Yerulanuly; Ilyas Kabimoldayev; Asset Daniyarov; Aigul Sharip; Ainur Seisenova; Zhaxybay Zhumadilov
Journal:  PeerJ       Date:  2021-05-03       Impact factor: 2.984

8.  ZBP1 not RIPK1 mediates tumor necroptosis in breast cancer.

Authors:  Jin Young Baik; Zhaoshan Liu; Delong Jiao; Hyung-Joon Kwon; Jiong Yan; Chamila Kadigamuwa; Moran Choe; Ross Lake; Michael Kruhlak; Mayank Tandon; Zhenyu Cai; Swati Choksi; Zheng-Gang Liu
Journal:  Nat Commun       Date:  2021-05-11       Impact factor: 14.919

9.  Multi-resolution characterization of molecular taxonomies in bulk and single-cell transcriptomics data.

Authors:  Eric R Reed; Stefano Monti
Journal:  Nucleic Acids Res       Date:  2021-09-27       Impact factor: 16.971

10.  Covalent Chemistry-Mediated Multimarker Purification of Circulating Tumor Cells Enables Noninvasive Detection of Molecular Signatures of Hepatocellular Carcinoma.

Authors:  Na Sun; Yi-Te Lee; Minhyung Kim; Jasmine J Wang; Ceng Zhang; Pai-Chi Teng; Dongping Qi; Ryan Y Zhang; Benjamin V Tran; Yue Tung Lee; Jinglei Ye; Juvelyn Palomique; Nicholas N Nissen; Steven-Huy B Han; Saeed Sadeghi; Richard S Finn; Sammy Saab; Ronald W Busuttil; Edwin M Posadas; Li Liang; Renjun Pei; Ju Dong Yang; Sungyong You; Vatche G Agopian; Hsian-Rong Tseng; Yazhen Zhu
Journal:  Adv Mater Technol       Date:  2021-04-09
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.