Julie Wu1, Jordan Bryan2, Samuel M Rubinstein3, Lucy Wang3, Michele Lenoue-Newton3, Raed Zuhour4, Mia Levy1,3,5, Christine Micheel3, Yaomin Xu5,6, Suresh K Bhavnani7, Lester Mackey8, Jeremy L Warner1,3,5. 1. Department of Internal Medicine, Vanderbilt University, Nashville, TN. 2. Duke University, Durham, NC. 3. Vanderbilt-Ingram Cancer Center, Vanderbilt University Medical Center, Nashville, TN. 4. Department of Radiation Oncology, University of Texas Medical Branch, Galveston, TX. 5. Department of Biomedical Informatics, Vanderbilt University, Nashville, TN. 6. Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN. 7. Institute for Translational Sciences, University of Texas Medical Branch, Galveston, TX. 8. Microsoft Research, Cambridge, MA.
Abstract
PURPOSE: Our goal was to identify the opportunities and challenges in analyzing data from the American Association of Cancer Research Project Genomics Evidence Neoplasia Information Exchange (GENIE), a multi-institutional database derived from clinically driven genomic testing, at both the inter- and the intra-institutional level. Inter-institutionally, we identified genotypic differences between primary and metastatic tumors across the 3 most represented cancers in GENIE. Intra-institutionally, we analyzed the clinical characteristics of the Vanderbilt-Ingram Cancer Center (VICC) subset of GENIE to inform the interpretation of GENIE as a whole. METHODS: We performed overall cohort matching on the basis of age, ethnicity, and sex of 13,208 patients stratified by cancer type (breast, colon, or lung) and sample site (primary or metastatic). We then determined whether detected variants, at the gene level, were associated with primary or metastatic tumors. We extracted clinical data for the VICC subset from VICC's clinical data warehouse. Treatment exposures were mapped to a 13-class schema derived from the HemOnc ontology. RESULTS: Across 756 genes, there were significant differences in all cancer types. In breast cancer, ESR1 variants were over-represented in metastatic samples (odds ratio, 5.91; q < 10-6). TP53 mutations were over-represented in metastatic samples across all cancers. VICC had a significantly different cancer type distribution than that of GENIE but patients were well matched with respect to age, sex, and sample type. Treatment data from VICC was used for a bipartite network analysis, demonstrating clusters with a mix of histologies and others being more histology specific. CONCLUSION: This article demonstrates the feasibility of deriving meaningful insights from GENIE at the inter- and intra-institutional level and illuminates the opportunities and challenges of the data GENIE contains. The results should help guide future development of GENIE, with the goal of fully realizing its potential for accelerating precision medicine.
PURPOSE: Our goal was to identify the opportunities and challenges in analyzing data from the American Association of Cancer Research Project Genomics Evidence Neoplasia Information Exchange (GENIE), a multi-institutional database derived from clinically driven genomic testing, at both the inter- and the intra-institutional level. Inter-institutionally, we identified genotypic differences between primary and metastatic tumors across the 3 most represented cancers in GENIE. Intra-institutionally, we analyzed the clinical characteristics of the Vanderbilt-Ingram Cancer Center (VICC) subset of GENIE to inform the interpretation of GENIE as a whole. METHODS: We performed overall cohort matching on the basis of age, ethnicity, and sex of 13,208 patients stratified by cancer type (breast, colon, or lung) and sample site (primary or metastatic). We then determined whether detected variants, at the gene level, were associated with primary or metastatic tumors. We extracted clinical data for the VICC subset from VICC's clinical data warehouse. Treatment exposures were mapped to a 13-class schema derived from the HemOnc ontology. RESULTS: Across 756 genes, there were significant differences in all cancer types. In breast cancer, ESR1 variants were over-represented in metastatic samples (odds ratio, 5.91; q < 10-6). TP53 mutations were over-represented in metastatic samples across all cancers. VICC had a significantly different cancer type distribution than that of GENIE but patients were well matched with respect to age, sex, and sample type. Treatment data from VICC was used for a bipartite network analysis, demonstrating clusters with a mix of histologies and others being more histology specific. CONCLUSION: This article demonstrates the feasibility of deriving meaningful insights from GENIE at the inter- and intra-institutional level and illuminates the opportunities and challenges of the data GENIE contains. The results should help guide future development of GENIE, with the goal of fully realizing its potential for accelerating precision medicine.
Authors: Jeremy L Warner; Dmitry Dymshyts; Christian G Reich; Michael J Gurley; Harry Hochheiser; Zachary H Moldwin; Rimma Belenkaya; Andrew E Williams; Peter C Yang Journal: J Biomed Inform Date: 2019-06-22 Impact factor: 6.317
Authors: Neal I Lindeman; Philip T Cagle; Mary Beth Beasley; Dhananjay Arun Chitale; Sanja Dacic; Giuseppe Giaccone; Robert Brian Jenkins; David J Kwiatkowski; Juan-Sebastian Saldivar; Jeremy Squire; Erik Thunnissen; Marc Ladanyi Journal: J Thorac Oncol Date: 2013-07 Impact factor: 15.609
Authors: Albert Y Lin; Mei-Sze Chua; Yoon-La Choi; William Yeh; Young H Kim; Raymond Azzi; Gregg A Adams; Kristin Sainani; Matt van de Rijn; Samuel K So; Jonathan R Pollack Journal: PLoS One Date: 2011-02-24 Impact factor: 3.240
Authors: A Rose Brannon; Efsevia Vakiani; Brooke E Sylvester; Sasinya N Scott; Gregory McDermott; Ronak H Shah; Krishan Kania; Agnes Viale; Dayna M Oschwald; Vladimir Vacic; Anne-Katrin Emde; Andrea Cercek; Rona Yaeger; Nancy E Kemeny; Leonard B Saltz; Jinru Shia; Michael I D'Angelica; Martin R Weiser; David B Solit; Michael F Berger Journal: Genome Biol Date: 2014-08-28 Impact factor: 13.583
Authors: Katherine Lawler; Efterpi Papouli; Cristina Naceur-Lombardelli; Anca Mera; Kayleigh Ougham; Andrew Tutt; Siker Kimbung; Ingrid Hedenfalk; Jun Zhan; Hongquan Zhang; Richard Buus; Mitch Dowsett; Tony Ng; Sarah E Pinder; Peter Parker; Lars Holmberg; Cheryl E Gillett; Anita Grigoriadis; Arnie Purushotham Journal: Breast Cancer Res Date: 2017-10-13 Impact factor: 6.466