Literature DB >> 32502231

CoMut: visualizing integrated molecular information with comutation plots.

Jett Crowdis1,2, Meng Xiao He1,2,3, Brendan Reardon1,2, Eliezer M Van Allen1,2.   

Abstract

MOTIVATION: Large-scale sequencing studies have created a need to succinctly visualize genomic characteristics of patient cohorts linked to widely variable phenotypic information. This is often done by visualizing the co-occurrence of variants with comutation plots. Current tools lack the ability to create highly customizable and publication quality comutation plots from arbitrary user data.
RESULTS: We developed CoMut, a stand-alone, object-oriented Python package that creates comutation plots from arbitrary input data, including categorical data, continuous data, bar graphs, side bar graphs and data that describes relationships between samples.
AVAILABILITY AND IMPLEMENTATION: The CoMut package is open source and is available at https://github.com/vanallenlab/comut under the MIT License, along with documentation and examples. A no installation, easy-to-use implementation is available on Google Colab (see GitHub).
© The Author(s) 2020. Published by Oxford University Press.

Entities:  

Year:  2020        PMID: 32502231      PMCID: PMC7520041          DOI: 10.1093/bioinformatics/btaa554

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

A common representation of cohort-level mutation data in large-scale sequencing studies is a comutation plot, which shows sample-level mutation status along with other relevant clinical and genomic characteristics. Originally designed in 2011 (Stransky ), comutation plots are now ubiquitous and provide a way to communicate mutations and other patterns in a cohort. Software currently exists to create comutation plots: CoMutPlotter (Huang ) and jsComut (Pierce ) provide a web interface for creating comutation plots while larger bioinformatic packages often have a function for creating comutation plots, including oncoplot in maftools (Mayakonda ) and waterfall in GenVisR (Skidmore ). However, these softwares only plot specific genomic and phenotypic data types, and as clinically integrated sequencing uses rapidly rise, comutation software that can capture the complexity of advances in molecular analyses and phenotypic information is necessary.

2 Materials and methods

Here we present a Python package named ‘CoMut’ to streamline the creation of comutation plots. Implemented in Python’s plotting library, matplotlib, it is the first of its kind to utilize an object-oriented framework for maximum customizability. As a result, users can freely edit individual parts of the plot after its creation and fine-tune plots for publication. Furthermore, instead of being constrained to a limited number of pre-specified genomic data types, CoMut is data agnostic and enables the plotting of arbitrary data types. For example categorical data encompasses mutation data (e.g. variant types) and most clinical variables (e.g. tumour stage). Categorical data are drawn as boxes with user specified colors, and two mutations in the same gene within one sample are drawn as triangles rather than boxes. This allows CoMut to depict allele-specific copy number alterations or plot mutations and copy number alterations together, which are both major advantages relative to existing software. CoMut also supports continuous data, bar graphs (e.g. mutation burden), side bar graphs and sample indicators, which indicate relationships between samples. CoMut uses the Pandas library to handle data and accepts a variety of file types, including tsv, csv and maf file formats. CoMut includes helper functions to parse common file types into dataframes, and it can export plots in both raster (.jpg, .png) and vector (.pdf, .svg) forms. It can also handle missing data, an important feature for clinical sequencing studies where some data types for individuals may be unavailable. We provide a quickstart notebook in GitHub connected to Google Colab that allows users to create basic comutation plots from maf files using their browser without any installations.

3 Usage scenario

To illustrate the features of CoMut, we created a comutation plot visualizing a cohort from a study of selective response to immunotherapy in melanoma (Liu ) (Fig. 1). We obtained mutation and clinical data from the supplement and used allele-specific copy number profiles from ABSOLUTE (Carter ) to classify allele-specific copy number alterations. In brief, we defined samples as whole genome doubled if they had an average ploidy greater than 2.5. We classified copy number alterations in genes by comparing the integer allelic copy number of the segment on which the gene was located to baseline allelic ploidy (2 if a sample was WGD, 1 otherwise).
Fig. 1.

A comutation plot generated with CoMut using data provided in the study by Liu . For visualization purposes, only 52 samples are shown. Each column represents a tumour. Tumours are ordered by best RECIST criteria response (CR, PR, PD, SD or MR) and within each subgroup by non-synonymous mutation load. For copy number data, each triangle represents one allele and allele-specific copy number alterations are classified relative to baseline ploidy (2 if sample has whole genome duplication, 1 otherwise). Unfilled boxes with a slash indicate that allelic copy number data was unavailable due to too few heterozygous SNP sites. Complex indicates that a segment breakpoint occurred within a gene, creating conflicting copy number. CN-LOH indicates copy neutral loss of heterozygosity. Sample indicators are added for demonstration purposes and do not represent data from the study.

A comutation plot generated with CoMut using data provided in the study by Liu . For visualization purposes, only 52 samples are shown. Each column represents a tumour. Tumours are ordered by best RECIST criteria response (CR, PR, PD, SD or MR) and within each subgroup by non-synonymous mutation load. For copy number data, each triangle represents one allele and allele-specific copy number alterations are classified relative to baseline ploidy (2 if sample has whole genome duplication, 1 otherwise). Unfilled boxes with a slash indicate that allelic copy number data was unavailable due to too few heterozygous SNP sites. Complex indicates that a segment breakpoint occurred within a gene, creating conflicting copy number. CN-LOH indicates copy neutral loss of heterozygosity. Sample indicators are added for demonstration purposes and do not represent data from the study. To create the comutation plot, we added the following datasets to the plot using CoMut, specifying color mappings for each. Mutation type, copy number alterations and discrete clinical data (e.g. primary type) were all added as categorical data. Purity was added as continuous data, and mutation burden and mutational signatures were added as bar graphs. We calculated the number of samples mutated in each gene and added this data as a side bar graph. We added sample indicators for demonstration, though they do not represent data from the study. All of this was completed using CoMut’s built-in functions for adding specific data types and was achieved with only a few lines of code. Leveraging CoMut’s object-oriented framework, small edits to the figure were easily made by acting on specific subplots after the plot was made (e.g. moving the y-axis labels inside the side bar graph). The legend was constructed using CoMut’s legend functions. This visualization reveals that allelic copy number alterations are common and that CDKN2A and TP53 often experience deletions after whole genome duplication in this study. The data and code to create this comutation plot can be found in the CoMut GitHub repository.

4 Conclusion

CoMut is a highly customizable tool for creating comutation plots to visualize arbitrary genomic and clinical characteristics of samples in sequencing studies. It supports a variety of data types and allows the user complete control over the structure and appearance of the plot. Its object-oriented framework allows users to customize the plot for publication and allows developers to extend CoMut’s functionality. By providing a quickstart notebook integrated with Google Colab, we also provide an easy way for those without programming experience to create comutation plots using only input files and a browser.
  7 in total

1.  Interactive Browser-Based Genomics Data Visualization Tools for Translational and Clinical Laboratory Applications.

Authors:  Thomas M Pearce; Marina N Nikiforova; Somak Roy
Journal:  J Mol Diagn       Date:  2019-08-02       Impact factor: 5.568

2.  The mutational landscape of head and neck squamous cell carcinoma.

Authors:  Nicolas Stransky; Ann Marie Egloff; Aaron D Tward; Aleksandar D Kostic; Kristian Cibulskis; Andrey Sivachenko; Gregory V Kryukov; Michael S Lawrence; Carrie Sougnez; Aaron McKenna; Erica Shefler; Alex H Ramos; Petar Stojanov; Scott L Carter; Douglas Voet; Maria L Cortés; Daniel Auclair; Michael F Berger; Gordon Saksena; Candace Guiducci; Robert C Onofrio; Melissa Parkin; Marjorie Romkes; Joel L Weissfeld; Raja R Seethala; Lin Wang; Claudia Rangel-Escareño; Juan Carlos Fernandez-Lopez; Alfredo Hidalgo-Miranda; Jorge Melendez-Zajgla; Wendy Winckler; Kristin Ardlie; Stacey B Gabriel; Matthew Meyerson; Eric S Lander; Gad Getz; Todd R Golub; Levi A Garraway; Jennifer R Grandis
Journal:  Science       Date:  2011-07-28       Impact factor: 47.728

3.  Absolute quantification of somatic DNA alterations in human cancer.

Authors:  Scott L Carter; Kristian Cibulskis; Elena Helman; Aaron McKenna; Hui Shen; Travis Zack; Peter W Laird; Robert C Onofrio; Wendy Winckler; Barbara A Weir; Rameen Beroukhim; David Pellman; Douglas A Levine; Eric S Lander; Matthew Meyerson; Gad Getz
Journal:  Nat Biotechnol       Date:  2012-05       Impact factor: 54.908

4.  GenVisR: Genomic Visualizations in R.

Authors:  Zachary L Skidmore; Alex H Wagner; Robert Lesurf; Katie M Campbell; Jason Kunisaki; Obi L Griffith; Malachi Griffith
Journal:  Bioinformatics       Date:  2016-06-10       Impact factor: 6.937

5.  CoMutPlotter: a web tool for visual summary of mutations in cancer cohorts.

Authors:  Po-Jung Huang; Hou-Hsien Lin; Chi-Ching Lee; Ling-Ya Chiu; Shao-Min Wu; Yuan-Ming Yeh; Petrus Tang; Cheng-Hsun Chiu; Ping-Chiang Lyu; Pei-Chien Tsai
Journal:  BMC Med Genomics       Date:  2019-07-11       Impact factor: 3.063

6.  Maftools: efficient and comprehensive analysis of somatic variants in cancer.

Authors:  Anand Mayakonda; De-Chen Lin; Yassen Assenov; Christoph Plass; H Phillip Koeffler
Journal:  Genome Res       Date:  2018-10-19       Impact factor: 9.043

7.  Integrative molecular and clinical modeling of clinical outcomes to PD1 blockade in patients with metastatic melanoma.

Authors:  David Liu; Bastian Schilling; Derek Liu; Antje Sucker; Elisabeth Livingstone; Livnat Jerby-Arnon; Lisa Zimmer; Ralf Gutzmer; Imke Satzger; Carmen Loquai; Stephan Grabbe; Natalie Vokes; Claire A Margolis; Jake Conway; Meng Xiao He; Haitham Elmarakeby; Felix Dietlein; Diana Miao; Adam Tracy; Helen Gogas; Simone M Goldinger; Jochen Utikal; Christian U Blank; Ricarda Rauschenberg; Dagmar von Bubnoff; Angela Krackhardt; Benjamin Weide; Sebastian Haferkamp; Felix Kiecker; Ben Izar; Levi Garraway; Aviv Regev; Keith Flaherty; Annette Paschen; Eliezer M Van Allen; Dirk Schadendorf
Journal:  Nat Med       Date:  2019-12-02       Impact factor: 53.440

  7 in total
  12 in total

1.  Comparison of the tumor immune microenvironment and checkpoint blockade biomarkers between stage III and IV non-small cell lung cancer.

Authors:  Yinjie Gao; Michelle M Stein; Matthew Kase; Amy L Cummings; Ramit Bharanikumar; Denise Lau; Edward B Garon; Sandip P Patel
Journal:  Cancer Immunol Immunother       Date:  2022-07-26       Impact factor: 6.630

2.  Mutational Footprint of Platinum Chemotherapy in a Secondary Thyroid Cancer.

Authors:  Julia Schiantarelli; Theodora Pappa; Jake Conway; Jett Crowdis; Brendan Reardon; Felix Dietlein; Julian Huang; Darren Stanizzi; Evan Carey; Alice Bosma-Moody; Alma Imamovic; Seunghun Han; Sabrina Camp; Eric Kofman; Erin Shannon; Justine A Barletta; Meng Xiao He; David Liu; Jihye Park; Jochen H Lorch; Eliezer M Van Allen
Journal:  JCO Precis Oncol       Date:  2022-08

3.  Tumor treating fields affect mesothelioma cell proliferation by exerting histotype-dependent cell cycle checkpoint activations and transcriptional modulations.

Authors:  Laura Mannarino; Federica Mirimao; Monica Lupi; Maurizio D'Incalci; Nicolò Panini; Lara Paracchini; Sergio Marchini; Luca Beltrame; Rosy Amodeo; Federica Grosso; Roberta Libener; Irene De Simone; Giovanni L Ceresoli; Paolo A Zucali
Journal:  Cell Death Dis       Date:  2022-07-15       Impact factor: 9.685

4.  Integrative clinical and molecular characterization of translocation renal cell carcinoma.

Authors:  Ziad Bakouny; Ananthan Sadagopan; Praful Ravi; Nebiyou Y Metaferia; Jiao Li; Shatha AbuHammad; Stephen Tang; Thomas Denize; Emma R Garner; Xin Gao; David A Braun; Laure Hirsch; John A Steinharter; Gabrielle Bouchard; Emily Walton; Destiny West; Chris Labaki; Shaan Dudani; Chun-Loo Gan; Vidyalakshmi Sethunath; Filipe L F Carvalho; Alma Imamovic; Cora Ricker; Natalie I Vokes; Jackson Nyman; Jacob E Berchuck; Jihye Park; Michelle S Hirsch; Rizwan Haq; Gwo-Shu Mary Lee; Bradley A McGregor; Steven L Chang; Adam S Feldman; Catherine J Wu; David F McDermott; Daniel Y C Heng; Sabina Signoretti; Eliezer M Van Allen; Toni K Choueiri; Srinivas R Viswanathan
Journal:  Cell Rep       Date:  2022-01-04       Impact factor: 9.995

5.  A phase II study of efficacy, toxicity, and the potential impact of genomic alterations on response to eribulin mesylate in combination with trastuzumab and pertuzumab in women with human epidermal growth factor receptor 2 (HER2)+ metastatic breast cancer.

Authors:  Nikhil Wagle; Rachel A Freedman; Sara M Balch; Ines Vaz-Luis; Tianyu Li; Nabihah Tayob; Esha Jain; Karla Helvie; Jorge E Buendia-Buendia; Erin Shannon; Steven J Isakoff; Nadine M Tung; Ian E Krop; Nancy U Lin
Journal:  Breast Cancer Res Treat       Date:  2021-07-24       Impact factor: 4.872

6.  Genomic Features of Muscle-invasive Bladder Cancer Arising After Prostate Radiotherapy.

Authors:  Matthew Mossanen; Filipe L F Carvalho; Vinayak Muralidhar; Mark A Preston; Brendan Reardon; Jake R Conway; Catherine Curran; Dory Freeman; Sybil Sha; Guru Sonpavde; Michelle Hirsch; Adam S Kibel; Eliezer M Van Allen; Kent W Mouw
Journal:  Eur Urol       Date:  2021-12-23       Impact factor: 24.267

7.  Transcriptional mediators of treatment resistance in lethal prostate cancer.

Authors:  Meng Xiao He; Michael S Cuoco; Jett Crowdis; Alice Bosma-Moody; Zhenwei Zhang; Kevin Bi; Abhay Kanodia; Mei-Ju Su; Sheng-Yu Ku; Maria Mica Garcia; Amalia R Sweet; Christopher Rodman; Laura DelloStritto; Rebecca Silver; John Steinharter; Parin Shah; Benjamin Izar; Nathan C Walk; Kelly P Burke; Ziad Bakouny; Alok K Tewari; David Liu; Sabrina Y Camp; Natalie I Vokes; Keyan Salari; Jihye Park; Sébastien Vigneau; Lawrence Fong; Joshua W Russo; Xin Yuan; Steven P Balk; Himisha Beltran; Orit Rozenblatt-Rosen; Aviv Regev; Asaf Rotem; Mary-Ellen Taplin; Eliezer M Van Allen
Journal:  Nat Med       Date:  2021-03-04       Impact factor: 53.440

8.  Mapping molecular subtype specific alterations in breast cancer brain metastases identifies clinically relevant vulnerabilities.

Authors:  Nicola Cosgrove; Damir Varešlija; Stephen Keelan; Ashuvinee Elangovan; Jennifer M Atkinson; Sinéad Cocchiglia; Fiona T Bane; Vikrant Singh; Simon Furney; Chunling Hu; Jodi M Carter; Steven N Hart; Siddhartha Yadav; Matthew P Goetz; Arnold D K Hill; Steffi Oesterreich; Adrian V Lee; Fergus J Couch; Leonie S Young
Journal:  Nat Commun       Date:  2022-01-26       Impact factor: 14.919

9.  Profile of Basal Cell Carcinoma Mutations and Copy Number Alterations - Focus on Gene-Associated Noncoding Variants.

Authors:  Paulina Maria Nawrocka; Paulina Galka-Marciniak; Martyna Olga Urbanek-Trzeciak; Ilamathi M-Thirusenthilarasan; Natalia Szostak; Anna Philips; Laura Susok; Michael Sand; Piotr Kozlowski
Journal:  Front Oncol       Date:  2021-11-25       Impact factor: 6.244

10.  Rare Occurrence of Aristolochic Acid Mutational Signatures in Oro-Gastrointestinal Tract Cancers.

Authors:  Abner Herbert Lim; Jason Yongsheng Chan; Ming-Chin Yu; Tsung-Han Wu; Jing Han Hong; Cedric Chuan Young Ng; Zhen Jie Low; Wei Liu; Rajasegaran Vikneswari; Pin-Cheng Sung; Wen-Lang Fan; Bin Tean Teh; Sen-Yung Hsieh
Journal:  Cancers (Basel)       Date:  2022-01-24       Impact factor: 6.639

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.