Literature DB >> 24903420

Sushi.R: flexible, quantitative and integrative genomic visualizations for publication-quality multi-panel figures.

Douglas H Phanstiel1, Alan P Boyle1, Carlos L Araya1, Michael P Snyder1.   

Abstract

MOTIVATION: Interpretation and communication of genomic data require flexible and quantitative tools to analyze and visualize diverse data types, and yet, a comprehensive tool to display all common genomic data types in publication quality figures does not exist to date. To address this shortcoming, we present Sushi.R, an R/Bioconductor package that allows flexible integration of genomic visualizations into highly customizable, publication-ready, multi-panel figures from common genomic data formats including Browser Extensible Data (BED), bedGraph and Browser Extensible Data Paired-End (BEDPE). Sushi.R is open source and made publicly available through GitHub (https://github.com/dphansti/Sushi) and Bioconductor (http://bioconductor.org/packages/release/bioc/html/Sushi.html).
© The Author 2014. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2014        PMID: 24903420      PMCID: PMC4173017          DOI: 10.1093/bioinformatics/btu379

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

Genomic science is a rich data-intensive field in which diverse data types are combined to uncover and explore characteristics of sequence elements on a large scale. However, despite a growing set of mature standard visualization techniques and file formats, no comprehensive tools exist to facilitate multi-panel visualization across a broad range of standard genomic data types. To address this deficiency, we developed Sushi.R, a flexible R library that leverages standard visualization techniques and file formats to produce highly customizable publication-quality figures of genomic data within the widespread analysis environment, R(R Core Team, 2013).

2 METHODS

Sushi.R is written exclusively in the R software environment. The Sushi.R package includes 13 example datasets and a vignette detailing the usage of each (Sanyal ; Li ; ENCODE Project Consortium ; Neph ; International Consortium for Blood Pressure Genome-Wide Association Studies ; Dixon ; Rhee and Pugh, 2011). Datasets that were mapped to hg19 were converted to hg18 using the liftOver tool. Sushi is compatible with all organisms and genome builds. Large datasets were filtered to include only regions shown in Figure 1. ChIA-PET interactions were additionally filtered to remove interactions between regions ≤1000 bp apart. To facilitate use, Sushi.R is open source and is distributed through both Bioconductor for one-step installation and GitHub for version control, issue management and third-party development (Gentleman ).
Fig. 1.

Multi-panel Sushi plot made without modification by external image-editing software. The Sushi functions used to create the plot include (A) plotManhattan, (B) plotHic, (C) plotBedpe, (D) plotBedpe, (E) plotBedgraph, (F) plotBedgraph, (G) plotBed, (H) plotManhattan, (I) plotBed, (J) plotGenes, (K) plotBed, (L) plotBedgraph, (M) plotBedgraph and (N) plotGenes. The code and data to make this figure are included as part of the Sushi.R package

Multi-panel Sushi plot made without modification by external image-editing software. The Sushi functions used to create the plot include (A) plotManhattan, (B) plotHic, (C) plotBedpe, (D) plotBedpe, (E) plotBedgraph, (F) plotBedgraph, (G) plotBed, (H) plotManhattan, (I) plotBed, (J) plotGenes, (K) plotBed, (L) plotBedgraph, (M) plotBedgraph and (N) plotGenes. The code and data to make this figure are included as part of the Sushi.R package

3 FEATURES

Quantitative and qualitative genomic information can typically be broken down into three data types: features, signals and interactions. Sushi.R provides flexible methods to plot each data type, allowing users to represent virtually any type of genomic data in an aesthetically pleasing, coherent and integrative fashion. A Sushi plot made entirely within R (without any modifications in image-editing software) displaying multiple data types is shown in Figure 1. The code and data to make Figure 1 are included as part of the Sushi.R package. Feature data describe genomic regions characterized by a unique combination of chromosome, start and stop coordinates. Often stored in Browser Extensible Data (BED) format, feature data can be used to represent sites of transcription factor binding, gene structures, transcript structures, sequence read alignments, Genome-Wide Association Studies (GWAS) hits and data from an array of other sources. The Sushi functions plotBed, plotGenes and plotManhattan facilitate the visualization of feature data in a host of different formats ranging from heatmaps of feature density to feature pileups (Figure 1A, G–K, N). Signal data representing quantitative values across genomic coordinates are commonly stored in bedGraph format and can be used to represent diverse forms of data including sequence conservation, transcription, transcription factor binding, chromatin accessibility and nascent transcription rates, among others. The Sushi function plotBedgraph provides flexible methods to plot, overlay and compare signal track data with appropriately represented data from each one of these disparate sources (Figure 1E, F, L, M). Finally, interaction data can be used to describe interactions between distal genomic elements in both a qualitative or quantitative fashion. Interaction data describing, for example, 3D chromatin structure are commonly stored in Browser Extensible Data Paired-End (BEDPE) format or in interaction matrices. Sushi functions plotHiC and plotBedpe are used to plot interactions data as either trapezoidal heatmaps, arched lines or box and line structures, and support quantitative mapping of interaction signals on y-axis values, color scales and line widths (Figure 1B–D). Sushi plots can easily be combined and augmented via a number of annotation functions including zoomsregion, zoombox, maptocolor and addlegend, allowing customizable scaling of colors, line types and line widths for flexible quantitative presentation. Zoom inset features facilitate visualization at multiple scales and diverse genomic contexts. Images can be written to all formats supported by R including Encapsulated PostScript (EPS), Portable Document Format (PDF) and Portable Network Graphics (PNG).

4 DISCUSSION

The rapid proliferation and complexity of genomics experiments—fueled by high-throughput sequencing—has concomitantly driven demand for analysis and visualization tools that facilitate interpretation and communication of rich and diverse genomic data types. Sushi fills a critical void among currently available visualization tools by providing a means to easily produce sophisticated, customizable, genomic visualizations. Sushi.R will be of great use to the genomic community, as it accelerates our ability to uncover, document and communicate important scientific findings derived from increasingly abundant, and complex, genomic data. Funding: This project is funded by NIH grant U54HG006996 (to M.P.S) and K99HG007356 (to A.P.B). D.H.P. is a Damon Runyon fellow supported by the Damon Runyon Cancer Research Foundation (DRG-2122-12). Conflict of interest: M.P.S. is a cofounder and scientific advisory board (SAB) member of Personalis and also on the SAB of Genapsys.
  8 in total

1.  Extensive promoter-centered chromatin interactions provide a topological basis for transcription regulation.

Authors:  Guoliang Li; Xiaoan Ruan; Raymond K Auerbach; Kuljeet Singh Sandhu; Meizhen Zheng; Ping Wang; Huay Mei Poh; Yufen Goh; Joanne Lim; Jingyao Zhang; Hui Shan Sim; Su Qin Peh; Fabianus Hendriyan Mulawadi; Chin Thing Ong; Yuriy L Orlov; Shuzhen Hong; Zhizhuo Zhang; Steve Landt; Debasish Raha; Ghia Euskirchen; Chia-Lin Wei; Weihong Ge; Huaien Wang; Carrie Davis; Katherine I Fisher-Aylor; Ali Mortazavi; Mark Gerstein; Thomas Gingeras; Barbara Wold; Yi Sun; Melissa J Fullwood; Edwin Cheung; Edison Liu; Wing-Kin Sung; Michael Snyder; Yijun Ruan
Journal:  Cell       Date:  2012-01-20       Impact factor: 41.582

2.  Comprehensive genome-wide protein-DNA interactions detected at single-nucleotide resolution.

Authors:  Ho Sung Rhee; B Franklin Pugh
Journal:  Cell       Date:  2011-12-09       Impact factor: 41.582

3.  Bioconductor: open software development for computational biology and bioinformatics.

Authors:  Robert C Gentleman; Vincent J Carey; Douglas M Bates; Ben Bolstad; Marcel Dettling; Sandrine Dudoit; Byron Ellis; Laurent Gautier; Yongchao Ge; Jeff Gentry; Kurt Hornik; Torsten Hothorn; Wolfgang Huber; Stefano Iacus; Rafael Irizarry; Friedrich Leisch; Cheng Li; Martin Maechler; Anthony J Rossini; Gunther Sawitzki; Colin Smith; Gordon Smyth; Luke Tierney; Jean Y H Yang; Jianhua Zhang
Journal:  Genome Biol       Date:  2004-09-15       Impact factor: 13.583

4.  Topological domains in mammalian genomes identified by analysis of chromatin interactions.

Authors:  Jesse R Dixon; Siddarth Selvaraj; Feng Yue; Audrey Kim; Yan Li; Yin Shen; Ming Hu; Jun S Liu; Bing Ren
Journal:  Nature       Date:  2012-04-11       Impact factor: 49.962

5.  Genetic variants in novel pathways influence blood pressure and cardiovascular disease risk.

Authors:  Georg B Ehret; Patricia B Munroe; Kenneth M Rice; Murielle Bochud; Andrew D Johnson; Daniel I Chasman; Albert V Smith; Martin D Tobin; Germaine C Verwoert; Shih-Jen Hwang; Vasyl Pihur; Peter Vollenweider; Paul F O'Reilly; Najaf Amin; Jennifer L Bragg-Gresham; Alexander Teumer; Nicole L Glazer; Lenore Launer; Jing Hua Zhao; Yurii Aulchenko; Simon Heath; Siim Sõber; Afshin Parsa; Jian'an Luan; Pankaj Arora; Abbas Dehghan; Feng Zhang; Gavin Lucas; Andrew A Hicks; Anne U Jackson; John F Peden; Toshiko Tanaka; Sarah H Wild; Igor Rudan; Wilmar Igl; Yuri Milaneschi; Alex N Parker; Cristiano Fava; John C Chambers; Ervin R Fox; Meena Kumari; Min Jin Go; Pim van der Harst; Wen Hong Linda Kao; Marketa Sjögren; D G Vinay; Myriam Alexander; Yasuharu Tabara; Sue Shaw-Hawkins; Peter H Whincup; Yongmei Liu; Gang Shi; Johanna Kuusisto; Bamidele Tayo; Mark Seielstad; Xueling Sim; Khanh-Dung Hoang Nguyen; Terho Lehtimäki; Giuseppe Matullo; Ying Wu; Tom R Gaunt; N Charlotte Onland-Moret; Matthew N Cooper; Carl G P Platou; Elin Org; Rebecca Hardy; Santosh Dahgam; Jutta Palmen; Veronique Vitart; Peter S Braund; Tatiana Kuznetsova; Cuno S P M Uiterwaal; Adebowale Adeyemo; Walter Palmas; Harry Campbell; Barbara Ludwig; Maciej Tomaszewski; Ioanna Tzoulaki; Nicholette D Palmer; Thor Aspelund; Melissa Garcia; Yen-Pei C Chang; Jeffrey R O'Connell; Nanette I Steinle; Diederick E Grobbee; Dan E Arking; Sharon L Kardia; Alanna C Morrison; Dena Hernandez; Samer Najjar; Wendy L McArdle; David Hadley; Morris J Brown; John M Connell; Aroon D Hingorani; Ian N M Day; Debbie A Lawlor; John P Beilby; Robert W Lawrence; Robert Clarke; Jemma C Hopewell; Halit Ongen; Albert W Dreisbach; Yali Li; J Hunter Young; Joshua C Bis; Mika Kähönen; Jorma Viikari; Linda S Adair; Nanette R Lee; Ming-Huei Chen; Matthias Olden; Cristian Pattaro; Judith A Hoffman Bolton; Anna Köttgen; Sven Bergmann; Vincent Mooser; Nish Chaturvedi; Timothy M Frayling; Muhammad Islam; Tazeen H Jafar; Jeanette Erdmann; Smita R Kulkarni; Stefan R Bornstein; Jürgen Grässler; Leif Groop; Benjamin F Voight; Johannes Kettunen; Philip Howard; Andrew Taylor; Simonetta Guarrera; Fulvio Ricceri; Valur Emilsson; Andrew Plump; Inês Barroso; Kay-Tee Khaw; Alan B Weder; Steven C Hunt; Yan V Sun; Richard N Bergman; Francis S Collins; Lori L Bonnycastle; Laura J Scott; Heather M Stringham; Leena Peltonen; Markus Perola; Erkki Vartiainen; Stefan-Martin Brand; Jan A Staessen; Thomas J Wang; Paul R Burton; Maria Soler Artigas; Yanbin Dong; Harold Snieder; Xiaoling Wang; Haidong Zhu; Kurt K Lohman; Megan E Rudock; Susan R Heckbert; Nicholas L Smith; Kerri L Wiggins; Ayo Doumatey; Daniel Shriner; Gudrun Veldre; Margus Viigimaa; Sanjay Kinra; Dorairaj Prabhakaran; Vikal Tripathy; Carl D Langefeld; Annika Rosengren; Dag S Thelle; Anna Maria Corsi; Andrew Singleton; Terrence Forrester; Gina Hilton; Colin A McKenzie; Tunde Salako; Naoharu Iwai; Yoshikuni Kita; Toshio Ogihara; Takayoshi Ohkubo; Tomonori Okamura; Hirotsugu Ueshima; Satoshi Umemura; Susana Eyheramendy; Thomas Meitinger; H-Erich Wichmann; Yoon Shin Cho; Hyung-Lae Kim; Jong-Young Lee; James Scott; Joban S Sehmi; Weihua Zhang; Bo Hedblad; Peter Nilsson; George Davey Smith; Andrew Wong; Narisu Narisu; Alena Stančáková; Leslie J Raffel; Jie Yao; Sekar Kathiresan; Christopher J O'Donnell; Stephen M Schwartz; M Arfan Ikram; W T Longstreth; Thomas H Mosley; Sudha Seshadri; Nick R G Shrine; Louise V Wain; Mario A Morken; Amy J Swift; Jaana Laitinen; Inga Prokopenko; Paavo Zitting; Jackie A Cooper; Steve E Humphries; John Danesh; Asif Rasheed; Anuj Goel; Anders Hamsten; Hugh Watkins; Stephan J L Bakker; Wiek H van Gilst; Charles S Janipalli; K Radha Mani; Chittaranjan S Yajnik; Albert Hofman; Francesco U S Mattace-Raso; Ben A Oostra; Ayse Demirkan; Aaron Isaacs; Fernando Rivadeneira; Edward G Lakatta; Marco Orru; Angelo Scuteri; Mika Ala-Korpela; Antti J Kangas; Leo-Pekka Lyytikäinen; Pasi Soininen; Taru Tukiainen; Peter Würtz; Rick Twee-Hee Ong; Marcus Dörr; Heyo K Kroemer; Uwe Völker; Henry Völzke; Pilar Galan; Serge Hercberg; Mark Lathrop; Diana Zelenika; Panos Deloukas; Massimo Mangino; Tim D Spector; Guangju Zhai; James F Meschia; Michael A Nalls; Pankaj Sharma; Janos Terzic; M V Kranthi Kumar; Matthew Denniff; Ewa Zukowska-Szczechowska; Lynne E Wagenknecht; F Gerald R Fowkes; Fadi J Charchar; Peter E H Schwarz; Caroline Hayward; Xiuqing Guo; Charles Rotimi; Michiel L Bots; Eva Brand; Nilesh J Samani; Ozren Polasek; Philippa J Talmud; Fredrik Nyberg; Diana Kuh; Maris Laan; Kristian Hveem; Lyle J Palmer; Yvonne T van der Schouw; Juan P Casas; Karen L Mohlke; Paolo Vineis; Olli Raitakari; Santhi K Ganesh; Tien Y Wong; E Shyong Tai; Richard S Cooper; Markku Laakso; Dabeeru C Rao; Tamara B Harris; Richard W Morris; Anna F Dominiczak; Mika Kivimaki; Michael G Marmot; Tetsuro Miki; Danish Saleheen; Giriraj R Chandak; Josef Coresh; Gerjan Navis; Veikko Salomaa; Bok-Ghee Han; Xiaofeng Zhu; Jaspal S Kooner; Olle Melander; Paul M Ridker; Stefania Bandinelli; Ulf B Gyllensten; Alan F Wright; James F Wilson; Luigi Ferrucci; Martin Farrall; Jaakko Tuomilehto; Peter P Pramstaller; Roberto Elosua; Nicole Soranzo; Eric J G Sijbrands; David Altshuler; Ruth J F Loos; Alan R Shuldiner; Christian Gieger; Pierre Meneton; Andre G Uitterlinden; Nicholas J Wareham; Vilmundur Gudnason; Jerome I Rotter; Rainer Rettig; Manuela Uda; David P Strachan; Jacqueline C M Witteman; Anna-Liisa Hartikainen; Jacques S Beckmann; Eric Boerwinkle; Ramachandran S Vasan; Michael Boehnke; Martin G Larson; Marjo-Riitta Järvelin; Bruce M Psaty; Gonçalo R Abecasis; Aravinda Chakravarti; Paul Elliott; Cornelia M van Duijn; Christopher Newton-Cheh; Daniel Levy; Mark J Caulfield; Toby Johnson
Journal:  Nature       Date:  2011-09-11       Impact factor: 49.962

6.  An integrated encyclopedia of DNA elements in the human genome.

Authors: 
Journal:  Nature       Date:  2012-09-06       Impact factor: 49.962

7.  The long-range interaction landscape of gene promoters.

Authors:  Amartya Sanyal; Bryan R Lajoie; Gaurav Jain; Job Dekker
Journal:  Nature       Date:  2012-09-06       Impact factor: 49.962

8.  An expansive human regulatory lexicon encoded in transcription factor footprints.

Authors:  Shane Neph; Jeff Vierstra; Andrew B Stergachis; Alex P Reynolds; Eric Haugen; Benjamin Vernot; Robert E Thurman; Sam John; Richard Sandstrom; Audra K Johnson; Matthew T Maurano; Richard Humbert; Eric Rynes; Hao Wang; Shinny Vong; Kristen Lee; Daniel Bates; Morgan Diegel; Vaughn Roach; Douglas Dunn; Jun Neri; Anthony Schafer; R Scott Hansen; Tanya Kutyavin; Erika Giste; Molly Weaver; Theresa Canfield; Peter Sabo; Miaohua Zhang; Gayathri Balasundaram; Rachel Byron; Michael J MacCoss; Joshua M Akey; M A Bender; Mark Groudine; Rajinder Kaul; John A Stamatoyannopoulos
Journal:  Nature       Date:  2012-09-06       Impact factor: 49.962

  8 in total
  88 in total

1.  Germ Granules Coordinate RNA-Based Epigenetic Inheritance Pathways.

Authors:  Anne E Dodson; Scott Kennedy
Journal:  Dev Cell       Date:  2019-08-08       Impact factor: 12.270

2.  DNA Rchitect: an R based visualizer for network analysis of chromatin interaction data.

Authors:  R N Ramirez; K Bedirian; S M Gray; A Diallo
Journal:  Bioinformatics       Date:  2020-01-15       Impact factor: 6.937

3.  epiTAD: a web application for visualizing chromosome conformation capture data in the context of genetic epidemiology.

Authors:  Jordan H Creed; Garrick Aden-Buie; Alvaro N Monteiro; Travis A Gerke
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

4.  De novo RNA sequence assembly during in vivo inflammatory stress reveals hundreds of unannotated lincRNAs in human blood CD14+ monocytes and in adipose tissue.

Authors:  Chenyi Xue; Xuan Zhang; Hanrui Zhang; Jane F Ferguson; Ying Wang; Christine C Hinkle; Mingyao Li; Muredach P Reilly
Journal:  Physiol Genomics       Date:  2017-04-07       Impact factor: 3.107

5.  Widespread antisense transcription of Populus genome under drought.

Authors:  Yinan Yuan; Su Chen
Journal:  Mol Genet Genomics       Date:  2018-06-06       Impact factor: 3.291

6.  Enhancer-gene rewiring in the pathogenesis of Quebec platelet disorder.

Authors:  Minggao Liang; Asim Soomro; Subia Tasneem; Luis E Abatti; Azad Alizada; Xuefei Yuan; Liis Uusküla-Reimand; Lina Antounians; Sana Akhtar Alvi; Andrew D Paterson; Georges-Étienne Rivard; Ian C Scott; Jennifer A Mitchell; Catherine P M Hayward; Michael D Wilson
Journal:  Blood       Date:  2020-12-03       Impact factor: 22.113

7.  Regional centromeres in the yeast Candida lusitaniae lack pericentromeric heterochromatin.

Authors:  Shivali Kapoor; Lisha Zhu; Cara Froyd; Tao Liu; Laura N Rusche
Journal:  Proc Natl Acad Sci U S A       Date:  2015-09-14       Impact factor: 11.205

8.  Maternal Ribosomes Are Sufficient for Tissue Diversification during Embryonic Development in C. elegans.

Authors:  Elif Sarinay Cenik; Xuefeng Meng; Ngang Heok Tang; Richard Nelson Hall; Joshua A Arribere; Can Cenik; Yishi Jin; Andrew Fire
Journal:  Dev Cell       Date:  2019-02-21       Impact factor: 12.270

9.  B chromosomes of multiple species have intense evolutionary dynamics and accumulated genes related to important biological processes.

Authors:  Syed F Ahmad; Maryam Jehangir; Adauto L Cardoso; Ivan R Wolf; Vladimir P Margarido; Diogo C Cabral-de-Mello; Rachel O'Neill; Guilherme T Valente; Cesar Martins
Journal:  BMC Genomics       Date:  2020-09-23       Impact factor: 3.969

10.  Charting the cis-regulome of activated B cells by coupling structural and functional genomics.

Authors:  Virendra K Chaudhri; Krista Dienger-Stambaugh; Zhiguo Wu; Mahesh Shrestha; Harinder Singh
Journal:  Nat Immunol       Date:  2019-12-23       Impact factor: 25.606

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.