Raymond G Cavalcante1, Maureen A Sartor1,2. 1. Department of Computational Medicine and Bioinformatics. 2. Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA.
Abstract
MOTIVATION: Analysis of next-generation sequencing data often results in a list of genomic regions. These may include differentially methylated CpGs/regions, transcription factor binding sites, interacting chromatin regions, or GWAS-associated SNPs, among others. A common analysis step is to annotate such genomic regions to genomic annotations (promoters, exons, enhancers, etc.). Existing tools are limited by a lack of annotation sources and flexible options, the time it takes to annotate regions, an artificial one-to-one region-to-annotation mapping, a lack of visualization options to easily summarize data, or some combination thereof. RESULTS: We developed the annotatr Bioconductor package to flexibly and quickly summarize and plot annotations of genomic regions. The annotatr package reports all intersections of regions and annotations, giving a better understanding of the genomic context of the regions. A variety of graphics functions are implemented to easily plot numerical or categorical data associated with the regions across the annotations, and across annotation intersections, providing insight into how characteristics of the regions differ across the annotations. We demonstrate that annotatr is up to 27× faster than comparable R packages. Overall, annotatr enables a richer biological interpretation of experiments. AVAILABILITY AND IMPLEMENTATION: http://bioconductor.org/packages/annotatr/ and https://github.com/rcavalcante/annotatr. CONTACT: rcavalca@umich.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Analysis of next-generation sequencing data often results in a list of genomic regions. These may include differentially methylated CpGs/regions, transcription factor binding sites, interacting chromatin regions, or GWAS-associated SNPs, among others. A common analysis step is to annotate such genomic regions to genomic annotations (promoters, exons, enhancers, etc.). Existing tools are limited by a lack of annotation sources and flexible options, the time it takes to annotate regions, an artificial one-to-one region-to-annotation mapping, a lack of visualization options to easily summarize data, or some combination thereof. RESULTS: We developed the annotatr Bioconductor package to flexibly and quickly summarize and plot annotations of genomic regions. The annotatr package reports all intersections of regions and annotations, giving a better understanding of the genomic context of the regions. A variety of graphics functions are implemented to easily plot numerical or categorical data associated with the regions across the annotations, and across annotation intersections, providing insight into how characteristics of the regions differ across the annotations. We demonstrate that annotatr is up to 27× faster than comparable R packages. Overall, annotatr enables a richer biological interpretation of experiments. AVAILABILITY AND IMPLEMENTATION: http://bioconductor.org/packages/annotatr/ and https://github.com/rcavalcante/annotatr. CONTACT: rcavalca@umich.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Lihua J Zhu; Claude Gazin; Nathan D Lawson; Hervé Pagès; Simon M Lin; David S Lapointe; Michael R Green Journal: BMC Bioinformatics Date: 2010-05-11 Impact factor: 3.169
Authors: Felipe J Núñez; Flor M Mendez; Padma Kadiyala; Mahmoud S Alghamri; Masha G Savelieff; Maria B Garcia-Fabiani; Santiago Haase; Carl Koschmann; Anda-Alexandra Calinescu; Neha Kamran; Meghna Saxena; Rohin Patel; Stephen Carney; Marissa Z Guo; Marta Edwards; Mats Ljungman; Tingting Qin; Maureen A Sartor; Rebecca Tagett; Sriram Venneti; Jacqueline Brosnan-Cashman; Alan Meeker; Vera Gorbunova; Lili Zhao; Daniel M Kremer; Li Zhang; Costas A Lyssiotis; Lindsey Jones; Cameron J Herting; James L Ross; Dolores Hambardzumyan; Shawn Hervey-Jumper; Maria E Figueroa; Pedro R Lowenstein; Maria G Castro Journal: Sci Transl Med Date: 2019-02-13 Impact factor: 17.956
Authors: Eirene Markenscoff-Papadimitriou; Sean Whalen; Pawel Przytycki; Reuben Thomas; Fadya Binyameen; Tomasz J Nowakowski; Arnold R Kriegstein; Stephan J Sanders; Matthew W State; Katherine S Pollard; John L Rubenstein Journal: Cell Date: 2020-06-30 Impact factor: 41.582
Authors: Joseph Kochmanski; Elizabeth H Marchlewicz; Raymond G Cavalcante; Maureen A Sartor; Dana C Dolinoy Journal: Epigenetics Date: 2018-08-23 Impact factor: 4.528
Authors: Neeraj K Sharma; Mary E Comeau; Dennis Montoya; Matteo Pellegrini; Timothy D Howard; Carl D Langefeld; Swapan K Das Journal: Diabetes Date: 2020-09-14 Impact factor: 9.461