Literature DB >> 32528659

Fluent genomics with  plyranges and  tximeta.

Stuart Lee1,2, Michael Lawrence3, Michael I Love4,5.   

Abstract

We construct a simple workflow for fluent genomics data analysis using the R/Bioconductor ecosystem. This involves three core steps: import the data into an appropriate abstraction, model the data with respect to the biological questions of interest, and integrate the results with respect to their underlying genomic coordinates. Here we show how to implement these steps to integrate published RNA-seq and ATAC-seq experiments on macrophage cell lines. Using tximeta, we import RNA-seq transcript quantifications into an analysis-ready data structure, called the SummarizedExperiment, that contains the ranges of the reference transcripts and metadata on their provenance. Using SummarizedExperiments to represent the ATAC-seq and RNA-seq data, we model differentially accessible (DA) chromatin peaks and differentially expressed (DE) genes with existing Bioconductor packages. Using plyranges we then integrate the results to see if there is an enrichment of DA peaks near DE genes by finding overlaps and aggregating over log-fold change thresholds. The combination of these packages and their integration with the Bioconductor ecosystem provide a coherent framework for analysts to iteratively and reproducibly explore their biological data. Copyright:
© 2020 Lee S et al.

Entities:  

Keywords:  Bioconductor; Chromatin Accessibility; Data Integration; Gene Expression; Workflow; plyranges; tximeta

Year:  2020        PMID: 32528659      PMCID: PMC7243206          DOI: 10.12688/f1000research.22259.1

Source DB:  PubMed          Journal:  F1000Res        ISSN: 2046-1402


  14 in total

1.  Snakemake--a scalable bioinformatics workflow engine.

Authors:  Johannes Köster; Sven Rahmann
Journal:  Bioinformatics       Date:  2012-08-20       Impact factor: 6.937

2.  RNA-Seq workflow: gene-level exploratory analysis and differential expression.

Authors:  Michael I Love; Simon Anders; Vladislav Kim; Wolfgang Huber
Journal:  F1000Res       Date:  2015-10-14

3.  Software for computing and annotating genomic ranges.

Authors:  Michael Lawrence; Wolfgang Huber; Hervé Pagès; Patrick Aboyoun; Marc Carlson; Robert Gentleman; Martin T Morgan; Vincent J Carey
Journal:  PLoS Comput Biol       Date:  2013-08-08       Impact factor: 4.475

4.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.

Authors:  Michael I Love; Wolfgang Huber; Simon Anders
Journal:  Genome Biol       Date:  2014       Impact factor: 13.583

5.  RNA-seq analysis is easy as 1-2-3 with limma, Glimma and edgeR.

Authors:  Charity W Law; Monther Alhamdoosh; Shian Su; Gordon K Smyth; Matthew E Ritchie
Journal:  F1000Res       Date:  2016-06-17

6.  Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences.

Authors:  Charlotte Soneson; Michael I Love; Mark D Robinson
Journal:  F1000Res       Date:  2015-12-30

7.  Shared genetic effects on chromatin and gene expression indicate a role for enhancer priming in immune response.

Authors:  Kaur Alasoo; Julia Rodrigues; Subhankar Mukhopadhyay; Andrew J Knights; Alice L Mann; Kousik Kundu; Christine Hale; Gordon Dougan; Daniel J Gaffney
Journal:  Nat Genet       Date:  2018-01-29       Impact factor: 38.330

8.  plyranges: a grammar of genomic data transformation.

Authors:  Stuart Lee; Dianne Cook; Michael Lawrence
Journal:  Genome Biol       Date:  2019-01-04       Impact factor: 13.583

9.  GENCODE reference annotation for the human and mouse genomes.

Authors:  Adam Frankish; Mark Diekhans; Anne-Maud Ferreira; Rory Johnson; Irwin Jungreis; Jane Loveland; Jonathan M Mudge; Cristina Sisu; James Wright; Joel Armstrong; If Barnes; Andrew Berry; Alexandra Bignell; Silvia Carbonell Sala; Jacqueline Chrast; Fiona Cunningham; Tomás Di Domenico; Sarah Donaldson; Ian T Fiddes; Carlos García Girón; Jose Manuel Gonzalez; Tiago Grego; Matthew Hardy; Thibaut Hourlier; Toby Hunt; Osagie G Izuogu; Julien Lagarde; Fergal J Martin; Laura Martínez; Shamika Mohanan; Paul Muir; Fabio C P Navarro; Anne Parker; Baikang Pei; Fernando Pozo; Magali Ruffier; Bianca M Schmitt; Eloise Stapleton; Marie-Marthe Suner; Irina Sycheva; Barbara Uszczynska-Ratajczak; Jinuri Xu; Andrew Yates; Daniel Zerbino; Yan Zhang; Bronwen Aken; Jyoti S Choudhary; Mark Gerstein; Roderic Guigó; Tim J P Hubbard; Manolis Kellis; Benedict Paten; Alexandre Reymond; Michael L Tress; Paul Flicek
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

10.  Tximeta: Reference sequence checksums for provenance identification in RNA-seq.

Authors:  Michael I Love; Charlotte Soneson; Peter F Hickey; Lisa K Johnson; N Tessa Pierce; Lori Shepherd; Martin Morgan; Rob Patro
Journal:  PLoS Comput Biol       Date:  2020-02-25       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.