Literature DB >> 27742662

Synthetic data sets for the identification of key ingredients for RNA-seq differential analysis.

Guillem Rigaill, Sandrine Balzergue, Véronique Brunaud, Eddy Blondet, Andrea Rau, Odile Rogier, José Caius, Cathy Maugis-Rabusseau, Ludivine Soubigou-Taconnat, Sébastien Aubourg, Claire Lurin, Marie-Laure Martin-Magniette, Etienne Delannoy.   

Abstract

Numerous statistical pipelines are now available for the differential analysis of gene expression measured with RNA-sequencing technology. Most of them are based on similar statistical frameworks after normalization, differing primarily in the choice of data distribution, mean and variance estimation strategy and data filtering. We propose an evaluation of the impact of these choices when few biological replicates are available through the use of synthetic data sets. This framework is based on real data sets and allows the exploration of various scenarios differing in the proportion of non-differentially expressed genes. Hence, it provides an evaluation of the key ingredients of the differential analysis, free of the biases associated with the simulation of data using parametric models. Our results show the relevance of a proper modeling of the mean by using linear or generalized linear modeling. Once the mean is properly modeled, the impact of the other parameters on the performance of the test is much less important. Finally, we propose to use the simple visualization of the raw P-value histogram as a practical evaluation criterion of the performance of differential analysis methods on real data sets.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Keywords:  RNA-seq; benchmark data set; differential analysis

Mesh:

Substances:

Year:  2018        PMID: 27742662     DOI: 10.1093/bib/bbw092

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  20 in total

1.  Two interacting PPR proteins are major Arabidopsis editing factors in plastid and mitochondria.

Authors:  Damien Guillaumot; Mauricio Lopez-Obando; Kevin Baudry; Alexandra Avon; Guillem Rigaill; Andéol Falcon de Longevialle; Benjamin Broche; Mizuki Takenaka; Richard Berthomé; Geert De Jaeger; Etienne Delannoy; Claire Lurin
Journal:  Proc Natl Acad Sci U S A       Date:  2017-07-31       Impact factor: 11.205

2.  Landscape of the Noncoding Transcriptome Response of Two Arabidopsis Ecotypes to Phosphate Starvation.

Authors:  Thomas Blein; Coline Balzergue; Thomas Roulé; Marc Gabriel; Laetitia Scalisi; Tracy François; Céline Sorin; Aurélie Christ; Christian Godon; Etienne Delannoy; Marie-Laure Martin-Magniette; Laurent Nussaume; Caroline Hartmann; Daniel Gautheret; Thierry Desnos; Martin Crespi
Journal:  Plant Physiol       Date:  2020-05-13       Impact factor: 8.340

3.  Analysis of the Plant Mitochondrial Transcriptome.

Authors:  Kevin Baudry; Etienne Delannoy; Catherine Colas des Francs-Small
Journal:  Methods Mol Biol       Date:  2022

4.  Transcriptomics at Maize Embryo/Endosperm Interfaces Identifies a Transcriptionally Distinct Endosperm Subdomain Adjacent to the Embryo Scutellum.

Authors:  Nicolas M Doll; Jeremy Just; Véronique Brunaud; José Caïus; Aurélie Grimault; Nathalie Depège-Fargeix; Eddi Esteban; Asher Pasha; Nicholas J Provart; Gwyneth C Ingram; Peter M Rogowsky; Thomas Widiez
Journal:  Plant Cell       Date:  2020-02-21       Impact factor: 11.277

5.  Silicon supply affects the root transcriptome of Brassica napus L.

Authors:  Cylia Haddad; Jacques Trouverie; Mustapha Arkoun; Jean-Claude Yvin; José Caïus; Véronique Brunaud; Philippe Laîné; Philippe Etienne
Journal:  Planta       Date:  2019-02-28       Impact factor: 4.116

Review 6.  Quantitative single-cell transcriptomics.

Authors:  Christoph Ziegenhain; Beate Vieth; Swati Parekh; Ines Hellmann; Wolfgang Enard
Journal:  Brief Funct Genomics       Date:  2018-07-01       Impact factor: 4.241

7.  DiCoExpress: a tool to process multifactorial RNAseq experiments from quality controls to co-expression analysis through differential analysis based on contrasts inside GLM models.

Authors:  Ilana Lambert; Christine Paysant-Le Roux; Stefano Colella; Marie-Laure Martin-Magniette
Journal:  Plant Methods       Date:  2020-05-12       Impact factor: 4.993

Review 8.  Essential guidelines for computational method benchmarking.

Authors:  Lukas M Weber; Wouter Saelens; Robrecht Cannoodt; Charlotte Soneson; Alexander Hapfelmeier; Paul P Gardner; Anne-Laure Boulesteix; Yvan Saeys; Mark D Robinson
Journal:  Genome Biol       Date:  2019-06-20       Impact factor: 13.583

9.  Data-based RNA-seq simulations by binomial thinning.

Authors:  David Gerard
Journal:  BMC Bioinformatics       Date:  2020-05-24       Impact factor: 3.169

10.  The Genomic Impact of Mycoheterotrophy in Orchids.

Authors:  Marcin Jąkalski; Julita Minasiewicz; José Caius; Michał May; Marc-André Selosse; Etienne Delannoy
Journal:  Front Plant Sci       Date:  2021-06-09       Impact factor: 5.753

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.