Literature DB >> 34320923

MUREN: a robust and multi-reference approach of RNA-seq transcript normalization.

Yance Feng1,2, Lei M Li3,4,5.   

Abstract

BACKGROUND: Normalization of RNA-seq data aims at identifying biological expression differentiation between samples by removing the effects of unwanted confounding factors. Explicitly or implicitly, the justification of normalization requires a set of housekeeping genes. However, the existence of housekeeping genes common for a very large collection of samples, especially under a wide range of conditions, is questionable.
RESULTS: We propose to carry out pairwise normalization with respect to multiple references, selected from representative samples. Then the pairwise intermediates are integrated based on a linear model that adjusts the reference effects. Motivated by the notion of housekeeping genes and their statistical counterparts, we adopt the robust least trimmed squares regression in pairwise normalization. The proposed method (MUREN) is compared with other existing tools on some standard data sets. The goodness of normalization emphasizes on preserving possible asymmetric differentiation, whose biological significance is exemplified by a single cell data of cell cycle. MUREN is implemented as an R package. The code under license GPL-3 is available on the github platform: github.com/hippo-yf/MUREN and on the conda platform: anaconda.org/hippo-yf/r-muren.
CONCLUSIONS: MUREN performs the RNA-seq normalization using a two-step statistical regression induced from a general principle. We propose that the densities of pairwise differentiations are used to evaluate the goodness of normalization. MUREN adjusts the mode of differentiation toward zero while preserving the skewness due to biological asymmetric differentiation. Moreover, by robustly integrating pre-normalized counts with respect to multiple references, MUREN is immune to individual outlier samples.
© 2021. The Author(s).

Entities:  

Keywords:  Asymmetrically regulated transcription profiles (ART); Mode; Multi-reference; Normalization; RNA-seq; Skewness

Mesh:

Year:  2021        PMID: 34320923     DOI: 10.1186/s12859-021-04288-0

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  25 in total

1.  RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays.

Authors:  John C Marioni; Christopher E Mason; Shrikant M Mane; Matthew Stephens; Yoav Gilad
Journal:  Genome Res       Date:  2008-06-11       Impact factor: 9.043

2.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

3.  Differential expression in RNA-seq: a matter of depth.

Authors:  Sonia Tarazona; Fernando García-Alcalde; Joaquín Dopazo; Alberto Ferrer; Ana Conesa
Journal:  Genome Res       Date:  2011-09-08       Impact factor: 9.043

4.  RNA-Seq gene expression estimation with read mapping uncertainty.

Authors:  Bo Li; Victor Ruotti; Ron M Stewart; James A Thomson; Colin N Dewey
Journal:  Bioinformatics       Date:  2009-12-18       Impact factor: 6.937

Review 5.  From RNA-seq reads to differential expression results.

Authors:  Alicia Oshlack; Mark D Robinson; Matthew D Young
Journal:  Genome Biol       Date:  2010-12-22       Impact factor: 13.583

6.  Improving RNA-Seq expression estimates by correcting for fragment bias.

Authors:  Adam Roberts; Cole Trapnell; Julie Donaghey; John L Rinn; Lior Pachter
Journal:  Genome Biol       Date:  2011-03-16       Impact factor: 13.583

7.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation.

Authors:  Cole Trapnell; Brian A Williams; Geo Pertea; Ali Mortazavi; Gordon Kwan; Marijke J van Baren; Steven L Salzberg; Barbara J Wold; Lior Pachter
Journal:  Nat Biotechnol       Date:  2010-05-02       Impact factor: 54.908

8.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

9.  Transcriptome sequencing to detect gene fusions in cancer.

Authors:  Christopher A Maher; Chandan Kumar-Sinha; Xuhong Cao; Shanker Kalyana-Sundaram; Bo Han; Xiaojun Jing; Lee Sam; Terrence Barrette; Nallasivam Palanisamy; Arul M Chinnaiyan
Journal:  Nature       Date:  2009-01-11       Impact factor: 49.962

10.  Comprehensive comparative analysis of strand-specific RNA sequencing methods.

Authors:  Joshua Z Levin; Moran Yassour; Xian Adiconis; Chad Nusbaum; Dawn Anne Thompson; Nir Friedman; Andreas Gnirke; Aviv Regev
Journal:  Nat Methods       Date:  2010-08-15       Impact factor: 28.547

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.