Literature DB >> 31456901

Modeling and analysis of RNA-seq data: a review from a statistical perspective.

Wei Vivian Li1, Jingyi Jessica Li1,2.   

Abstract

BACKGROUND: Since the invention of next-generation RNA sequencing (RNA-seq) technologies, they have become a powerful tool to study the presence and quantity of RNA molecules in biological samples and have revolutionized transcriptomic studies. The analysis of RNA-seq data at four different levels (samples, genes, transcripts, and exons) involve multiple statistical and computational questions, some of which remain challenging up to date.
RESULTS: We review RNA-seq analysis tools at the sample, gene, transcript, and exon levels from a statistical perspective. We also highlight the biological and statistical questions of most practical considerations.
CONCLUSIONS: The development of statistical and computational methods for analyzing RNA-seq data has made significant advances in the past decade. However, methods developed to answer the same biological question often rely on diverse statistical models and exhibit different performance under different scenarios. This review discusses and compares multiple commonly used statistical models regarding their assumptions, in the hope of helping users select appropriate methods as needed, as well as assisting developers for future method development.

Entities:  

Keywords:  RNA-seq; alternatively spliced exons; differentially expressed genes; isoform reconstruction and quantification; statistical modeling

Year:  2018        PMID: 31456901      PMCID: PMC6711375          DOI: 10.1007/s40484-018-0144-7

Source DB:  PubMed          Journal:  Quant Biol        ISSN: 2095-4689


  88 in total

1.  Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data.

Authors:  Eran Segal; Michael Shapira; Aviv Regev; Dana Pe'er; David Botstein; Daphne Koller; Nir Friedman
Journal:  Nat Genet       Date:  2003-06       Impact factor: 38.330

2.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias.

Authors:  B M Bolstad; R A Irizarry; M Astrand; T P Speed
Journal:  Bioinformatics       Date:  2003-01-22       Impact factor: 6.937

3.  A gene-coexpression network for global discovery of conserved genetic modules.

Authors:  Joshua M Stuart; Eran Segal; Daphne Koller; Stuart K Kim
Journal:  Science       Date:  2003-08-21       Impact factor: 47.728

4.  Hierarchical organization of modularity in metabolic networks.

Authors:  E Ravasz; A L Somera; D A Mongru; Z N Oltvai; A L Barabási
Journal:  Science       Date:  2002-08-30       Impact factor: 47.728

5.  Discovery of meaningful associations in genomic data using partial correlation coefficients.

Authors:  Alberto de la Fuente; Nan Bing; Ina Hoeschele; Pedro Mendes
Journal:  Bioinformatics       Date:  2004-07-29       Impact factor: 6.937

6.  A general framework for weighted gene co-expression network analysis.

Authors:  Bin Zhang; Steve Horvath
Journal:  Stat Appl Genet Mol Biol       Date:  2005-08-12

7.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

Review 8.  The potential and challenges of nanopore sequencing.

Authors:  Daniel Branton; David W Deamer; Andre Marziali; Hagan Bayley; Steven A Benner; Thomas Butler; Massimiliano Di Ventra; Slaven Garaj; Andrew Hibbs; Xiaohua Huang; Stevan B Jovanovich; Predrag S Krstic; Stuart Lindsay; Xinsheng Sean Ling; Carlos H Mastrangelo; Amit Meller; John S Oliver; Yuriy V Pershin; J Michael Ramsey; Robert Riehn; Gautam V Soni; Vincent Tabard-Cossa; Meni Wanunu; Matthew Wiggin; Jeffery A Schloss
Journal:  Nat Biotechnol       Date:  2008-10       Impact factor: 54.908

9.  Conserved co-expression for candidate disease gene prioritization.

Authors:  Martin Oti; Jeroen van Reeuwijk; Martijn A Huynen; Han G Brunner
Journal:  BMC Bioinformatics       Date:  2008-04-23       Impact factor: 3.169

10.  Gene expression during the life cycle of Drosophila melanogaster.

Authors:  Michelle N Arbeitman; Eileen E M Furlong; Farhad Imam; Eric Johnson; Brian H Null; Bruce S Baker; Mark A Krasnow; Matthew P Scott; Ronald W Davis; Kevin P White
Journal:  Science       Date:  2002-09-27       Impact factor: 47.728

View more
  19 in total

Review 1.  A simple guide to de novo transcriptome assembly and annotation.

Authors:  Venket Raghavan; Louis Kraft; Fantin Mesny; Linda Rigerte
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

2.  Benchmarking association analyses of continuous exposures with RNA-seq in observational studies.

Authors:  Tamar Sofer; Nuzulul Kurniansyah; François Aguet; Kristin Ardlie; Peter Durda; Deborah A Nickerson; Joshua D Smith; Yongmei Liu; Sina A Gharib; Susan Redline; Stephen S Rich; Jerome I Rotter; Kent D Taylor
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

3.  scDesign2: a transparent simulator that generates high-fidelity single-cell gene expression count data with gene correlations captured.

Authors:  Tianyi Sun; Dongyuan Song; Wei Vivian Li; Jingyi Jessica Li
Journal:  Genome Biol       Date:  2021-05-25       Impact factor: 13.583

4.  Insight into the Lifestyle of Amoeba Willaertia magna during Bioreactor Growth Using Transcriptomics and Proteomics.

Authors:  Issam Hasni; Philippe Decloquement; Sandrine Demanèche; Rayane Mouh Mameri; Olivier Abbe; Philippe Colson; Bernard La Scola
Journal:  Microorganisms       Date:  2020-05-21

5.  Enhancer occlusion transcripts regulate the activity of human enhancer domains via transcriptional interference: a computational perspective.

Authors:  Amit Pande; Wojciech Makalowski; Jürgen Brosius; Carsten A Raabe
Journal:  Nucleic Acids Res       Date:  2020-04-17       Impact factor: 16.971

6.  Transcriptomic Analysis, Motility and Biofilm Formation Characteristics of Salmonella typhimurium Exposed to Benzyl Isothiocyanate Treatment.

Authors:  Tong-Xin Niu; Xiao-Ning Wang; Hong-Yan Wu; Jing-Ran Bi; Hong-Shun Hao; Hong-Man Hou; Gong-Liang Zhang
Journal:  Int J Mol Sci       Date:  2020-02-04       Impact factor: 5.923

7.  AIDE: annotation-assisted isoform discovery with high precision.

Authors:  Wei Vivian Li; Shan Li; Xin Tong; Ling Deng; Hubing Shi; Jingyi Jessica Li
Journal:  Genome Res       Date:  2019-11-06       Impact factor: 9.043

Review 8.  Regulation of Vitamin C Accumulation for Improved Tomato Fruit Quality and Alleviation of Abiotic Stress.

Authors:  Ifigeneia Mellidou; Athanasios Koukounaras; Stefanos Kostas; Efstathia Patelou; Angelos K Kanellis
Journal:  Genes (Basel)       Date:  2021-05-06       Impact factor: 4.096

Review 9.  Selecting gene features for unsupervised analysis of single-cell gene expression data.

Authors:  Jie Sheng; Wei Vivian Li
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 13.994

10.  Coordinated analysis of exon and intron data reveals novel differential gene expression changes.

Authors:  Hamid R Eghbalnia; William W Wilfinger; Karol Mackey; Piotr Chomczynski
Journal:  Sci Rep       Date:  2020-09-24       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.