Literature DB >> 33509298

High heterogeneity undermines generalization of differential expression results in RNA-Seq analysis.

Weitong Cui1, Huaru Xue1, Lei Wei1, Jinghua Jin2, Xuewen Tian3, Qinglu Wang4.   

Abstract

BACKGROUND: RNA sequencing (RNA-Seq) has been widely applied in oncology for monitoring transcriptome changes. However, the emerging problem that high variation of gene expression levels caused by tumor heterogeneity may affect the reproducibility of differential expression (DE) results has rarely been studied. Here, we investigated the reproducibility of DE results for any given number of biological replicates between 3 and 24 and explored why a great many differentially expressed genes (DEGs) were not reproducible.
RESULTS: Our findings demonstrate that poor reproducibility of DE results exists not only for small sample sizes, but also for relatively large sample sizes. Quite a few of the DEGs detected are specific to the samples in use, rather than genuinely differentially expressed under different conditions. Poor reproducibility of DE results is mainly caused by high variation of gene expression levels for the same gene in different samples. Even though biological variation may account for much of the high variation of gene expression levels, the effect of outlier count data also needs to be treated seriously, as outlier data severely interfere with DE analysis.
CONCLUSIONS: High heterogeneity exists not only in tumor tissue samples of each cancer type studied, but also in normal samples. High heterogeneity leads to poor reproducibility of DEGs, undermining generalization of differential expression results. Therefore, it is necessary to use large sample sizes (at least 10 if possible) in RNA-Seq experimental designs to reduce the impact of biological variability and DE results should be interpreted cautiously unless soundly validated.

Entities:  

Keywords:  Differential expression; Heterogeneity; Outlier; RNA sequencing; Reproducibility; Tumor

Mesh:

Substances:

Year:  2021        PMID: 33509298      PMCID: PMC7845028          DOI: 10.1186/s40246-021-00308-5

Source DB:  PubMed          Journal:  Hum Genomics        ISSN: 1473-9542            Impact factor:   4.639


  40 in total

1.  Drug development: Raise standards for preclinical cancer research.

Authors:  C Glenn Begley; Lee M Ellis
Journal:  Nature       Date:  2012-03-28       Impact factor: 49.962

2.  Differential expression in RNA-seq: a matter of depth.

Authors:  Sonia Tarazona; Fernando García-Alcalde; Joaquín Dopazo; Alberto Ferrer; Ana Conesa
Journal:  Genome Res       Date:  2011-09-08       Impact factor: 9.043

3.  1,500 scientists lift the lid on reproducibility.

Authors:  Monya Baker
Journal:  Nature       Date:  2016-05-26       Impact factor: 49.962

Review 4.  Tumor heterogeneity: causes and consequences.

Authors:  Andriy Marusyk; Kornelia Polyak
Journal:  Biochim Biophys Acta       Date:  2009-11-18

5.  Error estimates for the analysis of differential expression from RNA-seq count data.

Authors:  Conrad J Burden; Sumaira E Qureshi; Susan R Wilson
Journal:  PeerJ       Date:  2014-09-23       Impact factor: 2.984

6.  CORNAS: coverage-dependent RNA-Seq analysis of gene expression data without biological replicates.

Authors:  Joel Z B Low; Tsung Fei Khang; Martti T Tammi
Journal:  BMC Bioinformatics       Date:  2017-12-28       Impact factor: 3.169

7.  Optimization of an RNA-Seq Differential Gene Expression Analysis Depending on Biological Replicate Number and Library Size.

Authors:  Sophie Lamarre; Pierre Frasse; Mohamed Zouine; Delphine Labourdette; Elise Sainderichin; Guojian Hu; Véronique Le Berre-Anton; Mondher Bouzayen; Elie Maza
Journal:  Front Plant Sci       Date:  2018-02-14       Impact factor: 5.753

8.  Choice of library size normalization and statistical methods for differential gene expression analysis in balanced two-group comparisons for RNA-seq studies.

Authors:  Xiaohong Li; Nigel G F Cooper; Timothy E O'Toole; Eric C Rouchka
Journal:  BMC Genomics       Date:  2020-01-28       Impact factor: 3.969

Review 9.  Tumour heterogeneity and the evolution of polyclonal drug resistance.

Authors:  Rebecca A Burrell; Charles Swanton
Journal:  Mol Oncol       Date:  2014-07-10       Impact factor: 6.603

10.  A comparison of methods for differential expression analysis of RNA-seq data.

Authors:  Charlotte Soneson; Mauro Delorenzi
Journal:  BMC Bioinformatics       Date:  2013-03-09       Impact factor: 3.169

View more
  4 in total

1.  Determination of WWOX Function in Modulating Cellular Pathways Activated by AP-2α and AP-2γ Transcription Factors in Bladder Cancer.

Authors:  Damian Kołat; Żaneta Kałuzińska; Andrzej K Bednarek; Elżbieta Płuciennik
Journal:  Cells       Date:  2022-04-19       Impact factor: 7.666

2.  An integrative transcriptome analysis reveals potential predictive, prognostic biomarkers and therapeutic targets in colorectal cancer.

Authors:  Pouria Samadi; Meysam Soleimani; Fatemeh Nouri; Fatemeh Rahbarizadeh; Rezvan Najafi; Akram Jalali
Journal:  BMC Cancer       Date:  2022-07-30       Impact factor: 4.638

Review 3.  Recent developments and future directions in meta-analysis of differential gene expression in livestock RNA-Seq.

Authors:  Brittney N Keel; Amanda K Lindholm-Perry
Journal:  Front Genet       Date:  2022-09-19       Impact factor: 4.772

4.  Deep Radiotranscriptomics of Non-Small Cell Lung Carcinoma for Assessing Molecular and Histology Subtypes with a Data-Driven Analysis.

Authors:  Eleftherios Trivizakis; John Souglakos; Apostolos Karantanas; Kostas Marias
Journal:  Diagnostics (Basel)       Date:  2021-12-17
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.