Literature DB >> 29768462

A comparative study of RNA-Seq and microarray data analysis on the two examples of rectal-cancer patients and Burkitt Lymphoma cells.

Alexander Wolff1, Michaela Bayerlová1, Jochen Gaedcke2, Dieter Kube3, Tim Beißbarth1.   

Abstract

BACKGROUND: Pipeline comparisons for gene expression data are highly valuable for applied real data analyses, as they enable the selection of suitable analysis strategies for the dataset at hand. Such pipelines for RNA-Seq data should include mapping of reads, counting and differential gene expression analysis or preprocessing, normalization and differential gene expression in case of microarray analysis, in order to give a global insight into pipeline performances.
METHODS: Four commonly used RNA-Seq pipelines (STAR/HTSeq-Count/edgeR, STAR/RSEM/edgeR, Sailfish/edgeR, TopHat2/Cufflinks/CuffDiff)) were investigated on multiple levels (alignment and counting) and cross-compared with the microarray counterpart on the level of gene expression and gene ontology enrichment. For these comparisons we generated two matched microarray and RNA-Seq datasets: Burkitt Lymphoma cell line data and rectal cancer patient data.
RESULTS: The overall mapping rate of STAR was 98.98% for the cell line dataset and 98.49% for the patient dataset. Tophat's overall mapping rate was 97.02% and 96.73%, respectively, while Sailfish had only an overall mapping rate of 84.81% and 54.44%. The correlation of gene expression in microarray and RNA-Seq data was moderately worse for the patient dataset (ρ = 0.67-0.69) than for the cell line dataset (ρ = 0.87-0.88). An exception were the correlation results of Cufflinks, which were substantially lower (ρ = 0.21-0.29 and 0.34-0.53). For both datasets we identified very low numbers of differentially expressed genes using the microarray platform. For RNA-Seq we checked the agreement of differentially expressed genes identified in the different pipelines and of GO-term enrichment results.
CONCLUSION: In conclusion the combination of STAR aligner with HTSeq-Count followed by STAR aligner with RSEM and Sailfish generated differentially expressed genes best suited for the dataset at hand and in agreement with most of the other transcriptomics pipelines.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29768462      PMCID: PMC5955523          DOI: 10.1371/journal.pone.0197162

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


  33 in total

1.  RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays.

Authors:  John C Marioni; Christopher E Mason; Shrikant M Mane; Matthew Stephens; Yoav Gilad
Journal:  Genome Res       Date:  2008-06-11       Impact factor: 9.043

Review 2.  Sequencing technologies - the next generation.

Authors:  Michael L Metzker
Journal:  Nat Rev Genet       Date:  2009-12-08       Impact factor: 53.242

3.  A comparison of massively parallel nucleotide sequencing with oligonucleotide microarrays for global transcription profiling.

Authors:  James R Bradford; Yvonne Hey; Tim Yates; Yaoyong Li; Stuart D Pepper; Crispin J Miller
Journal:  BMC Genomics       Date:  2010-05-05       Impact factor: 3.969

Review 4.  RNA-Seq: a revolutionary tool for transcriptomics.

Authors:  Zhong Wang; Mark Gerstein; Michael Snyder
Journal:  Nat Rev Genet       Date:  2009-01       Impact factor: 53.242

5.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.

Authors:  Bo Li; Colin N Dewey
Journal:  BMC Bioinformatics       Date:  2011-08-04       Impact factor: 3.307

6.  ArrayExpress update--simplifying data submissions.

Authors:  Nikolay Kolesnikov; Emma Hastings; Maria Keays; Olga Melnichuk; Y Amy Tang; Eleanor Williams; Miroslaw Dylag; Natalja Kurbatova; Marco Brandizi; Tony Burdett; Karyn Megy; Ekaterina Pilicheva; Gabriella Rustici; Andrew Tikhonov; Helen Parkinson; Robert Petryszak; Ugis Sarkans; Alvis Brazma
Journal:  Nucleic Acids Res       Date:  2014-10-31       Impact factor: 16.971

7.  A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium.

Authors: 
Journal:  Nat Biotechnol       Date:  2014-08-24       Impact factor: 54.908

8.  HTSeq--a Python framework to work with high-throughput sequencing data.

Authors:  Simon Anders; Paul Theodor Pyl; Wolfgang Huber
Journal:  Bioinformatics       Date:  2014-09-25       Impact factor: 6.937

9.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

10.  TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions.

Authors:  Daehwan Kim; Geo Pertea; Cole Trapnell; Harold Pimentel; Ryan Kelley; Steven L Salzberg
Journal:  Genome Biol       Date:  2013-04-25       Impact factor: 13.583

View more
  6 in total

1.  Extraction-free whole transcriptome gene expression analysis of FFPE sections and histology-directed subareas of tissue.

Authors:  Christy L Trejo; Miloš Babić; Elliot Imler; Migdalia Gonzalez; Sergei I Bibikov; Peter J Shepard; Harper C VanSteenhouse; Joanne M Yeakley; Bruce E Seligmann
Journal:  PLoS One       Date:  2019-02-22       Impact factor: 3.240

2.  A key genomic subtype associated with lymphovascular invasion in invasive breast cancer.

Authors:  Sasagu Kurozumi; Chitra Joseph; Sultan Sonbul; Sami Alsaeed; Yousif Kariri; Abrar Aljohani; Sara Raafat; Mansour Alsaleem; Angela Ogden; Simon J Johnston; Mohammed A Aleskandarany; Takaaki Fujii; Ken Shirabe; Carlos Caldas; Ibraheem Ashankyty; Leslie Dalton; Ian O Ellis; Christine Desmedt; Andrew R Green; Nigel P Mongan; Emad A Rakha
Journal:  Br J Cancer       Date:  2019-05-22       Impact factor: 7.640

3.  Correction: A comparative study of RNA-Seq and microarray data analysis on the two examples of rectal-cancer patients and Burkitt Lymphoma cells.

Authors:  Alexander Wolff; Michaela Bayerlová; Jochen Gaedcke; Dieter Kube; Tim Beißbarth
Journal:  PLoS One       Date:  2019-10-24       Impact factor: 3.240

4.  Relationship between HSPA1A-regulated gene expression and alternative splicing in mouse cardiomyocytes and cardiac hypertrophy.

Authors:  Shuai Li; Ping Yang
Journal:  J Thorac Dis       Date:  2021-09       Impact factor: 2.895

5.  An immuno-score signature of tumor immune microenvironment predicts clinical outcomes in locally advanced rectal cancer.

Authors:  Zhengfa Xue; Shuxin Yang; Yun Luo; Ming He; Huimin Qiao; Wei Peng; Suxin Tong; Guini Hong; You Guo
Journal:  Front Oncol       Date:  2022-09-29       Impact factor: 5.738

6.  Integrative Analysis of Axolotl Gene Expression Data from Regenerative and Wound Healing Limb Tissues.

Authors:  Mustafa Sibai; Cüneyd Parlayan; Pelin Tuğlu; Gürkan Öztürk; Turan Demircan
Journal:  Sci Rep       Date:  2019-12-30       Impact factor: 4.379

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.