Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A comparative study of RNA-Seq and microarray data analysis on the two examples of rectal-cancer patients and Burkitt Lymphoma cells.

Literature DB >> 29768462

A comparative study of RNA-Seq and microarray data analysis on the two examples of rectal-cancer patients and Burkitt Lymphoma cells.

Alexander Wolff¹, Michaela Bayerlová¹, Jochen Gaedcke², Dieter Kube³, Tim Beißbarth¹.

Abstract

BACKGROUND: Pipeline comparisons for gene expression data are highly valuable for applied real data analyses, as they enable the selection of suitable analysis strategies for the dataset at hand. Such pipelines for RNA-Seq data should include mapping of reads, counting and differential gene expression analysis or preprocessing, normalization and differential gene expression in case of microarray analysis, in order to give a global insight into pipeline performances.
METHODS: Four commonly used RNA-Seq pipelines (STAR/HTSeq-Count/edgeR, STAR/RSEM/edgeR, Sailfish/edgeR, TopHat2/Cufflinks/CuffDiff)) were investigated on multiple levels (alignment and counting) and cross-compared with the microarray counterpart on the level of gene expression and gene ontology enrichment. For these comparisons we generated two matched microarray and RNA-Seq datasets: Burkitt Lymphoma cell line data and rectal cancer patient data.
RESULTS: The overall mapping rate of STAR was 98.98% for the cell line dataset and 98.49% for the patient dataset. Tophat's overall mapping rate was 97.02% and 96.73%, respectively, while Sailfish had only an overall mapping rate of 84.81% and 54.44%. The correlation of gene expression in microarray and RNA-Seq data was moderately worse for the patient dataset (ρ = 0.67-0.69) than for the cell line dataset (ρ = 0.87-0.88). An exception were the correlation results of Cufflinks, which were substantially lower (ρ = 0.21-0.29 and 0.34-0.53). For both datasets we identified very low numbers of differentially expressed genes using the microarray platform. For RNA-Seq we checked the agreement of differentially expressed genes identified in the different pipelines and of GO-term enrichment results.
CONCLUSION: In conclusion the combination of STAR aligner with HTSeq-Count followed by STAR aligner with RSEM and Sailfish generated differentially expressed genes best suited for the dataset at hand and in agreement with most of the other transcriptomics pipelines.

Entities: CellLine Chemical Disease Gene Species

Mesh：

Substances：
RNA, Neoplasm

Year: 2018 PMID： 29768462 PMCID： PMC5955523 DOI： 10.1371/journal.pone.0197162

Source DB: PubMed Journal: PLoS One ISSN： 1932-6203 Impact factor: 3.240

33 in total

1. RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays.

Authors: John C Marioni; Christopher E Mason; Shrikant M Mane; Matthew Stephens; Yoav Gilad
Journal: Genome Res Date: 2008-06-11 Impact factor: 9.043

Review 2. Sequencing technologies - the next generation.

Authors: Michael L Metzker
Journal: Nat Rev Genet Date: 2009-12-08 Impact factor: 53.242

3. A comparison of massively parallel nucleotide sequencing with oligonucleotide microarrays for global transcription profiling.

Authors: James R Bradford; Yvonne Hey; Tim Yates; Yaoyong Li; Stuart D Pepper; Crispin J Miller
Journal: BMC Genomics Date: 2010-05-05 Impact factor: 3.969

Review 4. RNA-Seq: a revolutionary tool for transcriptomics.

Authors: Zhong Wang; Mark Gerstein; Michael Snyder
Journal: Nat Rev Genet Date: 2009-01 Impact factor: 53.242

5. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.

Authors: Bo Li; Colin N Dewey
Journal: BMC Bioinformatics Date: 2011-08-04 Impact factor: 3.307

6. ArrayExpress update--simplifying data submissions.

Authors: Nikolay Kolesnikov; Emma Hastings; Maria Keays; Olga Melnichuk; Y Amy Tang; Eleanor Williams; Miroslaw Dylag; Natalja Kurbatova; Marco Brandizi; Tony Burdett; Karyn Megy; Ekaterina Pilicheva; Gabriella Rustici; Andrew Tikhonov; Helen Parkinson; Robert Petryszak; Ugis Sarkans; Alvis Brazma
Journal: Nucleic Acids Res Date: 2014-10-31 Impact factor: 16.971

7. A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium.

Authors:
Journal: Nat Biotechnol Date: 2014-08-24 Impact factor: 54.908

8. HTSeq--a Python framework to work with high-throughput sequencing data.

Authors: Simon Anders; Paul Theodor Pyl; Wolfgang Huber
Journal: Bioinformatics Date: 2014-09-25 Impact factor: 6.937

9. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors: Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal: Bioinformatics Date: 2009-11-11 Impact factor: 6.937

10. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions.

Authors: Daehwan Kim; Geo Pertea; Cole Trapnell; Harold Pimentel; Ryan Kelley; Steven L Salzberg
Journal: Genome Biol Date: 2013-04-25 Impact factor: 13.583

6 in total

1. Extraction-free whole transcriptome gene expression analysis of FFPE sections and histology-directed subareas of tissue.

Authors: Christy L Trejo; Miloš Babić; Elliot Imler; Migdalia Gonzalez; Sergei I Bibikov; Peter J Shepard; Harper C VanSteenhouse; Joanne M Yeakley; Bruce E Seligmann
Journal: PLoS One Date: 2019-02-22 Impact factor: 3.240

2. A key genomic subtype associated with lymphovascular invasion in invasive breast cancer.

Authors: Sasagu Kurozumi; Chitra Joseph; Sultan Sonbul; Sami Alsaeed; Yousif Kariri; Abrar Aljohani; Sara Raafat; Mansour Alsaleem; Angela Ogden; Simon J Johnston; Mohammed A Aleskandarany; Takaaki Fujii; Ken Shirabe; Carlos Caldas; Ibraheem Ashankyty; Leslie Dalton; Ian O Ellis; Christine Desmedt; Andrew R Green; Nigel P Mongan; Emad A Rakha
Journal: Br J Cancer Date: 2019-05-22 Impact factor: 7.640

3. Correction: A comparative study of RNA-Seq and microarray data analysis on the two examples of rectal-cancer patients and Burkitt Lymphoma cells.

Authors: Alexander Wolff; Michaela Bayerlová; Jochen Gaedcke; Dieter Kube; Tim Beißbarth
Journal: PLoS One Date: 2019-10-24 Impact factor: 3.240

4. Relationship between HSPA1A-regulated gene expression and alternative splicing in mouse cardiomyocytes and cardiac hypertrophy.

Authors: Shuai Li; Ping Yang
Journal: J Thorac Dis Date: 2021-09 Impact factor: 2.895

5. An immuno-score signature of tumor immune microenvironment predicts clinical outcomes in locally advanced rectal cancer.

Authors: Zhengfa Xue; Shuxin Yang; Yun Luo; Ming He; Huimin Qiao; Wei Peng; Suxin Tong; Guini Hong; You Guo
Journal: Front Oncol Date: 2022-09-29 Impact factor: 5.738

6. Integrative Analysis of Axolotl Gene Expression Data from Regenerative and Wound Healing Limb Tissues.

Authors: Mustafa Sibai; Cüneyd Parlayan; Pelin Tuğlu; Gürkan Öztürk; Turan Demircan
Journal: Sci Rep Date: 2019-12-30 Impact factor: 4.379

6 in total