| Literature DB >> 30581915 |
Ewa Grabowiecka1, David Martin1, Louise Crozier2, Nicola Holden2.
Abstract
Whole transcriptome analysis to investigate differential gene expression and regulatory adaption can be carried out on two different technological platforms: by probe hybridisation to microarrays or by RNAseq for deep sequencing. Since there are difference in terms of their genome coverage, sensitivity and cost, there is a requirement for robust comparisons to determine the platform of choice. Here, we present datasets for the whole transcriptional response verocytoxigenic Escherichia coli (VTEC) obtained from RNA-seq and microarray platforms in response to spinach, together with a comparison between the datasets (available at Array Express: E-MTAB-3249, E-MTAB-4120, E-MTAB-7441).Entities:
Year: 2018 PMID: 30581915 PMCID: PMC6297237 DOI: 10.1016/j.dib.2018.11.136
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Flow diagram of analysis steps and processes. RNAseq (blue) and microarray (green) process steps are in blue and green respectively. Other steps were done in R (pink) or externally (yellow).
Fig. 2Correlation graph of expression profiles. Correlation of differential expression of genes by Log fold-change (logFC), acquired from the RNAseq and microarray datasets.
Analysis steps (EA) and associated scripts and processes.
| 1 | Process flow diagram | n/a: graphics ( |
| 2 | Script for microarray analysis | 1.Microarray_Data_Analysis |
| 3A | Script for generating the Bowtie2 index | Bowtie2-build –f sakaigenome.fasta EcoliSakai |
| 3B | Script for both paired end and unpaired alignment using Bowtie2 | Unpaired: bowtie2 -N -x EcoliSakai -U file1.fastq.gz, file2.fastq.gz.sam file |
| Paired end: bowtie2 -N -x EcoliSakai -1 file.fastq.gz -2 file2.fastq.gz –S.sam file | ||
| 3C | Script for index of featureCounts | { printf ׳GeneID/tChr/tStart/tEnd/tStrand/n׳; (grep -v CDS Sakai.gff | grep gene| sed -e ׳s/[^/t]*gene=//׳ -e ׳s//.[0–9]//׳| awk ׳BEGIN{OFS="/t"}{print $9, $1, $4, $5,$7}׳)} |uniq > exons.saf |
| 3D | featureCounts script | featureCounts –a exons.saf –F SAF –o outputfile.txt.sam file |
| 3E | Script for using blastdb for creation of Sakai database | formatdb name.fasta –n databaseName –t title –p F |
| 3F | Script for using matching the probe sequences to the database | blastall -p blastn –m8 –I Agilentprobes.fasta –d SakaiDatabase –v |-o output.txt |
| 4 | Script for RNA-Seq analysis | 2.RNASeq_Data_Analysis/3. DEG_analysis |
| 5 | Python Script for changing sequence headers in a fasta file | 3.Microarray_Vs_RNASeq/1.fasta_file |
| 6 | Pyhon Script for removing blast hits which are not suitable | 3.Microarray_Vs_RNASeq/2.blastn and sort |
| 7 | FastQC reports | 2.RNASeq_Data_Analysis/1.FastQC |
| 8 | Generation of Volcano Plots | Volcano plot |
| 9 | Microarray Top Table | 1.Microarray_Data_Analysis |
| 10 | RNASeq Top Table | 2.RNASeq_Data_Analysis/3. DEG_analysis |
| 11 | Comparison between RNAseq samples for paired end and random alignment | 2.RNASeq_Data_Analysis/2.Alighment_and_Count |
| 12 | Comparison between Microarray and RNASeq | 3.Microarray_Vs_RNASeq/3.table merge |
| Subject area | Biology |
| More specific subject area | Bioinformatics; Microbiology |
| Type of data | Table, graph, figure |
| How data were acquired | High-throughput RNA-sequencing; Microarray |
| Data format | Filtered and analysed with statistical tests |
| Experimental factors | |
| Experimental features | Total RNA was extracted using commercial kits and a cDNA library generated with enterobacteria-specific primers and hybridized to a microarray (E. coli v2 array – Agilent), or rRNA was depleted and paired-end cDNA libraries generated for sequence on an Illumina Hi-Seq. 2000. A series of statistical analyses was used for comparison between the datasets. |
| Data source location | James Hutton Institute, Dundee, DD2 5DA, UK. |
| Data accessibility | Data are with this article and also available at ArrayExpress: |
| E-MTAB-3249 (microarray) https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-3249/ | |
| E-MTAB-4120 (microarray) https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-4120/ | |
| E-MTAB-7441 (RNAseq) https://www.ebi.ac.uk/arrayexpress/experiments/E-MTAB-7441 | |
| Scripts used for data analysis are available on GitHub: https://github.com/TheMicroGirl/SakaiRNASeq | |
| Related research article | L. Crozier, P. Hedley, J. Morris, C. Wagstaff, S.C. Andrews, I. Toth, R.W. Jackson, N. Holden, Whole-transcriptome analysis of verocytotoxigenic |