Guy M Harris1, Shahroze Abbas2, Michael F Miles3,4,5. 1. VCU Pharmacology and Toxicology, Richmond, Virginia, 23298, USA. 2. VCU Center for the Study of Biological Complexity, Richmond, Virginia, 23298, USA. 3. VCU Pharmacology and Toxicology, Richmond, Virginia, 23298, USA. michael.miles@vcuhealth.org. 4. VCU Center for the Study of Biological Complexity, Richmond, Virginia, 23298, USA. michael.miles@vcuhealth.org. 5. Department of Pharmacology & Toxicology and Neurology, Virginia Commonwealth University, Richmond, VA, 23298, USA. michael.miles@vcuhealth.org.
Abstract
BACKGROUND: Despite the increasing use of RNAseq for transcriptome analysis, microarrays remain a widely-used methodology for genomic studies. The latest generation of Affymetrix/Thermo-Fisher microarrays, the ClariomD/XTA and ClariomS array, provide a sensitive and facile method for complex transcriptome expression analysis. However, existing methods of analysis for these high-density arrays do not leverage the statistical power contained in having multiple oligonucleotides representing each gene/exon, but rather summarize probes into a single expression value. We previously developed a methodology, the Sscore algorithm, for probe-level identification of differentially expressed genes (DEGs) between treatment and control samples with oligonucleotide microarrays. The Sscore algorithm was validated for sensitive detection of DEGs by comparison with existing methods. However, the prior version of the Sscore algorithm and a R-based implementation software, sscore, do not function with the latest generations of Affymetrix/Fisher microarrays due to changes in microarray design that eliminated probes previously used for estimation of non-specific binding. RESULTS: Here we describe the GCSscore algorithm, which utilizes the GC-content of a given oligonucleotide probe to estimate non-specific binding using antigenomic background probes found on new generations of arrays. We implemented this algorithm in an improved GCSscore R package for analysis of modern oligonucleotide microarrays. GCSscore has multiple methods for grouping individual probes on the ClariomD/XTA chips, providing the user with differential expression analysis at the gene-level and the exon-level. By utilizing the direct probe-level intensities, the GCSscore algorithm was able to detect DEGs under stringent statistical criteria for all Clariom-based arrays. We demonstrate that for older 3'-IVT arrays, GCSscore produced very similar differential gene expression analysis results compared to the original Sscore method. However, GCSscore functioned well for both the ClariomS and ClariomD/XTA newer microarrays and outperformed existing analysis approaches insofar as the number of DEGs and cognate biological functions identified. This was particularly striking for analysis of the highly complex ClariomD/XTA based arrays. CONCLUSIONS: The GCSscore package represents a powerful new application for analysis of the newest generation of oligonucleotide microarrays such as the ClariomS and ClariomD/XTA arrays produced by Affymetrix/Fisher.
BACKGROUND: Despite the increasing use of RNAseq for transcriptome analysis, microarrays remain a widely-used methodology for genomic studies. The latest generation of Affymetrix/Thermo-Fisher microarrays, the ClariomD/XTA and ClariomS array, provide a sensitive and facile method for complex transcriptome expression analysis. However, existing methods of analysis for these high-density arrays do not leverage the statistical power contained in having multiple oligonucleotides representing each gene/exon, but rather summarize probes into a single expression value. We previously developed a methodology, the Sscore algorithm, for probe-level identification of differentially expressed genes (DEGs) between treatment and control samples with oligonucleotide microarrays. The Sscore algorithm was validated for sensitive detection of DEGs by comparison with existing methods. However, the prior version of the Sscore algorithm and a R-based implementation software, sscore, do not function with the latest generations of Affymetrix/Fisher microarrays due to changes in microarray design that eliminated probes previously used for estimation of non-specific binding. RESULTS: Here we describe the GCSscore algorithm, which utilizes the GC-content of a given oligonucleotide probe to estimate non-specific binding using antigenomic background probes found on new generations of arrays. We implemented this algorithm in an improved GCSscore R package for analysis of modern oligonucleotide microarrays. GCSscore has multiple methods for grouping individual probes on the ClariomD/XTA chips, providing the user with differential expression analysis at the gene-level and the exon-level. By utilizing the direct probe-level intensities, the GCSscore algorithm was able to detect DEGs under stringent statistical criteria for all Clariom-based arrays. We demonstrate that for older 3'-IVT arrays, GCSscore produced very similar differential gene expression analysis results compared to the original Sscore method. However, GCSscore functioned well for both the ClariomS and ClariomD/XTA newer microarrays and outperformed existing analysis approaches insofar as the number of DEGs and cognate biological functions identified. This was particularly striking for analysis of the highly complex ClariomD/XTA based arrays. CONCLUSIONS: The GCSscore package represents a powerful new application for analysis of the newest generation of oligonucleotide microarrays such as the ClariomS and ClariomD/XTA arrays produced by Affymetrix/Fisher.
Authors: Richard E Kennedy; Robnet T Kerns; Xiangrong Kong; Kellie J Archer; Michael F Miles Journal: Bioinformatics Date: 2006-03-30 Impact factor: 6.937
Authors: Andrew D van der Vaart; Jennifer T Wolstenholme; Maren L Smith; Guy M Harris; Marcelo F Lopez; Aaron R Wolen; Howard C Becker; Robert W Williams; Michael F Miles Journal: Alcohol Date: 2016-11-01 Impact factor: 2.405
Authors: Matthew E Ritchie; Belinda Phipson; Di Wu; Yifang Hu; Charity W Law; Wei Shi; Gordon K Smyth Journal: Nucleic Acids Res Date: 2015-01-20 Impact factor: 16.971
Authors: Aaron R Wolen; Charles A Phillips; Michael A Langston; Alex H Putman; Paul J Vorster; Nathan A Bruce; Timothy P York; Robert W Williams; Michael F Miles Journal: PLoS One Date: 2012-04-12 Impact factor: 3.240
Authors: Dorothy E Grice; Ilkka Reenilä; Pekka T Männistö; Andrew I Brooks; George G Smith; Greg T Golden; Joseph D Buxbaum; Wade H Berrettini Journal: BMC Genomics Date: 2007-03-16 Impact factor: 3.969
Authors: Jennifer T Wolstenholme; Tariq Mahmood; Guy M Harris; Shahroze Abbas; Michael F Miles Journal: Front Mol Neurosci Date: 2017-09-26 Impact factor: 5.639