| Literature DB >> 28842509 |
Thomas Sbarrato1,2,3,4, Ruth V Spriggs5, Lindsay Wilson5, Carolyn Jones5, Kate Dudek5, Amandine Bastide5, Xavier Pichon5, Tuija Pöyry5, Anne E Willis5.
Abstract
Translational regulation plays a central role in the global gene expression of a cell, and detection of such regulation has allowed deciphering of critical biological mechanisms. Genome-wide studies of the regulation of translation (translatome) performed on microarrays represent a substantial proportion of studies, alongside with recent advances in deep-sequencing methods. However, there has been a lack of development in specific processing methodologies that deal with the distinct nature of translatome array data. In this study, we confirm that polysome profiling yields skewed data and thus violates the conventional transcriptome analysis assumptions. Using a comprehensive simulation of translatome array data varying the percentage and symmetry of deregulation, we show that conventional analysis methods (Quantile and LOESS normalizations) and statistical tests failed, respectively, to correctly normalize the data and to identify correctly deregulated genes (DEGs). We thus propose a novel analysis methodology available as a CRAN package; Internal Control Analysis of Translatome (INCATome) based on a normalization tied to a group of invariant controls. We confirm that INCATome outperforms the other normalization methods and allows a stringent identification of DEGs. More importantly, INCATome implementation on a biological translatome data set (cells silenced for splicing factor PSF) resulted in the best normalization performance and an improved validation concordance for identification of true positive DEGs. Finally, we provide evidence that INCATome is able to infer novel biological pathways with superior discovery potential, thus confirming the benefits for researchers of implementing INCATome for future translatome studies as well as for existing data sets to generate novel avenues for research.Entities:
Keywords: microarray analysis; polysome profiling; translational regulation; translatome
Mesh:
Year: 2017 PMID: 28842509 PMCID: PMC5648029 DOI: 10.1261/rna.060525.116
Source DB: PubMed Journal: RNA ISSN: 1355-8382 Impact factor: 4.942
FIGURE 1.Translatome studies violate method assumptions. (Ai) Polysome profile gradient traces showing differences between siCTRL (red) and siPSF (blue). (Aii) RNA concentration of pooled subpolysomal and polysomal RNA showing content imbalance. Quantified distributions are indicated in the corresponding table. (B) Study design. (C) Measure of skewness of unprocessed simulations at different PDE and SYM, of standard distributions and several translatome studies deposited in public repositories.
FIGURE 2.Novel INCATome normalization outperforms existing methods. (A) Boxplot for different normalization methods on simulated data at PDE 10 and 75% and Symmetry 0.1 and 0.9. (B) RMSD measures per normalization method for a set of given internal probes (spike-in or internal references ACTB/PABP). (C) Area under the curve for the ROC curves for each normalization method at each PDE and SYM of the simulation. (D) Statistical power (F1 Score) for each normalization method at each PDE and SYM of the simulation.
FIGURE 3.Identification of DEGs is hindered in cases of extreme deregulation. (A) Boxplot for different statistical methods on simulated data normalized with the internal reference-based INCATome (ACTB/PABP) approach at PDE 10 and 75% and SYM 0.1 and 0.9. (B) RMSD measures per statistical test for discovered PDE and SYM. (C) Area under the curve for the ROC curves for each statistical method at each PDE and SYM of the simulation. (D) Statistical power (F1 Score) for each statistical method at each PDE and SYM of the simulation.
FIGURE 4.INCATome implementation on PSF silenced data set provides the best validation concordance. (A) Boxplot for different normalization methods on PSF biological data set. Estimated PDE and SYM are indicated in the corresponding table. (B) RMSD measures per normalization method for internal references ACTB/PABP. (C) Venn Diagrams representing up-regulation and down-regulation identified in each method. Estimated PDE and SYM and averaged percentage of overlap are indicated in the corresponding tables. (D) Barplot of validation concordance between microarray and qPCR validation data. (E) Barplot of RMSD applied to fold change for both microarray data and qPCR validation data.
FIGURE 5.Biological inference enhanced by the implementation of INCATome. Barplots representing the logged P-value for the nonredundant gene ontology (GO) categories associated with the top 100 genes identified as up-regulated (A) or down-regulated (B) with each normalization method (in at least three statistical tests). Overlap in identification is represented by joined dots in the central table. Orange bars correspond to manually curated GOPubmed hits linking the SFPQ gene to the given GO category. Side barplot summarizes the number of nonredundant GO terms associated with each method.