Literature DB >> 25819671

Outlier detection at the transcriptome-proteome interface.

Yawwani Gunawardana1, Shuhei Fujiwara2, Akiko Takeda2, Jeongmin Woo3, Christopher Woelk3, Mahesan Niranjan1.   

Abstract

BACKGROUND: In high-throughput experimental biology, it is widely acknowledged that while expression levels measured at the levels of transcriptome and the corresponding proteome do not, in general, correlate well, messenger RNA levels are used as convenient proxies for protein levels. Our interest is in developing data-driven computational models that can bridge the gap between these two levels of measurement at which different mechanisms of regulation may act on different molecular species causing any observed lack of correlations. To this end, we build data-driven predictors of protein levels using mRNA levels and known proxies of translation efficiencies as covariates. Previous work showed that in such a setting, outliers with respect to the model are reliable candidates for post-translational regulation.
RESULTS: Here, we present and compare two novel formulations of deriving a protein concentration predictor from which outliers may be extracted in a systematic manner. The first approach, outlier rejecting regression, allows explicit specification of a certain fraction of the data as outliers. In a regression setting, this is a non-convex optimization problem which we solve by deriving a difference of convex functions algorithm (DCA). With post-translationally regulated proteins, one expects their concentrations to be affected primarily by disruption of protein stability. Our second algorithm exploits this observation by minimizing an asymmetric loss using quantile regression and extracts outlier proteins whose measured concentrations are lower than what a genome-wide regression would predict. We validate the two approaches on a dataset of yeast transcriptome and proteome. Functional annotation check on detected outliers demonstrate that the methods are able to identify post-translationally regulated genes with high statistical confidence.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25819671     DOI: 10.1093/bioinformatics/btv182

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  4 in total

1.  Object Weighting: A New Clustering Approach to Deal with Outliers and Cluster Overlap in Computational Biology.

Authors:  Alexandre Gondeau; Zahia Aouabed; Mohamed Hijri; Pedro Peres-Neto; Vladimir Makarenkov
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2021-04-08       Impact factor: 3.710

Review 2.  The AI for Scientific Discovery Network.

Authors:  Samantha Kanza; Colin Leonard Bird; Mahesan Niranjan; William McNeill; Jeremy Graham Frey
Journal:  Patterns (N Y)       Date:  2021-01-08

3.  Genome-Wide Posttranscriptional Dysregulation by MicroRNAs in Human Asthma as Revealed by Frac-seq.

Authors:  Rocio T Martinez-Nunez; Hitasha Rupani; Manuela Platé; Mahesan Niranjan; Rachel C Chambers; Peter H Howarth; Tilman Sanchez-Elsner
Journal:  J Immunol       Date:  2018-05-16       Impact factor: 5.422

4.  Uncovering extensive post-translation regulation during human cell cycle progression by integrative multi-'omics analysis.

Authors:  Gregory M Parkes; Mahesan Niranjan
Journal:  BMC Bioinformatics       Date:  2019-10-29       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.