Literature DB >> 21258061

Recovering key biological constituents through sparse representation of gene expression.

Yosef Prat1, Menachem Fromer, Nathan Linial, Michal Linial.   

Abstract

MOTIVATION: Large-scale RNA expression measurements are generating enormous quantities of data. During the last two decades, many methods were developed for extracting insights regarding the interrelationships between genes from such data. The mathematical and computational perspectives that underlie these methods are usually algebraic or probabilistic.
RESULTS: Here, we introduce an unexplored geometric view point where expression levels of genes in multiple experiments are interpreted as vectors in a high-dimensional space. Specifically, we find, for the expression profile of each particular gene, its approximation as a linear combination of profiles of a few other genes. This method is inspired by recent developments in the realm of compressed sensing in the machine learning domain. To demonstrate the power of our approach in extracting valuable information from the expression data, we independently applied it to large-scale experiments carried out on the yeast and malaria parasite whole transcriptomes. The parameters extracted from the sparse reconstruction of the expression profiles, when fed to a supervised learning platform, were used to successfully predict the relationships between genes throughout the Gene Ontology hierarchy and protein-protein interaction map. Extensive assessment of the biological results shows high accuracy in both recovering known predictions and in yielding accurate predictions missing from the current databases. We suggest that the geometrical approach presented here is suitable for a broad range of high-dimensional experimental data.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21258061     DOI: 10.1093/bioinformatics/btr002

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

Review 1.  Computational solutions for omics data.

Authors:  Bonnie Berger; Jian Peng; Mona Singh
Journal:  Nat Rev Genet       Date:  2013-05       Impact factor: 53.242

2.  Computational Biology in the 21st Century: Scaling with Compressive Algorithms.

Authors:  Bonnie Berger; Noah M Daniels; Y William Yu
Journal:  Commun ACM       Date:  2016-08       Impact factor: 4.654

3.  CaSPIAN: a causal compressive sensing algorithm for discovering directed interactions in gene networks.

Authors:  Amin Emad; Olgica Milenkovic
Journal:  PLoS One       Date:  2014-03-12       Impact factor: 3.240

4.  Predicting gene expression in the human malaria parasite Plasmodium falciparum using histone modification, nucleosome positioning, and 3D localization features.

Authors:  David F Read; Kate Cook; Yang Y Lu; Karine G Le Roch; William Stafford Noble
Journal:  PLoS Comput Biol       Date:  2019-09-11       Impact factor: 4.475

5.  Efficient Generation of Transcriptomic Profiles by Random Composite Measurements.

Authors:  Brian Cleary; Le Cong; Anthea Cheung; Eric S Lander; Aviv Regev
Journal:  Cell       Date:  2017-11-16       Impact factor: 41.582

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.