Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Denoising large-scale biological data using network filters.

Literature DB >> 33765911

Denoising large-scale biological data using network filters.

Andrew J Kavran^1,2, Aaron Clauset^3,4,5.

Abstract

BACKGROUND: Large-scale biological data sets are often contaminated by noise, which can impede accurate inferences about underlying processes. Such measurement noise can arise from endogenous biological factors like cell cycle and life history variation, and from exogenous technical factors like sample preparation and instrument variation.
RESULTS: We describe a general method for automatically reducing noise in large-scale biological data sets. This method uses an interaction network to identify groups of correlated or anti-correlated measurements that can be combined or "filtered" to better recover an underlying biological signal. Similar to the process of denoising an image, a single network filter may be applied to an entire system, or the system may be first decomposed into distinct modules and a different filter applied to each. Applied to synthetic data with known network structure and signal, network filters accurately reduce noise across a wide range of noise levels and structures. Applied to a machine learning task of predicting changes in human protein expression in healthy and cancerous tissues, network filtering prior to training increases accuracy up to 43% compared to using unfiltered data.
CONCLUSIONS: Network filters are a general way to denoise biological data and can account for both correlation and anti-correlation between different measurements. Furthermore, we find that partitioning a network prior to filtering can significantly reduce errors in networks with heterogenous data and correlation patterns, and this approach outperforms existing diffusion based methods. Our results on proteomics data indicate the broad potential utility of network filters to applications in systems biology.

Entities: Chemical Disease Gene Mutation Species

Keywords: Denoising; Machine learning; Networks

Mesh：

Year: 2021 PMID： 33765911 PMCID： PMC7992843 DOI： 10.1186/s12859-021-04075-x

Source DB: PubMed Journal: BMC Bioinformatics ISSN： 1471-2105 Impact factor: 3.169

30 in total

1. Mixing patterns in networks.

Authors: M E J Newman
Journal: Phys Rev E Stat Nonlin Soft Matter Phys Date: 2003-02-27

2. Proteomics. Tissue-based map of the human proteome.

Authors: Mathias Uhlén; Linn Fagerberg; Björn M Hallström; Cecilia Lindskog; Per Oksvold; Adil Mardinoglu; Åsa Sivertsson; Caroline Kampf; Evelina Sjöstedt; Anna Asplund; IngMarie Olsson; Karolina Edlund; Emma Lundberg; Sanjay Navani; Cristina Al-Khalili Szigyarto; Jacob Odeberg; Dijana Djureinovic; Jenny Ottosson Takanen; Sophia Hober; Tove Alm; Per-Henrik Edqvist; Holger Berling; Hanna Tegel; Jan Mulder; Johan Rockberg; Peter Nilsson; Jochen M Schwenk; Marica Hamsten; Kalle von Feilitzen; Mattias Forsberg; Lukas Persson; Fredric Johansson; Martin Zwahlen; Gunnar von Heijne; Jens Nielsen; Fredrik Pontén
Journal: Science Date: 2015-01-23 Impact factor: 47.728

3. Smoothing gene expression data with network information improves consistency of regulated genes.

Authors: Guro Dørum; Lars Snipen; Margrete Solheim; Solve Saebo
Journal: Stat Appl Genet Mol Biol Date: 2011-08-09

Review 4. Avoiding common pitfalls when clustering biological data.

Authors: Tom Ronan; Zhijie Qi; Kristen M Naegle
Journal: Sci Signal Date: 2016-06-14 Impact factor: 8.192

5. Compensatory phosphorylation and protein-protein interactions revealed by loss of function and gain of function mutants of multiple serine phosphorylation sites in endothelial nitric-oxide synthase.

Authors: Philip M Bauer; David Fulton; Yong Chool Boo; George P Sorescu; Bruce E Kemp; Hanjoong Jo; William C Sessa
Journal: J Biol Chem Date: 2003-02-18 Impact factor: 5.157

Review 6. EMT Transition States during Tumor Progression and Metastasis.

Authors: Ievgenia Pastushenko; Cédric Blanpain
Journal: Trends Cell Biol Date: 2018-12-26 Impact factor: 20.808

7. PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes.

Authors: Vamsi K Mootha; Cecilia M Lindgren; Karl-Fredrik Eriksson; Aravind Subramanian; Smita Sihag; Joseph Lehar; Pere Puigserver; Emma Carlsson; Martin Ridderstråle; Esa Laurila; Nicholas Houstis; Mark J Daly; Nick Patterson; Jill P Mesirov; Todd R Golub; Pablo Tamayo; Bruce Spiegelman; Eric S Lander; Joel N Hirschhorn; David Altshuler; Leif C Groop
Journal: Nat Genet Date: 2003-07 Impact factor: 38.330

Denoising large-scale biological data using network filters.

1. Mixing patterns in networks.

2. Proteomics. Tissue-based map of the human proteome.

3. Smoothing gene expression data with network information improves consistency of regulated genes.

Review 4. Avoiding common pitfalls when clustering biological data.

5. Compensatory phosphorylation and protein-protein interactions revealed by loss of function and gain of function mutants of multiple serine phosphorylation sites in endothelial nitric-oxide synthase.

Review 6. EMT Transition States during Tumor Progression and Metastasis.

7. PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes.

8. Structure and inference in annotated networks.

9. HINT: High-quality protein interactomes and their applications in understanding human disease.

10. Proficiency testing in immunohistochemistry--experiences from Nordic Immunohistochemical Quality Control (NordiQC).

1. Multiscale topology characterizes dynamic tumor vascular networks.