Literature DB >> 33765911

Denoising large-scale biological data using network filters.

Andrew J Kavran1,2, Aaron Clauset3,4,5.   

Abstract

BACKGROUND: Large-scale biological data sets are often contaminated by noise, which can impede accurate inferences about underlying processes. Such measurement noise can arise from endogenous biological factors like cell cycle and life history variation, and from exogenous technical factors like sample preparation and instrument variation.
RESULTS: We describe a general method for automatically reducing noise in large-scale biological data sets. This method uses an interaction network to identify groups of correlated or anti-correlated measurements that can be combined or "filtered" to better recover an underlying biological signal. Similar to the process of denoising an image, a single network filter may be applied to an entire system, or the system may be first decomposed into distinct modules and a different filter applied to each. Applied to synthetic data with known network structure and signal, network filters accurately reduce noise across a wide range of noise levels and structures. Applied to a machine learning task of predicting changes in human protein expression in healthy and cancerous tissues, network filtering prior to training increases accuracy up to 43% compared to using unfiltered data.
CONCLUSIONS: Network filters are a general way to denoise biological data and can account for both correlation and anti-correlation between different measurements. Furthermore, we find that partitioning a network prior to filtering can significantly reduce errors in networks with heterogenous data and correlation patterns, and this approach outperforms existing diffusion based methods. Our results on proteomics data indicate the broad potential utility of network filters to applications in systems biology.

Entities:  

Keywords:  Denoising; Machine learning; Networks

Mesh:

Year:  2021        PMID: 33765911      PMCID: PMC7992843          DOI: 10.1186/s12859-021-04075-x

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  30 in total

1.  Mixing patterns in networks.

Authors:  M E J Newman
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2003-02-27

2.  Proteomics. Tissue-based map of the human proteome.

Authors:  Mathias Uhlén; Linn Fagerberg; Björn M Hallström; Cecilia Lindskog; Per Oksvold; Adil Mardinoglu; Åsa Sivertsson; Caroline Kampf; Evelina Sjöstedt; Anna Asplund; IngMarie Olsson; Karolina Edlund; Emma Lundberg; Sanjay Navani; Cristina Al-Khalili Szigyarto; Jacob Odeberg; Dijana Djureinovic; Jenny Ottosson Takanen; Sophia Hober; Tove Alm; Per-Henrik Edqvist; Holger Berling; Hanna Tegel; Jan Mulder; Johan Rockberg; Peter Nilsson; Jochen M Schwenk; Marica Hamsten; Kalle von Feilitzen; Mattias Forsberg; Lukas Persson; Fredric Johansson; Martin Zwahlen; Gunnar von Heijne; Jens Nielsen; Fredrik Pontén
Journal:  Science       Date:  2015-01-23       Impact factor: 47.728

3.  Smoothing gene expression data with network information improves consistency of regulated genes.

Authors:  Guro Dørum; Lars Snipen; Margrete Solheim; Solve Saebo
Journal:  Stat Appl Genet Mol Biol       Date:  2011-08-09

Review 4.  Avoiding common pitfalls when clustering biological data.

Authors:  Tom Ronan; Zhijie Qi; Kristen M Naegle
Journal:  Sci Signal       Date:  2016-06-14       Impact factor: 8.192

5.  Compensatory phosphorylation and protein-protein interactions revealed by loss of function and gain of function mutants of multiple serine phosphorylation sites in endothelial nitric-oxide synthase.

Authors:  Philip M Bauer; David Fulton; Yong Chool Boo; George P Sorescu; Bruce E Kemp; Hanjoong Jo; William C Sessa
Journal:  J Biol Chem       Date:  2003-02-18       Impact factor: 5.157

Review 6.  EMT Transition States during Tumor Progression and Metastasis.

Authors:  Ievgenia Pastushenko; Cédric Blanpain
Journal:  Trends Cell Biol       Date:  2018-12-26       Impact factor: 20.808

7.  PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes.

Authors:  Vamsi K Mootha; Cecilia M Lindgren; Karl-Fredrik Eriksson; Aravind Subramanian; Smita Sihag; Joseph Lehar; Pere Puigserver; Emma Carlsson; Martin Ridderstråle; Esa Laurila; Nicholas Houstis; Mark J Daly; Nick Patterson; Jill P Mesirov; Todd R Golub; Pablo Tamayo; Bruce Spiegelman; Eric S Lander; Joel N Hirschhorn; David Altshuler; Leif C Groop
Journal:  Nat Genet       Date:  2003-07       Impact factor: 38.330

8.  Structure and inference in annotated networks.

Authors:  M E J Newman; Aaron Clauset
Journal:  Nat Commun       Date:  2016-06-16       Impact factor: 14.919

9.  HINT: High-quality protein interactomes and their applications in understanding human disease.

Authors:  Jishnu Das; Haiyuan Yu
Journal:  BMC Syst Biol       Date:  2012-07-30

10.  Proficiency testing in immunohistochemistry--experiences from Nordic Immunohistochemical Quality Control (NordiQC).

Authors:  Mogens Vyberg; Søren Nielsen
Journal:  Virchows Arch       Date:  2015-08-26       Impact factor: 4.064

View more
  1 in total

1.  Multiscale topology characterizes dynamic tumor vascular networks.

Authors:  Bernadette J Stolz; Jakob Kaeppler; Bostjan Markelc; Franziska Braun; Florian Lipsmeier; Ruth J Muschel; Helen M Byrne; Heather A Harrington
Journal:  Sci Adv       Date:  2022-06-10       Impact factor: 14.957

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.