Literature DB >> 26490630

Methods for biological data integration: perspectives and challenges.

Vladimir Gligorijević1, Nataša Pržulj2.   

Abstract

Rapid technological advances have led to the production of different types of biological data and enabled construction of complex networks with various types of interactions between diverse biological entities. Standard network data analysis methods were shown to be limited in dealing with such heterogeneous networked data and consequently, new methods for integrative data analyses have been proposed. The integrative methods can collectively mine multiple types of biological data and produce more holistic, systems-level biological insights. We survey recent methods for collective mining (integration) of various types of networked biological data. We compare different state-of-the-art methods for data integration and highlight their advantages and disadvantages in addressing important biological problems. We identify the important computational challenges of these methods and provide a general guideline for which methods are suited for specific biological problems, or specific data types. Moreover, we propose that recent non-negative matrix factorization-based approaches may become the integration methodology of choice, as they are well suited and accurate in dealing with heterogeneous data and have many opportunities for further development.
© 2015 The Author(s).

Keywords:  biological networks; data fusion; heterogeneous data integration; non-negative matrix factorization; omics data; systems biology

Mesh:

Year:  2015        PMID: 26490630      PMCID: PMC4685837          DOI: 10.1098/rsif.2015.0571

Source DB:  PubMed          Journal:  J R Soc Interface        ISSN: 1742-5662            Impact factor:   4.118


  141 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

3.  Learning the parts of objects by non-negative matrix factorization.

Authors:  D D Lee; H S Seung
Journal:  Nature       Date:  1999-10-21       Impact factor: 49.962

4.  The large-scale organization of metabolic networks.

Authors:  H Jeong; B Tombor; R Albert; Z N Oltvai; A L Barabási
Journal:  Nature       Date:  2000-10-05       Impact factor: 49.962

5.  GenMAPP, a new tool for viewing and analyzing microarray data on biological pathways.

Authors:  Kam D Dahlquist; Nathan Salomonis; Karen Vranizan; Steven C Lawlor; Bruce R Conklin
Journal:  Nat Genet       Date:  2002-05       Impact factor: 38.330

Review 6.  Computational analysis of microarray data.

Authors:  J Quackenbush
Journal:  Nat Rev Genet       Date:  2001-06       Impact factor: 53.242

7.  DaliLite workbench for protein structure comparison.

Authors:  L Holm; J Park
Journal:  Bioinformatics       Date:  2000-06       Impact factor: 6.937

8.  Toward a protein-protein interaction map of the budding yeast: A comprehensive system to examine two-hybrid interactions in all possible combinations between the yeast proteins.

Authors:  T Ito; K Tashiro; S Muta; R Ozawa; T Chiba; M Nishizawa; K Yamamoto; S Kuhara; Y Sakaki
Journal:  Proc Natl Acad Sci U S A       Date:  2000-02-01       Impact factor: 11.205

9.  A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae.

Authors:  P Uetz; L Giot; G Cagney; T A Mansfield; R S Judson; J R Knight; D Lockshon; V Narayan; M Srinivasan; P Pochart; A Qureshi-Emili; Y Li; B Godwin; D Conover; T Kalbfleisch; G Vijayadamodar; M Yang; M Johnston; S Fields; J M Rothberg
Journal:  Nature       Date:  2000-02-10       Impact factor: 49.962

10.  The small world inside large metabolic networks.

Authors:  A Wagner; D A Fell
Journal:  Proc Biol Sci       Date:  2001-09-07       Impact factor: 5.349

View more
  62 in total

1.  Integrative construction of regulatory region networks in 127 human reference epigenomes by matrix factorization.

Authors:  Dianbo Liu; Jose Davila-Velderrain; Zhizhuo Zhang; Manolis Kellis
Journal:  Nucleic Acids Res       Date:  2019-08-22       Impact factor: 16.971

2.  Mapping Biological Networks from Quantitative Data-Independent Acquisition Mass Spectrometry: Data to Knowledge Pipelines.

Authors:  Erin L Crowgey; Andrea Matlock; Vidya Venkatraman; Justyna Fert-Bober; Jennifer E Van Eyk
Journal:  Methods Mol Biol       Date:  2017

3.  Integrated analysis of genomics, longitudinal metabolomics, and Alzheimer's risk factors among 1,111 cohort participants.

Authors:  Burcu F Darst; Qiongshi Lu; Sterling C Johnson; Corinne D Engelman
Journal:  Genet Epidemiol       Date:  2019-05-18       Impact factor: 2.135

Review 4.  Biomechanisms of Comorbidity: Reviewing Integrative Analyses of Multi-omics Datasets and Electronic Health Records.

Authors:  N Pouladi; I Achour; H Li; J Berghout; C Kenost; M L Gonzalez-Garay; Y A Lussier
Journal:  Yearb Med Inform       Date:  2016-11-10

5.  From homogeneous to heterogeneous network alignment via colored graphlets.

Authors:  Shawn Gu; John Johnson; Fazle E Faisal; Tijana Milenković
Journal:  Sci Rep       Date:  2018-08-21       Impact factor: 4.379

6.  AOP-DB: A database resource for the exploration of Adverse Outcome Pathways through integrated association networks.

Authors:  Maureen E Pittman; Stephen W Edwards; Cataia Ives; Holly M Mortensen
Journal:  Toxicol Appl Pharmacol       Date:  2018-02-14       Impact factor: 4.219

7.  [Reconstruction of tumor clonal haplotypes based on an improved spanning algorithm].

Authors:  Yu Geng; Zhongmeng Zhao; Jianye Liu
Journal:  Nan Fang Yi Ke Da Xue Xue Bao       Date:  2019-11-30

Review 8.  Revolution of Alzheimer Precision Neurology. Passageway of Systems Biology and Neurophysiology.

Authors:  Harald Hampel; Nicola Toschi; Claudio Babiloni; Filippo Baldacci; Keith L Black; Arun L W Bokde; René S Bun; Francesco Cacciola; Enrica Cavedo; Patrizia A Chiesa; Olivier Colliot; Cristina-Maria Coman; Bruno Dubois; Andrea Duggento; Stanley Durrleman; Maria-Teresa Ferretti; Nathalie George; Remy Genthon; Marie-Odile Habert; Karl Herholz; Yosef Koronyo; Maya Koronyo-Hamaoui; Foudil Lamari; Todd Langevin; Stéphane Lehéricy; Jean Lorenceau; Christian Neri; Robert Nisticò; Francis Nyasse-Messene; Craig Ritchie; Simone Rossi; Emiliano Santarnecchi; Olaf Sporns; Steven R Verdooner; Andrea Vergallo; Nicolas Villain; Erfan Younesi; Francesco Garaci; Simone Lista
Journal:  J Alzheimers Dis       Date:  2018       Impact factor: 4.472

9.  Improving drug response prediction by integrating multiple data sources: matrix factorization, kernel and network-based approaches.

Authors:  Betül Güvenç Paltun; Hiroshi Mamitsuka; Samuel Kaski
Journal:  Brief Bioinform       Date:  2021-01-18       Impact factor: 11.622

10.  Towards enhanced and interpretable clustering/classification in integrative genomics.

Authors:  Yang Young Lu; Jinchi Lv; Jed A Fuhrman; Fengzhu Sun
Journal:  Nucleic Acids Res       Date:  2017-11-16       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.