Literature DB >> 24297565

Matrix factorization-based data fusion for gene function prediction in baker's yeast and slime mold.

Marinka Zitnik1, Blaž Zupan.   

Abstract

The development of effective methods for the characterization of gene functions that are able to combine diverse data sources in a sound and easily-extendible way is an important goal in computational biology. We have previously developed a general matrix factorization-based data fusion approach for gene function prediction. In this manuscript, we show that this data fusion approach can be applied to gene function prediction and that it can fuse various heterogeneous data sources, such as gene expression profiles, known protein annotations, interaction and literature data. The fusion is achieved by simultaneous matrix tri-factorization that shares matrix factors between sources. We demonstrate the effectiveness of the approach by evaluating its performance on predicting ontological annotations in slime mold D. discoideum and on recognizing proteins of baker's yeast S. cerevisiae that participate in the ribosome or are located in the cell membrane. Our approach achieves predictive performance comparable to that of the state-of-the-art kernel-based data fusion, but requires fewer data preprocessing steps.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24297565      PMCID: PMC3902649     

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  20 in total

1.  Hierarchical multi-label prediction of gene function.

Authors:  Zafer Barutcuoglu; Robert E Schapire; Olga G Troyanskaya
Journal:  Bioinformatics       Date:  2006-01-12       Impact factor: 6.937

Review 2.  Combining many interaction networks to predict gene function and analyze gene lists.

Authors:  Sara Mostafavi; Quaid Morris
Journal:  Proteomics       Date:  2012-05       Impact factor: 3.984

3.  PoGO: Prediction of Gene Ontology terms for fungal proteins.

Authors:  Jaehee Jung; Gangman Yi; Serenella A Sukno; Michael R Thon
Journal:  BMC Bioinformatics       Date:  2010-04-29       Impact factor: 3.169

4.  Conserved developmental transcriptomes in evolutionarily divergent species.

Authors:  Anup Parikh; Edward Roshan Miranda; Mariko Katoh-Kurasawa; Danny Fuller; Gregor Rot; Lan Zagar; Tomaz Curk; Richard Sucgang; Rui Chen; Blaz Zupan; William F Loomis; Adam Kuspa; Gad Shaulsky
Journal:  Genome Biol       Date:  2010-03-17       Impact factor: 13.583

5.  GOPred: GO molecular function prediction by combined classifiers.

Authors:  Omer Sinan Saraç; Volkan Atalay; Rengul Cetin-Atalay
Journal:  PLoS One       Date:  2010-08-31       Impact factor: 3.240

6.  Prediction of Drosophila melanogaster gene function using Support Vector Machines.

Authors:  Nicholas Mitsakakis; Zak Razak; Michael Escobar; J Timothy Westwood
Journal:  BioData Min       Date:  2013-04-02       Impact factor: 2.522

7.  Information-theoretic evaluation of predicted ontological annotations.

Authors:  Wyatt T Clark; Predrag Radivojac
Journal:  Bioinformatics       Date:  2013-07-01       Impact factor: 6.937

8.  A large-scale evaluation of computational protein function prediction.

Authors:  Predrag Radivojac; Wyatt T Clark; Tal Ronnen Oron; Alexandra M Schnoes; Tobias Wittkop; Artem Sokolov; Kiley Graim; Christopher Funk; Karin Verspoor; Asa Ben-Hur; Gaurav Pandey; Jeffrey M Yunes; Ameet S Talwalkar; Susanna Repo; Michael L Souza; Damiano Piovesan; Rita Casadio; Zheng Wang; Jianlin Cheng; Hai Fang; Julian Gough; Patrik Koskinen; Petri Törönen; Jussi Nokso-Koivisto; Liisa Holm; Domenico Cozzetto; Daniel W A Buchan; Kevin Bryson; David T Jones; Bhakti Limaye; Harshal Inamdar; Avik Datta; Sunitha K Manjari; Rajendra Joshi; Meghana Chitale; Daisuke Kihara; Andreas M Lisewski; Serkan Erdin; Eric Venner; Olivier Lichtarge; Robert Rentzsch; Haixuan Yang; Alfonso E Romero; Prajwal Bhat; Alberto Paccanaro; Tobias Hamp; Rebecca Kaßner; Stefan Seemayer; Esmeralda Vicedo; Christian Schaefer; Dominik Achten; Florian Auer; Ariane Boehm; Tatjana Braun; Maximilian Hecht; Mark Heron; Peter Hönigschmid; Thomas A Hopf; Stefanie Kaufmann; Michael Kiening; Denis Krompass; Cedric Landerer; Yannick Mahlich; Manfred Roos; Jari Björne; Tapio Salakoski; Andrew Wong; Hagit Shatkay; Fanny Gatzmann; Ingolf Sommer; Mark N Wass; Michael J E Sternberg; Nives Škunca; Fran Supek; Matko Bošnjak; Panče Panov; Sašo Džeroski; Tomislav Šmuc; Yiannis A I Kourmpetis; Aalt D J van Dijk; Cajo J F ter Braak; Yuanpeng Zhou; Qingtian Gong; Xinran Dong; Weidong Tian; Marco Falda; Paolo Fontana; Enrico Lavezzo; Barbara Di Camillo; Stefano Toppo; Liang Lan; Nemanja Djuric; Yuhong Guo; Slobodan Vucetic; Amos Bairoch; Michal Linial; Patricia C Babbitt; Steven E Brenner; Christine Orengo; Burkhard Rost; Sean D Mooney; Iddo Friedberg
Journal:  Nat Methods       Date:  2013-01-27       Impact factor: 28.547

9.  A Bayesian framework for combining heterogeneous data sources for gene function prediction (in Saccharomyces cerevisiae).

Authors:  Olga G Troyanskaya; Kara Dolinski; Art B Owen; Russ B Altman; David Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  2003-06-25       Impact factor: 12.779

10.  A critical assessment of Mus musculus gene function prediction using integrated genomic evidence.

Authors:  Lourdes Peña-Castillo; Murat Tasan; Chad L Myers; Hyunju Lee; Trupti Joshi; Chao Zhang; Yuanfang Guan; Michele Leone; Andrea Pagnani; Wan Kyu Kim; Chase Krumpelman; Weidong Tian; Guillaume Obozinski; Yanjun Qi; Sara Mostafavi; Guan Ning Lin; Gabriel F Berriz; Francis D Gibbons; Gert Lanckriet; Jian Qiu; Charles Grant; Zafer Barutcuoglu; David P Hill; David Warde-Farley; Chris Grouios; Debajyoti Ray; Judith A Blake; Minghua Deng; Michael I Jordan; William S Noble; Quaid Morris; Judith Klein-Seetharaman; Ziv Bar-Joseph; Ting Chen; Fengzhu Sun; Olga G Troyanskaya; Edward M Marcotte; Dong Xu; Timothy R Hughes; Frederick P Roth
Journal:  Genome Biol       Date:  2008-06-27       Impact factor: 13.583

View more
  14 in total

1.  Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities.

Authors:  Marinka Zitnik; Francis Nguyen; Bo Wang; Jure Leskovec; Anna Goldenberg; Michael M Hoffman
Journal:  Inf Fusion       Date:  2018-09-21       Impact factor: 12.975

2.  GUDM: Automatic Generation of Unified Datasets for Learning and Reasoning in Healthcare.

Authors:  Rahman Ali; Muhammad Hameed Siddiqi; Muhammad Idris; Taqdir Ali; Shujaat Hussain; Eui-Nam Huh; Byeong Ho Kang; Sungyoung Lee
Journal:  Sensors (Basel)       Date:  2015-07-02       Impact factor: 3.576

3.  COMPUTING THERAPY FOR PRECISION MEDICINE: COLLABORATIVE FILTERING INTEGRATES AND PREDICTS MULTI-ENTITY INTERACTIONS.

Authors:  Sam Regenbogen; Angela D Wilkins; Olivier Lichtarge
Journal:  Pac Symp Biocomput       Date:  2016

Review 4.  Yeast as a cell factory: current state and perspectives.

Authors:  Martin Kavšček; Martin Stražar; Tomaž Curk; Klaus Natter; Uroš Petrovič
Journal:  Microb Cell Fact       Date:  2015-06-30       Impact factor: 5.328

5.  Gene Prioritization by Compressive Data Fusion and Chaining.

Authors:  Marinka Žitnik; Edward A Nam; Christopher Dinh; Adam Kuspa; Gad Shaulsky; Blaž Zupan
Journal:  PLoS Comput Biol       Date:  2015-10-14       Impact factor: 4.475

6.  Augmenting subnetwork inference with information extracted from the scientific literature.

Authors:  Sid Kiblawi; Deborah Chasman; Amanda Henning; Eunju Park; Hoifung Poon; Michael Gould; Paul Ahlquist; Mark Craven
Journal:  PLoS Comput Biol       Date:  2019-06-27       Impact factor: 4.475

7.  Heterogeneous Network Edge Prediction: A Data Integration Approach to Prioritize Disease-Associated Genes.

Authors:  Daniel S Himmelstein; Sergio E Baranzini
Journal:  PLoS Comput Biol       Date:  2015-07-09       Impact factor: 4.475

8.  Computational algorithms to predict Gene Ontology annotations.

Authors:  Pietro Pinoli; Davide Chicco; Marco Masseroli
Journal:  BMC Bioinformatics       Date:  2015-04-17       Impact factor: 3.169

9.  Integration of molecular network data reconstructs Gene Ontology.

Authors:  Vladimir Gligorijević; Vuk Janjić; Nataša Pržulj
Journal:  Bioinformatics       Date:  2014-09-01       Impact factor: 6.937

10.  Network enhancement as a general method to denoise weighted biological networks.

Authors:  Bo Wang; Armin Pourshafeie; Marinka Zitnik; Junjie Zhu; Carlos D Bustamante; Serafim Batzoglou; Jure Leskovec
Journal:  Nat Commun       Date:  2018-08-06       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.