Literature DB >> 23514123

Combining heterogeneous data sources for accurate functional annotation of proteins.

Artem Sokolov1, Christopher Funk, Kiley Graim, Karin Verspoor, Asa Ben-Hur.   

Abstract

Combining heterogeneous sources of data is essential for accurate prediction of protein function. The task is complicated by the fact that while sequence-based features can be readily compared across species, most other data are species-specific. In this paper, we present a multi-view extension to GOstruct, a structured-output framework for function annotation of proteins. The extended framework can learn from disparate data sources, with each data source provided to the framework in the form of a kernel. Our empirical results demonstrate that the multi-view framework is able to utilize all available information, yielding better performance than sequence-based models trained across species and models trained from collections of data within a given species. This version of GOstruct participated in the recent Critical Assessment of Functional Annotations (CAFA) challenge; since then we have significantly improved the natural language processing component of the method, which now provides performance that is on par with that provided by sequence information. The GOstruct framework is available for download at http://strut.sourceforge.net.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23514123      PMCID: PMC3584846          DOI: 10.1186/1471-2105-14-S3-S10

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  33 in total

1.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.

Authors:  A Krogh; B Larsson; G von Heijne; E L Sonnhammer
Journal:  J Mol Biol       Date:  2001-01-19       Impact factor: 5.469

2.  Sources of systematic error in functional annotation of genomes: domain rearrangement, non-orthologous gene displacement and operon disruption.

Authors:  M Y Galperin; E V Koonin
Journal:  In Silico Biol       Date:  1998

3.  Hierarchical multi-label prediction of gene function.

Authors:  Zafer Barutcuoglu; Robert E Schapire; Olga G Troyanskaya
Journal:  Bioinformatics       Date:  2006-01-12       Impact factor: 6.937

4.  Functional grouping based on signatures in protein termini.

Authors:  Iris Bahir; Michal Linial
Journal:  Proteins       Date:  2006-06-01

Review 5.  Predicting functions from protein sequences--where are the bottlenecks?

Authors:  P Bork; E V Koonin
Journal:  Nat Genet       Date:  1998-04       Impact factor: 38.330

6.  Low-complexity regions within protein sequences have position-dependent roles.

Authors:  Alain Coletta; John W Pinney; David Y Weiss Solís; James Marsh; Steve R Pettifer; Teresa K Attwood
Journal:  BMC Syst Biol       Date:  2010-04-13

7.  PILGRM: an interactive data-driven discovery platform for expert biologists.

Authors:  Casey S Greene; Olga G Troyanskaya
Journal:  Nucleic Acids Res       Date:  2011-06-07       Impact factor: 16.971

8.  STRING 8--a global view on proteins and their functional interactions in 630 organisms.

Authors:  Lars J Jensen; Michael Kuhn; Manuel Stark; Samuel Chaffron; Chris Creevey; Jean Muller; Tobias Doerks; Philippe Julien; Alexander Roth; Milan Simonovic; Peer Bork; Christian von Mering
Journal:  Nucleic Acids Res       Date:  2008-10-21       Impact factor: 16.971

9.  Learning statistical models for annotating proteins with function information using biomedical text.

Authors:  Soumya Ray; Mark Craven
Journal:  BMC Bioinformatics       Date:  2005-05-24       Impact factor: 3.169

10.  A critical assessment of Mus musculus gene function prediction using integrated genomic evidence.

Authors:  Lourdes Peña-Castillo; Murat Tasan; Chad L Myers; Hyunju Lee; Trupti Joshi; Chao Zhang; Yuanfang Guan; Michele Leone; Andrea Pagnani; Wan Kyu Kim; Chase Krumpelman; Weidong Tian; Guillaume Obozinski; Yanjun Qi; Sara Mostafavi; Guan Ning Lin; Gabriel F Berriz; Francis D Gibbons; Gert Lanckriet; Jian Qiu; Charles Grant; Zafer Barutcuoglu; David P Hill; David Warde-Farley; Chris Grouios; Debajyoti Ray; Judith A Blake; Minghua Deng; Michael I Jordan; William S Noble; Quaid Morris; Judith Klein-Seetharaman; Ziv Bar-Joseph; Ting Chen; Fengzhu Sun; Olga G Troyanskaya; Edward M Marcotte; Dong Xu; Timothy R Hughes; Frederick P Roth
Journal:  Genome Biol       Date:  2008-06-27       Impact factor: 13.583

View more
  22 in total

1.  Evaluating a variety of text-mined features for automatic protein function prediction with GOstruct.

Authors:  Christopher S Funk; Indika Kahanda; Asa Ben-Hur; Karin M Verspoor
Journal:  J Biomed Semantics       Date:  2015-03-18

2.  Predicting protein function via downward random walks on a gene ontology.

Authors:  Guoxian Yu; Hailong Zhu; Carlotta Domeniconi; Jiming Liu
Journal:  BMC Bioinformatics       Date:  2015-08-27       Impact factor: 3.169

3.  Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters.

Authors:  Christopher Funk; William Baumgartner; Benjamin Garcia; Christophe Roeder; Michael Bada; K Bretonnel Cohen; Lawrence E Hunter; Karin Verspoor
Journal:  BMC Bioinformatics       Date:  2014-02-26       Impact factor: 3.169

4.  Associating disease-related genetic variants in intergenic regions to the genes they impact.

Authors:  Geoff Macintyre; Antonio Jimeno Yepes; Cheng Soon Ong; Karin Verspoor
Journal:  PeerJ       Date:  2014-10-23       Impact factor: 2.984

5.  Integrating multiple networks for protein function prediction.

Authors:  Guoxian Yu; Hailong Zhu; Carlotta Domeniconi; Maozu Guo
Journal:  BMC Syst Biol       Date:  2015-01-21

Review 6.  Hierarchical ensemble methods for protein function prediction.

Authors:  Giorgio Valentini
Journal:  ISRN Bioinform       Date:  2014-05-04

7.  The role of ontologies in biological and biomedical research: a functional perspective.

Authors:  Robert Hoehndorf; Paul N Schofield; Georgios V Gkoutos
Journal:  Brief Bioinform       Date:  2015-04-10       Impact factor: 11.622

8.  A close look at protein function prediction evaluation protocols.

Authors:  Indika Kahanda; Christopher S Funk; Fahad Ullah; Karin M Verspoor; Asa Ben-Hur
Journal:  Gigascience       Date:  2015-09-14       Impact factor: 6.524

9.  Structure to function prediction of hypothetical protein KPN_00953 (Ycbk) from Klebsiella pneumoniae MGH 78578 highlights possible role in cell wall metabolism.

Authors:  Boon Aun Teh; Sy Bing Choi; Nasihah Musa; Few Ling Ling; See Too Wei Cun; Abu Bakar Salleh; Nazalan Najimudin; Habibah A Wahab; Yahaya M Normi
Journal:  BMC Struct Biol       Date:  2014-02-05

10.  Finding New Order in Biological Functions from the Network Structure of Gene Annotations.

Authors:  Kimberly Glass; Michelle Girvan
Journal:  PLoS Comput Biol       Date:  2015-11-20       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.