Literature DB >> 14534170

Predicting gene function in Saccharomyces cerevisiae.

A Clare1, R D King.   

Abstract

MOTIVATION: S.cerevisiae is one of the most important model organisms, and has has been the focus of over a century of study. In spite of these efforts, 40% of its open reading frames (ORFs) remain classified as having unknown function (MIPS: Munich Information Center for Protein Sequences). We wished to make predictions for the function of these ORFs using data mining, as we have previously successfully done for the genomes of M.tuberculosis and E.coli. Applying this approach to the larger and eukaryotic S.cerevisiae genome involves modifying the machine learning and data mining algorithms, as this is a larger organism with more data available, and a more challenging functional classification.
RESULTS: Novel extensions to the machine learning and data mining algorithms have been devised in order to deal with the challenges. Accurate rules have been learned and predictions have been made for many of the ORFs whose function is currently unknown. The rules are informative, agree with known biology and allow for scientific discovery. AVAILABILITY: All predictions are freely available from http://www.genepredictions.org, all datasets used in this study are freely available from http://www.aber.ac.uk/compsci/Research/bio/dss/yeastdataand software for relational data mining is available from http://www.aber.ac.uk/compsci/Research/bio/dss/polyfarm.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 14534170     DOI: 10.1093/bioinformatics/btg1058

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  20 in total

1.  Automated prediction of protein function and detection of functional sites from structure.

Authors:  Florencio Pazos; Michael J E Sternberg
Journal:  Proc Natl Acad Sci U S A       Date:  2004-09-29       Impact factor: 11.205

2.  Large datasets in biomedicine: a discussion of salient analytic issues.

Authors:  Anshu Sinha; George Hripcsak; Marianthi Markatou
Journal:  J Am Med Inform Assoc       Date:  2009-08-28       Impact factor: 4.497

3.  Mining SARS-CoV protease cleavage data using non-orthogonal decision trees: a novel method for decisive template selection.

Authors:  Zheng Rong Yang
Journal:  Bioinformatics       Date:  2005-03-29       Impact factor: 6.937

4.  PoGO: Prediction of Gene Ontology terms for fungal proteins.

Authors:  Jaehee Jung; Gangman Yi; Serenella A Sukno; Michael R Thon
Journal:  BMC Bioinformatics       Date:  2010-04-29       Impact factor: 3.169

5.  Global protein function annotation through mining genome-scale data in yeast Saccharomyces cerevisiae.

Authors:  Yu Chen; Dong Xu
Journal:  Nucleic Acids Res       Date:  2004-12-07       Impact factor: 16.971

6.  The use of classification trees for bioinformatics.

Authors:  Xiang Chen; Minghui Wang; Heping Zhang
Journal:  Wiley Interdiscip Rev Data Min Knowl Discov       Date:  2011-01-06

7.  YAGM: a web tool for mining associated genes in yeast based on diverse biological associations.

Authors:  Wei-Sheng Wu; Chung-Ching Wang; Meng-Jhun Jhou; Yu-Cheng Wang
Journal:  BMC Syst Biol       Date:  2015-12-09

8.  Predicting functional upstream open reading frames in Saccharomyces cerevisiae.

Authors:  Christopher H Bryant; Graham J L Kemp; Janeli Sarv; Erik Kristiansson; Per Sunnerhagen
Journal:  BMC Bioinformatics       Date:  2009-12-30       Impact factor: 3.169

9.  Predicting gene function using hierarchical multi-label decision tree ensembles.

Authors:  Leander Schietgat; Celine Vens; Jan Struyf; Hendrik Blockeel; Dragi Kocev; Saso Dzeroski
Journal:  BMC Bioinformatics       Date:  2010-01-02       Impact factor: 3.169

10.  Knowledge-based classification of fine-grained immune cell types in single-cell RNA-Seq data.

Authors:  Xuan Liu; Sara J C Gosline; Lance T Pflieger; Pierre Wallet; Archana Iyer; Justin Guinney; Andrea H Bild; Jeffrey T Chang
Journal:  Brief Bioinform       Date:  2021-09-02       Impact factor: 11.622

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.