Literature DB >> 27318209

New KEGG pathway-based interpretable features for classifying ageing-related mouse proteins.

Fabio Fabris1, Alex A Freitas1.   

Abstract

MOTIVATION: The incidence of ageing-related diseases has been constantly increasing in the last decades, raising the need for creating effective methods to analyze ageing-related protein data. These methods should have high predictive accuracy and be easily interpretable by ageing experts. To enable this, one needs interpretable classification models (supervised machine learning) and features with rich biological meaning. In this paper we propose two interpretable feature types based on Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways and compare them with traditional feature types in hierarchical classification (a more challenging classification task regarding predictive performance) and binary classification (a classification task producing easier to interpret classification models). As far as we know, this work is the first to: (i) explore the potential of the KEGG pathway data in the hierarchical classification setting, (i) use the graph structure of KEGG pathways to create a feature type that quantifies the influence of a current protein on another specific protein within a KEGG pathway graph and (iii) propose a method for interpreting the classification models induced using KEGG features.
RESULTS: We performed tests measuring predictive accuracy considering hierarchical and binary class labels extracted from the Mouse Phenotype Ontology. One of the KEGG feature types leads to the highest predictive accuracy among five individual feature types across three hierarchical classification algorithms. Additionally, the combination of the two KEGG feature types proposed in this work results in one of the best predictive accuracies when using the binary class version of our datasets, at the same time enabling the extraction of knowledge from ageing-related data using quantitative influence information.
AVAILABILITY AND IMPLEMENTATION: The datasets created in this paper will be freely available after publication. CONTACT: ff79@kent.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27318209     DOI: 10.1093/bioinformatics/btw363

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  4 in total

1.  Integration of mechanistic immunological knowledge into a machine learning pipeline improves predictions.

Authors:  Anthony Culos; Amy S Tsai; Natalie Stanley; Martin Becker; Mohammad S Ghaemi; David R McIlwain; Ramin Fallahzadeh; Athena Tanada; Huda Nassar; Camilo Espinosa; Maria Xenochristou; Edward Ganio; Laura Peterson; Xiaoyuan Han; Ina A Stelzer; Kazuo Ando; Dyani Gaudilliere; Thanaphong Phongpreecha; Ivana Marić; Alan L Chang; Gary M Shaw; David K Stevenson; Sean Bendall; Kara L Davis; Wendy Fantl; Garry P Nolan; Trevor Hastie; Robert Tibshirani; Martin S Angst; Brice Gaudilliere; Nima Aghaeepour
Journal:  Nat Mach Intell       Date:  2020-10-12

2.  Machine learning-based predictions of dietary restriction associations across ageing-related genes.

Authors:  Gustavo Daniel Vega Magdaleno; Vladislav Bespalov; Yalin Zheng; Alex A Freitas; Joao Pedro de Magalhaes
Journal:  BMC Bioinformatics       Date:  2022-01-04       Impact factor: 3.169

Review 3.  A review of supervised machine learning applied to ageing research.

Authors:  Fabio Fabris; João Pedro de Magalhães; Alex A Freitas
Journal:  Biogerontology       Date:  2017-03-06       Impact factor: 4.277

4.  Gene prediction of aging-related diseases based on DNN and Mashup.

Authors:  Junhua Ye; Shunfang Wang; Xin Yang; Xianjun Tang
Journal:  BMC Bioinformatics       Date:  2021-12-17       Impact factor: 3.169

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.