Literature DB >> 32163118

forgeNet: a graph deep neural network model using tree-based ensemble classifiers for feature graph construction.

Yunchuan Kong1, Tianwei Yu1.   

Abstract

MOTIVATION: A unique challenge in predictive model building for omics data has been the small number of samples (n) versus the large amount of features (p). This 'n≪p' property brings difficulties for disease outcome classification using deep learning techniques. Sparse learning by incorporating known functional relationships between the biological units, such as the graph-embedded deep feedforward network (GEDFN) model, has been a solution to this issue. However, such methods require an existing feature graph, and potential mis-specification of the feature graph can be harmful on classification and feature selection.
RESULTS: To address this limitation and develop a robust classification model without relying on external knowledge, we propose a forest graph-embedded deep feedforward network (forgeNet) model, to integrate the GEDFN architecture with a forest feature graph extractor, so that the feature graph can be learned in a supervised manner and specifically constructed for a given prediction task. To validate the method's capability, we experimented the forgeNet model with both synthetic and real datasets. The resulting high classification accuracy suggests that the method is a valuable addition to sparse deep learning models for omics data.
AVAILABILITY AND IMPLEMENTATION: The method is available at https://github.com/yunchuankong/forgeNet. CONTACT: tianwei.yu@emory.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2020        PMID: 32163118      PMCID: PMC7267822          DOI: 10.1093/bioinformatics/btaa164

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  38 in total

1.  Missing value estimation methods for DNA microarrays.

Authors:  O Troyanskaya; M Cantor; G Sherlock; P Brown; T Hastie; R Tibshirani; D Botstein; R B Altman
Journal:  Bioinformatics       Date:  2001-06       Impact factor: 6.937

2.  Using GOstats to test gene lists for GO term association.

Authors:  S Falcon; R Gentleman
Journal:  Bioinformatics       Date:  2006-11-10       Impact factor: 6.937

3.  A graph-embedded deep feedforward network for disease outcome classification and feature selection using gene expression data.

Authors:  Yunchuan Kong; Tianwei Yu
Journal:  Bioinformatics       Date:  2018-11-01       Impact factor: 6.937

4.  Risk classification of cancer survival using ANN with gene expression data from multiple laboratories.

Authors:  Yen-Chen Chen; Wan-Chi Ke; Hung-Wen Chiu
Journal:  Comput Biol Med       Date:  2014-02-22       Impact factor: 4.589

Review 5.  AMP-activated protein kinase and energy balance in breast cancer.

Authors:  Hong Zhao; Yelda C Orhan; Xiaoming Zha; Ecem Esencan; Robert T Chatterton; Serdar E Bulun
Journal:  Am J Transl Res       Date:  2017-02-15       Impact factor: 4.060

Review 6.  Fatty acid metabolism in breast cancer subtypes.

Authors:  Marie E Monaco
Journal:  Oncotarget       Date:  2017-04-25

7.  HINT: High-quality protein interactomes and their applications in understanding human disease.

Authors:  Jishnu Das; Haiyuan Yu
Journal:  BMC Syst Biol       Date:  2012-07-30

8.  Sparse logistic regression with a L1/2 penalty for gene selection in cancer classification.

Authors:  Yong Liang; Cheng Liu; Xin-Ze Luan; Kwong-Sak Leung; Tak-Ming Chan; Zong-Ben Xu; Hai Zhang
Journal:  BMC Bioinformatics       Date:  2013-06-19       Impact factor: 3.169

Review 9.  The role of Hippo signal pathway in breast cancer metastasis.

Authors:  Changran Wei; Ying Wang; Xiangqi Li
Journal:  Onco Targets Ther       Date:  2018-04-17       Impact factor: 4.147

Review 10.  Biological Roles of Aberrantly Expressed Glycosphingolipids and Related Enzymes in Human Cancer Development and Progression.

Authors:  Dinghao Zhuo; Xiang Li; Feng Guan
Journal:  Front Physiol       Date:  2018-05-03       Impact factor: 4.566

View more
  5 in total

1.  Deep learning uncovers distinct behavior of rice network to pathogens response.

Authors:  Ravi Kumar; Abhishek Khatri; Vishal Acharya
Journal:  iScience       Date:  2022-06-07

2.  Disease-Ligand Identification Based on Flexible Neural Tree.

Authors:  Bin Yang; Wenzheng Bao; Baitong Chen
Journal:  Front Microbiol       Date:  2022-06-06       Impact factor: 6.064

3.  Active disease-related compound identification based on capsule network.

Authors:  Bin Yang; Wenzheng Bao; Jinglong Wang
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

4.  Predicting residues involved in anti-DNA autoantibodies with limited neural networks.

Authors:  Rachel St Clair; Michael Teti; Mirjana Pavlovic; William Hahn; Elan Barenholtz
Journal:  Med Biol Eng Comput       Date:  2022-03-18       Impact factor: 3.079

5.  Alzheimer-Compound Identification Based on Data Fusion and forgeNet_SVM.

Authors:  Bin Yang; Wenzheng Bao; Shichai Hong
Journal:  Front Aging Neurosci       Date:  2022-07-25       Impact factor: 5.702

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.