Literature DB >> 29850911

A graph-embedded deep feedforward network for disease outcome classification and feature selection using gene expression data.

Yunchuan Kong1, Tianwei Yu1.   

Abstract

Motivation: Gene expression data represents a unique challenge in predictive model building, because of the small number of samples (n) compared with the huge amount of features (p). This 'n≪p' property has hampered application of deep learning techniques for disease outcome classification. Sparse learning by incorporating external gene network information could be a potential solution to this issue. Still, the problem is very challenging because (i) there are tens of thousands of features and only hundreds of training samples, (ii) the scale-free structure of the gene network is unfriendly to the setup of convolutional neural networks.
Results: To address these issues and build a robust classification model, we propose the Graph-Embedded Deep Feedforward Networks (GEDFN), to integrate external relational information of features into the deep neural network architecture. The method is able to achieve sparse connection between network layers to prevent overfitting. To validate the method's capability, we conducted both simulation experiments and real data analysis using a breast invasive carcinoma RNA-seq dataset and a kidney renal clear cell carcinoma RNA-seq dataset from The Cancer Genome Atlas. The resulting high classification accuracy and easily interpretable feature selection results suggest the method is a useful addition to the current graph-guided classification models and feature selection procedures. Availability and implementation: The method is available at https://github.com/yunchuankong/GEDFN. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29850911      PMCID: PMC6198851          DOI: 10.1093/bioinformatics/bty429

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  49 in total

1.  Emergence of scaling in random networks

Authors: 
Journal:  Science       Date:  1999-10-15       Impact factor: 47.728

2.  Incorporating gene networks into statistical tests for genomic data via a spatially correlated mixture model.

Authors:  Peng Wei; Wei Pan
Journal:  Bioinformatics       Date:  2007-12-14       Impact factor: 6.937

3.  Network-constrained regularization and variable selection for analysis of genomic data.

Authors:  Caiyan Li; Hongzhe Li
Journal:  Bioinformatics       Date:  2008-03-01       Impact factor: 6.937

4.  Group SCAD regression analysis for microarray time course gene expression data.

Authors:  Lifeng Wang; Guang Chen; Hongzhe Li
Journal:  Bioinformatics       Date:  2007-04-26       Impact factor: 6.937

5.  Network-based penalized regression with application to genomic data.

Authors:  Sunkyung Kim; Wei Pan; Xiaotong Shen
Journal:  Biometrics       Date:  2013-07-03       Impact factor: 2.571

Review 6.  Alterations in the Smad pathway in human cancers.

Authors:  Debangshu Samanta; Pran K Datta
Journal:  Front Biosci (Landmark Ed)       Date:  2012-01-01

7.  Protein networks as logic functions in development and cancer.

Authors:  Janusz Dutkowski; Trey Ideker
Journal:  PLoS Comput Biol       Date:  2011-09-29       Impact factor: 4.475

8.  Estrogen receptor α regulates non-canonical autophagy that provides stress resistance to neuroblastoma and breast cancer cells and involves BAG3 function.

Authors:  V Felzen; C Hiebel; I Koziollek-Drechsler; S Reißig; U Wolfrum; D Kögel; C Brandts; C Behl; T Morawe
Journal:  Cell Death Dis       Date:  2015-07-09       Impact factor: 8.469

9.  Collapsin response mediator protein-1 (CRMP1) acts as an invasion and metastasis suppressor of prostate cancer via its suppression of epithelial-mesenchymal transition and remodeling of actin cytoskeleton organization.

Authors:  G Cai; D Wu; Z Wang; Z Xu; K-B Wong; C-F Ng; F L Chan; S Yu
Journal:  Oncogene       Date:  2016-06-20       Impact factor: 9.867

10.  Response to angiotensin blockade with irbesartan in a patient with metastatic colorectal cancer.

Authors:  M R Jones; K A Schrader; Y Shen; E Pleasance; C Ch'ng; N Dar; S Yip; D J Renouf; J E Schein; A J Mungall; Y Zhao; R Moore; Y Ma; B S Sheffield; T Ng; S J M Jones; M A Marra; J Laskin; H J Lim
Journal:  Ann Oncol       Date:  2016-02-18       Impact factor: 32.976

View more
  23 in total

1.  Imputing missing RNA-sequencing data from DNA methylation by using a transfer learning-based neural network.

Authors:  Xiang Zhou; Hua Chai; Huiying Zhao; Ching-Hsing Luo; Yuedong Yang
Journal:  Gigascience       Date:  2020-07-01       Impact factor: 6.524

2.  forgeNet: a graph deep neural network model using tree-based ensemble classifiers for feature graph construction.

Authors:  Yunchuan Kong; Tianwei Yu
Journal:  Bioinformatics       Date:  2020-06-01       Impact factor: 6.937

3.  Deep learning uncovers distinct behavior of rice network to pathogens response.

Authors:  Ravi Kumar; Abhishek Khatri; Vishal Acharya
Journal:  iScience       Date:  2022-06-07

4.  Deep learning-based microarray cancer classification and ensemble gene selection approach.

Authors:  Khosro Rezaee; Gwanggil Jeon; Mohammad R Khosravi; Hani H Attar; Alireza Sabzevari
Journal:  IET Syst Biol       Date:  2022-07-04       Impact factor: 1.468

Review 5.  Data analysis methods for defining biomarkers from omics data.

Authors:  Chao Li; Zhenbo Gao; Benzhe Su; Guowang Xu; Xiaohui Lin
Journal:  Anal Bioanal Chem       Date:  2021-12-24       Impact factor: 4.142

6.  Varmole: a biologically drop-connect deep neural network model for prioritizing disease risk variants and genes.

Authors:  Nam D Nguyen; Ting Jin; Daifeng Wang
Journal:  Bioinformatics       Date:  2021-07-19       Impact factor: 6.937

7.  Peel Learning for Pathway-Related Outcome Prediction.

Authors:  Yuantong Li; Fei Wang; Mengying Yan; Edward Cantu; Fan Nils Yang; Hengyi Rao; Rui Feng
Journal:  Bioinformatics       Date:  2021-05-27       Impact factor: 6.931

8.  A simple convolutional neural network for prediction of enhancer-promoter interactions with DNA sequence data.

Authors:  Zhong Zhuang; Xiaotong Shen; Wei Pan
Journal:  Bioinformatics       Date:  2019-09-01       Impact factor: 6.931

9.  A Deep Neural Network Model using Random Forest to Extract Feature Representation for Gene Expression Data Classification.

Authors:  Yunchuan Kong; Tianwei Yu
Journal:  Sci Rep       Date:  2018-11-07       Impact factor: 4.379

10.  An Improved Method for Prediction of Cancer Prognosis by Network Learning.

Authors:  Minseon Kim; Ilhwan Oh; Jaegyoon Ahn
Journal:  Genes (Basel)       Date:  2018-10-02       Impact factor: 4.096

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.