Literature DB >> 34368836

DeepFeature: feature selection in nonimage data using convolutional neural network.

Alok Sharma1, Artem Lysenko1, Keith A Boroevich1, Edwin Vans2, Tatsuhiko Tsunoda3.   

Abstract

Artificial intelligence methods offer exciting new capabilities for the discovery of biological mechanisms from raw data because they are able to detect vastly more complex patterns of association that cannot be captured by classical statistical tests. Among these methods, deep neural networks are currently among the most advanced approaches and, in particular, convolutional neural networks (CNNs) have been shown to perform excellently for a variety of difficult tasks. Despite that applications of this type of networks to high-dimensional omics data and, most importantly, meaningful interpretation of the results returned from such models in a biomedical context remains an open problem. Here we present, an approach applying a CNN to nonimage data for feature selection. Our pipeline, DeepFeature, can both successfully transform omics data into a form that is optimal for fitting a CNN model and can also return sets of the most important genes used internally for computing predictions. Within the framework, the Snowfall compression algorithm is introduced to enable more elements in the fixed pixel framework, and region accumulation and element decoder is developed to find elements or genes from the class activation maps. In comparative tests for cancer type prediction task, DeepFeature simultaneously achieved superior predictive performance and better ability to discover key pathways and biological processes meaningful for this context. Capabilities offered by the proposed framework can enable the effective use of powerful deep learning methods to facilitate the discovery of causal mechanisms in high-dimensional biomedical data.
© The Author(s) 2021. Published by Oxford University Press.

Entities:  

Keywords:  Convolutional neural network; DeepInsight; Feature selection; Non-image data; Omics data

Mesh:

Year:  2021        PMID: 34368836      PMCID: PMC8575039          DOI: 10.1093/bib/bbab297

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  35 in total

1.  A top-r feature selection algorithm for microarray gene expression data.

Authors:  Alok Sharma; Seiya Imoto; Satoru Miyano
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2012 May-Jun       Impact factor: 3.710

2.  Molecular signatures database (MSigDB) 3.0.

Authors:  Arthur Liberzon; Aravind Subramanian; Reid Pinchback; Helga Thorvaldsdóttir; Pablo Tamayo; Jill P Mesirov
Journal:  Bioinformatics       Date:  2011-05-05       Impact factor: 6.937

3.  FEATS: feature selection-based clustering of single-cell RNA-seq data.

Authors:  Edwin Vans; Ashwini Patil; Alok Sharma
Journal:  Brief Bioinform       Date:  2021-07-20       Impact factor: 11.622

4.  Single-stranded and double-stranded DNA-binding protein prediction using HMM profiles.

Authors:  Ronesh Sharma; Shiu Kumar; Tatsuhiko Tsunoda; Thirumananseri Kumarevel; Alok Sharma
Journal:  Anal Biochem       Date:  2020-09-15       Impact factor: 3.365

Review 5.  The Emerging Role of Adhesion GPCRs in Cancer.

Authors:  Abanoub A Gad; Nariman Balenga
Journal:  ACS Pharmacol Transl Sci       Date:  2020-01-13

6.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

7.  Integrating single-cell transcriptomic data across different conditions, technologies, and species.

Authors:  Andrew Butler; Paul Hoffman; Peter Smibert; Efthymia Papalexi; Rahul Satija
Journal:  Nat Biotechnol       Date:  2018-04-02       Impact factor: 54.908

8.  Tumor gene expression data classification via sample expansion-based deep learning.

Authors:  Jian Liu; Xuesong Wang; Yuhu Cheng; Lin Zhang
Journal:  Oncotarget       Date:  2017-11-30

9.  Visualizing structure and transitions in high-dimensional biological data.

Authors:  Kevin R Moon; David van Dijk; Zheng Wang; Scott Gigante; Daniel B Burkhardt; William S Chen; Kristina Yim; Antonia van den Elzen; Matthew J Hirn; Ronald R Coifman; Natalia B Ivanova; Guy Wolf; Smita Krishnaswamy
Journal:  Nat Biotechnol       Date:  2019-12-03       Impact factor: 54.908

View more
  5 in total

1.  Deep learning for de-convolution of Smad2 versus Smad3 binding sites.

Authors:  Jeremy W K Ng; Esther H Q Ong; Lisa Tucker-Kellogg; Greg Tucker-Kellogg
Journal:  BMC Genomics       Date:  2022-07-20       Impact factor: 4.547

2.  Nucleosome positioning based on DNA sequence embedding and deep learning.

Authors:  Guo-Sheng Han; Qi Li; Ying Li
Journal:  BMC Genomics       Date:  2022-04-13       Impact factor: 3.969

3.  AI/ML-driven advances in untargeted metabolomics and exposomics for biomedical applications.

Authors:  Lauren M Petrick; Noam Shomron
Journal:  Cell Rep Phys Sci       Date:  2022-07-20

4.  Identifying Non-Math Students from Brain MRIs with an Ensemble Classifier Based on Subspace-Enhanced Contrastive Learning.

Authors:  Shuhui Liu; Yupei Zhang; Jiajie Peng; Tao Wang; Xuequn Shang
Journal:  Brain Sci       Date:  2022-07-12

5.  Development and validation of a deep learning-based protein electrophoresis classification algorithm.

Authors:  Nuri Lee; Seri Jeong; Kibum Jeon; Wonkeun Song; Min-Jeong Park
Journal:  PLoS One       Date:  2022-08-24       Impact factor: 3.752

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.