Literature DB >> 26799292

Deep Feature Selection: Theory and Application to Identify Enhancers and Promoters.

Yifeng Li1,2, Chih-Yu Chen2, Wyeth W Wasserman2.   

Abstract

Sparse linear models approximate target variable(s) by a sparse linear combination of input variables. Since they are simple, fast, and able to select features, they are widely used in classification and regression. Essentially they are shallow feed-forward neural networks that have three limitations: (1) incompatibility to model nonlinearity of features, (2) inability to learn high-level features, and (3) unnatural extensions to select features in a multiclass case. Deep neural networks are models structured by multiple hidden layers with nonlinear activation functions. Compared with linear models, they have two distinctive strengths: the capability to (1) model complex systems with nonlinear structures and (2) learn high-level representation of features. Deep learning has been applied in many large and complex systems where deep models significantly outperform shallow ones. However, feature selection at the input level, which is very helpful to understand the nature of a complex system, is still not well studied. In genome research, the cis-regulatory elements in noncoding DNA sequences play a key role in the expression of genes. Since the activity of regulatory elements involves highly interactive factors, a deep tool is strongly needed to discover informative features. In order to address the above limitations of shallow and deep models for selecting features of a complex system, we propose a deep feature selection (DFS) model that (1) takes advantages of deep structures to model nonlinearity and (2) conveniently selects a subset of features right at the input level for multiclass data. Simulation experiments convince us that this model is able to correctly identify both linear and nonlinear features. We applied this model to the identification of active enhancers and promoters by integrating multiple sources of genomic information. Results show that our model outperforms elastic net in terms of size of discriminative feature subset and classification accuracy.

Keywords:  deep feature selection; deep learning; enhancer; promoter

Mesh:

Year:  2016        PMID: 26799292     DOI: 10.1089/cmb.2015.0189

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  22 in total

1.  Prediction of condition-specific regulatory genes using machine learning.

Authors:  Qi Song; Jiyoung Lee; Shamima Akter; Matthew Rogers; Ruth Grene; Song Li
Journal:  Nucleic Acids Res       Date:  2020-06-19       Impact factor: 16.971

Review 2.  Automating drug discovery.

Authors:  Gisbert Schneider
Journal:  Nat Rev Drug Discov       Date:  2017-12-15       Impact factor: 84.694

Review 3.  A roadmap for multi-omics data integration using deep learning.

Authors:  Mingon Kang; Euiseong Ko; Tesfaye B Mersha
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

4.  A nonlinear sparse neural ordinary differential equation model for multiple functional processes.

Authors:  Yijia Liu; Lexin Li; Xiao Wang
Journal:  Can J Stat       Date:  2021-11-16       Impact factor: 0.758

5.  Robust clinical marker identification for diabetic kidney disease with ensemble feature selection.

Authors:  Xing Song; Lemuel R Waitman; Yong Hu; Alan S L Yu; David C Robbins; Mei Liu
Journal:  J Am Med Inform Assoc       Date:  2019-03-01       Impact factor: 4.497

6.  Predicting enhancer-promoter interaction from genomic sequence with deep neural networks.

Authors:  Shashank Singh; Yang Yang; Barnabás Póczos; Jian Ma
Journal:  Quant Biol       Date:  2019-06

7.  Effective Cancer Subtype and Stage Prediction via Dropfeature-DNNs.

Authors:  Zhong Chen; Wensheng Zhang; Hongwen Deng; Kun Zhang
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2022-02-03       Impact factor: 3.710

8.  Predicting clinical outcomes from large scale cancer genomic profiles with deep survival models.

Authors:  Safoora Yousefi; Fatemeh Amrollahi; Mohamed Amgad; Chengliang Dong; Joshua E Lewis; Congzheng Song; David A Gutman; Sameer H Halani; Jose Enrique Velazquez Vega; Daniel J Brat; Lee A D Cooper
Journal:  Sci Rep       Date:  2017-09-15       Impact factor: 4.379

9.  DeepMAge: A Methylation Aging Clock Developed with Deep Learning.

Authors:  Fedor Galkin; Polina Mamoshina; Kirill Kochetov; Denis Sidorenko; Alex Zhavoronkov
Journal:  Aging Dis       Date:  2021-08-01       Impact factor: 6.745

Review 10.  Opportunities and obstacles for deep learning in biology and medicine.

Authors:  Travers Ching; Daniel S Himmelstein; Brett K Beaulieu-Jones; Alexandr A Kalinin; Brian T Do; Gregory P Way; Enrico Ferrero; Paul-Michael Agapow; Michael Zietz; Michael M Hoffman; Wei Xie; Gail L Rosen; Benjamin J Lengerich; Johnny Israeli; Jack Lanchantin; Stephen Woloszynek; Anne E Carpenter; Avanti Shrikumar; Jinbo Xu; Evan M Cofer; Christopher A Lavender; Srinivas C Turaga; Amr M Alexandari; Zhiyong Lu; David J Harris; Dave DeCaprio; Yanjun Qi; Anshul Kundaje; Yifan Peng; Laura K Wiley; Marwin H S Segler; Simina M Boca; S Joshua Swamidass; Austin Huang; Anthony Gitter; Casey S Greene
Journal:  J R Soc Interface       Date:  2018-04       Impact factor: 4.293

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.