Literature DB >> 29280033

A combined Fisher and Laplacian score for feature selection in QSAR based drug design using compounds with known and unknown activities.

Mohammad Amin Valizade Hasanloei1, Razieh Sheikhpour2, Mehdi Agha Sarram3, Elnaz Sheikhpour4, Hamdollah Sharifi1.   

Abstract

Quantitative structure-activity relationship (QSAR) is an effective computational technique for drug design that relates the chemical structures of compounds to their biological activities. Feature selection is an important step in QSAR based drug design to select the most relevant descriptors. One of the most popular feature selection methods for classification problems is Fisher score which aim is to minimize the within-class distance and maximize the between-class distance. In this study, the properties of Fisher criterion were extended for QSAR models to define the new distance metrics based on the continuous activity values of compounds with known activities. Then, a semi-supervised feature selection method was proposed based on the combination of Fisher and Laplacian criteria which exploits both compounds with known and unknown activities to select the relevant descriptors. To demonstrate the efficiency of the proposed semi-supervised feature selection method in selecting the relevant descriptors, we applied the method and other feature selection methods on three QSAR data sets such as serine/threonine-protein kinase PLK3 inhibitors, ROCK inhibitors and phenol compounds. The results demonstrated that the QSAR models built on the selected descriptors by the proposed semi-supervised method have better performance than other models. This indicates the efficiency of the proposed method in selecting the relevant descriptors using the compounds with known and unknown activities. The results of this study showed that the compounds with known and unknown activities can be helpful to improve the performance of the combined Fisher and Laplacian based feature selection methods.

Entities:  

Keywords:  Feature selection; Fisher criterion; Graph Laplacian; QSAR models; Semi-supervised

Mesh:

Substances:

Year:  2017        PMID: 29280033     DOI: 10.1007/s10822-017-0094-6

Source DB:  PubMed          Journal:  J Comput Aided Mol Des        ISSN: 0920-654X            Impact factor:   3.686


  14 in total

1.  Evaluation of mutual information and genetic programming for feature selection in QSAR.

Authors:  Vishwesh Venkatraman; Andrew Rowland Dalby; Zheng Rong Yang
Journal:  J Chem Inf Comput Sci       Date:  2004 Sep-Oct

2.  Development predictive QSAR models for artemisinin analogues by various feature selection methods: a comparative study.

Authors:  F Abbasitabar; V Zare-Shahabadi
Journal:  SAR QSAR Environ Res       Date:  2011-10-31       Impact factor: 3.000

3.  Consistency of QSAR models: Correct split of training and test sets, ranking of models and performance parameters.

Authors:  A Rácz; D Bajusz; K Héberger
Journal:  SAR QSAR Environ Res       Date:  2015-10-05       Impact factor: 3.000

4.  Application of artificial neural networks for predicting the aqueous acidity of various phenols using QSAR.

Authors:  Aziz Habibi-Yangjeh; Mohammad Danandeh-Jenagharad; Mahdi Nooshyar
Journal:  J Mol Model       Date:  2005-12-13       Impact factor: 1.810

5.  AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading.

Authors:  Oleg Trott; Arthur J Olson
Journal:  J Comput Chem       Date:  2010-01-30       Impact factor: 3.376

6.  Quantitative structure-activity relationship study of serotonin (5-HT7) receptor inhibitors using modified ant colony algorithm and adaptive neuro-fuzzy interference system (ANFIS).

Authors:  M Jalali-Heravi; M Asadollahi-Baboli
Journal:  Eur J Med Chem       Date:  2008-10-14       Impact factor: 6.514

7.  PaDEL-descriptor: an open source software to calculate molecular descriptors and fingerprints.

Authors:  Chun Wei Yap
Journal:  J Comput Chem       Date:  2010-12-17       Impact factor: 3.376

8.  Semisupervised feature selection via spline regression for video semantic recognition.

Authors:  Yahong Han; Yi Yang; Yan Yan; Zhigang Ma; Nicu Sebe; Xiaofang Zhou
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2015-02       Impact factor: 10.451

9.  BINANA: a novel algorithm for ligand-binding characterization.

Authors:  Jacob D Durrant; J Andrew McCammon
Journal:  J Mol Graph Model       Date:  2011-01-19       Impact factor: 2.518

10.  Comparison of Multiple Linear Regressions and Neural Networks based QSAR models for the design of new antitubercular compounds.

Authors:  Cristina Ventura; Diogo A R S Latino; Filomena Martins
Journal:  Eur J Med Chem       Date:  2013-10-23       Impact factor: 6.514

View more
  3 in total

1.  Influence of feature rankers in the construction of molecular activity prediction models.

Authors:  Gonzalo Cerruela-García; José Pérez-Parra Toledano; Aída de Haro-García; Nicolás García-Pedrajas
Journal:  J Comput Aided Mol Des       Date:  2019-12-31       Impact factor: 3.686

2.  A Deep Learning-Based Quantitative Structure-Activity Relationship System Construct Prediction Model of Agonist and Antagonist with High Performance.

Authors:  Yasunari Matsuzaka; Yoshihiro Uesawa
Journal:  Int J Mol Sci       Date:  2022-02-15       Impact factor: 5.923

3.  Graph-Based Feature Selection Approach for Molecular Activity Prediction.

Authors:  Gonzalo Cerruela-García; José Manuel Cuevas-Muñoz; Nicolás García-Pedrajas
Journal:  J Chem Inf Model       Date:  2022-03-22       Impact factor: 4.956

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.