Literature DB >> 35348984

Ligand-based approaches to activity prediction for the early stage of structure-activity-relationship progression.

Itsuki Maeda1, Akinori Sato1, Shunsuke Tamura1, Tomoyuki Miyao2,3.   

Abstract

The retrospective evaluation of virtual screening approaches and activity prediction models are important for methodological development. However, for fair comparison, evaluation data sets must be carefully prepared. In this research, we compiled structure-activity-relationship matrix-based data sets for 15 biological targets along with many diverse inactive compounds, assuming the early stage of structure-activity-relationship progression. To use a large number of diverse inactive compounds and a limited number of active compounds, similarity profiles (SPs) are proposed as a set of molecular descriptors. Using these highly imbalanced data sets, we evaluated various approaches including SPs, under-sampling, support vector machine (SVM), and message passing neural networks. We found that for the under-sampling approaches, cluster-based sampling is better than random sampling. For virtual screening, SPs with inactive reference compounds and the under-sampling SVM also perform well. For classification, SPs with many inactive references performed as well as the under-sampling SVM trained on a balanced data set. Although the performance of SPs and the under-sampling SVM were comparable, SPs with many inactive references were preferable for selecting structurally distinct compounds from the active training compounds.
© 2022. The Author(s), under exclusive licence to Springer Nature Switzerland AG.

Entities:  

Keywords:  Activity prediction; Chemoinformatics; Imbalanced data; Ligand based-approaches; Machine learning; Virtual screening

Mesh:

Substances:

Year:  2022        PMID: 35348984     DOI: 10.1007/s10822-022-00449-2

Source DB:  PubMed          Journal:  J Comput Aided Mol Des        ISSN: 0920-654X            Impact factor:   3.686


  13 in total

1.  Application of support vector machine to three-dimensional shape-based virtual screening using comprehensive three-dimensional molecular shape overlay with known inhibitors.

Authors:  Tomohiro Sato; Hitomi Yuki; Daisuke Takaya; Shunta Sasaki; Akiko Tanaka; Teruki Honma
Journal:  J Chem Inf Model       Date:  2012-03-27       Impact factor: 4.956

2.  SAR matrices: automated extraction of information-rich SAR tables from large compound data sets.

Authors:  Anne Mai Wassermann; Peter Haebel; Nils Weskamp; Jürgen Bajorath
Journal:  J Chem Inf Model       Date:  2012-06-15       Impact factor: 4.956

3.  Potency-directed similarity searching using support vector machines.

Authors:  Anne M Wassermann; Kathrin Heikamp; Jürgen Bajorath
Journal:  Chem Biol Drug Des       Date:  2010-11-29       Impact factor: 2.817

4.  Comparison of shape-matching and docking as virtual screening tools.

Authors:  Paul C D Hawkins; A Geoffrey Skillman; Anthony Nicholls
Journal:  J Med Chem       Date:  2007-01-11       Impact factor: 7.446

5.  Local structural changes, global data views: graphical substructure-activity relationship trailing.

Authors:  Mathias Wawer; Jürgen Bajorath
Journal:  J Med Chem       Date:  2011-03-28       Impact factor: 7.446

6.  Current Trends, Overlooked Issues, and Unmet Challenges in Virtual Screening.

Authors:  Dagmar Stumpfe; Jürgen Bajorath
Journal:  J Chem Inf Model       Date:  2020-02-03       Impact factor: 4.956

7.  ChEMBL: towards direct deposition of bioassay data.

Authors:  David Mendez; Anna Gaulton; A Patrícia Bento; Jon Chambers; Marleen De Veij; Eloy Félix; María Paula Magariños; Juan F Mosquera; Prudence Mutowo; Michal Nowotka; María Gordillo-Marañón; Fiona Hunter; Laura Junco; Grace Mugumbate; Milagros Rodriguez-Lopez; Francis Atkinson; Nicolas Bosc; Chris J Radoux; Aldo Segura-Cabrera; Anne Hersey; Andrew R Leach
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

8.  Prediction of pharmacological activities from chemical structures with graph convolutional neural networks.

Authors:  Miyuki Sakai; Kazuki Nagayasu; Norihiro Shibui; Chihiro Andoh; Kaito Takayama; Hisashi Shirakawa; Shuji Kaneko
Journal:  Sci Rep       Date:  2021-01-12       Impact factor: 4.379

9.  Comparing predictive ability of QSAR/QSPR models using 2D and 3D molecular representations.

Authors:  Akinori Sato; Tomoyuki Miyao; Swarit Jasial; Kimito Funatsu
Journal:  J Comput Aided Mol Des       Date:  2021-01-04       Impact factor: 3.686

10.  PubChem in 2021: new data content and improved web interfaces.

Authors:  Sunghwan Kim; Jie Chen; Tiejun Cheng; Asta Gindulyte; Jia He; Siqian He; Qingliang Li; Benjamin A Shoemaker; Paul A Thiessen; Bo Yu; Leonid Zaslavsky; Jian Zhang; Evan E Bolton
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.