Literature DB >> 31659696

When global and local molecular descriptors are more than the sum of its parts: Simple, But Not Simpler?

Yoan Martínez-López1,2,3, Yovani Marrero-Ponce4,5, Stephen J Barigye6,7, Enrique Teran1, Oscar Martínez-Santiago1,3,8, Cesar H Zambrano9,8, F Javier Torres9,8.   

Abstract

In this report, we introduce a set of aggregation operators (AOs) to calculate global and local (group and atom type) molecular descriptors (MDs) as a generalization of the classical approach of molecular encoding using the sum of the atomic (or fragment) contributions. These AOs are implemented in a new and free software denominated MD-LOVIs ( http://tomocomd.com/md-lovis ), which allows for the calculation of MDs from atomic weights vector and LOVIs (local vertex invariants). This software was developed in Java programming language and employed the Chemical Development Kit (CDK) library for handling chemical structures and the calculation of atomic weights. An analysis of the complexities of the algorithms presented herein demonstrates that these aspects were efficiently implemented. The calculation speed experiments show that the MD-LOVIs software has satisfactory behavior when compared to software such as Padel, CDKDescriptor, DRAGON and Bluecal software. Shannon's entropy (SE)-based variability studies demonstrate that MD-LOVIs yields indices with greater information content when compared to those of popular academic and commercial software. A principal component analysis reveals that our approach captures chemical information orthogonal to that codified by the DRAGON, Padel and Mold2 software, as a result of the several generalizations in MD-LOVIs not used in other programs. Lastly, three QSARs were built using multiple linear regression with genetic algorithms, and the statistical parameters of these models demonstrate that the MD-LOVIs indices obtained with AOs yield better performance than those obtained when the summation operator is used exclusively. Moreover, it is also revealed that the MD-LOVIs indices yield models with comparable to superior performance when compared to other QSAR methodologies reported in the literature, despite their simplicity. The studies performed herein collectively demonstrated that MD-LOVIs software generates indices as simple as possible, but not simpler and that use of AOs enhances the diversity of the chemical information codified, which consequently improves the performance of traditional MDs.

Entities:  

Keywords:  Aggregation operator; Atom weight vector; MD-LOVIs software; Molecular descriptor; No free lunch theorem; PCA; QSP(A)R; Shannon entropy

Mesh:

Substances:

Year:  2019        PMID: 31659696     DOI: 10.1007/s11030-019-10002-3

Source DB:  PubMed          Journal:  Mol Divers        ISSN: 1381-1991            Impact factor:   2.943


  3 in total

1.  Ensemble learning application to discover new trypanothione synthetase inhibitors.

Authors:  Juan I Alice; Carolina L Bellera; Diego Benítez; Marcelo A Comini; Pablo R Duchowicz; Alan Talevi
Journal:  Mol Divers       Date:  2021-07-15       Impact factor: 2.943

2.  IFPTML Mapping of Drug Graphs with Protein and Chromosome Structural Networks vs. Pre-Clinical Assay Information for Discovery of Antimalarial Compounds.

Authors:  Viviana Quevedo-Tumailli; Bernabe Ortega-Tenezaca; Humberto González-Díaz
Journal:  Int J Mol Sci       Date:  2021-12-02       Impact factor: 5.923

3.  Automatic construction of molecular similarity networks for visual graph mining in chemical space of bioactive peptides: an unsupervised learning approach.

Authors:  Longendri Aguilera-Mendoza; Yovani Marrero-Ponce; César R García-Jacas; Edgar Chavez; Jesus A Beltran; Hugo A Guillen-Ramirez; Carlos A Brizuela
Journal:  Sci Rep       Date:  2020-10-22       Impact factor: 4.379

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.