Literature DB >> 25620721

IMMAN: free software for information theory-based chemometric analysis.

Ricardo W Pino Urias1, Stephen J Barigye, Yovani Marrero-Ponce, César R García-Jacas, José R Valdes-Martiní, Facundo Perez-Gimenez.   

Abstract

The features and theoretical background of a new and free computational program for chemometric analysis denominated IMMAN (acronym for Information theory-based CheMoMetrics ANalysis) are presented. This is multi-platform software developed in the Java programming language, designed with a remarkably user-friendly graphical interface for the computation of a collection of information-theoretic functions adapted for rank-based unsupervised and supervised feature selection tasks. A total of 20 feature selection parameters are presented, with the unsupervised and supervised frameworks represented by 10 approaches in each case. Several information-theoretic parameters traditionally used as molecular descriptors (MDs) are adapted for use as unsupervised rank-based feature selection methods. On the other hand, a generalization scheme for the previously defined differential Shannon's entropy is discussed, as well as the introduction of Jeffreys information measure for supervised feature selection. Moreover, well-known information-theoretic feature selection parameters, such as information gain, gain ratio, and symmetrical uncertainty are incorporated to the IMMAN software ( http://mobiosd-hub.com/imman-soft/ ), following an equal-interval discretization approach. IMMAN offers data pre-processing functionalities, such as missing values processing, dataset partitioning, and browsing. Moreover, single parameter or ensemble (multi-criteria) ranking options are provided. Consequently, this software is suitable for tasks like dimensionality reduction, feature ranking, as well as comparative diversity analysis of data matrices. Simple examples of applications performed with this program are presented. A comparative study between IMMAN and WEKA feature selection tools using the Arcene dataset was performed, demonstrating similar behavior. In addition, it is revealed that the use of IMMAN unsupervised feature selection methods improves the performance of both IMMAN and WEKA supervised algorithms. Graphic representation for Shannon's distribution of MD calculating software.

Entities:  

Mesh:

Year:  2015        PMID: 25620721     DOI: 10.1007/s11030-014-9565-z

Source DB:  PubMed          Journal:  Mol Divers        ISSN: 1381-1991            Impact factor:   2.943


  17 in total

1.  Variability of molecular descriptors in compound databases revealed by Shannon entropy calculations

Authors: 
Journal:  J Chem Inf Comput Sci       Date:  2000-05

2.  Chemical descriptors with distinct levels of information content and varying sensitivity to differences between selected compound databases identified by SE-DSE analysis.

Authors:  Jeffrey W Godden; Jürgen Bajorath
Journal:  J Chem Inf Comput Sci       Date:  2002 Jan-Feb

3.  Structure/response correlations and similarity/diversity analysis by GETAWAY descriptors. 2. Application of the novel 3D molecular descriptors to QSAR/QSPR studies.

Authors:  Viviana Consonni; Roberto Todeschini; Manuela Pavan; Paola Gramatica
Journal:  J Chem Inf Comput Sci       Date:  2002 May-Jun

4.  Differential Shannon entropy analysis identifies molecular property descriptors that predict aqueous solubility of synthetic compounds with high accuracy in binary QSAR calculations.

Authors:  Florence L Stahura; Jeffrey W Godden; Jürgen Bajorath
Journal:  J Chem Inf Comput Sci       Date:  2002 May-Jun

5.  An invariant form for the prior probability in estimation problems.

Authors:  H JEFFREYS
Journal:  Proc R Soc Lond A Math Phys Sci       Date:  1946

6.  Identification of descriptors capturing compound class-specific features by mutual information analysis.

Authors:  Anne Mai Wassermann; Britta Nisius; Martin Vogt; Jürgen Bajorath
Journal:  J Chem Inf Model       Date:  2010-10-20       Impact factor: 4.956

Review 7.  Molecular similarity and diversity in chemoinformatics: from theory to applications.

Authors:  Ana G Maldonado; J P Doucet; Michel Petitjean; Bo-Tao Fan
Journal:  Mol Divers       Date:  2006-02       Impact factor: 2.943

8.  Quantitative structure-activity relationship studies of HIV-1 integrase inhibition. 1. GETAWAY descriptors.

Authors:  Liane Saíz-Urra; Maykel Pérez González; Yagamare Fall; Generosa Gómez
Journal:  Eur J Med Chem       Date:  2006-10-09       Impact factor: 6.514

9.  PaDEL-descriptor: an open source software to calculate molecular descriptors and fingerprints.

Authors:  Chun Wei Yap
Journal:  J Comput Chem       Date:  2010-12-17       Impact factor: 3.376

10.  GETAWAY descriptors to predicting A(2A) adenosine receptors agonists.

Authors:  M P González; C Terán; M Teijeira; M J González-Moa
Journal:  Eur J Med Chem       Date:  2005-07-11       Impact factor: 6.514

View more
  14 in total

1.  ProtDCal-Suite: A web server for the numerical codification and functional analysis of proteins.

Authors:  Sandra Romero-Molina; Yasser B Ruiz-Blanco; James R Green; Elsa Sanchez-Garcia
Journal:  Protein Sci       Date:  2019-09       Impact factor: 6.725

2.  Undersampling: case studies of flaviviral inhibitory activities.

Authors:  Stephen J Barigye; José Manuel García de la Vega; Juan A Castillo-Garit
Journal:  J Comput Aided Mol Des       Date:  2019-11-26       Impact factor: 3.686

3.  Examining the predictive accuracy of the novel 3D N-linear algebraic molecular codifications on benchmark datasets.

Authors:  César R García-Jacas; Ernesto Contreras-Torres; Yovani Marrero-Ponce; Mario Pupo-Meriño; Stephen J Barigye; Lisset Cabrera-Leyva
Journal:  J Cheminform       Date:  2016-02-25       Impact factor: 5.514

4.  Exploring general-purpose protein features for distinguishing enzymes and non-enzymes within the twilight zone.

Authors:  Yasser B Ruiz-Blanco; Guillermin Agüero-Chapin; Enrique García-Hernández; Orlando Álvarez; Agostinho Antunes; James Green
Journal:  BMC Bioinformatics       Date:  2017-07-21       Impact factor: 3.169

5.  QuBiLS-MAS, open source multi-platform software for atom- and bond-based topological (2D) and chiral (2.5D) algebraic molecular descriptors computations.

Authors:  José R Valdés-Martiní; Yovani Marrero-Ponce; César R García-Jacas; Karina Martinez-Mayorga; Stephen J Barigye; Yasser Silveira Vaz d'Almeida; Hai Pham-The; Facundo Pérez-Giménez; Carlos A Morell
Journal:  J Cheminform       Date:  2017-06-07       Impact factor: 5.514

6.  Database fingerprint (DFP): an approach to represent molecular databases.

Authors:  Eli Fernández-de Gortari; César R García-Jacas; Karina Martinez-Mayorga; José L Medina-Franco
Journal:  J Cheminform       Date:  2017-02-06       Impact factor: 5.514

7.  Multi-Target In Silico Prediction of Inhibitors for Mitogen-Activated Protein Kinase-Interacting Kinases.

Authors:  Amit Kumar Halder; M Natália D S Cordeiro
Journal:  Biomolecules       Date:  2021-11-10

8.  Choquet integral-based fuzzy molecular characterizations: when global definitions are computed from the dependency among atom/bond contributions (LOVIs/LOEIs).

Authors:  César R García-Jacas; Lisset Cabrera-Leyva; Yovani Marrero-Ponce; José Suárez-Lezcano; Fernando Cortés-Guzmán; Mario Pupo-Meriño; Ricardo Vivas-Reyes
Journal:  J Cheminform       Date:  2018-10-25       Impact factor: 5.514

9.  Tensor Algebra-based Geometrical (3D) Biomacro-Molecular Descriptors for Protein Research: Theory, Applications and Comparison with other Methods.

Authors:  Julio E Terán; Yovani Marrero-Ponce; Ernesto Contreras-Torres; César R García-Jacas; Ricardo Vivas-Reyes; Enrique Terán; F Javier Torres
Journal:  Sci Rep       Date:  2019-08-06       Impact factor: 4.379

10.  Automatic construction of molecular similarity networks for visual graph mining in chemical space of bioactive peptides: an unsupervised learning approach.

Authors:  Longendri Aguilera-Mendoza; Yovani Marrero-Ponce; César R García-Jacas; Edgar Chavez; Jesus A Beltran; Hugo A Guillen-Ramirez; Carlos A Brizuela
Journal:  Sci Rep       Date:  2020-10-22       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.