Literature DB >> 27353971

Using the gini coefficient to measure the chemical diversity of small-molecule libraries.

Iwona E Weidlich1, Igor V Filippov2.   

Abstract

Modern databases of small organic molecules contain tens of millions of structures. The size of theoretically available chemistry is even larger. However, despite the large amount of chemical information, the "big data" moment for chemistry has not yet provided the corresponding payoff of cheaper computer-predicted medicine or robust machine-learning models for the determination of efficacy and toxicity. Here, we present a study of the diversity of chemical datasets using a measure that is commonly used in socioeconomic studies. We demonstrate the use of this diversity measure on several datasets that were constructed to contain various congeneric subsets of molecules as well as randomly selected molecules. We also apply our method to a number of well-known databases that are frequently used for structure-activity relationship modeling. Our results show the poor diversity of the common sources of potential lead compounds compared to actual known drugs.
© 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

Entities:  

Keywords:  Diversity Genie; chemical databases; cheminformatics; molecular diversity

Year:  2016        PMID: 27353971     DOI: 10.1002/jcc.24423

Source DB:  PubMed          Journal:  J Comput Chem        ISSN: 0192-8651            Impact factor:   3.376


  3 in total

1.  Gini Coefficients as a Single Value Metric to Define Chemical Probe Selectivity.

Authors:  Andrei Ursu; Jessica L Childs-Disney; Alicia J Angelbello; Matthew G Costales; Samantha M Meyer; Matthew D Disney
Journal:  ACS Chem Biol       Date:  2020-07-09       Impact factor: 5.100

2.  A palette of fluorophores that are differentially accumulated by wild-type and mutant strains of Escherichia coli: surrogate ligands for profiling bacterial membrane transporters.

Authors:  Jesus Enrique Salcedo-Sora; Srijan Jindal; Steve O'Hagan; Douglas B Kell
Journal:  Microbiology (Reading)       Date:  2021-02       Impact factor: 2.777

3.  GeneGini: Assessment via the Gini Coefficient of Reference "Housekeeping" Genes and Diverse Human Transporter Expression Profiles.

Authors:  Steve O'Hagan; Marina Wright Muelas; Philip J Day; Emma Lundberg; Douglas B Kell
Journal:  Cell Syst       Date:  2018-02-07       Impact factor: 10.304

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.