Literature DB >> 16381957

SuperNatural: a searchable database of available natural compounds.

Mathias Dunkel1, Melanie Fullbeck, Stefanie Neumann, Robert Preissner.   

Abstract

Although tremendous effort has been put into synthetic libraries, most drugs on the market are still natural compounds or derivatives thereof. There are encyclopaedias of natural compounds, but the availability of these compounds is often unclear and catalogues from numerous suppliers have to be checked. To overcome these problems we have compiled a database of approximately 50,000 natural compounds from different suppliers. To enable efficient identification of the desired compounds, we have implemented substructure searches with typical templates. Starting points for in silico screenings are about 2500 well-known and classified natural compounds from a compendium that we have added. Possible medical applications can be ascertained via automatic searches for similar drugs in a free conformational drug database containing WHO indications. Furthermore, we have computed about three million conformers, which are deployed to account for the flexibilities of the compounds when the 3D superposition algorithm that we have developed is used. The SuperNatural Database is publicly available at http://bioinformatics.charite.de/supernatural. Viewing requires the free Chime-plugin from MDL (Chime) or Java2 Runtime Environment (MView), which is also necessary for using Marvin application for chemical drawing.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16381957      PMCID: PMC1347494          DOI: 10.1093/nar/gkj132

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

The world is full of natural products, but only a few existing natural products are known and our understanding of the metabolome is fragmentary. Nature invented a universe of secondary metabolites as ‘defense compounds’ against enemies in predator–prey relationships. Concomitantly, strategies for handling xenobiotics evolved, such as the multidrug resistance efflux pump and the cytochrome P450 monooxygenases (1,2). Tulp and Bohlin (3) hypothesize that when a natural compound occurs in unrelated species, it must have an important biological function, e.g. addressing a specific target, because fortuitous production of a particular compound by totally unrelated species is extremely improbable (3). About 200 000 natural compounds are currently known and many more will prove to be more than just ‘secondary metabolites’ (3). Even though combinatorial synthesis is now producing molecules that are drug-like in terms of size and property, these molecules, in contrast to natural products, have not evolved to interact with biomolecules (4). Natural compounds such as brefelidin A, camptothecin, forskolin and immunophilins often interfere with protein–protein interaction sites (5). Analysis of the properties of synthetic and natural compounds compared to drugs revealed the distinctiveness of natural compounds, especially concerning the diversity of scaffolds and the large number of chiral centers (6). This may be one reason why ∼50% of the drugs introduced to the market during the last 20 years are derived directly or indirectly from natural compounds (7). Although most drugs on the market have a natural origin, their availability often remains unclear (8). The percentage of new non-synthetic chemical entities in the area of cancer remained at a yearly average of 62% over the period of 1981–2002 (9). Some marine natural products are either in or approaching Phase II/III clinical trials in cancer, analgesia, allergy and cognitive diseases (10). The chemical diversity of these compounds is tremendous and may offer inspiration for innovations in the fields of medicine, nutrition, agrochemical and life sciences (11).

THE DATABASE

Several commercial databases and databases of rare compounds exist (12–14), but the SuperNatural Database is the first public resource containing 3D structures and conformers of 45917 natural compounds, derivatives and analogues purchasable from different suppliers. Currently, data from eight suppliers are available, but we plan to add further suppliers, compounds from which will be added on request (see ‘List of Suppliers’ on the SuperNatural Database website). The 2D structure of each compound, provided by the suppliers, was used to generate 3D structures (Discovery Studio, Accelrys Inc., ). Using a chemistry development kit (), fingerprints (966 bits, MACCS Keys) were calculated; each bit of a fingerprint represents functional groups (structural fingerprint). As a measure of 2D similarity we used the Tanimoto coefficient (15), which compares the bits of the structural fingerprints of two compounds. A Tanimoto coefficient of ≥0.85 indicates that a molecule has activities similar to a lead compound (16). For better coverage of the compounds and to ensure their flexibility during usage of the 3D-superposition algorithm, about three million conformers were evaluated (MedChem Explorer, Accelrys Inc., ). As a threshold for conformer generation, 20 kcal/mol as a relative maximum energy was set. This spacious threshold allows the user to find the best 3D superposition of two compounds even if they contain several rotatable bonds. The pre-computed fingerprints are stored in a MySQL-database on a web server, which is accessible via browser (see FAQ on the website for the database schema). Owing to the immense structural diversity of natural compounds compared to synthetic compounds, an increased spectrum of therapeutic activities can be covered. Natural compounds can be classified by different criteria (see the classification list at ‘Search via known compounds’ on the SuperNatural Database website): Classification by structural characteristics: alkaloid, amino acid, fatty acid, etc. Classification by functional aspects: vitamin, hormone, enzyme, etc. To find desired natural compounds, a number of search options were implemented: About 300 natural compounds from the SuperNatural Database are identical to active ingredients of drugs, and 8% (3600) of the natural compounds are similar to essential marketed drugs with Tanimoto coefficients >0.85. For each natural compound, information on different structural and chemical properties (DS Viewer, Property Calculator, ) such as number of chiral centers, estimated logp, surface area, etc. are precalculated and given in a separate ‘FULL INFO’ window (Figure 1D). For molecular visualization of the compounds, the user needs the free Chime-Plugin from MDL (available for Windows, SGI, Mac) or the Java2 Runtime Environment. Atomic coordinates of single or superimposed compounds are available for saving in Mol-format.
Figure 1

Screenshots of the web-interface of the SuperNatural Database. (A) Navigation frame and text query options for performing a search via known natural compounds. (B) Query results with the option for a 3D superposition. The 2D similarity query shows two compounds, which have a 2D similarity of 100.00 and 87.41 to the lead-structure. The compounds can be rotated (left mouse button), different display styles are available (right mouse button) and more detailed information concerning the properties of each structure can be obtained by use of the Properties button. Both compounds are available from the supplier MicroSource. (C) Screenshot of the Java applet Marvin, which allows upload or drawing of own structures for similarity searches in the SuperNatural Database. (D) Calculated properties for one structure. (E) Results of a 3D superposition. All conformations of both structures are superimposed and the best superposition is displayed. The table separately depicts the structures and the superposition of the corresponding conformations in the middle. The (superimposed) 3D structures can be saved by right clicking on the molecule. Also, information is given about the number of superimposed atoms and the root mean square distance.

As a starting point for screenings we compiled a searchable compendium of about 2500 well-known natural compounds characterized by a CAS-number (Chemical Abstracts), which is useful to cross-referencing other databases. This compendium contains systematic names, classification codes, empiric formulae, mixtures and synonyms (Figure 1A). Similarity searches based on fingerprints and Tanimoto coefficients are implemented in the SuperNatural Database (Figure 1B). Another way to perform a similarity search is the Marvin Applet, which allows the user to build or import a molecular structure and compare it with compounds of the SuperNatural Database (Figure 1C). Furthermore, an algorithm developed in our group enables 3D-superpositions of two compounds to be made. The algorithm compares all conformers of two compounds to find the best structural alignment (17) (Figure 1E). To identify possible applications, the user can search for similar drugs in the free drug database (SuperDrug Database) containing medical indications assigned by WHO (18).

PRACTICAL APPROACHES USING THE SIMILARITY SCREENING FUNCTION OF THE SuperNatural DATABASE

A detailed review of various approaches to similarity searching was given by Willet et al. (19). Screenings for new bioactive natural compounds on the basis of chemical similarity to a known ligand depend on the similar property principle of Johnson and Maggiora (20). As an example, we performed a similarity screening in the SuperNatural Database with natural compounds that are known drugs, from clinical trials or lead compounds for drug development (Tables 1 and 2 and Supplementary Data) (21). Our investigations showed that the database contains compounds that have already been investigated in clinical trials for different diseases (Table 1 and 2 and Supplementary Data) and a great number of compounds with calculated 2D similarities of ≥0.85 to the lead compounds. The SuperNatural Database contains 289 natural compounds, which are already known as drugs. Owing to the immense structural and chemical variety of natural compounds, the coverage of a great spectrum of diseases is possible, which is confirmed by the ATC classifications of the drugs (see ATC classification in the category statistics on the SuperNatural website). There are 73 different ATC classes (three letter abbreviations) covered by these 289 natural compounds. The results show that the SuperNatural Database is an excellent source for finding bioactive natural products.
Table 1

Well-known natural compounds (drugs, lead compounds for drugs or compounds in clinical trials) with antibacterial, antifungal, antiparasitic and antiviral effects and similar compounds (tanimoto ≥0.85) from the SuperNatural Database

*Anatomical Therapeutic Chemical (ATC) classification code generated by the World Health Organization (WHO) describes the therapeutic subgroup (25).

Table 2

Well-known natural compounds (drugs, lead compounds for drugs or compounds in clinical trials) used in areas of neurological diseases, immunological or inflammatory processes and oncological diseases and similar compounds (tanimoto ≥0.85) from the SuperNatural Database

*Anatomical Therapeutic Chemical (ATC) classification code generated by the World Health Organization (WHO) describes the therapeutic subgroup (25).

AVAILABILITY

The database is publicly available at . The data will be updated twice a year.

CONCLUSIONS AND FUTURE DIRECTIONS

The chemical diversity and unique properties of natural compounds provide a promising starting-point for developing innovations for scientific, medical and nutritional applications. The SuperNatural Database is a free resource with embedded screening functions for bioactive natural compounds. The extension of the database allows the scientific community simple access to a growing number of available natural compounds.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.
  23 in total

Review 1.  Fungal transporters involved in efflux of natural toxic compounds and fungicides.

Authors:  G Del Sorbo; H Schoonbeek; M A De Waard
Journal:  Fungal Genet Biol       Date:  2000-06       Impact factor: 3.495

2.  A marine natural product database.

Authors:  Jing Lei; Jiaju Zhou
Journal:  J Chem Inf Comput Sci       Date:  2002 May-Jun

3.  A 3D structure database of components from Chinese traditional medicinal herbs.

Authors:  Xuebin Qiao; Tingjun Hou; Wei Zhang; SenLi Guo; Xiaojie Xu
Journal:  J Chem Inf Comput Sci       Date:  2002 May-Jun

Review 4.  What can a chemist learn from nature's macrocycles?--a brief, conceptual view.

Authors:  Ludger A Wessjohann; Eelco Ruijter; Daniel Garcia-Rivera; Wolfgang Brandt
Journal:  Mol Divers       Date:  2005       Impact factor: 2.943

5.  SuperDrug: a conformational drug database.

Authors:  Andrean Goede; Mathias Dunkel; Nina Mester; Cornelius Frommel; Robert Preissner
Journal:  Bioinformatics       Date:  2005-02-02       Impact factor: 6.937

Review 6.  Natural products to drugs: natural product derived compounds in clinical trials.

Authors:  Mark S Butler
Journal:  Nat Prod Rep       Date:  2005-03-08       Impact factor: 13.423

7.  Rediscovery of known natural compounds: nuisance or goldmine?

Authors:  Martin Tulp; Lars Bohlin
Journal:  Bioorg Med Chem       Date:  2005-09-01       Impact factor: 3.641

Review 8.  [Caspofungin: mode of action and therapeutic applications].

Authors:  A Datry; E Bart-Delabesse
Journal:  Rev Med Interne       Date:  2005-06-27       Impact factor: 0.728

Review 9.  Antimalarial drugs: current status and new developments.

Authors:  Dharmendar Rathore; Thomas F McCutchan; Margery Sullivan; Sanjai Kumar
Journal:  Expert Opin Investig Drugs       Date:  2005-07       Impact factor: 6.206

Review 10.  Interfacial inhibition of macromolecular interactions: nature's paradigm for drug discovery.

Authors:  Yves Pommier; Jacqueline Cherfils
Journal:  Trends Pharmacol Sci       Date:  2005-03       Impact factor: 14.819

View more
  30 in total

1.  How "drug-like" are naturally occurring anti-cancer compounds?

Authors:  Fidele Ntie-Kang; Lydia L Lifongo; Philip N Judson; Wolfgang Sippl; Simon M N Efange
Journal:  J Mol Model       Date:  2014-01-24       Impact factor: 1.810

Review 2.  Review of natural product databases.

Authors:  Tao Xie; Sicheng Song; Sijia Li; Liang Ouyang; Lin Xia; Jian Huang
Journal:  Cell Prolif       Date:  2015-05-25       Impact factor: 6.831

Review 3.  Receptor-ligand molecular docking.

Authors:  Isabella A Guedes; Camila S de Magalhães; Laurent E Dardenne
Journal:  Biophys Rev       Date:  2013-12-21

4.  Identification of five structurally unrelated quorum-sensing inhibitors of Pseudomonas aeruginosa from a natural-derivative database.

Authors:  Sean Yang-Yi Tan; Song-Lin Chua; Yicai Chen; Scott A Rice; Staffan Kjelleberg; Thomas E Nielsen; Liang Yang; Michael Givskov
Journal:  Antimicrob Agents Chemother       Date:  2013-09-03       Impact factor: 5.191

5.  Biophysical and In-Silico Studies of Phytochemicals Targeting Chorismate Synthase from Drug-Resistant Moraxella Catarrhalis.

Authors:  Neetu Neetu; Monica Sharma; Jai Krishna Mahto; Pravindra Kumar
Journal:  Protein J       Date:  2020-10-10       Impact factor: 2.371

Review 6.  What lies underneath: conserving the oceans' genetic resources.

Authors:  Jesús M Arrieta; Sophie Arnaud-Haond; Carlos M Duarte
Journal:  Proc Natl Acad Sci U S A       Date:  2010-09-13       Impact factor: 11.205

7.  Virtual Screening with AutoDock: Theory and Practice.

Authors:  Sandro Cosconati; Stefano Forli; Alex L Perryman; Rodney Harris; David S Goodsell; Arthur J Olson
Journal:  Expert Opin Drug Discov       Date:  2010-06-01       Impact factor: 6.098

8.  Computer-aided identification of recognized drugs as Pseudomonas aeruginosa quorum-sensing inhibitors.

Authors:  Liang Yang; Morten Theil Rybtke; Tim Holm Jakobsen; Morten Hentzer; Thomas Bjarnsholt; Michael Givskov; Tim Tolker-Nielsen
Journal:  Antimicrob Agents Chemother       Date:  2009-04-13       Impact factor: 5.191

9.  NPACT: Naturally Occurring Plant-based Anti-cancer Compound-Activity-Target database.

Authors:  Manu Mangal; Parul Sagar; Harinder Singh; Gajendra P S Raghava; Subhash M Agarwal
Journal:  Nucleic Acids Res       Date:  2012-11-29       Impact factor: 16.971

10.  CamMedNP: building the Cameroonian 3D structural natural products database for virtual screening.

Authors:  Fidele Ntie-Kang; James A Mbah; Luc Meva'a Mbaze; Lydia L Lifongo; Michael Scharfe; Joelle Ngo Hanna; Fidelis Cho-Ngwa; Pascal Amoa Onguéné; Luc C Owono Owono; Eugene Megnassan; Wolfgang Sippl; Simon M N Efange
Journal:  BMC Complement Altern Med       Date:  2013-04-16       Impact factor: 3.659

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.