| Literature DB >> 26097510 |
Rowan Hatherley1, David K Brown1, Thommas M Musyoka1, David L Penkler1, Ngonidzashe Faya1, Kevin A Lobb2, Özlem Tastan Bishop1.
Abstract
BACKGROUND: Natural products (NPs) are important to the drug discovery process. NP research efforts are expanding world-wide and South Africa is no exception to this. While freely-accessible small molecule databases, containing compounds isolated from indigenous sources, have been established in a number of other countries, there is currently no such online database in South Africa. DESCRIPTION: The current research presents a South African natural compound database, named SANCDB. This is a curated and fully-referenced database containing compound information for 600 natural products extracted directly from journal articles, book chapters and theses. There is a web interface to the database, which is simple and easy to use, while allowing for compounds to be searched by a number of different criteria. Being fully referenced, each compound page contains links to the original referenced work from which the information was obtained. Further, the website provides a submission pipeline, allowing researchers to deposit compounds from their own research into the database.Entities:
Keywords: Chemical databases; Natural products; South Africa
Year: 2015 PMID: 26097510 PMCID: PMC4471313 DOI: 10.1186/s13321-015-0080-8
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Figure 1Simplified layout of the SANCDB database schema. Five tables from the database are shown. The links between tables represent many-to-many relationships.
Figure 2User interface for the search and download of compounds from the SANCDB website. The Search/Browse section represents the different criteria that can be used to query the database. Matching queries are returned on the same page in a tabulated format, displaying the compound’s SANC ID and the criteria matched, as well as a 2D image of the compound (not shown here). Compounds can be selected for download from this page in one of the chemical formats shown. The SANC ID of each compound links to a summary page, divided as follows. A Basic compound information and links to external databases; B toggle between compound 2D image and 3D representation; C additional compound information and links to references.
Figure 3Snapshop of the compound summary page. The interface for the SANCDB website is shown, with a screenshot of the part of the compound summary page for entry SANC00238, as explained in Figure 2.
Figure 4Compound uses within SANCDB. Chart showing the distribution of the 284 entries for tested uses of compounds within the database. Uses that comprised less than 1% of entries are grouped in the section labelled ‘Other’.
Figure 5Physicochemical properties of compounds in the database, based on Lipinski’s “rule of five”. a Bar graph of compounds grouped by number of hydrogen bond acceptors; b bar graph of compounds grouped by number of hydrogen bond donors; c scatter plot of the LogP of each compound, plotted against their mass. The red dotted lines indicate the cut-off Lipinski values for each axis (LogP ≤5 and mass ≤500 Da); d tabulated grouping of compounds in the database with their number of violations of Lipinski’s “rule of five”.