| Literature DB >> 29718389 |
Sunghwan Kim1, Paul A Thiessen1, Tiejun Cheng1, Bo Yu1, Evan E Bolton1.
Abstract
PubChem (https://pubchem.ncbi.nlm.nih.gov) is one of the largest open chemical information resources available. It currently receives millions of unique users per month on average, serving as a key resource for many research fields such as cheminformatics, chemical biology, medicinal chemistry, and drug discovery. PubChem provides multiple programmatic access routes to its data and services. One of them is PUG-REST, a Representational State Transfer (REST)-like web service interface to PubChem. On average, PUG-REST receives more than a million requests per day from tens of thousands of unique users. The present paper provides an update on PUG-REST since our previous paper published in 2015. This includes access to new kinds of data (e.g. concise bioactivity data, table of contents headings, etc.), full implementation of synchronous fast structure search, support for assay data retrieval using accession identifiers in response to the deprecation of NCBI's GI numbers, data exchange between PUG-REST and NCBI's E-Utilities through the List Gateway, implementation of dynamic traffic control through throttling, and enhanced usage policies. In addition, example Perl scripts are provided, which the user can easily modify, run, or translate into another scripting language.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29718389 PMCID: PMC6030920 DOI: 10.1093/nar/gky294
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Monthly average of the number of PUG-REST requests per day and unique IP addresses per day.
Figure 2.Syntax of a PUG-REST web service request URL, with an example that retrieves (in a text file) compounds tested to be active in an assay.
Figure 3.Specification of the input, operation, and output parts of a PUG-REST request.
Figure 4.Construction of PUG-REST requests for chemical structure search and molecular formula search in synchronous and asynchronous approaches.
Figure 5.Schematic diagram for the function of the List Gateway.
Figure 6.Comparison of the identifiers and lengths of the target protein sequences for AIDs 38693 and 1159673.