| Literature DB >> 29202222 |
Jonathan B Baell1,2, J Willem M Nissink3.
Abstract
Pan-Assay Interference Compounds (PAINS) are very familiar to medicinal chemists who have spent time fruitlessly trying to optimize these nonprogressible compounds. Electronic filters formulated to recognize PAINS can process hundreds and thousands of compounds in seconds and are in widespread current use to identify PAINS in order to exclude them from further analysis. However, this practice is fraught with danger because such black box treatment is simplistic. Here, we outline for the first time all necessary considerations for the appropriate use of PAINS filters.Entities:
Mesh:
Year: 2017 PMID: 29202222 PMCID: PMC5778390 DOI: 10.1021/acschembio.7b00903
Source DB: PubMed Journal: ACS Chem Biol ISSN: 1554-8929 Impact factor: 5.100
Figure 1Epoxide 1, aziridine 2, and nitroalkene 3 unrecognized by PAINS electronic filters because such compounds were not included in the inaugural WEHI HTS library. The dicyanoalkene 4, however, is a recognized PAINS chemotype, but the corresponding electronic filter ene_cyano_A would not recognize consequently plausible PAIN 5 simply because 5 was a substructure not represented by any compound in the initial HTS library from which PAINS were defined. Other reactive compounds such as β-aminoketone 6, isothiazolones 7, and toxoflavins like 8 are not recognized by PAINS filters because their PAINS behavior was only subsequently identified after filter definition.
Figure 2Simplified ontology of hits and false-positives. Red boxes indicate potential approaches to identify different types of hits in a cascade of assays. The type and order of assays used in the cascade needs to be considered based on the expected hit rates and achievable throughputs of the relevant assays. The term “frequent hitter” in a sense lies outside this system as it presumes a body of associated historical screening data.
Figure 3Interference by alkylanilines such as 9 with AlphaScreen signaling, probably through reaction with singlet oxygen, routinely returning an apparent IC50 value of around 3 μM.
Figure 4SMARTS implementations of the original SLN PAINS filter may inappropriately identify non-PAINS, shown here for 10 and 11 inappropriately identified as belonging to hzone_phenol_B and dyes5a PAINS classes, respectively.
Some of the Most Common PAINS Generally Recognized by Other Measures of Promiscuitya
These are in order according to the original Family_Filter_A. PAINS are characterized by an enrichment factor, defined as the number of analogues of a given class that registered as active in between 2 and 6 of the 6 HTS campaigns analyzed, expressed as a percentage of the number of analogues of that class that did not register as active in any of the 6 HTS campaigns. The AstraZeneca approach (AZ incidence) interrogates the AZ corporate database and reports the incidence of bioactivity of any compound relative to that expected from a random selection (6.5%). We have arbitrarily selected <10%, <15%, and ≥15% as the criteria for color coding green (benign), orange (raised), red (promiscuous). Badapple requires input of a compound of interest and then undertakes a hierarchical scaffold analysis and reports on the promiscuity score (pScore) of the different scaffolds that make up the compound of interest, where a pScore of 0–100 suggests no indication (green), 100–300 is a moderate score suggesting weak indication of promiscuity (orange), and >300 is a high pScore with a strong suggestion of promiscuity (red). Because Badapple is scaffold-centric, some substructures where substituents are part of the definition cannot be sensibly analyzed (no pScore shown) or are incompletely analyzed (pScore*).
Substructures Defined in PAINS Filters Not Generally Recognized As Promiscuous by Other Measuresa
Refer to Table .
Figure 5An example of successful scaffold hop during optimization of a PAIN to a non-PAIN chemotype.