| Literature DB >> 28165734 |
Stephen J Capuzzi1, Eugene N Muratov1, Alexander Tropsha1.
Abstract
The use of substructural alerts to identify Pan-Assay INterference compoundS (PAINS) has become a common component of the triage process in biological screening campaigns. These alerts, however, were originally derived from a proprietary library tested in just six assays measuring protein-protein interaction (PPI) inhibition using the AlphaScreen detection technology only; moreover, 68% (328 out of the 480 alerts) were derived from four or fewer compounds. In an effort to assess the reliability of these alerts as indicators of pan-assay interference, we performed a large-scale analysis of the impact of PAINS alerts on compound promiscuity in bioassays using publicly available data in PubChem. We found that the majority (97%) of all compounds containing PAINS alerts were actually infrequent hitters in AlphaScreen assays measuring PPI inhibition. We also found that the presence of PAINS alerts, contrary to expectations, did not reflect any heightened assay activity trends across all assays in PubChem including AlphaScreen, luciferase, beta-lactamase, or fluorescence-based assays. In addition, 109 PAINS alerts were present in 3570 extensively assayed, but consistently inactive compounds called Dark Chemical Matter. Finally, we observed that 87 small molecule FDA-approved drugs contained PAINS alerts and profiled their bioassay activity. Based on this detailed analysis of PAINS alerts in nonproprietary compound libraries, we caution against the blind use of PAINS filters to detect and triage compounds with possible PAINS liabilities and recommend that such conclusions should be drawn only by conducting orthogonal experiments.Mesh:
Year: 2017 PMID: 28165734 PMCID: PMC5411023 DOI: 10.1021/acs.jcim.6b00465
Source DB: PubMed Journal: J Chem Inf Model ISSN: 1549-9596 Impact factor: 4.956
Figure 1Probing the extrapolative power of PAINS alerts. A histogram showing the distribution of the number of PAINS alerts (amounting to 480 total) as the function of the number of compounds used to derive each alert. Note that 190 PAINS alerts were derived from one representative compound only whereas only 18 PAINS alerts were derived from samples including more than 100 compounds per alert.
Figure 2Study design for examining compounds in PubChem tested with AlphaScreen assay technology. Six assays targeting PPIs and using AlphaScreen were identified in PubChem. Only those compounds that were tested in all six assays were considered. Compounds were binned into two categories according to the number of active calls. Compounds in each category were then queried for PAINS alerts. The PubChem-wide bioassay activity of all compounds was then investigated.
Lack of Pan-Assay Activity for Compounds with PAINS Alerts in PubChema
| compound categories | luciferase | β-lactamase | fluorescence | all assays | |
|---|---|---|---|---|---|
| FH-PAINS | 208 | 12% (93) | 4% (9) | 7% (312) | 10% (546) |
| FH-NoPAINS | 694 | 6% (95) | 2% (9) | 3% (320) | 5% (550) |
| IH-PAINS | 6413 | 3% (93) | 1% (10) | 2% (323) | 2% (550) |
| IH-NoPAINS | 21500 | 1.5% (95) | 0.5% (9) | 1% (326) | 1% (555) |
| Random-PAINS | 14611 | 3% (95) | 1% (12) | 2% (329) | 3% (562) |
| Random-NoPAINS | 58722 | 2% (93) | 0.6% (13) | 0.8% (321) | 1% (550) |
| Drugs-PAINS | 87 | 9% (71) | 7% (40) | 6% (223) | 24% (602) |
| Drug-NoPAINS | 1373 | 5% (59) | 5% (33) | 3% (183) | 15% (458) |
The average fraction of activity calls for PAINS and non-PAINS (defined as containing or lacking PAINS alerts, respectively) across both detection technology-specific assays and all assays in PubChem. The average number of assays in which the compounds were tested are shown in parentheses.
Defined by the compound profile in PPI assays utilizing AlphaScreen.
Defined by presence or absence of PAINS alerts.
Figure 3Compounds with multiple PAINS alerts. Two representative compounds from the IH-PAINS and FH-PAINS categories that contain multiple PAINS alerts.
PAINS Enrichment in Six PubChem Assays Employing AlphaScreena
Forty alerts were present in both FH-PAINS and IH-PAINS. Two alerts showed EVs greater or equal to 25% and were found in 10 or more total compounds. Six alerts had EVs below 1.0%.
Figure 4Random-PAINS displaying pan-assay activity. Tanespimycin and dihydrexidine are active in 85% and 50% of all assays in PubChem, respectively.
PAINS alerts enriched in Dark Chemical Mattera
Ten alerts are present in 100 or more in Dark Chemical Matter compounds (NDCM).
PAINS Alerts Displaying Elevated Assay Activity in PubChema
Twelve alerts have more than 10 compounds (NPubChem) and greater than 10% assay activity in either all assays, luciferase-, β-lactamase-, or fluorescence-based assays (bold). Six alerts were also present in DCM (NDCM). The average numbers of assays in which the compounds were tested are shown in parentheses.
PAINS Alerts Not Displaying Elevated Assay Activity in PubChema
These 6 alerts are present in more than 1000 compounds (NPubChem) and have less than 10% assay activity in either all assays, luciferase-, β-lactamase-, or fluorescence-based assays. All six alerts were also present in DCM (NDCM). The average numbers of assays in which the compounds were tested are shown in parentheses.
Global Assay Activity of Compounds in PubChem Possessing the Top 16 PAINS Alertsa
The bioassay activities of 14 out of 16 alerts have been profiled with more compounds than were used to derive the alerts originally.[6] Two alerts were found only in DCM (NDCM). The average numbers of assays in which the compounds were tested are shown in parentheses.