| Literature DB >> 21247425 |
Plamen Dragiev1, Robert Nadon, Vladimir Makarenkov.
Abstract
BACKGROUND: High-throughput screening (HTS) is a key part of the drug discovery process during which thousands of chemical compounds are screened and their activity levels measured in order to identify potential drug candidates (i.e., hits). Many technical, procedural or environmental factors can cause systematic measurement error or inequalities in the conditions in which the measurements are taken. Such systematic error has the potential to critically affect the hit selection process. Several error correction methods and software have been developed to address this issue in the context of experimental HTS 1234567. Despite their power to reduce the impact of systematic error when applied to error perturbed datasets, those methods also have one disadvantage - they introduce a bias when applied to data not containing any systematic error 6. Hence, we need first to assess the presence of systematic error in a given HTS assay and then carry out systematic error correction method if and only if the presence of systematic error has been confirmed by statistical tests.Entities:
Mesh:
Year: 2011 PMID: 21247425 PMCID: PMC3034671 DOI: 10.1186/1471-2105-12-25
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Systematic error in experimental HTS data. Hit distribution surfaces for the McMaster (cases (a) and (b) - 1250 plates - [18]) and Princeton (cases (c) and (d) - 164 plates - [19]) Universities experimental HTS assays. Values deviating from the plate means for more than 2 standard deviations - cases (a) and (c), and for more than 3 standard deviations - cases (b) and (d) were selected as hits. The well, row and column positional effects are shown (the wells containing controls are not presented).
Figure 2Simulation 1, Plate Size: 96 wells - Cohen's Kappa vs Error Size. Systematic error size: 10% (at most 2 columns and 2 rows affected). First column: (a) - (c): α = 0.01; Second column: (d) - (f): α = 0.1. Systematic Error Detection Tests: (◆) t-test and (■) K-S test.
Figure 4Simulation 1, Plate Size: 1536 wells - Cohen's Kappa vs Error Size. Systematic error size: 10% (at most 8 columns and 8 rows affected). First column: (a) - (c): α = 0.01; Second column: (d) - (f): α = 0.1. Systematic Error Detection Tests: (◆) t-test and (■) K-S test.
Five types of HTS datasets containing different kinds of systematic and/or random error generated and tested in this study
| Error type | Generation of error-affected measurements |
|---|---|
where: is the error-affected value in well i,j (row i, column j) of plate p.
xis the original value in well i,j of plate p in the error-free dataset.
ris the systematic error in row i (constant over all plates); it had a normal distribution with the parameters ~N(0, C).
xis the systematic error in column j (constant over all plates); it had a normal distribution with the parameters ~N(0, C).
wis the systematic error that affects well i,j (row i, column j) and is the same for all plates; it had a normal distribution with the parameters ~N(0, C).
ris the systematic error in row i of plate p; it had a normal distribution with the parameters ~N(0, C).
cis the systematic error in column i of plate p; it had a normal distribution with the parameters ~N(0, C).
Randis the random error affecting well i,j (row i, column j) of plate p; it had a normal distribution with the parameters ~N(0, 0.3×SD).
Datasets for C = 0, 0.6×SD, 1.2×SD, 1.8×SD, 2.4×SD and 3×SD were generated and tested, where μ is the mean and SD is the standard deviation of the error-free dataset.
Figure 3Simulation 1, Plate Size: 384 wells - Cohen's Kappa vs Error Size. Systematic error size: 10% (at most 4 columns and 4 rows affected). First column: (a) - (c): α = 0.01; Second column: (d) - (f): α = 0.1. Systematic Error Detection Tests: (◆) t-test and (■) K-S test.
Figure 5Simulation 2, Plate Size: 96 wells, Cohen's Kappa vs Hit Percentage. Systematic error size: 10% (at most 2 columns and 2 rows affected). First column: cases (a) - (b): α = 0.01; Second column: cases (c) - (d): α = 0.1. Systematic Error Detection Tests: (◆) t-test, (■) K-S test and (▲)χgoodness-of-fit test.
Figure 6Simulation 2, Plate Size: 384 wells, Cohen's Kappa vs Hit Percentage. Systematic error size: 10% (at most 4 columns and 4 rows affected). First column: cases (a) - (b): α = 0.01; Second column: cases (c) - (d): α = 0.1. Systematic Error Detection Tests: (◆) t-test, (■) K-S test and (▲)χgoodness-of-fit test.
Figure 7Simulation 2, Plate Size: 1536 wells, Cohen's Kappa vs Hit Percentage. Systematic error size: 10% (at most 8 columns and 8 rows affected). First column: cases (a) - (b): α = 0.01; Second column: cases (c) - (d): α = 0.1. Systematic Error Detection Tests: (◆) t-test, (■) K-S test and (▲)χgoodness-of-fit test.
Number of rows, columns and plates (where at least one row or column contains systematic error) of McMaster Test assay in which the t-test reported the presence of systematic error, depending on the α parameter
| Plates | Rows | Rows % | Columns | Columns % | |
|---|---|---|---|---|---|
| 0.01 | 159 | 76 | 0.76% | 94 | 0.75% |
| 0.05 | 814 | 575 | 5.76% | 606 | 4.86% |
| 0.1 | 1121 | 1148 | 11.50% | 1296 | 10.38% |
| 0.2 | 1241 | 2242 | 22.46% | 2583 | 20.70% |
Only 8 rows and 10 columns of McMaster Test assay were examined because the first and twelfth columns of the (8 by 12) plates were used for controls.
Number of hits selected in McMaster Test assay for the μ-3SD threshold after the application of the B-score correction, depending on the α parameter
| Original hits | Obtained hits | Preserved hits | Added hits | Removed hits | |
|---|---|---|---|---|---|
| 0.01 | 96 | 123 | 57 | 66 | 39 |
| 0.05 | 96 | 125 | 55 | 70 | 41 |
| 0.1 | 96 | 126 | 52 | 74 | 44 |
| 0.2 | 96 | 130 | 55 | 75 | 41 |
The t-test was carried out to detect systematic error.
Number of hits selected in McMaster Test assay for the μ-2.29SD threshold (i.e., threshold used by the McMaster competition organizers to select the 96 original average hits) after the application of the B-score correction, depending on the α parameter
| Original hits | Obtained hits | Preserved hits | Added hits | Removed hits | |
|---|---|---|---|---|---|
| 0.01 | 96 | 357 | 79 | 278 | 17 |
| 0.05 | 96 | 419 | 79 | 340 | 17 |
| 0.1 | 96 | 411 | 79 | 332 | 17 |
| 0.2 | 96 | 417 | 76 | 341 | 20 |
The t-test was carried out to detect systematic error.
Number of hits selected in McMaster Test assay for the μ-3SD and μ-2.29SD thresholds after the application of the Well Correction method
| Threshold | Original hits | Obtained hits | Preserved hits | Added hits | Removed hits |
|---|---|---|---|---|---|
| 96 | 26 | 26 | 0 | 70 | |
| 96 | 102 | 72 | 30 | 24 | |
Figure 8Intersections between the original set of hits (96 hits in total) and the sets of hits obtained after the application of the . The μ- 2.29SD hit selection threshold was used to select hits.