Literature DB >> 24902483

Exposure time independent summary statistics for assessment of drug dependent cell line growth inhibition.

Steffen Falgreen¹, Maria Bach Laursen, Julie Støve Bødker, Malene Krag Kjeldsen, Alexander Schmitz, Mette Nyegaard, Hans Erik Johnsen, Karen Dybkær, Martin Bøgsted.

Abstract

BACKGROUND: In vitro generated dose-response curves of human cancer cell lines are widely used to develop new therapeutics. The curves are summarised by simplified statistics that ignore the conventionally used dose-response curves' dependency on drug exposure time and growth kinetics. This may lead to suboptimal exploitation of data and biased conclusions on the potential of the drug in question. Therefore we set out to improve the dose-response assessments by eliminating the impact of time dependency.
RESULTS: First, a mathematical model for drug induced cell growth inhibition was formulated and used to derive novel dose-response curves and improved summary statistics that are independent of time under the proposed model. Next, a statistical analysis workflow for estimating the improved statistics was suggested consisting of 1) nonlinear regression models for estimation of cell counts and doubling times, 2) isotonic regression for modelling the suggested dose-response curves, and 3) resampling based method for assessing variation of the novel summary statistics. We document that conventionally used summary statistics for dose-response experiments depend on time so that fast growing cell lines compared to slowly growing ones are considered overly sensitive. The adequacy of the mathematical model is tested for doxorubicin and found to fit real data to an acceptable degree. Dose-response data from the NCI60 drug screen were used to illustrate the time dependency and demonstrate an adjustment correcting for it. The applicability of the workflow was illustrated by simulation and application on a doxorubicin growth inhibition screen. The simulations show that under the proposed mathematical model the suggested statistical workflow results in unbiased estimates of the time independent summary statistics. Variance estimates of the novel summary statistics are used to conclude that the doxorubicin screen covers a significant diverse range of responses ensuring it is useful for biological interpretations.
CONCLUSION: Time independent summary statistics may aid the understanding of drugs' action mechanism on tumour cells and potentially renew previous drug sensitivity evaluation studies.

Entities: CellLine Chemical Disease Gene Species

Mesh：

Substances：
Growth Inhibitors

Year: 2014 PMID： 24902483 PMCID： PMC4127655 DOI： 10.1186/1471-2105-15-168

Source DB: PubMed Journal: BMC Bioinformatics ISSN： 1471-2105 Impact factor: 3.169

Background

An essential part of discovery and development of anticancer drugs is to assess the induced growth inhibition in a biologically broad range of tumour derived cell lines by dose-response experiments [1,2]. The three large cell line screens NCI60 [3,4], JFCR39 [5,6], and CMT1000 [2,7] are among the most well-known high throughput cell line drug screens. The approach used in CMT1000 and several other studies [8-10] is currently the standard approach for conducting dose-response experiments. The experiments are performed by challenging exponentially growing cell lines with a serial dilution of drug concentrations and estimating growth inhibition by relative cell counts between the treated and untreated cell line. Then, a summary statistic of drug efficiency (50% growth inhibition) is obtained by estimating the concentration at which the relative cell count is 50% after a fixed period of time. Hence, neither drug exposure time nor varying cell line growth rates are considered. The method is easily comprehended and implemented, however, as illustrated in Figure 1 this assessment of growth inhibition leads to summary statistics that are difficult to interpret. Panels A and B illustrate generated growth curves for two cell line models with doubling times 60 and 30 hours, respectively. The cell line models are treated with 6 increasing concentrations C1,…,C6 of a potent drug for which the effect is assumed constant through time, resulting in 6 growth/decay curves. For concentration C4 cell line model 1 is in the decay phase and cell line model 2 is in the growth phase suggesting that cell line model 1 is the more sensitive of the two.

Figure 1

Illustration of growth inhibition assessed by relative cell counts. Panels A and B show growth curves for two cell line models with doubling times of 60 and 30 hours, respectively. The cell line models are treated with 6 increasing concentrations C1,…,C6 and growth curves for each concentration are shown. The red line illustrates total growth inhibition (TGI). Dose-response curves calculated by relative cell counts for time points 25, 49, and 73 hours are shown in Panel C. Panel C illustrates dose-response curves calculated at three time points: 25, 49, and 73 hours, for the two cell line models. Because of the fast growth rate of cell line model 2, the summary statistic is obtained at a lower concentration for this cell line model than for cell line model 1 for each of the three time points. This indicates that cell line model 2 is evaluated as the more sensitive of the two. Hence, this assessment of growth inhibition generates summary statistics that are incomparable between cell lines with different growth rates. The dose-response experiments performed for the NCI60 and JFCR39 screens are summarised by comparing net differences between cell counts at observation time and the initial cell counts for the treated and untreated cell lines. As we illustrate later this method only partially solves the problem of growth rate dependency. The concept behind the present work is that modelling the growth of a cell line exposed to a drug by a simplified differential equation will allow us to derive dose-response curves and summary statistics that are independent of time under the proposed model. For estimation of the improved summary statistics a statistical workflow is suggested consisting of 1) pre-processing of absorbance measurements to account for multiplicative errors originating from e.g. cell line seeding [11] and correcting for background absorbance caused by the drug [12], 2) isotonic regression for modelling the dose-response curve which is robust against outliers and model misspecifications [13,14], and 3) a bootstrap method for estimation of confidence intervals for summary statistics [9]. We also aim to illustrate a transformation of the model used in the cell line screen NCI60, which accounts for each cell line’s doubling time and enables a reanalysis of existing dose-response data. Finally, the adequacy of the differential equation for modelling real data is tested using a doxorubicin screen. The screen is also used to investigate the applicability of the proposed statistical analysis workflow by providing variance estimates for obtained exposure time independent summary statistics.

Methods

The mathematical model

To analyse dose-response experiments rigorously we formulate a model of how the growth of a cell line is influenced by a given drug. The growth inhibition is modelled by the compartment models illustrated in Figures 2A and B. Panel A shows a compartment model for drugs that induce cell cycle arrest followed by death. For a cell line treated with drug concentration c ≥ 0 the number of unaffected cells at time t is denoted N0 (t,c). The growth of this cell population is assumed exponential with doubling time T0 or equivalently a growth rate of 1/T0. The concentration dependent rate for cells going into cell cycle arrest is likewise assumed exponential with halving time , and N1 (t,c) denotes the cell count for this population. Finally, the death rate is assumed exponential and concentration dependent with halving time .

Figure 2

Illustration of the proposed compartment models. Panel A illustrates a compartment model where the drug is assumed to induce cell cycle arrest with halving time and for cells in cell cycle arrest the death rate is assumed exponential with halving time . Panel B illustrates a simplified compartment model in which the drug is assumed to induce cell death with halving time . In both models the cell line grows with doubling time T0. Panel C illustrates growth curves according to Model B for a cell line with T0 = 60 hours and N0 = 30,000 cells treated with three different concentrations of a potent drug. The concentrations correspond to the summary statistics GI50 = c1, TGI =c2, and LC48 = c3. We focus on dose-response experiments where the cell count is estimated indirectly by cell metabolism. However, cells in cell cycle arrest have a very low metabolism and such cells are indistinguishable from dead cells in these experiments [15]. Because of this we use the simplified compartment model illustrated in Figure 2B for drugs that induce cell cycle arrest as well as drugs that lead directly to cell death. In this model N (t,c) denotes the cell count of metabolising cells at time t for a cell line treated with drug concentration c ≥ 0. The growth rate is assumed to be exponential with doubling time T0. Similarly, the death rates are assumed to be exponential with halving times that decrease concurrently with increasing drug concentrations c. For drugs that induce cell cycle arrest the halving time is equal to the rate with which cells go into cell cycle arrest when treated with drug concentration c. Compartment model B gives rise to the following differential equation model The differential equation has the following solution where the initial condition N0 = N (0,c), with c dropped for short, denotes the cell count at t = 0, and Tcorresponds to the net observed doubling or halving time at concentration c. The posed differential equation model can be summarised by the following statistics where GI50 (50% growth inhibition) denotes the concentration at which the cell line grows with a doubling time twice as long as the same cell line untreated, TGI (total growth inhibition) denotes the concentration at which the cell line has no net growth, and LC(lethal concentration t) denotes the concentration at which the cell count decays with a halving time of t hours. For example LC48 is the concentration at which N (48,c) = N0/2. The growth inhibition induced by these drug concentrations is illustrated in Figure 2C for a cell line model with doubling time T0 = 60 hours and N0 = 30,000. At the concentration corresponding to GI50 the doubling time for the cell line is doubled to 120, TGI the halving time such that the growth of the cell line is completely halted, and LC48 the halving time for the cell line is 48 hours. This leads us to suggest the following growth based dose-response model denoted by G for evaluating a dose-response experiment It is noteworthy that the G-model is independent of the duration of the dose-response experiment. The model is summarised by the statistics GI50, TGI, and LC48 at which the G (t,c) equals 0.5, 0, and -1/48, respectively. In general we define GIand LCto be the concentrations where G (t,c) = (100-x)/100 and G (t,c) = -1/t. The cell line screens NCI60 [3,4] and JFCR39 [5,6] apply an alternative dose-response model denoted by D, which is based on net differences between the cell count at time t, N (t,c), and the initial cell count, N0. The model is defined as The cell counts which the D-model is based upon are illustrated by the triangle and circles in Figure 2C for t = 48 hours. For this model x% growth inhibition and y% lethal concentration are attained at concentrations c1 and c2 where D(t,c1) = (100-x)/100 and D (t,c2) = -(100-y)/100. The dose-response model is usually summarised for a fixed t by , , and TGIthe latter of which is attained at the concentration c where D (t,c) = 0. The large cell line screen CMT1000 [2,7] utilises another commonly used dose-response model based on relative cell counts which is defined as The cell counts which the R-model is based upon are illustrated by circles in Figure 2C for t = 48 hours. For this model x% growth inhibition is attained at concentration c where R (t,c) = (100-x)/100. The R-model is usually summarised by , , and . For a fixed t, the graph of a dose-response model, say G, {(c,G (t,c)):c > 0} is denoted the dose-response curve of G. As the D- and R-models suggest, the corresponding dose-response curves are dependent on the time t, whereas the dose-response curve of G is not. Notice it is possible to define a fourth summary statistic AUC(area under curve) which is the area above a specified value q and below the dose-response curve [16]. Thus, for the dose-response models G and D, AUC0 is the area under the dose-response curve for which the cell count is still increasing with time. It is possible to circumvent the time dependency of models D and R by letting the drug exposure time t equal the cell line specific doubling time T0 or a multiple hereof, i.e. t = kT0. Furthermore, the summary statistics for the G-model are then related with the D-model by and for the R-model by Even if the drug exposure time is not kT0 it is still possible to obtain an estimate of the G-model by the following transformation of the D-model Similarly, an estimate of the G-model can be obtained by the following transformation of the R-model Note, however, that both transformations require access to the cell line specific doubling time T0.

Estimation of cell count

Absorbance measurements can be utilised as surrogates for the cell count N(t,c) and thereby used to estimate the three dose-response models. This is generally done using an MTS assay (3-(4,5-dimethylthiazol-2-yl)-5-(3-carboxymethoxyphenyl)-2-(4-sulfophenyl)-2H-tetrazolium) that exploits the mitochondrial reduction of tetrazolium to an aqueous soluble formazan product by the dehydrogenase enzyme in viable cells at 37°C. The amount of produced formazan is directly proportional to the cell count N (t,c) and can be quantified colourimetrically by measuring absorbance at 492 nm [12]. The time line for dose-response experiments utilising such assays is outlined in Figure 3. At time each cell line is seeded into two 96 well plates in which they incubate until time where C decreasing concentrations of the tested drug are added to each plate in L replicates. This time point marks the start of the dose-response experiment. For plate 1 the MTS assay is added immediately after drug exposure i.e. at time and for plate 2 the assay is added at time a pre-specified time after the addition of drug. Following the addition of the reagent each plate incubates for a fixed time tafter which the absorbance is measured. The metabolic reduction occurs from the instant the reagent is added until the absorbance is measured, thus the amount of formazan is related to the cell count during this time interval. Since the number of living cells may differ substantially throughout this period, the measured absorbance is assumed to estimate the cell count at the centre of the interval, i.e. at time and .

Figure 3

Time line for dose-response experiments. The red boxes mark the two time points that are used in the statistical analysis.

Time line for dose-response experiments. The red boxes mark the two time points that are used in the statistical analysis. By seeding cells in an appropriate medium at predetermined concentrations the relationship between the cell count and the absorbance measure can be assumed linear [12], i.e. where γ is a proportionality factor and αis the absorbance at time t, for a cell line exposed to drug concentration c, i = 1,…,I where 0 = c0 tetrazolium into the coloured formazan product. In order to ensure reproducible results the experiment is repeated K times for each cell line. The measured absorbance level for the l’th well, treated with concentration c, within the k’th cell line replicate, is assumed equal to where δis the inter-plate variation assumed to be multiplicative, βis the plate specific background absorbance which is assumed to be additive, and, finally, the technical variation εis assumed to be additive and normally distributed with mean zero and following a heteroscedastic variance model .

Statistical analysis workflow

The proposed statistical workflow has been implemented in the statistical software R version 3.0.1. The parameter estimation is performed using the function gnls from the package nlme[17]. Isotonic regression is implemented by the function isoreg from the library stats, and the area under the curve is calculated by the function trapz from the library pracma.

Conventionally pre-processing is performed through the following steps: 1) the background wells containing only medium are used to measure the plate specific background absorbance, which are averaged and subtracted from the absorbance measures of all other wells, 2) normalisation is done indirectly through calculation of growth inhibition by either of the dose-response models D or R, and 3) summation is done by an average of the obtained values. In this article, we propose a model-based approach for the pre-processing. The pre-processing of the dose-response experiments consists of outlier detection, background correction, normalisation, and summation. These steps are done simultaneously by estimating the coefficients β, δ, and αin absorbance model (11). In order to do this, the absorbance model is reformulated as a nonlinear regression model: where s, b = 1,…,n, is an indicator variable equal to 0 if Yis a background measurement and 1 otherwise, ϕ= Aβ, and The nonlinear regression model corresponds to the formalism in Chapter 7 of [17] and can be estimated by the methods herein. The components of the heteroscedastic variance is estimated by an iteratively reweighted scheme, see page 207 of [17]. The I + 1 coefficients are the summarised absorbance measures for each concentration c. Since negative absorbance measures are meaningless, all absorbance estimates below a pre-specified value are replaced by this value. We use the value 0.025 as the cut point in the current study. One of the favourable features of the model-based approach to pre-processing dose-response experiments is outlier detection based on residuals. The residuals are the difference between the observed values and the values estimated by the regression model. First, the regression model is fitted to all data. Absorbance measures with residuals greater than a pre-specified number of standard deviations are regarded as outliers and removed. Based on the remaining absorbance measures the model is fitted again and outliers are detected and removed. This process is iterated until no outliers are detected or until a pre-specified maximum number of iterations is reached. In the current study we use 3 standard deviations and iterate the process twice. When estimating the dose-response curve by the G-model (5) we need estimates of T0 and the T’s. This can be done by estimating the coefficients N0, T0, and in the solution to the proposed differential equation (2). The resulting estimates of the absorbance measures obtained by the pre-processing method are used as estimates of γN (t,c). Thus we consider the following nonlinear model where α0 is an estimate of γN0, sis an indicator function equal to 0 when i = 0 and 1 otherwise. This nonlinear model is also formulated in concordance with Chapter 7 in [17]. This gives the nonlinear regression function where ϕ= Aβ, and The additive error term εis assumed to be normally distributed with mean zero and variance . The estimates of T0 and are calculated as . The parameterisations T0 = 1/T, are used to ensure that is positive. Finally, the estimates of the ’s are obtained as for i = 1,…,I. In order to make the estimation method robust against outliers and model misspecifications we suggest to use isotonic regression [13,14] for which the dose-response curve is found by the piecewise linear and decreasing function that in square norm is closest to the data. Because the G-model is a piecewise smooth function with a singularity at the TGI-value a supplementary function is introduced to circumvent biased estimates. First we define the function Γ (t,c) = 1/Twhich is an analogue to the G-model in (5) without the multiplication by T0. Then we estimate the function Γ at concentrations c, i = 1,…,I, by the values which minimises , subject to γ1 ≥ γ2 ≥ ⋯ ≥ γ. The Γ-function is estimated by linear interpolation in the following way and the G-model is then estimated by plugging and into (5) The dose-response model D is similarly estimated pointwise by (6) with cell counts and estimated by the absorbance measures and . The approach used for the G-model is also recommended for estimating the dose-response curve for the D-model which also is a piecewise smooth function with a singularity at the TGI-value. In this case the function Γ (t,c) is replaced by the function Δ(t,c) = (N(t,c)-N0)/N0, and the D-model is subsequently estimated by The dose-response model R is estimated pointwise by (7) with . The dose-response curve for the R-model is obtained using isotonic regression and linear interpolation between the pointwise estimates. The estimates for GI50, TGI, and LC48 are obtained by the concentration c where , equals 0.5, 0, and -1/48, respectively. The summary statistic AUC0 is estimated by the area under the curve where . To compute confidence intervals for the summary statistics the following parametric bootstrap algorithm is applied with the number of iterations equal to J 1. For j in 1:J 1) Generate K plate sets on basis of the pre-processing model fitted to each cell line. 2) Fit the pre-processing model without outlier detection. 3) Fit the growth model to the pre-processed absorbance measurements. 4) Calculate the growth inhibition on basis of the G-model. 5) Estimate the summary statistics GI50, TGI, LC48, and AUC0. 2. Estimate a confidence interval for each summary statistic by use of the 2.5% and 97.5% percentiles of the J estimates obtained in step 5). A similar approach can be used to estimate summary statistics for the dose-response models D and R with summary statistics obtained by the concentrations where and equal e.g. 0.5, 0, and -0.5 and 0.75, 0.5, and 0.25, respectively. When the drug under investigation is coloured like e.g. doxorubicin or interacts with the MTS assay, the background absorbance measures are elevated for increasing drug concentrations. This elevation necessitates correction when estimating the cell count [12]. One method is to include a background control for each concentration of the drug. Such an approach, however, requires a large number of wells. Alternative one may create a number of background plates with a setup similar to the one used for evaluating the cell count but without seeding cells into them. Next, these plates are pre-processed as described in Model-based pre-processing, which results in measurements of the absorbance caused by the drug. Finally, the excessive absorbance caused by each concentration of the drug is subtracted from the raw absorbance. This is done as an initial step before the Statistical analysis workflow.

The simulation study

We compare the three dose-response models G, D, and R with a simulation study where the cell count is estimated through absorbance measurements based on nine simulated cell line models specified in Table 1. For each cell line this table includes the doubling time T0 and the summary statistics GI50, TGI, and LC48.

Table 1

Characteristics of the used cell line models

Cell line	T ₀	GI ₅₀	TGI	LC ₄₈
Cell 1	60	-8.40	-8.03	-7.57
Cell 2	30	-8.18	-7.78	-7.47
Cell 3	15	-7.95	-7.49	-7.27
Cell 4	15	-7.72	-7.27	-7.05
Cell 5	30	-7.50	-7.11	-6.80
Cell 6	60	-7.28	-6.91	-6.44
Cell 7	15	-7.05	-6.59	-6.37
Cell 8	60	-6.82	-6.46	-5.99
Cell 9	30	-6.60	-6.21	-5.90

Doubling time T0 and summary statistics GI50, TGI, and LC48 for the nine simulated cell line models.

Characteristics of the used cell line models Doubling time T0 and summary statistics GI50, TGI, and LC48 for the nine simulated cell line models. The drug is added in triplicates for C = 18 decreasing concentrations in 96 well culture plates. Further, three wells are used as untreated controls and three wells are used as background controls. In order to illustrate the time dependence the cell count is estimated at four time points with 24-hour intervals. The net growth of the cell lines are modelled using the differential equation (1) with modelled by the following five parameter logistic function: where a = 2, b = 1/5, d = 0, f = 1. The parameter e is specified such that the summary statistics shown in Table 1 are obtained. Each cell line model is simulated with N0 = 10,000 cells seeded into each well for all four plates at time t = 0 together with the drug. In order to imitate laboratory conditions the MTS assay is assumed added at time points hours and the absorbance is measured after t= 2 hours. Consequently, the cell counts are generated at . In order to attain absorbance measurements the proportionality factor γ in (10) is set equal to 0.4/10,000 for all nine cell line models. In order to investigate the dose-response model G’s capability of estimating the summary statistics GI50, TGI, and LC48, noise is added to the system. To mimic real data this is done according to absorbance model (11) with parameters chosen in concordance with the estimates obtained for the B-cell cancer cell line panel introduced later. For each cell line model the plate specific background absorbance βis drawn from a lognormal distribution with mean μ= -0.8 and standard deviation σ= 0.13; the plate specific multiplicative error δis likewise drawn from a lognormal distribution with mean μ= 0 and standard deviation σ= 0.38. The parameters μ, μ, σ, and σare respectively chosen as the mean and standard deviation of the log transformed estimates for β and δ obtained for the B-cell cancer cell line panel. Finally, the technical variation εis drawn from a mean zero normal distribution with heteroscedastic variance where ξ = 1.42 and σ= 0.074 are chosen as the medians of the estimates for ξ and σobtained for the B-cell cancer cell line panel. The Statistical analysis workflow is used to obtain estimates of the summary statistics GI50, TGI, and LC48 associated with each cell line for 1,000 simulated datasets. Finally, the mean bias, standard deviation, and mean square error (MSE) are calculated for each cell line model and time point.

The NCI60 cancer cell line panel

The cell line screen NCI60 is utilised to quantify the effect of a cell line’s doubling time on the GI50-value obtained by the dose-response model D in real data. Pharmacological data generated in the screen and modelled by the D-model is available online for 49,450 different compounds: http://dtp.nci.nih.gov. In this study we apply all compounds available in the September 2012 release that are tested at least three times on more than half the cell lines and for which half of the tested cell lines are affected by the drug. These criteria are satisfied for 1,699 different compounds. The growth inhibition data is averaged for all experiments that are not already summarised by the mean. Next, the G-model is calculated by use of the transformation (8). For the dose-response models G and D the summary statistics GI50 and are estimated using isotonic regression. The association between the cell lines’ doubling time T0 and the summary statistics GI50 and is determined using Pearson’s correlation coefficient for all 1,699 compounds. In order to determine whether or not the transformation causes a significant reduction in the correlation a paired t-test is used. The reduction in the aforementioned correlation is further illustrated for doxorubicin and the drug with the greatest change. For these compounds the summary statistics GI50 and are plotted against the doubling time T0, and the Pearson’s correlation coefficient is calculated.

The B-cell cancer cell line panel

A doxorubicin dose-response screen of 14 Diffuse Large B-cell Lymphoma and 12 Multiple Myeloma cell lines is used to illustrate the proposed Statistical analysis workflow. The origin of the cell lines is as listed: KMM-1 and KMS-11 were obtained from JCRB (Japanese Collection of Research Bioresources). AMO-1, DB, KMS-12-PE, KMS-12-BM, LP-1, MC-116, MOLP-8, NCI-H929, NU-DHL-1, NU-DUL-1, OPM-2, RPMI-8226, SU-DHL-4, SU-DHL-5, and U-266 were purchased from DSMZ (German Collection of Microorganisms and Cell Cultures). FARAGE, HBL-1, OCI-Ly3, OCI-Ly7, OCI-Ly19, RIVA, SU-DHL-8, and U2932 were kindly provided by Dr. Jose A. Martinez-Climent (Molecular Oncology Laboratory University of Navarra, Pamplona, Spain). Finally, Dr. Steven T. Rosen generously provided MM1S. The identity of the cell lines was verified by DNA barcoding performed every time a cell line was thawed and brought into culture. In brief, PCR analysis of left over traces of genomic DNA in 0.2 ng/μl extracted RNA from cell lines was used as template in a multiplex PCR amplifying specific repeated DNA regions using the AmpFISTR Identifiler PCR amplification kit (Applied Biosystems, CA, USA). A fragment analysis of the amplified PCR products was performed by capillary electrophoresis (Eurofins Medigenomix GmbH, Applied Genetics, Germany). The resulting FSA file was analysed using the Osiris software (http://ncbi.nlm.nih.gov/projects/SNP/osiris) confirming the identity of the cell lines. The cell lines were cultured under standard conditions at 37° C; in a humidified atmosphere of 95% air and 5% CO 2 with the appropriate medium (RPMI1640 or IMDM), fetal bovine serum (FBS), and 1% penicillin/streptomycin. The cell lines were maintained for a maximum of 20 passages to minimize any long-term culturing effects. Penicillin/streptomycin 1%, RPMI1640, IMDM and FBS were purchased from Invitrogen. Doxorubicin was added in quadruplicates for C = 18 decreasing concentrations using the 96 well plate setup shown in Figure 4. All wells marked with a C in the table were seeded with cells and doxorubicin was added 24 hours later. Proliferation assays were performed using the CellTiter 96®; AQueous one Solution Reagent Cat no. G3580 (Promega, Madison WI, USA). The plates were incubated for t= 2 hours and absorbance was estimated at 492 nM using the Optima-Fluostar (BMG LABTECH). For plate 1, the reagent was added immediately after drug exposure and for plate 2, 48 hours later. With this approach the estimates of the cell count were obtained at approximately t1 = 1 hour, and t2 = 49 hours. To achieve high reproducibility, the entire experiment was repeated at least thrice. In order to avoid border effects only non-border wells were used for the subsequent analysis, whereby absorbance measurements are available in triplicates.

Figure 4

Culture plate layout. Wells labelled: M contain medium alone, C0 contain cell culture with salt water added at time , C1,…,C18 contain cell culture with drug dilutions added at time , and B contain only medium with salt water added at time . The drug dilutions are given as 2-fold dilutions from C 18 = 10 μg/ml, C 17 = 5 μg/ml down to C 1 = 763 · 10-7μg/ml. Doxorubicin is a coloured agent which was accounted for according to Correction of background absorbance. Using the corrected absorbance measurements, the dose-response model G and time independent summary statistics GI50, TGI, LC48, and AUC0 were estimated according to the established Statistical analysis workflow. The outlined bootstrap algorithm was applied to estimate 95% confidence intervals for the summary statistics with the number of iterations J = 200.

Model check

Since different drugs have different action mechanisms one should investigate whether or not the proposed differential equation models data adequately well. This is, however, not possible if the dose response experiment has not been conducted for more than two time points. Here the experiment was expanded to include five time points , , , , . Each of the five plates are configured with the same setup as that described in section Dose-response experiments with t= 2 hours. This approach gives estimates of the cell counts at approximately t1 = 1, t2 = 13, t3 = 25, t4 = 37, t5 = 49 hours. The experiment was repeated thrice. The differential equation model is fitted to the data using only t1 and t5. By plotting the data for all time points together with the fitted model it is possible to observe whether or not the model fits adequately well or whether a more advanced model is necessary.

Results

Dose-response curves for the nine cell line models, described in Methods, are shown in Figure 5 for the dose-response models G (5), D (6), and R (7) and time points t1 = 25, t2 = 49, and, t3 = 73 hours.

Figure 5

The dose-response curves for the three dose-response models. The dose-response curves for the R-model are shown in Panels A, D, and G, for the time points 25, 49, and 73 hours, respectively. Similarly for the D- and G-models Panels B, E, and H and C, F, and I show the dose-response curves obtained for time points 25, 49, and 73 hours, respectively. The effect of the drug was modelled to be constant through time. However, according to both the R- and D-models the cell lines’ sensitivity toward the drug increased with time. More specifically, for the R-model this is illustrated in Panels A, D, and G of Figure 5 which depict the obtained dose-response curves for the three time points. For each cell line model the dose-response curves had the same sigmoidal shape for all time points. As shown in (7) the R-model is indifferent towards the cell line doubling time which entailed that the drug sensitivity increased equivalently for each cell line model as the drug exposure time was prolonged. Furthermore, cell line model 1 was simulated as the most sensitive with a GI50-value of -8.4 log10(mmol/ml). However, for all time points, the R-model evaluated it as the fourth most sensitive, surpassed by cell line models 2, 3, and 4 which were simulated with GI50-values of -8.18, -7.95, and, -7.72 log10(mmol/ml), respectively. For the D-model the increase in sensitivity was related to the growth rate of the cell line such that the order of the cell lines’ sensitivity interchanged through time. This is illustrated in Panels B, E, and H where the dose-response curves obtained by the D-model are shown for the three time points. The cell line models 3, 4, and 7 and 1, 6, and 8 were respectively fast and slowly growing. Accordingly, the increase in sensitivity through time was much more pronounced for cell line models 3, 4, and 7 than for 1, 6, and 8. In particular the fast growth rate of cell line model 7 and slow growth rate of cell line model 6 caused the obtained -values to interchange throughout the three time points. The cell lines 1, 2, and 3 were simulated with GI50-values of -8.40, -8.18, and, -7.95 log10(mmol/ml), respectively; however, the -values obtained by the D-model were indistinguishable in Panel H. Additionally, the sensitivity level for cell line models 6 and 7 were reversed such that cell line 7 was evaluated to be more sensitive to the drug than cell line 6. This implied that the summary statistics obtained by the R- and D-models were biased and relative to the cell lines’ sensitivity they were ordered incorrectly. Time independent summary statistics obtained by the G-model equalled those shown in Table 1. The dose-response models G and D are continuous everywhere and differentiable everywhere except at the TGI-value. The latter results in the singularity occurring at that value for both functions. Since the R-model is continuous and differentiable everywhere such singularities do not occur for this model. For each cell line model, 1,000 independent datasets were simulated with a culture plate and noise setup as outlined in Methods. The mean bias, standard deviation, and MSE for the summary statistics GI50, TGI, and LC48 obtained by the outlined statistical analysis workflow applied to the G-model are shown in Table 2 for the nine cell line models and three time points. The Statistical analysis workflow combined with the G-model was capable of producing unbiased estimates of the time independent summary statistics. The standard deviation and MSE were generally smallest for fast growing cell lines and decreased as the drug exposure time was prolonged.

Table 2

Summary of the simulation study

	Mean bias			Standard deviation			MSE
	GI ₅₀	TGI	LC ₄₈	GI ₅₀	TGI	LC ₄₈	GI ₅₀	TGI	LC ₄₈
Cell line 1, T ₀ = 60
Time 25	0.01	0.04	0.07	0.49	0.49	0.22	0.24	0.24	0.05
Time 49	-0.07	-0.06	0.00	0.36	0.36	0.14	0.13	0.14	0.02
Time 73	-0.05	-0.05	-0.01	0.25	0.25	0.11	0.06	0.07	0.01
Cell line 2, T ₀ = 30
Time 25	-0.10	-0.08	-0.01	0.41	0.41	0.24	0.17	0.17	0.06
Time 49	-0.04	-0.04	-0.02	0.18	0.19	0.13	0.03	0.04	0.02
Time 73	-0.02	-0.02	-0.01	0.11	0.12	0.09	0.01	0.01	0.01
Cell line 3, T ₀ = 15
Time 25	-0.05	-0.04	-0.02	0.21	0.25	0.21	0.05	0.06	0.05
Time 49	-0.01	-0.01	0.00	0.08	0.11	0.10	0.01	0.01	0.01
Time 73	-0.02	-0.01	0.00	0.07	0.09	0.08	0.01	0.01	0.01
Cell line 4, T ₀ = 15
Time 25	-0.02	0.00	0.01	0.20	0.22	0.20	0.04	0.05	0.04
Time 49	-0.01	0.00	0.00	0.09	0.11	0.10	0.01	0.01	0.01
Time 73	-0.04	-0.04	-0.03	0.10	0.11	0.11	0.01	0.01	0.01
Cell line 5, T ₀ = 30
Time 25	-0.08	-0.06	0.00	0.43	0.43	0.23	0.19	0.19	0.05
Time 49	-0.03	-0.03	-0.01	0.17	0.18	0.12	0.03	0.03	0.01
Time 73	-0.03	-0.03	-0.01	0.12	0.13	0.09	0.02	0.02	0.01
Cell line 6, T ₀ = 60
Time 25	-0.07	0.00	0.07	0.69	0.63	0.23	0.47	0.40	0.06
Time 49	-0.08	-0.06	0.00	0.41	0.38	0.14	0.18	0.15	0.02
Time 73	-0.04	-0.03	0.00	0.25	0.23	0.10	0.06	0.06	0.01
Cell line 7, T ₀ = 15
Time 25	-0.05	-0.03	-0.02	0.22	0.24	0.20	0.05	0.06	0.04
Time 49	-0.02	-0.01	0.00	0.09	0.11	0.10	0.01	0.01	0.01
Time 73	-0.10	-0.11	-0.04	0.22	0.23	0.10	0.06	0.06	0.01
Cell line 8, T ₀ = 60
Time 25	-0.10	-0.03	0.06	0.73	0.65	0.22	0.54	0.42	0.05
Time 49	-0.14	-0.09	-0.02	0.49	0.41	0.15	0.26	0.18	0.02
Time 73	-0.06	-0.05	-0.01	0.29	0.27	0.11	0.09	0.08	0.01
Cell line 9, T ₀ = 30
Time 25	-0.09	-0.06	0.00	0.46	0.44	0.23	0.22	0.20	0.05
Time 49	-0.04	-0.03	-0.01	0.17	0.19	0.12	0.03	0.04	0.02
Time 73	-0.02	-0.02	-0.01	0.11	0.12	0.08	0.01	0.01	0.01

The mean bias, standard deviation, and mean squared error of the summary statistics GI50, TGI, and LC48 obtained by the G-model for the 1,000 simulated datasets and the three time points.

Summary of the simulation study The mean bias, standard deviation, and mean squared error of the summary statistics GI50, TGI, and LC48 obtained by the G-model for the 1,000 simulated datasets and the three time points. To check if the proposed differential equation models the dose response data adequately a time experiment was conducted. As an example of the model check the model-based pre-processed absorbance data for five different time points are shown in Figure 6 for the cell line SU-DHL-4. Model (12) was fitted using only the t1 = 1 and t2 = 49 hour time points. In this instance the model was found to fit the data adequately and that restricting the model fit to two time points yielded satisfactory results. However, it seems that the growth inhibition was underestimated for the large concentrations. This was the case for most cell lines and was a consequence of only using the 1 and 49 hour time points for estimating model (12).

Figure 6

The adequacy of the proposed differential equation model is checked. Absorbance measurements Growth curves for the cell line SU-DHL-4 for five time points: t1 = 1, t2 = 13, t3 = 25, t4 = 37, and t5 = 49 hours are shown for the control C0 and under influence of the ten strongest concentrations of doxorubicin C9,…,C18. The growth curves are fitted using only the time points t1 and t49. The points correspond to the model-based pre-processed absorbance measurements. In the last panel the fitted growth curves for the cell line untreated (green) and for all ten concentrations (grey) are shown. In this panel the blue, red, and black curves correspond to the estimated growth curves at the summary statistics GI50, TGI, and LC48. The NCI60 dose-response screen was used to illustrate how data obtained by the D-model can be transformed into the G-model by use of (8) and thereby be corrected for the cell line doubling time T0 and duration of the experiment. For the 1,699 compounds that satisfied the selection criteria the association between growth inhibition and growth rate was determined by Pearson’s correlation coefficient between T0 and the summary statistics GI50 and estimated by the dose-response models G and D, respectively. Kernel estimated density functions of the correlations obtained using the two models are shown in Figure 7A. Because the bias associated with the D-model renders fast growing cell lines overly sensitive, the correlation between T0 and the summary statistic was biased upwards. The average decrease of the aforementioned correlation, by adjusting for the cell line doubling time, was 0.145 (95 % CI: (0.14;0.15), p-value <0.001). More specifically, a significant negative correlation was found for 50 compounds using the D-model and 123 compounds using the G-model. In contrast, a significant positive correlation was found for 705 compounds using the D-model and only 278 using the G-model.

Figure 7

Correction of the summary statistic . Panel A: Kernel estimated density functions of correlations between T0 and summary statistics and GI50 for 1699 compounds obtained by the D-model (blue) and the G-model (green), respectively. Panel B: Correlation between T0 and the uncorrected summary statistic obtained by the D-model for the single agent doxorubicin. Panel C: Correlation between T0 and the time independent summary statistic GI50 obtained by the G-model for doxorubicin. Panel D: Correlation between T0 and the uncorrected summary statistic obtained by the D-model for NSC = 624806. Panel E: Correlation between T0 and the time independent summary statistic GI50 obtained by the G-model for NSC = 624806. In Figure 7B to 7E the considered correlation is illustrated for the single agent doxorubicin and the drug giving rise to the greatest change in correlation by plotting the summary statistics and GI50 against T0. For doxorubicin the correlation was 0.16, (95% CI: (-0.1,0.4)) for the uncorrected D-model and -0.03, (95% CI: (-0.28,0.22)) for the G-model. The drug with NSC number 624806 gave rise to the greatest change in correlation, specifically, from 0.41, (95% CI: (0.15,0.62)) for the uncorrected D-model to -0.26, (95% CI: (-0.5,0.02)) for the G-model. The ten drugs with the greatest negative correlation between T0 and GI50 for model G have NSC numbers: 38721, 343513, 338720, 638850, 637404, 624807, 59729, 630684, 698673, and 353882. For model D the drugs were: 637404, 638850, 19994, 698673, 627666, 637603, 626734, 630684, 690134, and 37364. Out of these drugs four were found through both model G and D. A doxorubicin dose-response screen was used to illustrate the proposed Statistical analysis workflow. Doxorubicin is a coloured agent that elevates the absorbance measurements with increasing concentrations. In Figure 8 the background absorbance associated with each concentration is plotted. This was corrected for according to Correction of background absorbance prior to the application of the suggested pre-processing procedure.

Figure 8

Background absorbance measures as a function of the concentrations of doxorubicin. The bars depict 95% CI’s associated with the absorbance.

Background absorbance measures as a function of the concentrations of doxorubicin. The bars depict 95% CI’s associated with the absorbance. In Figure 9 the result of the pre-processing procedure is illustrated for the cell line SU-DHL-4. Panels A and B show the raw absorbance measures for the four replicated experiments whereas the effect of the colour correction is shown in Panels C and D. In Panels E and F the results of the conventionally applied background correction are depicted. Finally, the result of the Model-based pre-processing is illustrated in Panels G and H. When comparing panels E and F to G and H we noticed that the mean absorbance was estimated with a considerable lower variance when the model-based pre-processing was used. A cross marks the measurements that are found to be outliers and for example two of the un-treated control measurements for plate 2 were deemed to be outliers as illustrated in Panel H. In panel H these measurements were clearly extreme values, however, prior to the model-based pre-processing this was not the case.For each cell line residual plots of the final pre-processing models were investigated to ensure that the absorbance model fitted the data reasonably well. For cell line KMS-12-BM the residual plot is illustrated in Figure 10. Panel A shows the residual plot obtained when the heteroscedasticity of the variance was ignored, whilst Panel B shows the residual plot when the variance model was fitted. These plots confirm that the variance model was capable of fitting the heteroscedastic variance observed in dose-response experiments.

Figure 9

Figure 10

Residual absorbance for cell line KMS-12-BM at 49 hours plotted against the pre-processed absorbance measure. The three colours represent the triplicates. The triangles and circles represent background values and all other values, respectively. Panel A shows the result of ignoring the heteroscedasticity of the variance and Panel B shows the result of using the variance model.

The result of the pre-processing procedure is illustrated for the cell line SU-DHL-4. The circles represent absorbance measures for the particular concentration at which it is plotted, the triangles represent the un-treated controls, the plusses represent background absorbance measurements, and, finally, the crosses illustrate outliers. The figure is divided into eight panels, where Panels A, C, E, and G show the results for time t1 = 1 hour and Panels B, D, F, and H for time t2 = 49 hours. Panels A and B show the raw absorbance measures for the four replicated experiments. The effect of the colour correction is shown in Panels C and D. Panels E and F illustrate the result of the conventionally applied background correction. Finally, the result of the model-based pre-processing procedure is illustrated in Panels G and H. Residual absorbance for cell line KMS-12-BM at 49 hours plotted against the pre-processed absorbance measure. The three colours represent the triplicates. The triangles and circles represent background values and all other values, respectively. Panel A shows the result of ignoring the heteroscedasticity of the variance and Panel B shows the result of using the variance model. The dose-response curves obtained for the 14 DLBCL and 12 MM cancer cell lines are illustrated in Panels A and B of Figure 11. The first quadrant of the plots depicts the percentage growth for the treated cell line compared to the same cell line un-treated, e.g. the values 75, 50, and 25 were attained at the concentrations where the doubling time for the control was 75, 50, and 25% of that for the treated cell line, or equivalently, the growth rate of the treated cell line was 75, 50, and 25% of that for the un-treated cell line. The fourth quadrant depicts cell decay, e.g. the values -1/48, -1/24, and -1/16 were attained at the concentrations where the cell line population was halved in 48, 24, and 16 hours, respectively. None of the curves contained points where the treated cell line outgrew the controls, i.e. values greater than 100. This was an effect of forcing to be positive which was of great importance for the summary statistic AUC0. The estimated cell line doubling time and summary statistics GI50, TGI, LC 48, and AUC0 are shown in Table 3 with associated 95% confidence intervals. The bootstrapped summary statistics GI50, TGI, LC 48, and AUC0 are also illustrated by box plots in Panels C, D, E, and F of Figure 11.

Figure 11

Illustration of dose-response curves for the B-cell cancer cell line panel. Panels A and B illustrate dose-response curves obtained by the G-model for the 14 DLBCL and 12 MM cancer cell lines. Panels C, D, E, and F depict boxplots of the bootstrapped summary statistics GI50, TGI, LC 48, and AUC0, respectively. The green and blue colours are used for DLBCL and MM cell lines, respectively.

Table 3

Summary statistics for the B-cell cancer cell line panel

Cell Line	T ₀	G -Model
	Hours	GI ₅₀	TGI	LC ₄₈	AUC ₀
DLBCL
OCI-Ly7	24 (21;26)	-6.23 (-6.31;-6.18)	-5.96 (-6.05;-5.84)	-5.57 (-5.74;-5.31)	325 (310;332)
RIVA	68 (55;91)	-6.35 (-6.39;-6.27)	-5.87 (-6.18;-5.79)	-4.97 (-5.36;-4.82)	327 (316;337)
U2932	34 (33;35)	-6.53 (-6.56;-6.51)	-6.28 (-6.29;-6.25)	-5.60 (-5.68;-5.48)	295 (285;299)
DB	37 (35;39)	-6.58 (-6.65;-6.52)	-6.19 (-6.25;-6.15)	-5.71 (-5.76;-5.65)	276 (260;285)
SU-DHL-4	36 (34;39)	-6.59 (-6.64;-6.52)	-6.33 (-6.39;-6.24)	-5.92 (-6.05;-5.73)	289 (277;294)
MC-116	50 (41;70)	-6.64 (-6.87;-6.48)	-6.29 (-6.38;-6.20)	-5.57 (-5.62;-5.53)	272 (233;300)
HBL-1	46 (42;51)	-6.83 (-6.89;-6.78)	-6.46 (-6.55;-6.37)	-5.76 (-5.83;-5.68)	272 (266;277)
OCI-Ly3	91 (69;119)	-7.02 (-7.06;-6.94)	-6.89 (-6.96;-6.81)	-6.67 (-6.73;-6.59)	253 (244;259)
NU-DUL-1	48 (40;63)	-7.27 (-7.33;-7.21)	-7.13 (-7.14;-7.11)	-7.02 (-7.04;-7.00)	224 (191;231)
SU-DHL-8	85 (53;175)	-7.32 (-7.39;-7.21)	-7.19 (-7.32;-7.10)	-7.01 (-7.07;-6.93)	222 (207;231)
NU-DHL-1	29 (27;32)	-7.39 (-7.49;-7.29)	-6.94 (-6.98;-6.89)	-6.66 (-6.70;-6.63)	201 (184;213)
SU-DHL-5	32 (30;35)	-7.42 (-7.44;-7.40)	-7.29 (-7.32;-7.26)	-7.12 (-7.16;-7.08)	202 (189;209)
FARAGE	58 (49;72)	-7.79 (-7.93;-7.49)	-6.93 (-7.23;-6.80)	-6.44 (-6.52;-6.37)	180 (158;198)
OCI-Ly19	39 (36;45)	-7.87 (-7.95;-7.79)	-7.13 (-7.25;-6.83)	-6.39 (-6.46;-6.31)	167 (157;178)
MM
KMS-11	48 (45;51)	-5.91 (-5.94;-5.88)	-5.65 (-5.68;-5.63)	-5.46 (-5.48;-5.44)	356 (340;361)
KMM-1	44 (40;52)	-6.03 (-6.13;-5.95)	-5.87 (-5.91;-5.85)	-5.70 (-5.74;-5.66)	346 (316;350)
KMS-12-PE	24 (20;29)	-6.04 (-6.24;-5.91)	-5.73 (-5.82;-5.61)	-5.50 (-5.61;-5.32)	340 (310;353)
LP-1	33 (30;36)	-6.07 (-6.12;-6.02)	-5.82 (-5.85;-5.80)	-5.60 (-5.62;-5.59)	316 (294;336)
U-266	48 (43;54)	-6.13 (-6.20;-6.06)	-5.86 (-5.91;-5.80)	-5.42 (-5.47;-5.37)	325 (299;339)
OPM-2	57 (47;71)	-6.24 (-6.33;-6.13)	-5.98 (-6.07;-5.91)	-5.61 (-5.64;-5.57)	329 (313;339)
KMS-12-BM	47 (42;57)	-6.25 (-6.33;-6.16)	-5.92 (-6.02;-5.85)	-5.59 (-5.61;-5.56)	331 (307;337)
RPMI-8226	30 (29;32)	-6.42 (-6.45;-6.40)	-6.11 (-6.14;-6.09)	-5.87 (-5.89;-5.85)	297 (292;306)
NCI-H929	28 (25;31)	-6.50 (-6.54;-6.48)	-6.39 (-6.42;-6.34)	-6.25 (-6.31;-6.17)	291 (270;300)
AMO-1	32 (30;34)	-6.74 (-6.76;-6.72)	-6.61 (-6.64;-6.58)	-6.30 (-6.40;-6.17)	269 (265;283)
MOLP-8	34 (21;48)	-6.78 (-6.93;-6.62)	-6.52 (-6.67;-6.27)	-6.23 (-6.37;-6.10)	253 (229;290)
MM1S	37 (26;51)	-7.20 (-7.29;-7.10)	-7.05 (-7.12;-6.95)	-6.87 (-6.94;-6.60)	227 (208;239)

The doubling time T0 and summary statistics GI50, TGI, LC48, and AUC0 for the dose-response model G.

Discussion

In the present study a differential equation that models drug induced growth inhibition of human tumour cell lines was established. Based on this equation a novel model for summarising dose-response experiments was produced, that in combination with a statistical workflow, is capable of generating unbiased time independent summary statistics. To determine if the differential equation is adequate for modelling real data a time experiment based on doxorubicin was conducted. The experiment included five time points of which only the first and last were used to fit the differential equation. The differential equation was found to model the data adequately, albeit the use of only two time points may lead to an underestimated drug efficiency for large doses. Since the differential equation was found adequate, a simulation study was performed to document the potential bias when using existing methods, and the robustness of the workflow. We deduced that under the proposed differential equation these summary statistics are biased estimators of growth inhibition so that the drug effect is amplified concurrently with increasing growth rate of the cell lines. In Kondoh et al. [18] 40 representative anticancer drugs from the NCI60 screen were used to illustrate the association between cell line growth rate and drug sensitivity assessed by the D-model. They found the growth rate was positively correlated with drug sensitivity. We propose that this finding is partly caused by systematic bias induced by the experimental setup of the cell line screens. Since the difference between the treated and un-treated cell line will increase with time, the effect of the drug will seem greater for fast growing cell lines. We showed that by transforming data obtained by the D-model into the G-model the correlation between doubling time and drug resistance decreased significantly. We do not argue against drug resistance being associated with growth rate as the authors successfully discover and validate a potential new anticancer drug, we merely suggest that removing the design-based bias may lead to a range of new potential drugs to be investigated. In order to illustrate the suggested workflow for dose-response experiments, a study of 26 cell lines tested for drug resistance at 18 different concentrations of doxorubicin was presented. The results illustrate that it is possible to gain realistic estimates of the variance of the growth inhibition characteristics, which is of great value in the application of dose-response studies.

Practical considerations

Since the establishment of NCI60, dose-response screens of human tumour cell lines have been one of the most commonly used methods for discovering new anticancer drugs [2]. The approach has mostly been used to discover drugs that are potent in a considerable part of the tested cell lines originating from various tumour types. With this purpose in mind, the bias introduced by analysing cell lines with different doubling times has little or no influence on the conclusions. More recently, the cell line screens have been used to discover treatments that are only potent in a small proportion of the tested cell lines and hence in a small proportion of the cancer patients. Ignoring the doubling time of the cell lines may reduce the capability of discovering such drugs since slowly growing cell lines may appear resistant to the drug. The issue of growth rate bias may be remedied by using cell lines with approximately the same doubling time or alternatively by conducting the experiments using individual time spans corresponding to a given number of doubling times for each cell line. Based on the latter approach Bracht et al. [19] took the doubling times into account by conducting the dose-response experiments such that each of the 77 cell lines was exposed to the drug for three cell line specific doubling times. For large cell line screens it is neither feasible to generate diverse panels consisting of cancer cell lines with similar doubling times nor is it practical to conduct each experiment for different time spans. The latter option is further complicated since it may not be possible to keep slowly growing cell lines in the exponential growth phase for several doubling times throughout the experiment. The models used in cell line screens NCI60 [3], JFCR39 [5], and CMT100 [2] are based on fixed drug exposure times. We established transformations of these models so that each cell line’s doubling time can be accounted for.

Methodological considerations

Modelling the growth of a cell line exposed to an anticancer drug by the simple differential equation (1) facilitated a meticulous analysis of existing summary statistics for cell line based dose-response studies of growth inhibition. It may be possible to establish a differential equation that leads to either the D or R model. However, the authors have not been able to do so in an unblemished fashion. It is thus difficult to determine which assumptions must be met for the results of these models to be unbiased. The differential equation was based on exponential cell growth which seems a reasonably assumption since all drug response assays strive toward using the exponential growth phase of the cell lines for the out-read window. Similarly, the rate for cells going into cell cycle arrest or death is assumed exponential and concentration dependent, partly due to computational convenience and partly because no obvious alternative is present. It should be emphasized the assumption of an exponential rate for cells going into cell cycle arrest or death induce a constant drug efficiency throughout the experiment. However, since different drugs induce growth inhibition by different mechanisms, the established differential equation (1) is oversimplified and may therefore model the growth of a cell line exposed to a drug inadequately. It would be interesting to establish more complex systems of differential equations of cell culture growth in combination with more measurements during drug exposure time [20-25]. This would allow estimation of drug induced growth inhibition with improved precision and hence increased biological understanding. A model-based approach to pre-processing based on a nonlinear regression model was introduced. This model efficiently and simultaneously addresses a number of issues such as background absorbance correction, multiplicative seeding effects and heteroscedastic variance of absorbance measures. All are well-known nuisance effects in cellular/bacterial growth studies [11,26]. The modelling approach also facilitated outlier detection by residual analysis and standard model checks from regression theory [17]. The dose-response relationship was modelled by the growth curves arising from the solution to the posed differential equation. This lead to pointwise estimates of the dose-response curve of the G-model and interpolation of the curve was done by isotonic regression which is robust against outliers and model misspecifications [14,27]. Providing precision estimates of the growth inhibition characteristics in this complex setting is not straightforward, so parametric bootstrap of the nonlinear model of the absorbance measurements was used [28]. Alternatively the statistical delta method could have been applied [11,29]. Although feasible, this would have required complicated approximations by Taylor series expansions, and bootstrapping is generally considered to have superior small sample properties [30]. The dose-response model R (7) based on relative cell counts is very appealing due to its simplicity. Moreover, it is a smooth function so it is possible to fit parametric models to the dose-response curve of R which facilitates extrapolation. When extrapolation is necessary it is possible to fit a parametric model to the dose-response curve of R[31,32] and subsequently transform the result into the dose-response curve of G using (9). This approach facilitates estimation of time independent summary statistics by extrapolation.

Conclusions

In this study we have shown that conventionally used dose-response models can give rise to biased summary statistics erroneously correlated to the growth rate of the cell lines. We have developed novel summary statistics of dose-response experiments that are applicable on existing data and independent of time under the proposed differential equation. Consequently, we expect that the present approach will be able to improve future drug evaluation studies.

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

All authors conceived the project. SF and MB designed the differential equation model and algorithms for estimating the associated coefficients with confidence intervals. The laboratory studies were designed by SF, MBL, JSB, MKK, KD, and MB and performed by MBL, JSB, and MKK. AS, MN, HEJ, and KD contributed reagents and materials used in the study. SF implemented algorithms and performed the computational experiments. All authors analyzed the results, were involved in manuscript preparation, and read and approved the final manuscript.

22 in total

1. Analysis of resonance chemotherapy in leukemia treatment via multi-staged population balance models.

Authors: E Sherer; R E Hannemann; A Rundell; D Ramkrishna
Journal: J Theor Biol Date: 2006-01-23 Impact factor: 2.691

2. The sensitivity and specificity of the MTS tetrazolium assay for detecting the in vitro cytotoxicity of 20 chemicals using human cell lines.

Authors: G Malich; B Markovic; C Winder
Journal: Toxicology Date: 1997-12-31 Impact factor: 4.221

Review 3. The NCI60 human tumour cell line anticancer drug screen.

Authors: Robert H Shoemaker
Journal: Nat Rev Cancer Date: 2006-10 Impact factor: 60.716

Review 4. Cell line-based platforms to evaluate the therapeutic efficacy of candidate anticancer agents.

Authors: Sreenath V Sharma; Daniel A Haber; Jeff Settleman
Journal: Nat Rev Cancer Date: 2010-03-19 Impact factor: 60.716

5. Suppression of microtubule dynamic instability and turnover in MCF7 breast cancer cells by sulforaphane.

Authors: Olga Azarenko; Tatiana Okouneva; Keith W Singletary; Mary Ann Jordan; Leslie Wilson
Journal: Carcinogenesis Date: 2008-10-23 Impact factor: 4.944

Review 6. Models, mechanisms and clinical evidence for cancer dormancy.

Authors: Julio A Aguirre-Ghiso
Journal: Nat Rev Cancer Date: 2007-11 Impact factor: 60.716

7. A random effect multiplicative heteroscedastic model for bacterial growth.

Authors: Ricardo Cao; Mario Francisco-Fernández; Emiliano J Quinto
Journal: BMC Bioinformatics Date: 2010-02-08 Impact factor: 3.169

8. Identification of genotype-correlated sensitivity to selective kinase inhibitors by using high-throughput tumor cell line profiling.

Authors: Ultan McDermott; Sreenath V Sharma; Lori Dowell; Patricia Greninger; Clara Montagut; Jennifer Lamb; Heidi Archibald; Raul Raudales; Angela Tam; Diana Lee; S Michael Rothenberg; Jeffrey G Supko; Raffaella Sordella; Lindsey E Ulkus; A John Iafrate; Shyamala Maheswaran; Ching Ni Njauw; Hensin Tsao; Lisa Drew; Jeff H Hanke; Xiao-Jun Ma; Mark G Erlander; Nathanael S Gray; Daniel A Haber; Jeffrey Settleman
Journal: Proc Natl Acad Sci U S A Date: 2007-12-06 Impact factor: 11.205

9. Evaluation of action mechanisms of toxic chemicals using JFCR39, a panel of human cancer cell lines.

Authors: Noriyuki Nakatsu; Tomoki Nakamura; Kanami Yamazaki; Soutaro Sadahiro; Hiroyasu Makuuchi; Jun Kanno; Takao Yamori
Journal: Mol Pharmacol Date: 2007-08-16 Impact factor: 4.436

10. Mechanistic mathematical modelling of mercaptopurine effects on cell cycle of human acute lymphoblastic leukaemia cells.

Authors: J C Panetta; W E Evans; M H Cheok
Journal: Br J Cancer Date: 2006-01-16 Impact factor: 7.640

10 in total

1. MicroRNA-155 controls vincristine sensitivity and predicts superior clinical outcome in diffuse large B-cell lymphoma.

Authors: Hanne Due; Anna Amanda Schönherz; Laura Ryø; Maria Nascimento Primo; Ditte Starberg Jespersen; Emil Aagaard Thomsen; Anne Stidholt Roug; Min Xiao; Xiaohong Tan; Yuyang Pang; Ken H Young; Martin Bøgsted; Jacob Giehm Mikkelsen; Karen Dybkær
Journal: Blood Adv Date: 2019-04-09

Review 2. The myeloma stem cell concept, revisited: from phenomenology to operational terms.

Authors: Hans Erik Johnsen; Martin Bøgsted; Alexander Schmitz; Julie Støve Bødker; Tarec Christoffer El-Galaly; Preben Johansen; Peter Valent; Niklas Zojer; Els Van Valckenborgh; Karin Vanderkerken; Mark van Duin; Pieter Sonneveld; Martin Perez-Andres; Alberto Orfao; Karen Dybkær
Journal: Haematologica Date: 2016-11-10 Impact factor: 9.941

3. Diffuse large B-cell lymphoma classification system that associates normal B-cell subset phenotypes with prognosis.

Authors: Karen Dybkær; Martin Bøgsted; Steffen Falgreen; Julie S Bødker; Malene K Kjeldsen; Alexander Schmitz; Anders E Bilgrau; Zijun Y Xu-Monette; Ling Li; Kim S Bergkvist; Maria B Laursen; Maria Rodrigo-Domingo; Sara C Marques; Sophie B Rasmussen; Mette Nyegaard; Michael Gaihede; Michael B Møller; Richard J Samworth; Rajen D Shah; Preben Johansen; Tarec C El-Galaly; Ken H Young; Hans E Johnsen
Journal: J Clin Oncol Date: 2015-03-23 Impact factor: 44.544

4. A bendamustine resistance gene signature in diffuse large B-cell lymphoma and multiple myeloma.

Authors: Issa Ismail Issa; Rasmus Froberg Brøndum; Hanne Due; Linnéa Schmidt; Martin Bøgsted; Karen Dybkær
Journal: Cancer Drug Resist Date: 2021-03-19

5. A multiple myeloma classification system that associates normal B-cell subset phenotypes with prognosis.

Authors: Julie Støve Bødker; Rasmus Froberg Brøndum; Alexander Schmitz; Anna Amanda Schönherz; Ditte Starberg Jespersen; Mads Sønderkær; Charles Vesteghem; Hanne Due; Caroline Holm Nørgaard; Martin Perez-Andres; Mehmet Kemal Samur; Faith Davies; Brian Walker; Charlotte Pawlyn; Martin Kaiser; David Johnson; Uta Bertsch; Annemiek Broyl; Mark van Duin; Rajen Shah; Preben Johansen; Martin Agge Nørgaard; Richard J Samworth; Pieter Sonneveld; Hartmut Goldschmidt; Gareth J Morgan; Alberto Orfao; Nikhil Munshi; Hans Erik Johnson; Tarec El-Galaly; Karen Dybkær; Martin Bøgsted
Journal: Blood Adv Date: 2018-09-25

6. Predicting response to multidrug regimens in cancer patients using cell line experiments and regularised regression models.

Authors: Steffen Falgreen; Karen Dybkær; Ken H Young; Zijun Y Xu-Monette; Tarec C El-Galaly; Maria Bach Laursen; Julie S Bødker; Malene K Kjeldsen; Alexander Schmitz; Mette Nyegaard; Hans Erik Johnsen; Martin Bøgsted
Journal: BMC Cancer Date: 2015-04-08 Impact factor: 4.430

7. The CXCR4 antagonist plerixafor enhances the effect of rituximab in diffuse large B-cell lymphoma cell lines.

Authors: Linn Reinholdt; Maria Bach Laursen; Alexander Schmitz; Julie Støve Bødker; Lasse Hjort Jakobsen; Martin Bøgsted; Hans Erik Johnsen; Karen Dybkær
Journal: Biomark Res Date: 2016-06-14

8. Molecular classification of tissue from a transformed non-Hogkin's lymphoma case with unexpected long-time remission.

Authors: Julie Støve Bødker; Marianne Tang Severinsen; Tarec Christoffer El-Galaly; Rasmus Froberg Brøndum; Maria Bach Laursen; Steffen Falgreen; Mette Nyegaard; Alexander Schmitz; Lasse Hjort Jakobsen; Anna Amanda Schönherz; Hanne Due; Linn Reinholdt; Martin Bøgsted; Karen Dybkær; Hans Erik Johnsen
Journal: Exp Hematol Oncol Date: 2017-01-11

9. Modelling changes in glutathione homeostasis as a function of quinone redox metabolism.

Authors: Ross A Kelly; Joseph Leedale; Dominic Calleja; Steven J Enoch; Andy Harrell; Amy E Chadwick; Steven Webb
Journal: Sci Rep Date: 2019-04-19 Impact factor: 4.379

10. Hsp90 inhibition sensitizes DLBCL cells to cisplatin.

Authors: Linnéa Schmidt; Issa Ismail Issa; Hulda Haraldsdóttir; Jonas Laugård Hald; Alexander Schmitz; Hanne Due; Karen Dybkær
Journal: Cancer Chemother Pharmacol Date: 2022-02-21 Impact factor: 3.333

10 in total