| Literature DB >> 23231059 |
Sandra Plancade1, Yves Rozenholc, Eiliv Lund.
Abstract
BACKGROUND: Illumina BeadArray technology includes non specific negative control features that allow a precise estimation of the background noise. As an alternative to the background subtraction proposed in BeadStudio which leads to an important loss of information by generating negative values, a background correction method modeling the observed intensities as the sum of the exponentially distributed signal and normally distributed noise has been developed. Nevertheless, Wang and Ye (2012) display a kernel-based estimator of the signal distribution on Illumina BeadArrays and suggest that a gamma distribution would represent a better modeling of the signal density. Hence, the normal-exponential modeling may not be appropriate for Illumina data and background corrections derived from this model may lead to wrong estimation.Entities:
Mesh:
Year: 2012 PMID: 23231059 PMCID: PMC3599453 DOI: 10.1186/1471-2105-13-329
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Normal-exponential fit. Normal-Exponential estimation for one array from (E1) after removal of imperfectly designed probes: irregular density histogram of all regular probe intensities and the plug-in normexp density of the regular probes with MLE, RMA and NP parameter estimates.
Figure 2Normal-exponential and normal-gamma fit. Normal-Gamma estimation for one array from (E1) after removal of imperfectly designed probes: irregular density histogram of all regular probe intensities, plug-in normexp density with RMA estimate and plug-in normal-gamma density with MLE estimate.
Deviation between reconstructed intensities and observation histogram
| | |||
|---|---|---|---|
| nexp MLE | 7.09 | 5.14 | 4.83 |
| nexp RMA | 2.96 | 3.18 | 2.71 |
| nexp NP | 7.69 | 5.50 | 5.29 |
| Abs Dev normgam | 0.17 | 0.21 | 0.20 |
Average deviation between normexp reconstructed density and histogram divided by the deviation between normal-gamma reconstructed density and histogram (First row: RMA estimator; second row: MLE normexp estimator; third row: NP normexp estimator). The fourth row gives the mean deviation of the normal-gamma estimator as a reference. The mean is computed over the ten arrays from (E1) with (first column) or without (second column) the non specific binding probes and over the four arrays from (E2) (third column).
Relative-error for each parameter in the normal-gamma model using MLE estimates
| set 1 | 7.1e-4 | 5.6e-3 | 9.3e-3 | 1.7e-2 |
| set 2 | 1.3e-3 | 5.5e-3 | 1.0e-2 | 1.8e-2 |
| set 3 | 3.5e-3 | 1.6e-2 | 6.9e-3 | 8.3e-3 |
| set 4 | 4.5e-3 | 1.3e-2 | 8.9e-3 | 9.8e-3 |
| set 5 | 2.1e-3 | 7.6e-3 | 2.6e-2 | 1.7e-2 |
| set 6 | 3.5e-3 | 7.2e-3 | 3.9e-2 | 2.4e-2 |
Error in parameter estimation
| set 3 | 1.1 | 1.0 | 1.6 |
| set 4 | 1.2 | 1.0 | 1.8 |
| set 5 | 2.1 | 1.0 | 2.3 |
| set 6 | 1.9 | 1.0 | 1.9 |
Ratio between the relative L1errors of the MLE estimation in the normal-gamma and in the normexp models for (μ,σ,θ) from normexp data.
Excess risk ratio of background corrected raw-scale intensities
| set 1 | 1.00 | 4.16 | 1.77 | 1.52 | 1.16 | 2.34 |
| set 2 | 1.00 | 4.10 | 1.90 | 1.66 | 1.20 | 11.7 |
| set 3 | 1.00 | 1.00 | 4.69 | 1.00 | 1.00 | 4.57 |
| set 4 | 1.00 | 1.00 | 3.71 | 1.00 | 1.02 | 31.4 |
| set 5 | 1.00 | 1.00 | 2.11 | 1.00 | 1.15 | 2.95 |
| set 6 | 1.00 | 1.00 | 1.46 | 1.00 | 1.35 | 17.2 |
Mean Absolute Deviation (MAD) of the background corrected intensities for methods , j=1,…,5 divided by the MAD for the theoretical normal-gamma BgC with the true parameters (method ), from the simulation data set (S1). Column 1: normal-gamma, column 2: normexp-MLE, column 3: normexp-RMA, column 4: normexp-NP, column 5: background subtraction. The MAD of the theoretical normal-gamma deconvolution with the true parameters is given as reference in column 6.
Excess risk ratio of background corrected log-transformed intensities
| set 1 | 1.00 | 1.32 | 1.18 | 1.17 |
| set 2 | 1.00 | 1.28 | 1.16 | 1.16 |
| set 3 | 1.00 | 1.00 | 2.98 | 1.00 |
| set 4 | 1.00 | 1.00 | 2.45 | 1.00 |
| set 5 | 1.00 | 1.00 | 1.81 | 1.00 |
| set 6 | 1.00 | 1.00 | 1.39 | 1.00 |
Mean Absolute Deviation (MAD) of the background corrected intensities for methods , j=1,…,5 divided by the MAD for the theoretical normal-gamma BgC with the true parameters (method ), from the simulation data set (S1). Column 1: normal-gamma, column 2: normexp-MLE, column 3: normexp-RMA, column 4: normexp-NP.
Figure 3Absolute deviation of the signal estimation on simulated data. Logarithm of the Absolute Deviation of estimated signal on raw scale (first row), Absolute Deviation of log-transformed estimated signal (second row) and signal log-density (third row). Normal-gamma BgC (purple) and normexp BgC with MLE (pink), RMA (blue) and NP (green) parameters.
Figure 4Operating characteristics of the BgC methods on spike-in and simulated data. Row 1: average spike intensities (left) and standard deviation of spike replicates (right) for all non-zero spike concentrations. Row 2 to 4: average intensity (left) and standard deviation of replicates (right) as a function of signal intensity. Row 2: normal-gamma simulation in data set (S3) (parameter set 7); Row 3: gamma signal and empirical background noise distribution (data set (S4)); Row 4 normal-exponential simulation in data set (S3) (parameter set 9).
Innate offset and operating characteristics
| normexp MLE | 23.4 | 0.095 | 0.74 |
| normexp NP | 12.4 | 0.100 | 0.80 |
| normexp RMA | 6.9 | 0.110 | 0.86 |
| normal-gamma | 1.5 | 0.200 | 0.99 |
Innate offset, average standard deviation of spike replicates and slope of the linear regression of the spike average intensity on the log-concentration.
Figure 5AUC as a function of added offset. AUC from moderated t-test for mixed sample differential analysis in data set (E4) (proportion 25%/75% and 75%/25%) for different values of offset.
AUC from Spearman correlation test
| | ||||
|---|---|---|---|---|
| 250ng | 0.9778 | 0.9812 | 0.9813 | 0.9820 |
| 100ng | 0.9774 | 0.9807 | 0.9809 | 0.9808 |
| 50ng | 0.9805 | 0.9834 | 0.9832 | 0.9841 |
| 10ng | 0.9782 | 0.9818 | 0.9787 | 0.9816 |
AUC from Spearman correlation test between the proportion and the intensity in the dilution data set (E5), for the four BgC methods, and the four RNA starting concentrations.