| Literature DB >> 28458954 |
Bernard Rosner1,2, Shelley Tworoger1,3, Weiliang Qiu1.
Abstract
Diagnostic biomarkers are used frequently in epidemiologic and clinical work. The ability of a diagnostic biomarker to discriminate between subjects who develop disease (cases) and subjects who do not (controls) is often measured by the area under the receiver operating characteristic curve (AUC). The diagnostic biomarkers are usually measured with error. Ignoring measurement error can cause biased estimation of AUC, which results in misleading interpretation of the efficacy of a diagnostic biomarker. Several methods have been proposed to correct AUC for measurement error, most of which required the normality assumption for the distributions of diagnostic biomarkers. In this article, we propose a new method to correct AUC for measurement error and derive approximate confidence limits for the corrected AUC. The proposed method does not require the normality assumption. Both real data analyses and simulation studies show good performance of the proposed measurement error correction method.Entities:
Keywords: AUC; Biomarkers; Non-normal distributions
Year: 2015 PMID: 28458954 PMCID: PMC5409172 DOI: 10.4172/2155-6180.1000270
Source DB: PubMed Journal: J Biom Biostat
Bias, mean square error (MSE), and coverage for AUC (μ) from simulation I **.
| λ | μY | μX | AUCtrue | MW | R | P | |
|---|---|---|---|---|---|---|---|
| 0 | 0 | 0.25 | 0.570 | Bias(×103) 95% Cl | -13 (-17, -9) | 0 (-4,4) | 0 (-4,5) |
| MSE(×104) 95% Cl | 18 (16,20) | 19 (18,22) | 25 (22,28) | ||||
| Coverage (%) 95% Cl | 93.7 (91.8,95.6) | 95.1 (93.4, 96.9) | 94.9 (93.0, 96.8) | ||||
| 0 | 0 | 0.5 | 0.638 | Bias(×103) 95% Cl | -25 (-28, -21) | 0 (-4,4) | 0 (-4,5) |
| MSE(×104) 95% Cl | 22 (19,24) | 18 (16,20) | 23 (20,26) | ||||
| Coverage(%) 95% Cl | 90.3 (87.6, 93.0) | 95.2 (93.5, 96.8) | 94.8 (92.9, 96.7) | ||||
| 0 | 0 | 1 | 0.760 | Bias(×103) 95% Cl | -42 (-45, -39) | 0 (-4,3) | 0 (-3,4) |
| MSE(×104) 95% Cl | 31 (27,34) | 14 (12,16) | 17 (15,20) | ||||
| Coverage (%) 95% Cl | 76.6 (73.0, 80.2) | 95.2 (93.3, 97.0) | 94.4 (92.4, 96.5) |
MW: Mann-Whitney estimate (i.e., AUC); R: Reiser's (2000) method; P: Probit method.
Simulation I was run 100 times. Each time, we generated 1000 simulated data sets. Each data set consists of 100 cases and 100 controls. Each subject provides two replicate biomaker scores. Both true values and random errors are assumed to come from normal distributions with .
Bias, mean square error (MSE), and coverage for AUC (μ) from simulation III**.
| A | μY | μX | AUCtrue | MW | R | P | |
|---|---|---|---|---|---|---|---|
| 0 | 0 | 0.25 | 0.570 | Bias(×103) 95% Cl | - 26 (-29, -22) | - 14 (-19, -10) | 0 (-6,5) |
| MSE(×104) 95% Cl | 23 (20, 26) | 30 (26,34) | 41 (35,46) | ||||
| Coverage (%) 95% Cl | 90.3 (87.5,93.1) | 94.5 (92.6, 96.5) | 95.3 (93.5, 97.1) | ||||
| 0 | 0 | 0.5 | 0.638 | Bias(×103) 95% Cl | - 53 (-56, -51) | - 31 (-34, -28) | 2 (-2,7) |
| MSE(×104) 95% Cl | 45 (40,49) | 38 (33,43) | 37 (40,53) | ||||
| Coverage (%) 95% Cl | 72.5 (68.3,76.8) | 91.4 (88.8, 93.9) | 95.7 (93.9, 97.6) | ||||
| 0 | 0 | 1 | 0.76 | Bias(×103) 95% Cl | - 111 (-114, -108) | - 72 (-76, -67) | 17 (11,24) |
| MSE(×104) 95% Cl | 138 (131, 146) | 83 (75,90) | 67 (58,75) | ||||
| Coverage (%) 95% Cl | 12.9 (10.3, 15.6) | 66.1 (61.8, 70.5) | 96.9 (95.2, 98.7) |
MW: Mann-Whitney estimate (i.e., AUC); R: Reiser's (2000) method; P: probit method.
Simulation III was run 100 times. Each time, we generated 1000 simulated data sets. Each data set consists of 100 cases and 100 controls. each subject provides two replicate biomaker scores. Both true values and random errors were generated from log normal distributions with .
Figure 1Histograms of the NAPAP values. The upper panel: cases (left) and controls (right) measured at the clinic visit; The middle panel: cases (left) and controls (right) measured at the first home collection; The bottom panel: cases (left) and controls (right) measured at the second home collection.
Estimate of AUC and its 95% confidence interval for the NAPAP data.
| MW | R | P | |
|---|---|---|---|
| 0.589 [0.557,0.663] | 0.611 [0.557,0.663] | 0.618 [0.549,0.684] |
Bias, mean square error (MSE), and coverage for AUC (μ) from simulation II**.
| λ | μY | μX | AUCtrue | MW | R | P | |
|---|---|---|---|---|---|---|---|
| 0 | 0 | 0.25 | 0.57 | Bias(×103) 95% Cl | -23 (-26, -19) | 15 (-18, -11) | 4 (-9,1) |
| MSE(×104) 95% Cl | 22 (19, 25) | 20 (17,22) | 32 (28,26) | ||||
| Coverage (%) 95% Cl | 91.2 (88.5,93.9) | 93.8 (91.6, 95.6) | 94.8 (92.8, 96.8) | ||||
| 0 | 0 | 0.50 | 0.638 | Bias(×103) 95% Cl | - 49 (-52, -45) | -32 (-35, -28) | -7 (-12, -2) |
| MSE(×104) 95% Cl | 39 (35,44) | 26 (23,29) | 36 (31,41) | ||||
| Coverage (%) 95% Cl | 76.3 (72.1,80.5) | 89.6 (86.9, 92.2) | 94.7 (92.7, 96.8) | ||||
| 0 | 0 | 1.0 | 0.760 | Bias(×103) 95% Cl | -104 (-107, -101) | -74 (-77, -70) | 2 (-4, 8) |
| MSE(×104) 95% Cl | 122 (115, 130) | 69 (64,74) | 53 (46, 61) | ||||
| Coverage (%) 95% Cl | 17.4 (14.6, 20.3) | 52.3 (47.6, 56.9) | 95.3 (93.1, 97.5) |
MW: Mann-Whitney estimate (i.e., AUC); R: Reiser's (2000) method; P: Probit method.
Simulation II was run 100 times. Each time, we generated 1000 simulated data sets. Each data set consists of 100 cases and 100 controls. Each subject provides two replicate biomaker scores. True values were generated from log normal distributions and random errors were generated from normal distributions with .