| Literature DB >> 24602132 |
Xiaohong Li1, Patricia L Blount, Brian J Reid, Thomas L Vaughan.
Abstract
BACKGROUND: With the rapid development of "-omic" technologies, an increasing number of purported biomarkers have been identified for cancer and other diseases. The process of identifying those that are most promising and validating them for use at the population level for prevention and early detection is a critical next step in achieving significant health benefits.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24602132 PMCID: PMC3996972 DOI: 10.1186/1472-6947-14-15
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Numerical illustration for calculating RPB of hypothetical binary markers using data of three cancers as examples
| 1.5 | 0.1 | 0.05 | 0.001 | 0.999 | -0.000 | -0.001 | -0.001 | -0.000 | -0.000 | -0.000 |
| 2 | 0.1 | 0.10 | 0.002 | 0.999 | -0.000 | -0.001 | -0.001 | -0.000 | -0.000 | -0.000 |
| 4 | 0.1 | 0.29 | 0.004 | 0.999 | 0.000 | -0.000 | -0.001 | 0.000 | -0.000 | -0.000 |
| 10 | 0.1 | 0.81 | 0.009 | 0.999 | 0.002 | 0.001 | -0.000 | 0.000 | 0.000 | -0.000 |
| 20 | 0.1 | 1.56 | 0.017 | 0.999 | 0.004 | 0.002 | 0.000 | 0.000 | 0.000 | 0.000 |
| 50 | 0.1 | 3.19 | 0.033 | 0.999 | 0.008 | 0.004 | 0.001 | 0.000 | 0.000 | 0.000 |
| 1.5 | 1 | 0.49 | 0.015 | 0.990 | -0.004 | -0.006 | -0.008 | -0.000 | -0.000 | -0.001 |
| 2 | 1 | 0.97 | 0.020 | 0.990 | -0.002 | -0.006 | -0.008 | 0.000 | -0.000 | -0.001 |
| 4 | 1 | 2.81 | 0.038 | 0.990 | 0.002 | -0.003 | -0.007 | 0.000 | -0.000 | -0.001 |
| 10 | 1 | 7.61 | 0.085 | 0.991 | 0.015 | 0.004 | -0.003 | 0.001 | 0.000 | -0.001 |
| 20 | 1 | 13.93 | 0.148 | 0.991 | 0.031 | 0.014 | 0.001 | 0.001 | 0.001 | 0.000 |
| 50 | 1 | 26.33 | 0.271 | 0.993 | 0.063 | 0.033 | 0.010 | 0.002 | 0.002 | 0.002 |
| 1.5 | 10 | 4.70 | 0.142 | 0.900 | -0.039 | -0.065 | -0.084 | -0.002 | -0.004 | -0.013 |
| 2 | 10 | 8.94 | 0.180 | 0.901 | -0.029 | -0.059 | -0.082 | -0.001 | -0.004 | -0.013 |
| 4 | 10 | 22.54 | 0.303 | 0.902 | 0.003 | -0.040 | -0.073 | 0.000 | -0.003 | -0.012 |
| 10 | 10 | 46.05 | 0.514 | 0.904 | 0.058 | -0.008 | -0.057 | 0.002 | -0.001 | -0.009 |
| 20 | 10 | 63.92 | 0.675 | 0.906 | 0.100 | 0.017 | -0.046 | 0.004 | 0.001 | -0.007 |
| 50 | 10 | 81.79 | 0.836 | 0.907 | 0.141 | 0.041 | -0.034 | 0.006 | 0.003 | -0.005 |
| 1.5 | 30 | 12.90 | 0.390 | 0.701 | -0.125 | -0.200 | -0.256 | -0.005 | -0.014 | -0.041 |
| 2 | 30 | 22.80 | 0.460 | 0.702 | -0.107 | -0.189 | -0.251 | -0.004 | -0.013 | -0.040 |
| 4 | 30 | 46.84 | 0.628 | 0.703 | -0.064 | -0.163 | -0.238 | -0.003 | -0.011 | -0.038 |
| 10 | 30 | 72.43 | 0.807 | 0.705 | -0.017 | -0.136 | -0.225 | -0.001 | -0.009 | -0.036 |
| 20 | 30 | 84.69 | 0.893 | 0.706 | 0.005 | -0.123 | -0.219 | 0.000 | -0.009 | -0.035 |
| 50 | 30 | 93.44 | 0.954 | 0.707 | 0.021 | -0.114 | -0.215 | 0.001 | -0.008 | -0.034 |
| 1.5 | 70 | 25.71 | 0.777 | 0.301 | -0.327 | -0.486 | -0.606 | -0.013 | -0.034 | -0.096 |
| 2 | 70 | 40.89 | 0.823 | 0.301 | -0.316 | -0.480 | -0.603 | -0.013 | -0.033 | -0.096 |
| 4 | 70 | 67.46 | 0.902 | 0.302 | -0.295 | -0.467 | -0.597 | -0.012 | -0.032 | -0.095 |
| 10 | 70 | 86.14 | 0.958 | 0.303 | -0.280 | -0.459 | -0.593 | -0.011 | -0.032 | -0.094 |
| 20 | 70 | 92.92 | 0.979 | 0.303 | -0.275 | -0.456 | -0.591 | -0.011 | -0.032 | -0.094 |
| 50 | 70 | 97.13 | 0.991 | 0.303 | -0.272 | -0.454 | -0.591 | -0.011 | -0.031 | -0.094 |
ca= cancer. EA= esophageal adenocarcinoma. f= loss adjustment factor of quality-adjusted life year. NB= net benefit. OR= odds ratio.
The table shows numerical relationship between Odds ratio, Marker prevalence, PAR% of binary markers and their RPB based on the cancer data of three studies. NB were also calculated.
The table assumes 1% disease prevalence in general population. For PAR%, sensitivity and specificity similar patterns are observed with other disease prevalence values less than about 10%. Disease prevalence affects RPB more directly (see text).
Figure 1Hypothetical distribution patterns of continuous markers with different relative risks, and thresholds for risk prediction. Two normal distributions (mean = 0 and standard deviation = 0.5) are used to represent the distribution of a continuous marker in disease (solid curved line) and non-disease (dashed curved line) populations for six different ORs. The locations of the means for the disease population are consistent with the logit model Pr(D = 1|X) = α + βX, in which one unit increase corresponds to the OR shown in the figure. The three vertical bars (solid, dotted, and dashed) correspond to different thresholds (cut off value ‘c’) for positive-negative calls of a disease with a continuous distribution marker. Specifically, the solid bar represents the threshold value c such that the sensitivity is kept for 0.95 for various OR values in the plot; the dotted bar represents the threshold value c such that the sensitivity and specificity are equal for various OR values in the plots; and the dashed bar represents the threshold value c such that the specificity is kept for 0.95 for various OR values in the plot. The examples of using the continuous marker for disease classification or prevention are shown in Figure 2; and corresponding sensitivity, specificity and PAR% of various thresholds (three bars in this Figure) are shown in Table 2 and Figure 2 (the cross, circle, and triangle in Figure 2 correspond to solid, dashed, and solid vertical bars in Figure 1).
Figure 2Disease prediction performance evaluated by ROC curves for the hypothetical continuous markers with different relative risks. ROC curves for continuous risk marker with different odds ratios (from bottom to top OR = 1.5, 2, 4, 10, 20, 50), which corresponding to the distribution plots of continuous markers shown in Figure 1. The crosses correspond to a fixed sensitivities; the circles to a fixed specificities; and triangles to equal sensitivities and specificities. Their corresponding PAR% values are shown in Table 2.
Numerical illustration for calculating RPB of hypothetical continuous markers using data of three cancers as examples
| Fixed sensitivity (95%) | 1.5 | 33.22 | 0.95 | 0.07 | -0.456 | -0.659 | -0.812 | -0.018 | -0.046 | -0.129 |
| | 2 | 48.88 | 0.95 | 0.09 | -0.440 | -0.641 | -0.792 | -0.017 | -0.045 | -0.126 |
| | 4 | 71.30 | 0.95 | 0.16 | -0.389 | -0.583 | -0.728 | -0.015 | -0.040 | -0.115 |
| | 10 | 84.46 | 0.95 | 0.29 | -0.287 | -0.466 | -0.601 | -0.011 | -0.032 | -0.095 |
| | 20 | 89.01 | 0.95 | 0.43 | -0.187 | -0.352 | -0.475 | -0.007 | -0.024 | -0.075 |
| | 50 | 92.25 | 0.95 | 0.61 | -0.051 | -0.196 | -0.305 | -0.002 | -0.014 | -0.048 |
| Fixed specificity (95%) | 1.5 | 2.59 | 0.08 | 0.95 | -0.019 | -0.032 | -0.043 | -0.001 | -0.002 | -0.007 |
| | 2 | 4.95 | 0.10 | 0.95 | -0.013 | -0.029 | -0.041 | -0.001 | -0.002 | -0.007 |
| | 4 | 12.67 | 0.17 | 0.95 | 0.006 | -0.018 | -0.036 | 0.000 | -0.001 | -0.006 |
| | 10 | 27.40 | 0.31 | 0.95 | 0.041 | 0.002 | -0.028 | 0.002 | 0.000 | -0.004 |
| | 20 | 41.15 | 0.44 | 0.95 | 0.074 | 0.021 | -0.019 | 0.003 | 0.001 | -0.003 |
| | 50 | 60.16 | 0.62 | 0.95 | 0.119 | 0.047 | -0.008 | 0.005 | 0.003 | -0.001 |
| Balanced sensitivity & specificityc | 1.5 | 14.83 | 0.54 | 0.54 | -0.208 | -0.316 | -0.397 | -0.008 | -0.022 | -0.063 |
| | 2 | 23.89 | 0.57 | 0.57 | -0.178 | -0.285 | -0.366 | -0.007 | -0.020 | -0.058 |
| | 4 | 42.57 | 0.64 | 0.63 | -0.114 | -0.222 | -0.304 | -0.005 | -0.015 | -0.048 |
| | 10 | 60.28 | 0.72 | 0.72 | -0.030 | -0.137 | -0.218 | -0.001 | -0.010 | -0.035 |
| | 20 | 70.68 | 0.78 | 0.77 | 0.024 | -0.085 | -0.166 | 0.001 | -0.006 | -0.026 |
| 50 | 80.17 | 0.84 | 0.84 | 0.088 | -0.020 | -0.101 | 0.003 | -0.001 | -0.016 | |
Numerical relationships between Odds ratio and PAR% of continuous markers and their RPB based on the data of three studies.
ca= cancer. EA= esophageal adenocarcinoma. f = loss-adjustment factor of quality-adjusted life year. NB= net benefit.
OR= odds ratio. Sen.= sensitivity, Spe.= specificity.
The disease prevalence of population used for the table = 0.01. As shown in the formula of RPB for continuous markers, disease prevalence w will directly affect the value of RPB.
aHypothetical continuous biomarker/exposure with assumed distributions as described in Figure 1.
bThreshold used for continuous biomarkers positive are the thresholds shown in Figure 1 (three vertical bars).
cUsing a threshold that lead to sensitivity and specificity closest to the upper left corner of ROC curve coordinates for cutoff.