| Literature DB >> 25962444 |
Vivienne Breen1, Nikola Kasabov2, Ashish M Kamat3, Elsie Jacobson4, James M Suttie5, Paul J O'Sullivan6, Laimonis Kavalieris7, David G Darling8.
Abstract
BACKGROUND: Comparing the relative utility of diagnostic tests is challenging when available datasets are small, partial or incomplete. The analytical leverage associated with a large sample size can be gained by integrating several small datasets to enable effective and accurate across-dataset comparisons. Accordingly, we propose a methodology for a holistic comparative analysis and ranking of cancer diagnostic tests through dataset integration and imputation of missing values, using urothelial carcinoma (UC) as a case study.Entities:
Mesh:
Year: 2015 PMID: 25962444 PMCID: PMC4494166 DOI: 10.1186/s12874-015-0036-8
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
UC diagnostic test datasets used in the analysis
| Dataset | Study/publication | Original dataset, n | Data analyzed (UC/non-UC) | Cytology | NMP22 | FISH | Cxbladder Detect |
|---|---|---|---|---|---|---|---|
| 1 | Pacific Edge Limited, NZ [ | 476, Primary detection | 63/411 | • | • | • | |
| 2 | Canterbury Urology Research Trust, Canterbury, NZ (Pacific Edge Limited, Unpublished data) | 94, Primary detection | 6/74 | • | • | ||
| 3 | North Shore Hospital, Takapuna, NZ (Pacific Edge Limited, Unpublished data) | 84, Primary detection | 5/63 | • | • | ||
| 4 | Kamat, USA [ | 200, Secondary monitoring | 6/187 | • | • | • | |
| 5 | Clinical Trials USA (Pacific Edge Limited, Unpublished data) | 124, Secondary monitoring | 9/115 | • | • | • | • |
The closed symbol (•) indicates that the test was carried out in the study. A gap indicates that the test was not carried out. Data analyzed differs from the study population as any patient samples either without diagnosis or where only one test result was available were discarded. Primary detection means the study population was composed of patients presenting with hematuria prior to UC diagnosis. Secondary monitoring means that patients were presenting after primary UC diagnosis and treatment
Imputation process, in order of execution, for each of the datasets in the integrated dataset
| Imputation step | Model inputs | Imputed variable output | |||
|---|---|---|---|---|---|
| Datasets | Variables | Dataset | Variable | Number of samples imputed | |
| 1 | 5 | Age, gender, cytology, FISH, Cxbladder Detect | 5 | NMP22 | 2 |
| 2 | 5 | Age, gender, NMP22, FISH, Cxbladder Detect | 5 | Cytology | 3 |
| 3 | 1, 4, 5 | Age, gender, cytology, NMP22 | 4 | Cxbladder Detect | 193 |
| 4 | 1, 2, 3, 4, 5 | Age, gender, cytology, Cxbladder Detect | 2 | NMP22 | 80 |
| 3 | NMP22 | 80 | |||
| 5 | 3, 4, 5 | Age, gender, cytology, Cxbladder Detect | 3 | FISH | 68 |
| 6 | 2, 4, 5 | Age, gender, cytology, Cxbladder Detect | 2 | FISH | 80 |
| 7 | 1, 4, 5 | Age, gender, cytology, Cxbladder Detect | 1 | FISH | 474 |
Note: all imputations maintain at least a 70 % level of known data
Measured and published sensitivity and specificity for each test in the integrated dataset before imputation, mean and 95 % CIs
| Measured | Published | |||
|---|---|---|---|---|
| Sensitivity, % (95 % CI) | Specificity, % (95 % CI) | Sensitivity, % (95 % CI) | Specificity, % (95 % CI) | |
| Cytology | 45.5 (40.6–50.4) | 96.3 (94.5–97.9) | 56.1 (43.3–68.3) [ | 94.5 (91.9–96.5) [ |
| NMP22 | 44.9 (37.4–52.3) | 89.0 (86.5–91.5) | 50.0 (37.4–62.6) [ | 88.0 (84.6–91.0) [ |
| FISH | 40.0 (22.7–52.3) | 87.3 (83.7–91.6) | 72 (69–75) [ | 83 (82–85) [ |
| 61.9 [ | 89.7 [ | |||
| 18 [ | 90 [ | |||
| Cxbladder Detect | 79.5 (71.1–87.8) | 82.2 (79.2–85.0) | 81.8 [ | 85.1 (fixed) [ |
Sensitivity and specificity for each test from the Bayesian estimate of conditional distribution of parameters and missing observations given observed data, mean and 95 % CIs
| Sensitivity, % (95 % CI) | Specificity, % (95 % CI) | |
|---|---|---|
| Cytology | 46.0 (36.3–55.8) | 95.3 (93.7–96.6) |
| NMP22 | 45.9 (35.9–56.3) | 88.0 (85.5–90.2) |
| FISH | 47.7 (31.5–63.3) | 87.7 (84.7–90.3) |
| Cxbladder Detect | 73.6 (65.1–81.7) | 81.7 (78.7–84.4) |
Fig. 1Ranking of tests in a univariate mode using SNR on the integrated dataset before imputation
Sensitivity and specificity of tests measured on the integrated, imputed dataset using different imputation methods
| Supervised imputation | Unsupervised imputation | |||
|---|---|---|---|---|
| Sensitivity, % | Specificity, % | Sensitivity, % | Specificity, % | |
|
| ||||
| Cytology | 44.94 | 96.35 | 44.94 | 96.35 |
| NMP22 | 41.57 | 88.82 | 41.57 | 88.94 |
| FISH | 32.58 | 85.24 | 38.20 | 85.95 |
| Cxbladder Detect | 80.90 | 82.12 | 77.53 | 79.76 |
|
| ||||
| Cytology | 44.94 | 96.35 | 44.94 | 96.35 |
| NMP22 | 43.82 | 90.26 | 39.33 | 90.24 |
| FISH | 29.21 | 91.90 | 31.46 | 91.55 |
| Cxbladder Detect | 80.90 | 83.80 | 77.53 | 80.47 |
|
| ||||
| Cytology | 44.94 | 96.49 | 44.94 | 96.35 |
| NMP22 | 43.82 | 90.59 | 39.33 | 90.35 |
| FISH | 29.21 | 91.07 | 23.60 | 90.00 |
| Cxbladder Detect | 80.90 | 85.53 | 78.65 | 82.59 |
|
| ||||
| Cytology | 44.94 | 96.35 | 44.94 | 96.56 |
| NMP22 | 42.70 | 90.82 | 39.33 | 90.82 |
| FISH | 47.19 | 93.69 | 47.19 | 93.33 |
| Cxbladder Detect | 80.90 | 77.29 | 77.53 | 84.71 |
|
| ||||
| Cytology | 44.94 | 96.35 | 44.94 | 96.35 |
| NMP22 | 39.33 | 90.82 | 39.33 | 90.82 |
| FISH | 49.44 | 93.81 | 47.19 | 93.33 |
| Cxbladder Detect | 78.65 | 85.18 | 77.53 | 84.71 |
Cross-validation of methods and difference between sensitivity and specificity obtained before and after imputation
| Supervised imputation | Unsupervised imputation | |||||||
|---|---|---|---|---|---|---|---|---|
| Imputation method | Leave-one-out cross-validation accuracy of the imputation, % | Sensitivity difference before and after imputation, % | Specificity difference before and after imputation, % | Average absolute difference before and after imputation, % | Leave-one-out cross-validation accuracy of the imputation, % | Sensitivity difference before and after imputation, % | Specificity difference before and after imputation,% | Average absolute difference before and after imputation, % |
|
| ||||||||
| Cytology | 96.70 | 2.68 | 0.02 | 1.35 | 96.70 | 2.68 | 0.02 | 1.35 |
| NMP22 | 83.44 | 3.30 | 0.41 | 1.86 | 83.19 | 3.30 | 0.29 | 1.80 |
| FISH | 83.17 | 7.42. | 1.70 | 4.56 | 85.49 | 1.80 | 0.99 | 1.40 |
| Cxbladder Detect | 81.44 | −1.38 | 0.67 | 1.03 | 80.44 | 1.99 | 3.03 | 2.51 |
| Mean for method | 86.19 | 3.01 | 0.70 | 2.20 | 86.46 | 2.44 | 1.08 | 1.76 |
|
| ||||||||
| Cytology | 96.69 | 2.68 | 0.02 | 1.35 | 96.69 | 2.68 | 0.02 | 1.35 |
| NMP22 | 84.20 | 1.05 | −1.03 | 1.04 | 84.07 | 5.54 | −1.01 | 3.28 |
| FISH | 84.54 | 10.79 | −4.96 | 7.88 | 85.80 | 8.54 | −4.61 | 6.58 |
| Cxbladder Detect | 81.94 | −1.38 | −1.01 | 1.20 | 81.10 | 1.99 | 2.32 | 2.16 |
| Mean for method | 86.84 | 3.29 | −1.75 | 2.87 | 86.92 | 4.69 | −0.82 | 3.34 |
|
| ||||||||
| Cytology | 96.69 | 2.68 | 0.02 | 1.35 | 96.69 | 2.68 | 0.02 | 1.35 |
| NMP22 | 86.09 | 1.05 | −1.36 | 1.21 | 85.84 | 5.54 | −1.12 | 3.33 |
| FISH | 86.12 | 10.79 | −4.13 | 7.46 | 85.80 | 16.40 | −3.06 | 9.73 |
| Cxbladder Detect | 82.44 | −1.38 | −2.74 | 2.06 | 81.77 | 0.87 | 0.20 | 0.53 |
| Mean for method | 87.84 | 3.29 | −2.05 | 3.02 | 87.53 | 6.37 | −0.99 | 3.74 |
|
| ||||||||
| Cytology | 96.69 | 2.68 | 0.02 | 1.35 | 96.69 | 2.68 | 0.02 | 1.35 |
| NMP22 | 85.59 | 2.17 | −1.59 | 1.88 | 85.21 | 5.54 | −1.59 | 3.56 |
| FISH | 89.58 | −7.19 | −6.75 | 6.97 | 89.58 | −7.19 | −6.39 | 6.79 |
| Cxbladder Detect | 84.11 | −1.38 | 5.50 | 3.44 | 84.20 | 1.99 | −1.92 | 1.95 |
| Mean for method | 88.99 | −0.93 | −0.70 | 3.41 | 88.92 | 0.75 | −2.47 | 3.42 |
|
| ||||||||
| Cytology | 95.87 | 1.55 | 0.02 | 0.79 | 95.87 | 2.68 | 0.02 | 1.35 |
| NMP22 | 86.35 | 5.54 | −1.59 | 3.56 | 83.19 | 5.54 | −1.59 | 3.56 |
| FISH | 89.25 | −9.44 | −6.87 | 8.16 | 88.60 | −7.19 | −6.39 | 6.79 |
| Cxbladder Detect | 82.78 | 0.87 | −2.39 | 1.63 | 82.47 | 1.99 | −1.92 | 1.95 |
| Mean for method | 88.56 | −0.37 | −2.71 | 3.53 | 87.53 | −2.47 | −2.47 | 3.42 |
Difference = measured – imputed
Fig. 2Rankings of tests for the integrated dataset after a supervised and b unsupervised imputation
Fig. 3Comparisons after a supervised and b unsupervised imputation in two-dimensional contour plots of sensitivity and specificity