Junfeng Wang1, Patrick Bossuyt1, Ronald Geskus1, Aeilko Zwinderman1, Madeleine Dolleman2, Simone Broer2, Frank Broekmans2, Ben Willem Mol3, Mariska Leeflang4. 1. Department of Clinical Epidemiology, Biostatistics & Bioinformatics, Academic Medical Center, PO Box 22700, 1100 DE Amsterdam, The Netherlands. 2. Department of Reproductive Medicine, University Medical Center Utrecht, PO Box 85500, 3508 GA Utrecht, The Netherlands. 3. Department of Obstetrics and Gynaecology, Women's and Children's Hospital, University of Adelaide, 72 King William Road, SA 5006 Adelaide, Australia. 4. Department of Clinical Epidemiology, Biostatistics & Bioinformatics, Academic Medical Center, PO Box 22700, 1100 DE Amsterdam, The Netherlands. Electronic address: m.m.leeflang@amc.uva.nl.
Abstract
OBJECTIVES: In comparative systematic reviews of diagnostic accuracy, inconsistencies between direct and indirect comparisons may lead to bias. We investigated whether using individual patient data (IPD) can adjust for this form of bias. STUDY DESIGN AND SETTING: We included IPD of 3 ovarian reserve tests from 32 studies. Inconsistency was defined as a statistically significant difference in relative accuracy or different comparative results between the direct and indirect evidence. We adjusted for the effect of threshold and reference standard, as well as for patient-specific variables. RESULTS: Anti-Müllerian hormone (AMH) and follicle stimulation hormone (FSH) differed significantly in sensitivity (-0.1563, P = 0.04). AMH and antral follicle count (AFC) differed significantly in sensitivity (0.1465, P < 0.01). AMH and AFC differed significantly in specificity (-0.0607, P = 0.02). The area under the curve (AUC) differed significantly between AFC and FSH (0.0948, P < 0.01) in the direct comparison but not (0.0678, P = 0.09) in the indirect comparison. The AUCs of AFC and AMH differed significantly (-0.0830, P < 0.01) in the indirect comparison but not (-0.0176, P = 0.29) in the direct comparison. These differences remained after adjusting for indirectness. CONCLUSION: Estimates of comparative accuracy obtained through indirect comparisons are not always consistent with those obtained through direct comparisons. Using IPD to adjust for indirectness did not successfully remove the bias in this case study.
OBJECTIVES: In comparative systematic reviews of diagnostic accuracy, inconsistencies between direct and indirect comparisons may lead to bias. We investigated whether using individual patient data (IPD) can adjust for this form of bias. STUDY DESIGN AND SETTING: We included IPD of 3 ovarian reserve tests from 32 studies. Inconsistency was defined as a statistically significant difference in relative accuracy or different comparative results between the direct and indirect evidence. We adjusted for the effect of threshold and reference standard, as well as for patient-specific variables. RESULTS: Anti-Müllerian hormone (AMH) and follicle stimulation hormone (FSH) differed significantly in sensitivity (-0.1563, P = 0.04). AMH and antral follicle count (AFC) differed significantly in sensitivity (0.1465, P < 0.01). AMH and AFC differed significantly in specificity (-0.0607, P = 0.02). The area under the curve (AUC) differed significantly between AFC and FSH (0.0948, P < 0.01) in the direct comparison but not (0.0678, P = 0.09) in the indirect comparison. The AUCs of AFC and AMH differed significantly (-0.0830, P < 0.01) in the indirect comparison but not (-0.0176, P = 0.29) in the direct comparison. These differences remained after adjusting for indirectness. CONCLUSION: Estimates of comparative accuracy obtained through indirect comparisons are not always consistent with those obtained through direct comparisons. Using IPD to adjust for indirectness did not successfully remove the bias in this case study.