Siamak Sabour1, Matin Shokrgozar2. 1. Department of Clinical Epidemiology, School of Health and Safety and Safety Promotions and Injury Prevention Research Centre, Shahid Beheshti University of Medical Sciences, Tehran, I.R. Iran. s.sabour@sbmu.ac.ir 2. Department of Clinical Epidemiology, School of Health and Safety, Shahid Beheshti University of Medical Sciences, Tehran, I.R. Iran.
To the Editor,We read the paper from Kim et al. [1] published in J Gynecol Oncol. The aim of the study was to compare the diagnostic accuracy of dilatation and curettage (D&C) versus endometrial aspiration biopsy in follow-up evaluation of patients treated with progestin for endometrial hyperplasia (EH). In a prospective multicenter study, the authors reported kappa value to compare.Although we appreciate this significant study, we would like to raise some methodological issues that can affect the interpretation of results. First, it is important to note that the estimate reported by the authors cannot provide all the information needed for diagnostic accuracy and decision-making in clinical practice, since prevalence of the concordance cells dramatically affect the value of kappa [234]. Therefore, an alternative approach is to determine sensitivity, specificity, positive and negative predictive values, the likelihood ratios (LR+, ranging from 1 to infinity; the higher the LR+, the more accurate the test; and LR−, ranging from 0 to 1; the lower the LR−, the more accurate the test) and odds ratio (ratio of true to false results), which provides a more accurate estimate [256]. Second, the authors estimated the diagnostic accuracy by agreement. They mentioned that aspiration biopsy is less accurate than D&C and might not be a reliable method. It should be noted that agreement is typically used to determine reliability (precision). There is a methodological difference between accuracy and reliability. Third, in diagnostic accuracy research, it is essential to evaluate the diagnostic added value, since a diagnostic accuracy of a single test might be excellent, however for clinical purposes it can be worthless. Like evaluating discrimination, it would be possible to estimate the diagnostic added value by receiver operating characteristic (ROC) [278].Eventually, any decision in clinical practice needs to evaluate both reliability and accuracy of the test. Without knowledge about the reliability of the test, any judgment would be wrong. Hence, we suggest the authors to estimate both accuracy and reliability of the test by an appropriate method.Siamak Sabour (https://orcid.org/0000-0002-1928-992X)Department of Clinical Epidemiology, School of Health and Safety and Safety Promotions and Injury Prevention Research Centre, Shahid Beheshti University of Medical Sciences, Tehran, I.R. Iran. s.sabour@sbmu.ac.irMatin Shokrgozar (https://orcid.org/0000-0002-9151-7077)Department of Clinical Epidemiology, School of Health and Safety, Shahid Beheshti University of Medical Sciences, Tehran, I.R. Iran.
Authors: Mi Kyoung Kim; Seok Ju Seong; Dong Choon Park; Jin Hwa Hong; Ju Won Roh; Soon Beom Kang Journal: J Gynecol Oncol Date: 2020-02-07 Impact factor: 4.401