Literature DB >> 27514486

Estimating risk of malignancy in adnexal masses: external validation of the ADNEX model and comparison with other frequently used ultrasound methods.

E M J Meys1,2, L S Jeelof1, N M J Achten1, B F M Slangen1,2, S Lambrechts1,2, R F P M Kruitwagen1,2, T Van Gorp1,2.   

Abstract

OBJECTIVES: To validate externally the performance of the Assessment of Different NEoplasias in the adneXa (ADNEX) model and compare this model with other frequently used models in the differentiation between benign and malignant adnexal masses.
METHODS: In this retrospective diagnostic accuracy study, we assessed data collected prospectively from patients with adnexal pathology who underwent real-time transvaginal or transrectal ultrasound by a single expert ultrasonographer in a tertiary care hospital between July 2011 and July 2015. The presence of a malignancy was determined by subjective assessment and use of four prediction models: the ADNEX model, simple ultrasound-based rules (simple rules), Logistic Regression model 2 (LR2) and the Risk of Malignancy Index (RMI), of which three different variants were assessed. Pathology was the clinical reference standard.
RESULTS: In total, 851 consecutive patients underwent ultrasound examination for an adnexal mass. For 326 patients (128 premenopausal and 198 postmenopausal), pathology results were available (211 (64.7%) benign; 115 (35.3%) malignant) and these were included in the analysis. The area under the receiver-operating characteristics curve (AUC) of the ADNEX model for the discrimination between benign and malignant tumors was 0.93 (95% CI, 0.89-0.95). AUCs for the subtypes of malignancy (i.e. borderline, Stage I-IV and metastatic adnexal tumors) ranged between 0.60 and 0.90. Only subjective assessment (AUC, 0.96 (95% CI, 0.93-0.98)) was superior to the ADNEX model (P = 0.01) in differentiating malignant from benign tumors. AUCs for the other models were 0.92 (95% CI, 0.89-0.95) for LR2, 0.85 (95% CI, 0.81-0.89) for RMI-I, 0.82 (95% CI, 0.77-0.86) for RMI-II and 0.84 (95% CI, 0.80-0.88) for RMI-III. At the proposed cut-off of ≥ 10%, the ADNEX model had the highest sensitivity (0.98 (95% CI, 0.93-1.00)) but the lowest specificity (0.62 (95% CI, 0.55-0.68)) compared with the other models. Both subjective assessment (sensitivity, 0.90 (95% CI, 0.83-0.95); specificity 0.91 (95% CI, 0.86-0.94)) and the simple rules model with inconclusive cases classified by subjective assessment (sensitivity, 0.89 (95% CI, 0.81-0.94); specificity, 0.90 (95% CI, 0.85-0.94)) had lower sensitivity, but their sensitivity and specificity were better balanced.
CONCLUSIONS: Although the test performance of subjective assessment by an expert remains superior, the ADNEX model can help in the differentiation between benign and malignant ovarian tumors. The advantage of the ADNEX model as a polytomous model remains to be shown.
© 2016 The Authors. Ultrasound in Obstetrics & Gynecology published by John Wiley & Sons Ltd on behalf of the International Society of Ultrasound in Obstetrics and Gynecology. © 2016 The Authors. Ultrasound in Obstetrics & Gynecology published by John Wiley & Sons Ltd on behalf of the International Society of Ultrasound in Obstetrics and Gynecology.

Entities:  

Keywords:  ADNEX model; LR2; RMI; diagnostic test accuracy; ovarian carcinoma; simple rules; subjective assessment

Mesh:

Year:  2017        PMID: 27514486      PMCID: PMC5488216          DOI: 10.1002/uog.17225

Source DB:  PubMed          Journal:  Ultrasound Obstet Gynecol        ISSN: 0960-7692            Impact factor:   7.299


INTRODUCTION

Adnexal masses are common. Although only a minority are malignant, their management, and therefore both patient morbidity and mortality, depends on their correct preoperative differentiation. For benign masses, conservative management or laparoscopic and fertility‐sparing surgery is preferred. Laparoscopy is associated with reduced morbidity and lower cost when compared with laparotomy1. In the case of malignancy, however, more extensive surgery is necessary, preferably performed in an oncology center. This is essential to optimize care and thereby survival of the patient2, 3. Ultrasound examination, more specifically subjective assessment by an expert examiner, is considered the best way to differentiate malignant from benign adnexal masses prior to surgery4. However, an expert examiner is not always available. Various ultrasound‐based prediction models and scoring systems have been developed to support the diagnosis of adnexal masses by less experienced examiners. The Risk of Malignancy Index (RMI) is one such scoring system5, 6, 7, and is currently recommended by many national guidelines. However, performance of this model is poor4, 8. Other models, with better test accuracy, include the International Ovarian Tumour Analysis (IOTA) simple ultrasound‐based rules (‘simple rules’)9 and IOTA Logistic Regression model 2 (LR2)10. Another model with excellent test performance was developed recently: the Assessment of Different NEoplasias in the adneXa (ADNEX) model11. This model predicts not only whether a mass is malignant, but also, to a certain extent, the type of malignancy. Insight into the specific tumor type makes it possible to optimize treatment, which may reduce morbidity and enhance the chances of survival3. For example, the distinction between malignancy and borderline malignancy is relevant for the treatment of premenopausal women in the context of fertility preservation. The aim of this study was to validate externally the performance of the ADNEX model, and compare it with that of other frequently used models, in the differentiation between benign and malignant adnexal masses.

METHODS

Study design and setting

This was a retrospective, single‐center, diagnostic accuracy study, conducted at a tertiary care hospital using data collected prospectively between July 2011 and July 2015. A single ultrasonographer (T.V.G.) with more than 10 years' experience in gynecological ultrasound (Level‐3 examiner) assessed all consecutively recruited patients with adnexal pathology12. All women underwent transvaginal or transrectal grayscale and color Doppler ultrasound examination, using a Voluson E8 (GE Healthcare Ultrasound, Milwaukee, WI, USA) ultrasound machine. If the mass was too large to be seen entirely by transvaginal ultrasound, or if malignancy was suspected, transabdominal ultrasound was also performed. The operator assessed the sonographic tumor morphology based on the nomenclature of the IOTA Group13, recording the ultrasound findings in a secure electronic data‐collection system (Astraia version 1.23.6, Astraia Software GmbH, Munich, Germany) together with demographic data, tumor markers and tumor diagnosis based on subjective assessment. The complete list of prospectively collected ultrasound features is shown in Table S1. Along with the subjective classification as benign or malignant, the ultrasound examiner noted his level of confidence (certain, probable or uncertain). All assessments were done prior to obtaining the histological diagnosis. Patients were excluded when no pathology result was obtained, when the pathology result was known before the ultrasound examination (from transabdominal biopsy in the case of metastasis), when pathology was obtained > 120 days after the ultrasound examination and when a patient had previously undergone a bilateral oophorectomy. Patients with a previous hysterectomy who were 50 years of age or older and patients with amenorrhea of more than 1 year were defined as postmenopausal. Pathology was the clinical reference standard used for all patients in this study. Results were obtained by either surgery or biopsy of a metastasis and added to the database. The pathologist was unaware of the results of the ultrasound examination. Tumors were classified according to the World Health Organization International Classification of Ovarian Tumors14. Tumor stage was defined according to the International Federation of Gynecologists and Obstetricians (FIGO) 2012 classification15. The study was approved by the Medical Research Ethics Committee of the Maastricht University Medical Center in The Netherlands. According to Dutch law, this study was not subject to formal ethics committee assessment and therefore no informed consent of patients was required. STARD guidelines16 were followed for the conduct, analysis and reporting of the study.

Prediction models

Risk of malignancy was determined by four prediction models and subjective assessment by the expert ultrasonographer. The ADNEX model11 includes nine variables: age (years), serum CA 125 level (U/mL), type of center (oncology center/other hospital), maximum diameter of the lesion (mm), proportion of solid tissue (%), number of papillary projections (0/1/2/3/> 3), more than 10 cyst locules (yes/no), acoustic shadow (yes/no) and ascites (yes/no). The formula for the risk calculation can be found in the original article11; for use in clinical practice, an application is available (http://www.iotagroup.org/adnexmodel). The outcome of this model is an absolute risk estimate (expressed as a percentage) for five different types of adnexal pathology: benign, borderline, Stage‐I invasive, Stage‐II–IV invasive and secondary metastatic. Furthermore, a risk estimate for the overall risk of malignancy is given (which is the sum of the estimates for all subtypes of malignancy). A cut‐off of ≥ 10% for the overall risk of malignancy was used to predict malignancy11, 17. The IOTA simple rules model (Table 1)9 includes five ultrasound features suggestive of benignity (B‐features) and five features suggestive of malignancy (M‐features). If one or more B‐features are present in the absence of M‐features, the mass is classified as benign, and vice versa. If both B‐ and M‐features exist or if none of the 10 features is present, the simple rules yields an inconclusive result. Two different approaches were used for these difficult‐to‐diagnose masses: use of subjective assessment by the expert ultrasonographer as a second‐stage test, and classification of all inconclusive masses as malignant.
Table 1

International Ovarian Tumour Analysis (IOTA) simple ultrasound‐based rules9 for prediction of malignancy in adnexal mass, divided into five benign (B)‐features and five malignant (M)‐features

B1 Unilocular tumor M1 Irregular solid tumor
B2Solid component with largest diameter < 7 mmM2Presence of ascites
B3Presence of acoustic shadowM3≥ 4 papillary projections
B4Smooth multilocular tumor with largest diameter < 100 mmM4Irregular multilocular solid tumor with largest diameter ≥ 100 mm
B5No blood flow (color score 1)M5Very strong blood flow (color score 4)
International Ovarian Tumour Analysis (IOTA) simple ultrasound‐based rules9 for prediction of malignancy in adnexal mass, divided into five benign (B)‐features and five malignant (M)‐features The LR2 model10 uses six variables to estimate the probability of malignancy: (a) age (years); (b) presence of ascites (yes = 1, no = 0); (c) presence of blood flow within a papillary projection (yes = 1, no = 0); (d) maximum diameter of the solid component (mm; capped at 50 mm); (e) irregular internal cyst walls (yes = 1, no = 0); and (f) presence of acoustic shadow (yes = 1, no = 0). The estimated probability of malignancy for an adnexal tumor is calculated by LR2 as: 1/(1 exp(−z)), where z = −5.3718 + 0.0354a + 1.6159b + 1.1768c + 0.0697d + 0.9586e – 2.9486f. A cut‐off of ≥ 0.1 (≥ 10%) was used to predict malignancy. The RMI scoring system5, 6, 7 combines the ultrasound features of the mass (U), the menopausal status of the patient (M) and the serum CA 125 level (U/mL) into a risk score (U × M × serum CA 125). The ultrasound features are multilocularity, solid areas, bilaterality, ascites and intra‐abdominal metastases. Three principal variants of the RMI were applied (RMI‐I, RMI‐II and RMI‐III), which differed according to the points attributed to the different ultrasound variables and the menopausal status of the patient (Table 2). A total score of ≥ 200 was used as a cut‐off for malignancy.
Table 2

Characteristics of three variants of Risk of Malignancy Index (RMI) scoring system for prediction of malignancy in adnexal mass5, 6, 7

RMI variantUltrasound score (U)Menopausal status (M)
CharacteristicScoreCharacteristicScore
RMI‐INo features present0Premenopausal1
1 feature present1Postmenopausal3
≥ 2 features present3
RMI‐II≤ 1 feature present1Premenopausal1
≥ 2 features present4Postmenopausal4
RMI‐III≤ 1 feature present1Premenopausal1
≥ 2 features present3Postmenopausal3

Ultrasound score (U) includes five features: multilocular cyst, solid areas, bilateral cysts, ascites and intra‐abdominal metastases.

Total scores for U and M are inserted into the following formula to calculate RMI: U × M × serum CA 125.

Characteristics of three variants of Risk of Malignancy Index (RMI) scoring system for prediction of malignancy in adnexal mass5, 6, 7 Ultrasound score (U) includes five features: multilocular cyst, solid areas, bilateral cysts, ascites and intra‐abdominal metastases. Total scores for U and M are inserted into the following formula to calculate RMI: U × M × serum CA 125.

Statistical analysis

All data analyses were performed with IBM SPSS statistics v20 (IBM Corp, Los Angeles, CA, USA) and MedCalc v16.1 (MedCalc Software, Mariakerke, Belgium). For statistical purposes, borderline tumors were considered malignant. In women with bilateral tumors, only the tumor with the most complex ultrasound morphology was included in the statistical analysis. If both masses had the same morphology, the mass with the largest size was used. We calculated sensitivity, specificity, positive and negative predictive values (PPV and NPV) and positive and negative likelihood ratios (LR+ and LR–) for the cut‐off points proposed in the original publications for each model. We also performed a subgroup analysis for pre‐ and postmenopausal patients. Multiple imputation (fully conditional specification) was used to deal with missing values of serum CA 12511, 18. Predictive mean matching regression was applied, using variables from our dataset related to the level of CA 125 (i.e. values included in the ADNEX model and others such as pathology results, previous hysterectomy and parity), or the unavailability of this tumor marker (i.e. a binary indicator with value 1 if results from CA 125 were missing and value 0 if they were not). Receiver–operating characteristics (ROC) curves were derived for the ADNEX model, subjective assessment, LR2 and RMI, and summarized by calculating the area under the curve (AUC) with 95% CI using exact methods based on the binomial distribution. To calculate the AUC for subjective assessment, six levels of diagnostic confidence were used (certainly benign; probably benign; uncertain, but most likely benign; uncertain, but most likely malignant; probably malignant; certainly malignant). The method described by DeLong et al.19 was used to calculate statistical significance of differences between AUCs. The McNemar test was used to test the statistical significance of differences in sensitivity and specificity between the various models, when an AUC could not be calculated (i.e. the two different variants of the simple rules). P < 0.05 was considered statistically significant for all comparisons.

RESULTS

Between July 2011 and July 2015 a total of 851 patients visited our hospital to undergo adnexal ultrasound examination by an expert, and pathology results were obtained for 424 of them. The final cohort consisted of 326 consecutive patients who fulfilled our inclusion criteria, involving 128 (39.3%) premenopausal and 198 (60.7%) postmenopausal patients. A detailed overview of patient inclusion is shown in Figure 1. Patient characteristics and data for the ultrasound features used in the different models are shown in Table 3.
Figure 1

Flow diagram summarizing inclusion of patients in the study. US, ultrasound examination.

Table 3

Descriptive statistics for patient characteristics and ultrasound features according to tumor type in 326 patients with adnexal mass

VariableBenign (n = 211)Borderline (n = 27)Stage I (n = 18)Stage II–IV (n = 56)Metastatic (n = 14)
Age (years)53.250.663.167.764.6
(16.1–87.2)(36.9–65.8)(50.3–68.5)(32.3–87.0)(20.0–87.1)
CA 125 (U/mL)* 26.061.9109.5456.078.6
[16.5–27.0][27.5–295.0][16.8–361.5][170.8–1175.0][27.5–260.8]
Missing values for CA 12531 (14.7)1 (3.7)0 (0.0)0 (0.0)0 (0.0)
Menopausal status
Premenopausal97 (46.0)15 (55.6)6 (33.3)7 (12.5)3 (21.4)
Postmenopausal114 (54.0)12 (44.4)12 (66.7)49 (87.5)11 (78.6)
Patient pregnant2 (0.9)1 (3.7)0 (0.0)1 (1.8)0 (0.0)
Family history of ovarian cancer6 (2.8)1 (3.7)0 (0.0)3 (5.3)0 (0.0)
Laterality of tumor
Unilateral169 (80.1)21 (77.8)16 (88.9)35 (62.5)10 (71.4)
Bilateral42 (19.9)6 (22.2)2 (11.1)21 (37.5)4 (28.6)
Maximum diameter of lesion (mm)80.0155.0122.571.5105.0
[59.0–115.0][123.0–229.0][92.0–214.0][50.3–102.8][63.0–133.5]
Type of tumor
Unilocular65 (30.8)0 (0.0)0 (0.0)0 (0.0)1 (7.1)
Multilocular71 (33.6)6 (22.2)0 (0.0)0 (0.0)0 (0.0)
Unilocular‐solid19 (9.0)5 (19.5)2 (11.1)4 (7.1)0 (0.0)
Multilocular‐solid35 (16.6)16 (59.3)11 (61.1)21 (37.5)4 (28.6)
Solid20 (9.5)0 (0.0)5 (27.8)31 (55.4)9 (64.3)
Unclassifiable1 (0.5)0 (0.0)0 (0.0)0 (0.0)0 (0.0)
Solid tissue1 (0.5)0 (0.0)0 (0.0)0 (0.0)0 (0.0)
Presence of solid tissue78 (37.0)21 (77.8)18 (100.0)55 (98.2)14 (100.0)
Maximum diameter of solid tissue (mm)0.050.352.558.060.5
[0.0–16.0][25.0–60.5][29.8–93.5][33.5–80.0][34.5–123.5]
Proportion of solid tissue if present (%)28.927.955.9100.0100.0
[15.7–100.0][18.4–46.9][20.9–100.0][56.1–100.0][54.6–100.0]
Number of locules
021 (10.0)0 (0.0)5 (27.8)31 (55.4)9 (64.3)
1–4134 (63.5)12 (44.4)3 (16.7)7 (12.5)3 (21.4)
5–1029 (13.7)4 (14.8)4 (22.2)13 (23.2)1 (7.1)
> 1027 (12.8)11 (40.7)6 (33.3)5 (8.9)1 (7.1)
Number of papillary projections
0167 (79.1)13 (48.1)13 (72.2)46 (82.1)12 (85.7)
122 (10.4)3 (11.1)1 (5.6)5 (8.9)1 (7.1)
28 (3.8)1 (3.7)0 (0.0)1 (1.8)0 (0.0)
31 (0.5)1 (3.7)0 (0.0)1 (1.8)0 (0.0)
> 313 (6.2)9 (33.3)4 (22.2)3 (5.4)1 (7.1)
Blood flow in papillary projections: color Doppler score
1106 (50.2)3 (11.1)1 (5.6)3 (5.4)2 (14.3)
275 (35.5)11 (40.7)3 (16.7)11 (19.6)2 (14.3)
324 (11.4)12 (44.4)11 (61.1)24 (42.9)4 (28.6)
46 (2.8)1 (3.7)3 (16.7)18 (32.1)6 (42.9)
Irregular cyst wall78 (37.0)20 (74.1)13 (72.2)51 (91.1)13 (92.9)
Metastases2 (0.9)3 (11.1)1 (5.6)34 (60.7)
Acoustic shadow78 (37.0)4 (14.8)2 (11.1)5 (8.9)1 (7.1)
Ascites13 (6.2)6 (22.2)3 (16.7)35 (62.5)2 (14.3)

Data are given as n (%), median (range) or median [interquartile range].

Results based on multiple imputation of missing values.

Flow diagram summarizing inclusion of patients in the study. US, ultrasound examination. Descriptive statistics for patient characteristics and ultrasound features according to tumor type in 326 patients with adnexal mass Data are given as n (%), median (range) or median [interquartile range]. Results based on multiple imputation of missing values. The median interval between ultrasound examination and obtaining the pathology results was 21 days. Results were benign for 211 (64.7%) masses and malignant for 115 (35.3%) masses (Table 4). The most common benign pathologies were cystadenoma, endometrioma, mature teratoma, fibroma and cystadenofibroma. Six benign masses consisted of mixed pathology (two or more different histological subtypes) and therefore could not be categorized into a specific subtype.
Table 4

Pathology results of 326 adnexal masses

Pathology n (%)
Benign211 (64.7)
Cystadenoma 82 (25.2)
Endometriotic cyst 39 (12.0)
Mature teratoma 29 (8.9) 
Fibroma 23 (7.1) 
Cystadenofibroma 15 (4.6) 
Salpingitis  6 (1.8) 
Functional cyst  4 (1.2) 
Parasalpingeal cyst  2 (0.6) 
Struma ovarii  2 (0.6) 
Pseudocyst  2 (0.6) 
Unknown type  1 (0.3) 
Mixed  6 (1.8) 
Borderline 27 (8.3) 
Serous 11 (3.4) 
Mucinous 13 (4.0) 
Other  3 (0.9) 
Malignant 88 (27.0)
Epithelial ovarian cancer 70 (21.5)
Stage
Stage I 14 (20.0)
Stage II  8 (11.4)
Stage III 27 (38.6)
Stage IV 21 (30.0)
Differentiation grade
Grade 1  8 (11.4)
Grade 2 12 (17.1)
Grade 3 46 (65.7)
Unknown  4 (5.7) 
Granulosa cell carcinoma  3 (0.9) 
Yolk‐sac tumor  1 (0.3) 
Metastatic tumor 10 (3.1) 
Non‐primary ovarian carcinoma  4 (1.2) 
Pathology results of 326 adnexal masses The majority (84.3% (97/115)) of malignancies consisted of epithelial ovarian carcinomas. Almost a quarter of all malignant masses were borderline tumors. Furthermore, 14 patients were diagnosed with extraovarian primary tumors; 10 of these were extraovarian tumors (mainly of gastrointestinal or endometrial origin) with metastases to the ovaries, while the others were primary tumors of rectosigmoid or endometrial origin, mimicking a primary tumor of the ovary.

Validation of ADNEX model

The ADNEX model, at a cut‐off ≥ 10%, had a sensitivity of 0.98 (95% CI, 0.93–1.00) and a specificity of 0.62 (95% CI, 0.55–0.68) (Table 5). The AUC for the overall discrimination between benign and malignant tumors was 0.93 (95% CI, 0.89–0.95). AUCs for discrimination between different tumor subgroups ranged between 0.60 and 0.97 (Table 6). The model was particularly able to distinguish benign from Stage‐II–IV tumors, benign from secondary metastatic cancer and borderline from secondary metastatic cancer. In contrast, discrimination between borderline and Stage‐I tumors and between Stage‐II–IV tumors and secondary metastatic cancer was mediocre (Table 6).
Table 5

Diagnostic performance indices for subjective assessment (SA) and four prediction models for differentiation between benign and malignant adnexal masses, in whole study population (n = 326) and in premenopausal (n = 128) and postmenopausal (n = 198) subgroups

Assessment methodSensitivitySpecificityPPVNPVLR+LR–
All patients
ADNEX0.980.620.580.982.560.03
(0.93–1.00)(0.55–0.68)(0.51–0.65)(0.94–0.99)(2.15–3.04)(0.01–0.11)
SA0.900.910.830.959.540.11
(0.83–0.95)(0.86–0.94)(0.76–0.90)(0.90–0.97)(6.26–14.54)(0.06–0.19)
IOTA‐SR + mal0.930.680.610.932.510.11
(0.86–0.97)(0.61–0.70)(0.54–0.69)(0.87–0.97)(2.07–3.06)(0.06–0.22)
IOTA‐SR + SA0.890.900.830.948.910.13
(0.81–0.94)(0.85–0.94)(0.75–0.89)(0.89–0.96)(5.91–13.44)(0.08–0.21)
LR20.930.790.710.954.460.09
(0.86–0.97)(0.73–0.84)(0.63–0.78)(0.91–0.98)(3.41–5.83)(0.04–0.17)
RMI‐I0.710.790.650.833.340.36
(0.62–0.79)(0.72–0.84)(0.56–0.73)(0.77–0.88)(2.52–4.44)(0.27–0.49)
RMI‐II0.740.730.600.842.740.36
(0.65–0.81)(0.66–0.79)(0.51–0.68)(0.77–0.89)(2.14–3.50)(0.26–0.49)
RMI‐III0.710.810.670.843.760.35
(0.62–0.79)(0.75–0.86)(0.58–0.75)(0.78–0.88)(2.78–5.09)(0.26–0.47)
Premenopausal
ADNEX1.000.710.531.003.460.00
(0.86–1.00)(0.61–0.80)(0.39–0.66)(0.93–1.00)(2.53–4.73)(0–NA)
SA0.840.960.870.9520.300.17
(0.66–0.94)(0.89–0.99)(0.68–0.96)(0.88–0.98)(7.70–53.75)(0.08–0.38)
IOTA‐SR + mal0.940.760.560.973.950.08
(0.77–0.99)(0.66–0.84)(0.41–0.69)(0.89–1.00)(2.73–5.70)(0.02–0.33)
IOTA‐SR + SA0.870.960.870.9621.120.13
(0.69–0.96)(0.89–0.99)(0.69–0.96)(0.89–0.99)(8.01–55.66)(0.05–0.34)
LR20.830.920.760.9510.170.18
(0.66–0.94)(0.84–0.96)(0.58–0.89)(0.87–0.98)(5.14–20.10)(0.08–0.39)
RMI‐I0.420.940.680.836.780.62
(0.25–0.61)(0.86–0.97)(0.43–0.86)(0.75–0.90)(2.82–16.32)(0.46–0.84)
RMI‐II0.450.920.640.845.480.60
(0.28–0.64)(0.84–0.96)(0.41–0.82)(0.75–0.90)(2.54–11.81)(0.43–0.82)
RMI‐III0.390.950.710.837.510.65
(0.22–0.58)(0.88–0.98)(0.44–0.89)(0.74–0.89)(2.87–19.65)(0.49–0.86)
Postmenopausal
ADNEX0.980.540.610.972.100.04
(0.91–1.00)(0.44–0.63)(0.52–0.69)(0.88–0.99)(1.72–2.56)(0.01–0.18)
SA0.930.860.830.946.620.08
(0.86–0.97)(0.78–0.92)(0.74–0.90)(0.87–0.98)(4.19–10.46)(0.04–0.18)
IOTA‐SR + mal0.930.610.640.922.410.12
(0.85–0.97)(0.52–0.70)(0.55–0.72)(0.83–0.97)(1.89–3.06)(0.05–0.25)
IOTA‐SR + SA0.890.850.820.925.990.13
(0.80–0.95)(0.77–0.91)(0.72–0.89)(0.84–0.96)(3.84–9.34)(0.07–0.23)
LR20.960.680.690.963.050.05
(0.89–0.99)(0.59–0.77)(0.60–0.77)(0.89–0.99)(2.32–4.01)(0.02–0.16)
RMI‐I0.820.660.640.832.400.27
(0.72–0.89)(0.56–0.74)(0.54–0.73)(0.74–0.90)(1.83–3.16)(0.17–0.43)
RMI‐II0.850.570.590.831.970.27
(0.75–0.91)(0.47–0.66)(0.50–0.68)(0.73–0.90)(1.56–2.48)(0.16–0.45)
RMI‐III0.830.690.670.852.710.24
(0.73–0.90)(0.60–0.77)(0.57–0.75)(0.76–0.91)(2.03–3.63)(0.15–0.39)

Values in parentheses are 95% CI.

Prediction models: Assessment of Different NEoplasias in the adneXa (ADNEX) model11; subjective assessment (SA); International Ovarian Tumour Analysis simple ultrasound‐based rules9 (IOTA‐SR), applied both with inconclusive results being considered to be malignant (IOTA‐SR + mal) and with inconclusive results diagnosed by subjective assessment (IOTA‐SR + SA); IOTA Logistic Regression model 210 (LR2); and three variants of Risk of Malignancy Index (RMI)5, 6, 7.

For both ADNEX and LR2 models, cut‐off value of ≥ 0.1 (≥ 10%) was used; for variants of RMI model, cut‐off value of ≥ 200 was used.

LR–, negative likelihood ratio; LR+, positive likelihood ratio; NPV, negative predictive value; PPV, positive predictive value.

Table 6

Performance of Assessment of Different NEoplasias in the adneXa (ADNEX) model11 for five tumor types, expressed as area under the receiver–operating characteristics curve (AUC)

Tumor typeAUC (95% CI)
Benign vs borderline0.81 (0.75–0.86)
Benign vs Stage I0.87 (0.84–0.91)
Benign vs Stage II–IV0.97 (0.94–0.99)
Benign vs metastatic0.93 (0.89–0.96)
Borderline vs Stage I0.60 (0.44–0.74)
Borderline vs Stage II–IV0.87 (0.78–0.93)
Borderline vs metastatic0.90 (0.77–0.97)
Stage I vs Stage II–IV0.82 (0.71–0.90)
Stage I vs metastatic0.72 (0.53–0.86)
Stage II–IV vs metastatic0.67 (0.55–0.78)
Diagnostic performance indices for subjective assessment (SA) and four prediction models for differentiation between benign and malignant adnexal masses, in whole study population (n = 326) and in premenopausal (n = 128) and postmenopausal (n = 198) subgroups Values in parentheses are 95% CI. Prediction models: Assessment of Different NEoplasias in the adneXa (ADNEX) model11; subjective assessment (SA); International Ovarian Tumour Analysis simple ultrasound‐based rules9 (IOTA‐SR), applied both with inconclusive results being considered to be malignant (IOTA‐SR + mal) and with inconclusive results diagnosed by subjective assessment (IOTA‐SR + SA); IOTA Logistic Regression model 210 (LR2); and three variants of Risk of Malignancy Index (RMI)5, 6, 7. For both ADNEX and LR2 models, cut‐off value of ≥ 0.1 (≥ 10%) was used; for variants of RMI model, cut‐off value of ≥ 200 was used. LR–, negative likelihood ratio; LR+, positive likelihood ratio; NPV, negative predictive value; PPV, positive predictive value. Performance of Assessment of Different NEoplasias in the adneXa (ADNEX) model11 for five tumor types, expressed as area under the receiver–operating characteristics curve (AUC)

ADNEX model vs other methods

When comparing overall test performance, expressed as AUC, subjective assessment performed significantly better than did the ADNEX model (P = 0.01), with AUCs of 0.96 (95% CI, 0.93–0.98) and 0.93 (95% CI, 0.89–0.95), respectively (Table 7 and Figure 2). The difference between the ADNEX model and LR2 (AUC, 0.92 (95% CI, 0.89–0.95)) was not significant (P = 0.60). The AUCs of all variants of the RMI were significantly lower than those of the other methods in our comparison (all P < 0.001).
Table 7

Pairwise receiver–operating characteristics (ROC) curve comparisons expressed as differences in area under the curve (AUC) and P‐values calculated for whole study population

SALR2RMI‐IRMI‐IIRMI‐III
ADNEX0.027 (0.008–0.047)* 0.005 (−0.015–0.025) 0.075 (0.040–0.109) 0.108 (0.066–0.149) 0.088 (0.049–0.127)
P = 0.01 P = 0.5968 P < 0.0001 P < 0.0001 P < 0.0001
SA0.033 (0.007–0.058) 0.102 (0.062–0.141) 0.135 (0.089–0.182) 0.115 (0.072–0.159)
P = 0.0119 P < 0.0001 P < 0.0001 P < 0.0001
LR20.069 (0.029–0.110) 0.103 (0.057–0.148) 0.082 (0.039–0.126)
P = 0.0009 P < 0.0001 P = 0.0002
RMI‐I0.033 (0.016–0.051) 0.013 (0.004–0.022)
P = 0.0003 P = 0.0041
RMI‐II0.020 (0.004–0.036)
P = 0.0123
RMI‐III

Prediction models: Assessment of Different NEoplasias in the adneXa (ADNEX) model11; subjective assessment (SA); IOTA Logistic Regression model 210 (LR2); and three variants of Risk of Malignancy Index (RMI)5, 6, 7.

Methods in left column are used as reference standard for comparisons:

model in upper row outperforms corresponding model in left column;

model in left column outperforms corresponding model in upper row.

Values in parentheses are 95% CI.

Figure 2

Receiver–operating characteristics curves for detection of malignant disease (including borderline ovarian tumors) for the Assessment of Different NEoplasias in the adneXa (ADNEX) model11, subjective assessment (SA), International Ovarian Tumour Analysis (IOTA) Logistic Regression model 210 (LR2) and three variants of the Risk of Malignancy Index5, 6, 7 (RMI) in the whole population (n = 326) (a) and in premenopausal (n = 128) (b) and postmenopausal (n = 198) (c) subgroups. AUC, area under the curve.

Pairwise receiver–operating characteristics (ROC) curve comparisons expressed as differences in area under the curve (AUC) and P‐values calculated for whole study population Prediction models: Assessment of Different NEoplasias in the adneXa (ADNEX) model11; subjective assessment (SA); IOTA Logistic Regression model 210 (LR2); and three variants of Risk of Malignancy Index (RMI)5, 6, 7. Methods in left column are used as reference standard for comparisons: model in upper row outperforms corresponding model in left column; model in left column outperforms corresponding model in upper row. Values in parentheses are 95% CI. Receiver–operating characteristics curves for detection of malignant disease (including borderline ovarian tumors) for the Assessment of Different NEoplasias in the adneXa (ADNEX) model11, subjective assessment (SA), International Ovarian Tumour Analysis (IOTA) Logistic Regression model 210 (LR2) and three variants of the Risk of Malignancy Index5, 6, 7 (RMI) in the whole population (n = 326) (a) and in premenopausal (n = 128) (b) and postmenopausal (n = 198) (c) subgroups. AUC, area under the curve. For the study population as a whole, among all the methods assessed, the sensitivity of the ADNEX model (at cut‐off ≥ 10%) was highest, although the specificity was lowest (Table 5). The sensitivity and specificity of subjective assessment differed significantly from those of the ADNEX model (P = 0.01 and P < 0.0001, respectively). The sensitivity and specificity of the simple rules, using subjective assessment in case of inconclusive test results, were comparable to those of subjective assessment and, as for subjective assessment, differed significantly from those of the ADNEX model (P = 0.03 and P < 0.001 for sensitivity and specificity, respectively). When all masses yielding inconclusive results using the simple rules were classified as malignant instead of using subjective assessment, the specificity dropped significantly (P < 0.0001), while the sensitivity remained high (P = 0.06). The three different variants of the RMI (cut‐off ≥ 200) performed worst of all the methods, with sensitivities as low as 0.71 for both RMI‐I and RMI‐III (95% CI, 0.62–0.79 for both), resulting in the largest differences in sensitivity from that of the ADNEX model (P < 0.0001 for both).

Optimal cut‐off

We calculated optimal cut‐off values for all models with a cut‐off (i.e. ADNEX model, LR2, RMI‐I, RMI‐II and RMI‐III) at a fixed sensitivity of 90% (Table S2). The optimal cut‐off values for both the ADNEX model and LR2 were higher in our population than the values applied in the original articles: ≥ 26.1% for the ADNEX model, with a specificity of 0.76 (95% CI, 0.66–0.85) and ≥ 16.5% for LR2, with a specificity of 0.82 (95% CI, 0.68–0.89). Optimal cut‐off values for RMI‐I, RMI‐II and RMI‐III, on the other hand, were lower in our population (≥ 63.7%, ≥ 51.3% and ≥ 64.3%, respectively).

Pre‐ and postmenopausal subgroups

Malignant masses occurred more frequently in postmenopausal than in premenopausal women (42.4% and 24.2%, respectively). Subjective assessment had the highest diagnostic accuracy for differentiating between benign and malignant adnexal masses in both pre‐ and postmenopausal subgroups (Table 5 and Figure 2). Nonetheless, the differences between the AUC for subjective assessment and that for the ADNEX model and for LR2 were not significant (P = 0.65 and P = 0.08, respectively) for premenopausal women (Table S3), while in the postmenopausal subgroup this difference for the ADNEX model was significant (P = 0.02) (Table S4).

DISCUSSION

We have shown that the ADNEX model has good overall performance in the differentiation between benign and malignant adnexal masses, with an AUC of 0.93 (95% CI, 0.89–0.95). At the recommended cut‐off of ≥ 10%, the model had high sensitivity; however, this was at the expense of specificity. In our population, the optimal cut‐off of ≥ 26.1% gave somewhat more balanced results for sensitivity and specificity. The model is particularly good at differentiating benign from Stage‐II–IV or secondary metastatic tumors and borderline from secondary metastatic cancer. However, other tumor types could be distinguished less easily. Furthermore, our study suggests that subjective assessment remains superior to the ADNEX model. In the original article11, validation AUCs for the ADNEX model were slightly higher than ours, especially for differentiating between various types of malignancy. The model showed better discrimination of borderline from Stage‐I tumors (AUC, 0.75 in the original vs 0.60 in the current study) and Stage‐II–IV from metastatic tumors (AUC, 0.82 in the original vs 0.67 in the current study). This could be due to the distribution of tumor types in each dataset. For example, in the present study, the number of borderline tumors amounted to almost a quarter (23.4%) of all malignancies, and borderline tumors are generally known to be difficult to diagnose20. This also resulted in a slightly lower‐than‐expected test accuracy of subjective assessment. Furthermore, the number of inconclusive results when applying the IOTA simple rules was higher than usual (26.3% in the present study vs 19% in a recent review4). However, general malignancy rate in our study (27%) was similar to that in the original publication (33%). Moreover, this study was conducted in an oncology center, while the original study was performed in both second‐ and third‐level hospitals. Although type of center is the weakest predictor in this model, this could mean that results from our study might not be generalizable11. The poorest performance was seen for RMI, yet this method is advocated by many guidelines. Although a sensitivity of ≥ 90% is generally considered most important in the preoperative diagnosis of ovarian carcinoma, the sensitivity of RMI‐III was only 71% in our study. This is in accordance with the sensitivity of 0.71 (95% CI, 0.67–0.75) for RMI‐III reported in a recent review of 18 studies validating RMI4. Thus, more than a quarter of ovarian carcinomas will be missed, leading to incorrect treatment of these masses and subsequently to deterioration of the prognosis of these patients3, 21. This external validation study compared the ADNEX model with other frequently used models to evaluate its added value. It is a strength of our study that the collection of clinical and ultrasound data was meticulous and prospective, in accordance with IOTA nomenclature and measurement techniques, based on real‐time ultrasound and with blinding to pathology results. Although data analysis was done retrospectively, this was not regarded a limitation since previous research on the effects of design‐related biases in studies assessing diagnostic tests showed that a retrospective design is not associated with overestimation or underestimation of diagnostic accuracy22. Overestimation may occur in diagnostic accuracy studies that use different reference tests or with inadequate blinding; this was not the case in our study. A limitation of this study is that CA 125 values were missing in 32 (9.8%) patients. When an adnexal mass gives the impression of being completely benign from the overall clinical picture and morphology on ultrasound, clinicians may be less inclined to determine the CA 125 level preoperatively. Had we excluded these patients, we would have introduced selection bias, since all but one of the cases in which CA 125 data were missing were benign (the exception concerned a mucinous borderline tumor). Another potential limitation is that we included both pregnant patients (n = 4) and non‐primary ovarian carcinomas (n = 4). The level of CA 125 can rise during pregnancy, which could lead to an overestimation of the risk of malignancy by models incorporating CA 12523. However, an analysis performed on all patients except the pregnant patients confirmed that pregnancy hardly influenced the results of the ADNEX model (data not shown). The non‐primary ovarian carcinomas were included only if the ultrasound findings were suspicious for ovarian pathology. This is in accordance with daily clinical practice, because the risk of malignancy is estimated after ultrasound and before surgery, and therefore before pathology results revealing non‐primary ovarian carcinoma become available. Finally, ultrasound examinations in our study were performed by an expert ultrasonographer. It remains to be shown if the ADNEX model retains its performance when applied by non‐experts. Using the ADNEX model, absolute risk estimates for benign tumors and four types of malignancy can be obtained with acceptable diagnostic performance. However, how to use the model clinically is not straightforward, as also observed by Van Calster et al.17. Two options are available. First, a cut‐off can be used, such as the one applied in this study. However, rigid use of a cut‐off may result in suboptimal and even unethical judgment, according to Van Calster et al.17. Furthermore, this can only be used to distinguish between benign and malignant masses, thereby losing the advantage of a polytomous model (i.e. a model differentiating between more than two subgroups). Second, an assessment per tumor type can be made to estimate how the predicted risk per type relates to the baseline risk. This requires certain calculations (not supplied by the IOTA application for the ADNEX model) and can also be difficult to interpret. In this study the ADNEX model was used as a single test, but it can also be applied as a two‐step triage test. For example, when results of the ADNEX model are between certain values (e.g. 5–25%), subjective assessment can be used as a second‐line test to increase diagnostic accuracy. The same kind of triage test could be performed with the other models investigated. In conclusion, the ADNEX model can be used as a good alternative to subjective assessment in the estimation of risk of malignancy of adnexal masses. However, the advantage of the ADNEX model as a polytomous model for the differentiation between various subtypes of malignancy was modest in our study. More guidance on how to use the ADNEX model in clinical practice would be useful. Table S1 List of prospectively collected ultrasound features (per adnexal mass) Table S2 Optimal cut‐off and corresponding performance indices for models with a cut‐off value at a fixed sensitivity of 90% Tables S3 and S4 Pairwise receiver–operating characteristics (ROC) curve comparisons expressed as differences in the area under the curve (AUC) and P‐values calculated for premenopausal (Table S3) and postmenopausal (Table S4) patients Click here for additional data file.
  22 in total

1.  Staging classification for cancer of the ovary, fallopian tube, and peritoneum.

Authors:  Jaime Prat
Journal:  Int J Gynaecol Obstet       Date:  2013-10-22       Impact factor: 3.561

Review 2.  Subjective assessment versus ultrasound models to diagnose ovarian cancer: A systematic review and meta-analysis.

Authors:  E M J Meys; J Kaijser; R F P M Kruitwagen; B F M Slangen; B Van Calster; B Aertgeerts; J Y Verbakel; D Timmerman; T Van Gorp
Journal:  Eur J Cancer       Date:  2016-02-27       Impact factor: 9.162

3.  Committee Opinion No. 477: the role of the obstetrician-gynecologist in the early detection of epithelial ovarian cancer.

Authors: 
Journal:  Obstet Gynecol       Date:  2011-03       Impact factor: 7.661

4.  A risk of malignancy index incorporating CA 125, ultrasound and menopausal status for the accurate preoperative diagnosis of ovarian cancer.

Authors:  I Jacobs; D Oram; J Fairbanks; J Turner; C Frost; J G Grudzinskas
Journal:  Br J Obstet Gynaecol       Date:  1990-10

5.  A novel approach to predict the likelihood of specific ovarian tumor pathology based on serum CA-125: a multicenter observational study.

Authors:  Ben Van Calster; Lil Valentin; Caroline Van Holsbeke; Jing Zhang; Davor Jurkovic; Andrea Alberto Lissoni; Antonia Carla Testa; Artur Czekierdowski; Daniela Fischerová; Ekaterini Domali; Gregg Van de Putte; Ignace Vergote; Sabine Van Huffel; Tom Bourne; Dirk Timmerman
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2011-09-09       Impact factor: 4.254

6.  Laparoscopic surgical staging of early ovarian cancer.

Authors:  Sarah Weber; Christopher K McCann; David M Boruta; John O Schorge; Whitfield B Growdon
Journal:  Rev Obstet Gynecol       Date:  2011

Review 7.  Towards complete and accurate reporting of studies of diagnostic accuracy: the STARD initiative.

Authors:  Patrick M Bossuyt; Johannes B Reitsma; David E Bruns; Constantine A Gatsonis; Paul P Glasziou; Les M Irwig; Jeroen G Lijmer; David Moher; Drummond Rennie; Henrica C W de Vet
Journal:  BMJ       Date:  2003-01-04

8.  Simple ultrasound-based rules for the diagnosis of ovarian cancer.

Authors:  D Timmerman; A C Testa; T Bourne; L Ameye; D Jurkovic; C Van Holsbeke; D Paladini; B Van Calster; I Vergote; S Van Huffel; L Valentin
Journal:  Ultrasound Obstet Gynecol       Date:  2008-06       Impact factor: 7.299

9.  Physiologic variations of serum tumor markers in gynecological malignancies during pregnancy: a systematic review.

Authors:  Sileny N Han; Anouk Lotgerink; Mina Mhallem Gziri; Kristel Van Calsteren; Myriam Hanssens; Frédéric Amant
Journal:  BMC Med       Date:  2012-08-08       Impact factor: 8.775

Review 10.  Practical guidance for applying the ADNEX model from the IOTA group to discriminate between different subtypes of adnexal tumors.

Authors:  B Van Calster; K Van Hoorde; W Froyman; J Kaijser; L Wynants; C Landolfo; C Anthoulakis; I Vergote; T Bourne; D Timmerman
Journal:  Facts Views Vis Obgyn       Date:  2015
View more
  21 in total

Review 1.  Multimodality imaging review of complex pelvic lesions in female pelvis.

Authors:  Anuradha Chandramohan; Tameem Ahmed Bhat; Reetu John; Betty Simon
Journal:  Br J Radiol       Date:  2020-09-04       Impact factor: 3.039

2.  Ultrasound features of immature ovarian teratomas: Case series and review of literature.

Authors:  S Abdullahi Idle; K Hayes; J A Ross
Journal:  Ultrasound       Date:  2020-02-11

3.  Response to letter to the editor concerning validation of IOTA ADNEX model.

Authors:  Sebastian Szubert; Andrzej Wojtowicz; Patryk Zywica
Journal:  Gynecol Oncol Rep       Date:  2016-11-01

4.  Validation of the Performance of International Ovarian Tumor Analysis (IOTA) Methods in the Diagnosis of Early Stage Ovarian Cancer in a Non-Screening Population.

Authors:  Wouter Froyman; Laure Wynants; Chiara Landolfo; Tom Bourne; Lil Valentin; Antonia Testa; Povilas Sladkevicius; Dorella Franchi; Daniela Fischerova; Luca Savelli; Ben Van Calster; Dirk Timmerman
Journal:  Diagnostics (Basel)       Date:  2017-06-02

5.  Validation of IOTA-ADNEX Model in Discriminating Characteristics of Adnexal Masses: A Comparison with Subjective Assessment.

Authors:  Soo Young Jeong; Byung Kwan Park; Yoo Young Lee; Tae-Joong Kim
Journal:  J Clin Med       Date:  2020-06-26       Impact factor: 4.241

Review 6.  ESGO/ISUOG/IOTA/ESGE Consensus Statement on pre-operative diagnosis of ovarian tumors.

Authors:  Dirk Timmerman; François Planchamp; Tom Bourne; Chiara Landolfo; Andreas du Bois; Luis Chiva; David Cibula; Nicole Concin; Daniela Fischerova; Wouter Froyman; Guillermo Gallardo Madueño; Birthe Lemley; Annika Loft; Liliana Mereu; Philippe Morice; Denis Querleu; Antonia Carla Testa; Ignace Vergote; Vincent Vandecaveye; Giovanni Scambia; Christina Fotopoulou
Journal:  Int J Gynecol Cancer       Date:  2021-06-10       Impact factor: 3.437

7.  Validation of models to diagnose ovarian cancer in patients managed surgically or conservatively: multicentre cohort study.

Authors:  Ben Van Calster; Lil Valentin; Wouter Froyman; Chiara Landolfo; Jolien Ceusters; Antonia C Testa; Laure Wynants; Povilas Sladkevicius; Caroline Van Holsbeke; Ekaterini Domali; Robert Fruscio; Elisabeth Epstein; Dorella Franchi; Marek J Kudla; Valentina Chiappa; Juan L Alcazar; Francesco P G Leone; Francesca Buonomo; Maria Elisabetta Coccia; Stefano Guerriero; Nandita Deo; Ligita Jokubkiene; Luca Savelli; Daniela Fischerová; Artur Czekierdowski; Jeroen Kaijser; An Coosemans; Giovanni Scambia; Ignace Vergote; Tom Bourne; Dirk Timmerman
Journal:  BMJ       Date:  2020-07-30

8.  Diagnostic Accuracy of the ADNEX Model for Ovarian Cancer at the 15% Cut-Off Value: A Systematic Review and Meta-Analysis.

Authors:  Xiaotong Huang; Ziwei Wang; Meiqin Zhang; Hong Luo
Journal:  Front Oncol       Date:  2021-06-17       Impact factor: 6.244

9.  Factors Influencing the Discordancy Between Intraoperative Frozen Sections and Final Paraffin Pathologies in Ovarian Tumors.

Authors:  Hung Shen; Heng-Cheng Hsu; Yi-Jou Tai; Kuan-Ting Kuo; Chia-Ying Wu; Yen-Ling Lai; Ying-Cheng Chiang; Yu-Li Chen; Wen-Fang Cheng
Journal:  Front Oncol       Date:  2021-07-01       Impact factor: 6.244

10.  Comparison of the Diagnostic Performances of Ultrasound-Based Models for Predicting Malignancy in Patients With Adnexal Masses.

Authors:  Le Qian; Qinwen Du; Meijiao Jiang; Fei Yuan; Hui Chen; Weiwei Feng
Journal:  Front Oncol       Date:  2021-06-01       Impact factor: 6.244

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.