Literature DB >> 34693032

Endometriosis classification, staging and reporting systems: a review on the road to a universally accepted endometriosis classification.

Nathalie Vermeulen1, Mauricio S Abrao2,3, Jon I Einarsson4, Andrew W Horne5, Neil P Johnson6, Ted T M Lee7, Stacey Missmer8,9,10, John Petrozza11, Carla Tomassetti12, Krina T Zondervan13,14, Grigoris Grimbizis15, Rudy Leon De Wilde16.   

Abstract

STUDY QUESTION: Which endometriosis classification, staging and reporting systems have been published and validated for use in clinical practice? SUMMARY ANSWER: Of the 22 endometriosis classification, staging and reporting systems identified in this historical overview, only a few have been evaluated, in 46 studies, for the purpose for which they were developed. WHAT IS KNOWN ALREADY: In the field of endometriosis, several classification, staging and reporting systems have been developed. PARTICIPANTS/MATERIALS SETTING
METHODS: A systematic PUBMED literature search was performed. Data were extracted and summarized. MAIN RESULTS AND THE ROLE OF CHANCE: Twenty-two endometriosis classification, staging and reporting systems have been published between 1973 and 2021, each developed for specific, and different, purposes. There still is no international agreement on how to describe the disease. Studies evaluating the different systems are summarized showing a discrepancy between the intended and the evaluated purpose, and a general lack of validation data confirming a correlation with pain symptoms or quality of life for any of the current systems. A few studies confirm the value of the ENZIAN system for surgical description of deep endometriosis. With regards to infertility, the endometriosis fertility index has been confirmed valid for its intended purpose. LARGE SCALE DATA: NA. LIMITATIONS REASONS FOR CAUTION: The literature search was limited to PUBMED. Unpublished classification, staging or reporting systems, or those published in books were not considered. WIDER IMPLICATIONS OF THE
FINDINGS: It can be concluded that there is no international agreement on how to describe endometriosis or how to classify it, and that most classification/staging systems show no or very little correlation with patient outcomes. This overview of existing systems is a first step in working toward a universally accepted endometriosis classification. STUDY FUNDING/COMPETING INTERESTS: The meetings and activities of the working group were funded by the American Association of Gynecologic Laparoscopists, European Society for Gynecological Endoscopy, European Society of Human Reproduction and Embryology and World Endometriosis Society. A.W.H. reports grant funding from the MRC, NIHR, CSO, Wellbeing of Women, Roche Diagnostics, Astra Zeneca, Ferring, Charles Wolfson Charitable Trust, Standard Life, Consultancy fees from Roche Diagnostics, AbbVie, Nordic Pharma and Ferring, outside the submitted work. In addition, A.W.H. has a patent Serum biomarker for endometriosis pending. N.P.J. reports personal fees from Abbott, Guerbet, Myovant Sciences, Vifor Pharma, Roche Diagnostics, outside the submitted work; he is also President of the World Endometriosis Society and chair of the trust board. S.M. reports grants and personal fees from AbbVie, and personal fees from Roche outside the submitted work. C.T. reports grants, non-financial support and other from Merck SA, non-financial support and other from Gedeon Richter, non-financial support from Ferring Pharmaceuticals, outside the submitted work and without private revenue. K.T.Z. reports grants from Bayer Healthcare, MDNA Life Sciences, Roche Diagnostics Inc, Volition Rx, outside the submitted work; she is also a Board member (Secretary) of the World Endometriosis Society and World Endometriosis Research Foundation, Research Advisory Board member of Wellbeing of Women, UK (research charity), and Chair, Research Directions Working Group, World Endometriosis Society. The other authors had nothing to disclose. TRIAL REGISTRATION NUMBER: NA.
© The Author(s) 2021. Published by Oxford University Press on behalf of European Society of Human Reproduction and Embryology.

Entities:  

Keywords:  classification; endometriosis; infertility; reporting; staging; surgery

Year:  2021        PMID: 34693032      PMCID: PMC8530712          DOI: 10.1093/hropen/hoab025

Source DB:  PubMed          Journal:  Hum Reprod Open        ISSN: 2399-3529


What does this mean for patients? Since 1973, clinicians have proposed classification systems for endometriosis, and so far 22 different systems have been developed. Some of these systems focus on symptoms, while others have been developed to document the surgical observations, or predict the outcomes after treatment. Ideally, classification systems are evaluated in a research project to confirm it is useful in clinical management. We found that of the 22 classification systems, few have been evaluated for the purpose for which they were developed. From this review, it can be concluded that there is no international agreement on how to describe endometriosis or how to classify it.

Introduction

Endometriosis is an inflammatory estrogen-dependent disease associated with chronic pelvic pain and/or infertility that is characterized by lesions of endometrial-like tissue outside of the uterus (Johnson ). The disease is usually confined to the abdominal cavity but, rarely, extra-abdominal lesions have been detected in the lungs, brain and even in the eye. Within the pelvic cavity, the variety of presentations is extensive with lesions detected on the peritoneum, within the ovaries (endometrioma), around the uterus, but also affecting the urinary tract, bowel, and vagina. Most definitions, but not all, consider adenomyosis (similar lesions arising within the myometrium) as a separate disease (Zegers-Hochschild ). Traditionally, three phenotypes of endometriosis lesions are recognized; peritoneal, ovarian (endometrioma) and deep endometriosis (DE) (Working group of ESGE ESHRE and WES ,b; Working group of ESGE ESHRE and WES ,b). Symptoms include chronic pelvic pain (dysmenorrhea, acyclic pelvic pain, dyspareunia, dyschezia, dysuria) with severity ranging from mild to debilitating, infertility, and non-specific symptoms (fatigue), but endometriosis can also be asymptomatic (Zondervan ). Treatment options for pain include different medical and hormonal treatments or surgery, while for infertility, surgery and/or ART have been used. Since the first descriptions of endometriosis, this spectrum of lesions and symptoms has urged clinicians to attempt to classify the disease into informative subgroups or hierarchical stages. By definition, classification entails a systematic arrangement of similar entities on the basis of certain differing characteristics (Miller-Keane and O’Toole, 2005). When disease classification can be related to treatment outcomes or prognosis, the system is considered a staging system. In the field of endometriosis, several classification, staging and reporting systems have been developed. The current paper provides, based on an assessment of published studies, a historical overview of these different systems. Validation studies and published reports on the implementation of the different classification, staging and reporting systems have been summarized to highlight the uptake, benefits and drawbacks of published systems for endometriosis.

Materials and methods

A literature review was performed collecting studies and reports focusing on ‘endometriosis’ and ‘classification, staging, or scoring’. PUBMED/MEDLINE was searched, and studies were included from inception (1966) up to 08 May 2020; all retrieved references were checked for relevance. Non-English language studies, animal studies and papers not focusing on endometriosis, including those focusing specifically on adenomyosis, were excluded from the retrieved references. Papers and classifications systems focusing on endometriosis but including adenomyosis were not excluded. For the remaining references, the full-text papers were collected and assessed. Inclusion criteria included original studies focusing on endometriosis and classification, staging or reporting systems. The results of the literature search are summarized in a PRISMA flowchart (Fig. 1). The details of the final set of papers are summarized in evidence tables. The draft paper was published for stakeholder review by all societies involved; 81 comments were tabulated in a review report and, where relevant, incorporated in the final version of the paper.
Figure 1.

PRISMA flow diagram for the selection of studies describing endometriosis classification, staging and reporting systems.

PRISMA flow diagram for the selection of studies describing endometriosis classification, staging and reporting systems.

Results

The literature review retrieved 1305 references; one reference was added at a later stage. After applying the exclusion criteria, 154 full papers were assessed, of which 84 papers were excluded for the following reasons: full-text papers could not be retrieved (n = 9), not written in English (n = 4), inappropriate publication types (case report, expert opinion, editorial) (n = 28) and relevant patients and/or intervention/outcomes are not assessed (not endometriosis or not classification) (n = 43). Seventy papers were included for either describing a classification, staging or reporting system in endometriosis (n = 24) or evaluating one (n = 46) (Fig. 1). The systems in endometriosis described in this paper have been published as classification, staging or reporting systems, even though some were developed for stratification or subgrouping rather than classification. Table I provides an overview of the 22 classification, staging or reporting systems identified in the literature and included in this report. The 46 studies reporting an evaluation of the different systems are listed in Table II.
Table I

Historical overview of endometriosis classification/staging systems.

Endometriosis classification/ staging systemPublication yearClassification/staging based onIntended purpose
DetailsReference
Diagnosis/pre-operativeassessmentDescriptionStagingTreatment selectionPrediction of difficultyof surgeryPrediction of pain remediation/improved QoLPrediction of conception
#ENZIAN classification 2021Surgical observation or imagingNon-invasive and surgical description system for endometriosis Keckstein et al. (2021)
Adhesion scoring system 2020USPre-operative prediction of the pelvic adhesion status Ichikawa et al. (2020)
ENDOGRAM 2019Disease markers in biopsy sampleAnalysis of endometriotic tissues supporting therapeutic decisions Bouquet de Joliniere et al. (2019)
ENDORECT 2019Clinical examination, US, MRIPreoperative score to predict rectosigmoid involvement Chattot et al. (2019)
Bowel Endometriosis Syndrome (BENS) score 2017SymptomsIdentify bowel endometriosis syndrome based on patient reported symptoms and QoL Riiskjær et al. (2017)
Preoperative ultrasound-based endometriosis staging system (UBESS) 2016USPre-operative staging and prediction of the level of complexity of laparoscopic surgery. Menakaya et al. (2016)
Classification of ureteral endometriosis 2015Surgical observationClinical classification of urinary tract endometriosis Knabben et al. (2015)
EPHect SSF—EPHect MSF (surgical form) 2014Surgical observationRecording of surgical phenotypic information and related sample collections obtained at laparoscopy Becker et al. (2014)
Clinical score 2014SymptomsPredict DE presence before endometrioma surgery Lafay Pillet et al. (2014)
LSD/MURO Classification 2013Modified Virtual ColonoscopyDescriptive imaging classification, with implied severity for rectogenital and disseminated endometriosis van der Wat et al. (2013)
ECO system 2012Extent, symptoms and objectivesDetermine most appropriate management Lasmar et al. (2012), Lasmar et al. (2015)
Deep endometriosis staging form 2011USStaging system for DE based on ultrasonographic finding Coccia and Rizzello (2011)
Endometriosis Fertility Index (EFI) 2010Surgical observation + Patient parametersPrediction of (non-IVF) pregnancy after surgery Adamson and Pasta (2010)
ENZIAN classification 2005Surgical observation (or MRI)1Surgical classification for DE Keckstein et al. (2021), Tuttlies et al. (2005)
Chapron classification 2003Surgical observationSurgical classification for DE with suggested operative procedure Chapron et al. (2003a), Chapron et al. (2003b)
Revised ASRM classification 1997Surgical observationAdapted from rAFS score American Society for Reproductive Medicine (1997)
TOP classification 1993Surgical observationEvaluate the severity of endometriosis by site, i.e., fallopian tubes (T), ovaries (O) and the peritoneum (P) and impact on PR Kurata et al. (1993)
Revised American Fertility Society (rAFS) classification 1985Surgical observationPoint system to determine stage/degree of endometriosis involvement American Fertility Society (1985)
American Fertility Society (AFS) classification 1979Surgical observation American Fertility Society (1979)
Buttram classification 1978Surgical observationClassification of endometriosis in the infertile female (expanded from Acosta et al. (1973)) Buttram (1978)
Kistner classification 1977Surgical observationClassification as a tool to link pregnancy rates with the presence/extent of disease Kistner et al. (1977)
Acosta classification 1973Surgical observationClassify the extent of disease and relationship with pregnancy rate Acosta et al. (1973)

ENZIAN, the recently updated ENZIAN classification, which incorporates all types of endometriosis; DE, deep endometriosis; ECO, disease extent complaints, objectives system; PR, pregnancy rate; QoL, quality of life; US, ultrasound.

In case of ENZIAN score based on MRI.

Table II

Overview of replication, validation and clinical value of published systems.

Diagnosis/pre-operative assessmentDescriptionStagingTreatment selectionPrediction of difficulty of surgery/complicationPrediction of pain remediation/QoLPrediction of conceptionFeasibilityInterobserver agreementAim of the studySample sizeAge mean (range or SD)Population sourceEndometriosis case definitionMain resultsReference
ENZIAN

+

length of hospital stays

Correlation with Clavien-Dindo complication grading40134.8 years (SD 8.73)Single centerHistologic confirmationENZIAN A2, C1, C3 and FA were risk factors for the length of hospital stay. Nicolaus et al. (2020)

+

ENZIAN (MRI)

Correlation with intraoperative findings

6333.5 years (22–49)2 centersSurgical confirmationSensitivity and NPV of MRI confirmed by surgery were 95.2% and 91.7% (lesions in the vaginal/rectovaginal space), 78.4% and 56% (utero-sacral ligaments), 91.4% and 89.7% (rectum/sigmoid colon), 57.1% and 94.1% (myometrium), 85.7% and 98.3% (bladder), and 73.3% and 92.2% (intestine), respectively. Burla et al. (2019)

No analysisApplication of the rENZIAN system6030.5 years (28.6–32.3)Single centerLaparoscopic diagnosisMedial compartment was found as the most affected one in 80% of the cases (mainly ovarian endometriomas), followed by posterior compartment in 65% and less frequent, anterior compartment. Morgan-Ortiz et al. (2018)

+

ENZIAN (MRI)

Accuracy of the score compared to surgical-pathologic findings

11536 years (20.3–48)Single centerHistologic confirmationThe sensitivity, specificity, accuracy, PPV and NPV of MRI were 94%, 97%, 95%, 99%, 86%, respectively. The highest accuracy was for adenomyosis (100%) and endometriosis of utero-sacral ligaments (98%), slightly lower for vagina-rectovaginal septum and colorectal walls (96%), and the lowest for bladder endometriosis (92%). The concordance with histopathology was excellent. Di Paola et al. (2015)
ENZIAN

+

Operating time

Preoperative estimation of laparoscopic operating time151

31 years

(19–53 years)

Single centerHistologic confirmationAn ENZIAN-based model for estimating operating time for DE, assuming complication-free procedures (model’s predictive power: P < 0.001). The error of estimation for the operating time prediction is 0 ± 35.35 min (range 83 to +117 min). Haas et al. (2013a)

+ Identification of duplicate classifications of the same lesions219Not reportedSingle centerHistologic confirmation Comparison to rAFS: The severity of DE according to ENZIAN was as follows: grade 1: 45%; grade 2: 26%; grade 3: 19%; grade 4: 10%. Fifty-eight patients were classified according to ENZIAN although they did not fulfill the criteria of DE and had previously been classified according to the rAFS classification. Adaptation of the ENZIAN score would reduce the diagnoses of DE by 36% (95% CI: 29–44%). Haas et al. (2011)
UBESS
+ Correlation with RCOG laparoscopic level of complexity 1, 2, and 3293Not reportedMulti-centerhistory of chronic pelvic pain, or endometriosis.

Strengths: UBESS predicted the requirement for RCOG level 3 laparoscopic surgical skills (accuracy, 89.4–95.4%).

Limitation: misclassification of women who require surgical ureterolysis in the absence of bowel DE.

Espada et al. (2020)

Correlation with the difficulty of the surgery3332.8 years (SD 7.7)Single centerHistologic confirmationWeak concordance between pre-operative UBESS score and the difficulty of the surgery (RCOG, concordance Kendall Tau 0.22) and between UBESS and CHI (concordance 0.30). Chaabane et al. (2019)
UBESS

+

surgical skills

Validation for predicting the correct RANZCOG/AGES’ laparoscopic skill level.15532.7 years (SD 8.6)Multi-centerhistory of chronic pelvic pain and/or endometriosisThe accuracy, sensitivity, specificity, PPV and NPV, and positive and negative likelihood ratios of the UBESS I to predict the RANZCOG/AGES surgical skill levels 1/2 were 99.4%, 98.9%, 100%, 100%, 98.5%, not applicable, and 0.011; those of UBESS II to predict surgical skill levels 3/4 were: 98.1%, 96.8%, 98.4%, 93.8%, 99.2%, 60 and 0.033, and those for UBESS III to predict surgical skill level 6 were: 98.7%, 97.2%, 99.2%, 97.2%, 99.2%, 115.7, and 0.028. The rate of correctly predicting the exact level of skills needed was 98.1%, and Cohen’s kappa statistic for the agreement between UBESS prediction and levels of training required at surgery was 0.97, indicating almost perfect agreement. Tompsett et al. (2019)
EFI
+ Accuracy for the prediction of non-ART pregnancy4598NANACumulative non-ART pregnancy rate at 36 months increased from 10% (95% CI: 3, 16%) in women with EFI score 0–2 to 69% (95% CI: 58, 79%) in women with EFI score 9–10, with a significant increase for each score category (0–2, 3–4, 5–6, 7–8, 9–10)

Meta-analysis

Vesaliet al. (2020)

EFI
AcceptableReproducibility among three experts82Reproductive age as inclusion criteriumSingle-centerSurgical confirmationA near ‘inter-expert’ clinical agreement rate (1.000, 95% CI 0.956–1.000; P = 0.0149) was observed. The numerical agreement between two experts was also high (0.988, 95% CI 0.934–1.000); similarly, high agreement rates were observed for both ‘junior-expert’ comparisons (clinical 0.963, 95% CI 0.897–0.992; numerical 0.988, 95% CI 0.934–1.000) and ‘intra-expert’ comparisons (clinical 0.988, 95% CI 0.934–1.000; numerical 1.000, 95% CI 0.956–1.000). Tomassetti et al. (2020)

+

Conception rate

Accuracy for the prediction of pregnancy12332.4 years (no range)Single-centerSurgical confirmation8 (40%) patients with low, 20 (58.82%) with moderate, and 26 (96.29%) with high EFI conceived. EFI score showed statistically significant positive correlation with pregnancy outcome P = 0.001. Patients conceived spontaneously, after ovulation induction (±IUI) or after IVF. Negi et al. (2019)

+ Accuracy for the prediction of non-ART pregnancy in recurrent endometriosis10731.1 years (SD 0.39)Single-centerSurgical confirmation—recurrent endometriosisCumulative pregnancy rates (CPR) during the first 2 years were 51.86% in women with EFI ≥5, and 26.00% in women with EFI <5. At 3- and 5-year post-surgery, the CPR increased further in women with EFI ≥5, but not in women with EFI <5. The EFI score had good predictive power for postoperative pregnancy in women with recurrent endometriosis. Zhou et al. (2019)
EFI
+ Accuracy for the prediction of non-ART pregnancy68XXSingle-centerSurgical confirmationThe mean EFI scores of 68 women who were not pregnant and pregnant were 5.43 ± 0.36 and 6.88 ± 0.28, respectively. The relation between EFI and natural pregnancy was significant (cumulative overall PR, p = 0.006), whereas rAFS stage was not (univariate logistics, P = 0.853). The cut-point for maximum natural pregnancy outcomes was 6 (area under ROC curve = 0.710, 95% CI 0.586–0.835). Kim et al. (2019) *

+ Accuracy for the prediction of non-ART pregnancy109729.8 years (20–46)Single-centerSurgical confirmationThe difference in cumulative pregnancy incidence among EFI scores 10, 7–9, 4–6, and 2–3 was statistically significant (Kaplan–Meier survival analysis). A significant relationship was found between EFI and time to achieving pregnancy. Zhang et al. (2018) *

+ Accuracy for the prediction of non-ART pregnancy23534 years (20–47)2 centersHistologic confirmationThe EFI was highly associated with live births (P < 0.001): for EFI of 0–2, the estimated cumulative non-ART LBR at 5 years was 0% and steadily increased up to 91% with an EFI of 9–10, while the proportion of women who attempted ART and had a live birth, steadily increased from 38% to 71% among the same EFI strata (P = 0.1). A low least function score was the most significant predictor of failure, followed by having had a previous resection or incomplete resection, being older than 40 compared to <35 years, and having leiomyomas. Maheux-Lacroix et al. (2017) *
EFI
+ Accuracy for the prediction of non-ART pregnancy and ART pregnancy19632.3 years (SD 4.8)Single centerSurgical confirmationThe cumulative PR was 76%. The PR, non-ART PR and ART PR for EFI ≤4 were 42.3%, 0% and 50%; for EFI 5–6, 67.9%, 30.5% and 60.6%; and for EFI ≥7, 87.7%48.2% and 80.3%, respectively. The benefit of ART was inversely correlated with the mean EFI score. On multivariate analysis, the EFI score was significantly associated with non-ART pregnancy (OR 1.629, 95% CI 1.235–2.150). Boujenah et al. (2017) *

+

ART outcome

+ Accuracy for the prediction of non-ART pregnancy + use for treatment selection (Surgery vs surgery + IVF-ET)34532.2 years (22.0–45.0)Single centerHistologic confirmationSignificant differences in spontaneous PRs among different EFI scores were identified (chi2 = 29.945, P < 0.05). The least function score was proved to be the most important factor for EFI. In patients with an EFI score ≥5 after 12 months from surgery, the cumulative PRs of those who received both surgery and IVF-ET were much higher than the spontaneous PRs of those who received surgery alone (chi2 = 4.16, ns). Li et al. (2017) *

+

ART outcome

+ Accuracy for the prediction of non-ART pregnancy + use for treatment selection (Surgery vs surgery + IVF-ET)41232.5 years (SD 4.6)Single centerHistologic confirmationA significant relationship between EFI and spontaneous PR was observed at 12 months (P = 0.001). The least function score and complete removal of endometriotic lesions and pelvic adhesions were significantly associated with spontaneous pregnancy (P = 0.006). Cumulative PR at 18 months was 78.8%. ART benefits were higher for patients with poor EFI. Boujenah et al. (2015) *
EFI

+

non-ART/ART outcome

Accuracy for the prediction of non-ART pregnancy and ART pregnancy10434.5 years (SD 4.5)Single centerSurgical confirmationDifferences in time to non-ART pregnancy for the six EFI groups were statistically significant (log-rank, P = 1.4 × 10(4)). The AUC for EFI as ART outcome predictor was 0.75 (95% CI 0.61–0.89, P = 6.2 × 10(3)), while the best cut-point for pregnancy was 5.5. Garavaglia et al. (2015) *

+ Accuracy for the prediction of non-ART pregnancy16132.08 years (22–40)Single centerSurgical confirmation Comparison to rAFS: Significant differences in cumulative PRs were observed among EFI scores (EFI score 0–3, 8.3%; EFI score 4–7 41.2%, and EFI score 8–10 60.9%; chi2 = 16.254, P < 0.001). EFI scores, but not rAFS stage, predict PRs in patients with endometriosis-associated infertility. Zeng et al. (2014) *

+

ART outcome

Ability of the EFI score and rAFS classification for predicting IVF outcomes19932.0 years (SD 4.2)Single centerHistologic confirmation Comparison to rAFS: The AUC of the EFI score (AUC = 0.641, standard error (SE) = 0.039, 95% CI = 0.564–0.717, cutoff score = 6) was significantly larger than that of the r-AFS classification (AUC = 0.445, SE = 0.041, and 95% CI = 0.364–0.526). The antral follicle count, estradiol level on day of hCG, number of oocytes retrieved, number of oocytes fertilised, number of cleaved embryos, implantation rate, CPR, and cumulative pregnancy rate were greater in the ≥6 EFI score group compared to the ≤5 EFI score group. EFI has more predictive power for IVF outcomes than r-AFS. Wang et al. (2013)
EFI
+ Accuracy for the prediction of non-ART pregnancy23331.3 years (SD 3.9)Single centerSurgical confirmationHighly significant relationship between EFI and the time to non-ART pregnancy (P = 0.0004), with the K-M estimate of cumulative overall PR at 12 months after surgery equal to 45.5% (95% CI 39.47–49.87) ranging from 16.67% (95% CI 5.01–47.65) for EFI scores 0–3, to 62.55% (95% CI 55.18–69.94) for EFI scores 9–10. For each increase of 1 point in the EFI score, the relative risk of becoming pregnant increased by 31% (95% CI 16–47%; i.e. HR 1.31). The ‘least function score’ was found to be the most important contributor to the total EFI score. Tomassetti et al. (2013) *
ECO system
+ Validation16634.0 years (SD 7.2)2 centersHistologic confirmationAmong patients, 78 (47.0%) were medically treated and 88 (53.0%) underwent therapeutic laparoscopy. All three patients scoring two had undergone hormonal treatment. Among 51 patients scoring 3, 49 (96.1%) were clinically managed and 2 (3.9%) underwent surgery. Among 52 patients scoring 4, 26 (50.0%) had undergone medical treatment and 26 (50.0%) surgical treatment. All 56 patients who scored 5 and the 4 patients who scored 6 underwent surgery. Lasmar et al. (2015)
rASRM/rAFS/AFS
Accuracy for the prediction of non-ART pregnancy16132.08 years (22–40)Single centerSurgical confirmation Comparison to EFI: The cumulative PR 36 months after surgery was 46.6% (stage I, 53.6%; stage II, 36.0%; stage III, 51.7%, and stage IV, 41.7%; chi2 = 4.143, P = 0.246). In the 1st year, PRs significantly differed between patients with rAFS stage IV and those with stages I–III (chi2 = 6.024, P = 0.014). rAFS stage did not predict PR in patients with endometriosis-associated infertility. Zeng et al. (2014)

ART

Ability of rAFS (vs EFI) to predict IVF outcomes19932.0 years (SD 4.2)Single centerHistologic confirmation Comparison to EFI: The AUC of the EFI score was significantly larger than that of the r-AFS classification (AUC = 0.445, SE = 0.041, and 95% CI = 0.364–0.526). Wang et al. (2013)

+ Correlation with Clavien-Dindo complication grading40134.8 years (SD 8.73)Single centerHistologic confirmationrASRM IV was a risk factors for the length of hospital stay. Clavien-Dindo Grade III complications were significantly associated with rASRM stage IV. Nicolaus et al. (2020)
rASRM/rAFS/AFS
AcceptableInter-observer agreement14832.0 years (SD 6.7)Single center105 women with and 43 women without a postoperative endometriosisSurgeons and expert reviewers had substantial agreement on diagnosis and staging after viewing digital images (n = 148; mean j = 0.67, range 0.61–0.69; mean j = 0.64, range 0.53–0.78, respectively) and after additionally viewing operative reports (n = 148; mean j = 0.88, range 0.85–0.89; mean j = 0.85, range 0.84–0.86, respectively). Although additionally viewing MRI findings (n = 36) did not greatly impact agreement, agreement substantially decreased after viewing histological findings (n = 67), with expert reviewers changing their assessment from a positive to a negative diagnosis in up to 20% of cases. Schliep et al. (2017)

+ Prognostic value of individual adhesion scores for recurrence37931.8 years (SD 6.7)Single centerhistologic confirmationIn endometriosis of advanced stage, younger age at the time of surgery, bilateral ovarian cysts at the time of diagnosis, a rAFS ovarian adhesion score >24, and complete cul-de-sac obliteration were independent risk factors of poor outcomes, and a rAFS ovarian adhesion score >24 had the highest risk of recurrence [hazard ratio = 2.948 (95% CI: 1.116–7.789), P = 0.029]. Yun et al. (2015)
rASRM/rAFS/AFS

ns

AFC, FSH

Correlation with the number of follicles, the level of FSH3928.7 years (22–34)Single centerSurgical confirmationNo statistically significant correlation between the AFC, the level of FSH and the stage of endometriosis was found. Posadzka et al. (2014)

+

ART outcome

Prediction of IVF outcome4034.7 years (SD 4.3)Single centerSurgical confirmationHigher cancelation rates, higher total gonadotropin requirements, and lower oocyte yield were found in women with endometriosis Stage III and IV compared with both the Stage I/II and control groups. The fertilization rate was higher in Stage III/IV endometriosis compared to Stage I/II. CPR and LBR were comparable between patients with endometriosis Stage I/II and control group, whereas they were significantly lower in patients with endometriosis Stage III/IV compared to other two groups. Pop-Trajkovic et al. (2014)

ART outcome

Correlation of rASRM stage with outcome ART treatment1764 (11)Not applicableNot applicableNot reportedComparison of women with Stage-III/IV vs Stage-I/II endometriosis: LBR, RR = 0.94 (95% CI, 0.80–1.11); CPR, RR = 0.90 (95% CI, 0.82–1.00); miscarriage, RR = 0.99 (95% CI, 0.73–1.36); number of oocytes retrieved, MD = 1.03 (95% CI, 1.67 to 0.39). No relevant difference between Stage-III/IV and Stage-I/II in LBR following ART.

Meta-analysis

Barbosa et al. (2014)

rASRM/rAFS/AFS
Acceptable (surgeons)Interrater and intrarater reliability (8 experts)148Not reportedSingle centerNot reportedThe interrater reliability for endometriosis diagnosis among the 8 surgeons was substantial: Fleiss kappa = 0.69 (95% CI 0.64–0.74). Surgeons agreed on revised ASRM endometriosis staging criteria after experienced assessment in a majority of cases (mean 61%, range 52–75%) with moderate interrater reliability: Fleiss kappa = 0.44 (95% CI 0.41–0.47). Schliep et al. (2012)

Correlation with symptoms319Age categories reportedSingle centerSurgical confirmation, histologic confirmation in 72.9%A correlation between endometriosis stage and severity of symptoms was observed only for dysmenorrhea (chi2 = 5.14, P = 0.02) and non-menstrual pain (chi2 = 5.63, P = 0.018). However, the point estimates of ORs were very close to unity (respectively, 1.33, 95% CI 1.04–1.71, and 1.01, 95% CI 1.00–1.03). The association between endometriosis stage and severity of pelvic symptoms was marginal and inconsistent. Vercellini et al. (2007)
rASRM/rAFS/AFS

pain recur rence

+ relapse

pregnancy

Predictive value for response to surgical treatment537Age categories reportedSingle centerHistologic confirmationThe cumulative probability of pregnancy at 3 years from surgery was 47% (51% at stage I, 45% at stage II, 46% at stage III and 44% at stage IV; chi2 = 1.50, ns). The cumulative probability of moderate or severe dysmenorrhoea recurrence in 425 symptomatic subjects was 24% (32% at stage I, 24% at stage II, 21% at stage III and 19% at stage IV; chi2 = 6.39, ns). The cumulative probability of disease relapse was 12% (3% at stage I, 11% at stage II, 11% at stage III and 23% at stage IV; chi2 = 24.95, P = 0.0001). Vercellini et al. (2006)

+ +

+

pain

Association with type and severity of pain, and with symptoms after laparoscopic surgery95Not reportedSingle centerSurgical confirmationIn patients with AFS ≥16; preoperative pain scores were significantly higher for dysmenorrhea (P = 0.0022) and deep dyspareunia (P < 0.0001) but not for non-menstrual pain. After surgery, dysmenorrhea improved in 43% of cases in patients with AFS <16 vs 66% with AFS ≥16 (P = 0.0037). For deep dyspareunia, improvement was reported by 33% and 67%, respectively (ns). Improvement in non-menstrual pain was not significantly different (67% vs 56%). Cases with advanced disease benefit the most from laparoscopy. Milingos et al. (2006)
rASRM/rAFS/AFS
Impact of treatments on pain + association pain scores181Not reportedSingle centerHistologic confirmationNo correlation was found between the stage of endometriosis according to R-AFS score and the severity of CPP. Szendei et al. (2005)

VariableComparison of laparoscopic and laparotomic scoring84Not reportedSingle centerSurgical confirmationThere was considerable variability in laparoscopic vs laparotomic scoring by the same observer, with largest variability in ovarian endometriosis and cul-de-sac obliteration subscores, and least variability for peritoneum endometriosis. The inter-method variation was sufficient to alter the staging in 34.5% of patients, with a difference of two stages in 3.6% of patients. In general, there was fair-to-good agreement (kappa coefficient 0.49). Lin et al. (1998)

ART pregnancy

Impact of severity of endometriosis on the outcome of IVF61

33.6 years (SD 3.0) and

34.4 years (SD 4.0)

Single centerSurgical confirmationResponse to COH and the number, maturity, and quality of the oocytes was comparable between stages. Fertilization rates for oocytes of patients with stages III/IV were significantly impaired compared to those in stage I/II (P = 0.004). The implantation rate, CPR, and miscarriage rate were comparable between stages I/II and stages III/IV. Pal et al. (1998)
rASRM/rAFS/AFS
VariableIntraobserver and interobserver variability—5 experts20Not reportedSingle centerNot reportedThe grand total score varied with an SD of 13.44 when the videotape of a single patient was rated twice by the same observer and varied with an SD of 17.12 when rated by two observers. The greatest variability occurred in endometriosis of the ovary and cul-de-sac obliteration, with less variability for peritoneum endometriosis and for ovarian and tubal adhesions. Comparison of intraobserver and interobserver scores resulted in a change in endometriosis stage in 38% and 52% of patients, respectively. Hornstein et al. (1993)

AcceptableReproducibility—2 experts315Not reportedSingle centerNot reportedGood to fair agreement scoring endometriosis between the investigator and the blinded reviewer was noted. Rock (1995)

No analysisFeasibility of AFS and adnexal score89Not reportedSingle centerSurgical confirmationSuggestion to split class IV in class IV and class V (with higher rate of bilateral adnexal disease/adhesions). Canis et al. (1992)

age, symptoms

Relation with endometriosis-associated symptoms and patients’ age.20630 years (18–44)Single centerSurgical confirmation

No significant differences were found in total endometriosis scores, in active scores or in adhesion scores in different age groups. There was no significant difference in prevalence rate of symptoms for different aspects of endometriosis (implants, cysts or adhesions).

AFS score does not reflect the intensity of symptoms.

Marana et al. (1991)
rASRM/rAFS/AFS
VariableFeasibility of measuring endometrioma5229.5 years (24–39)Single centerNACyst diameter was calculated using the geometric formula radius = 3 square root of 3 V/4 pi where V = volume of liquid aspirated. Eight patients with apparently normal pelvis had endometriosis, and 14 with apparent minimal or mild endometriotic lesions were restaged. Laparoscopic ovarian puncture of enlarged ovaries was important for correct diagnosis and staging of endometriosis. Candiani et al. (1990)

pregnancy

Relation with pregnancy after therapy214Not reportedSingle centerSurgical confirmationThe AFS scale poorly specifies the relation between severity of disease and pregnancy outcome after therapy. A nonparametric monotonic estimator, generating a relationship between AFS score and pregnancy following treatment is shown to improve the discriminatory power of the AFS scale. Guzick et al. (1982)

+

pregnancy

(+ Kistner, Buttram) Prediction of pregnancy21428.6 years (17–37)Single centerSurgical confirmationThe AFS score revealed significant differences in pregnancy rate only if categories were combined (mild plus moderate versus severe plus extensive, P ≤ 0.05). The AFS system revealed that pregnancy success was significantly reduced if an ovarian endometrioma was greater than 3 cm or had ruptured (P ≤ 0.01). Rock et al. (1981)

The symbols should be interpreted as follows; + indicates a significant positive result in a correlation (or similar) test, − indicates a significant negative result in a correlation (or similar) test, ns indicates a non-conclusive/non-significant result in a correlation (or similar) test. The highlighted columns represent the intended purpose of the classification/staging system (as in Table I).

AFC, antral follicle count; AFS, American Fertility Society; AGES, Australasian Gynaecological Endoscopy and Surgery; COH, controlled ovarian hyperstimulation; CPP, chronic pelvic pain; CPR, clinical pregnancy rate; DE, deep endometriosis; EFI: endometriosis fertility index; HR, hazard ratio; IVF-ET, IVF embryo transfer; K-M, Kaplan–Meier; LBR, live birth rate; NPV, negative predictive value; PPV, positive predictive value; RANZCOG, Royal Australian and New Zealand College of Obstetricians and Gynaecologists; RR, relative risk.

Study included in meta-analysis (Vesali et al., 2020).

Historical overview of endometriosis classification/staging systems. ENZIAN, the recently updated ENZIAN classification, which incorporates all types of endometriosis; DE, deep endometriosis; ECO, disease extent complaints, objectives system; PR, pregnancy rate; QoL, quality of life; US, ultrasound. In case of ENZIAN score based on MRI. Overview of replication, validation and clinical value of published systems. + length of hospital stays ENZIAN (MRI) Correlation with intraoperative findings ENZIAN (MRI) Accuracy of the score compared to surgical-pathologic findings + Operating time 31 years (19–53 years) Strengths: UBESS predicted the requirement for RCOG level 3 laparoscopic surgical skills (accuracy, 89.4–95.4%). Limitation: misclassification of women who require surgical ureterolysis in the absence of bowel DE. + surgical skills Meta-analysis Vesali + Conception rate + ART outcome + ART outcome + non-ART/ART outcome + ART outcome ART ns AFC, FSH + ART outcome ART outcome Meta-analysis Barbosa pain recur rence + relapse pregnancy + pain ART pregnancy 33.6 years (SD 3.0) and 34.4 years (SD 4.0) age, symptoms No significant differences were found in total endometriosis scores, in active scores or in adhesion scores in different age groups. There was no significant difference in prevalence rate of symptoms for different aspects of endometriosis (implants, cysts or adhesions). AFS score does not reflect the intensity of symptoms. pregnancy + pregnancy The symbols should be interpreted as follows; + indicates a significant positive result in a correlation (or similar) test, − indicates a significant negative result in a correlation (or similar) test, ns indicates a non-conclusive/non-significant result in a correlation (or similar) test. The highlighted columns represent the intended purpose of the classification/staging system (as in Table I). AFC, antral follicle count; AFS, American Fertility Society; AGES, Australasian Gynaecological Endoscopy and Surgery; COH, controlled ovarian hyperstimulation; CPP, chronic pelvic pain; CPR, clinical pregnancy rate; DE, deep endometriosis; EFI: endometriosis fertility index; HR, hazard ratio; IVF-ET, IVF embryo transfer; K-M, Kaplan–Meier; LBR, live birth rate; NPV, negative predictive value; PPV, positive predictive value; RANZCOG, Royal Australian and New Zealand College of Obstetricians and Gynaecologists; RR, relative risk. Study included in meta-analysis (Vesali et al., 2020).

Classification and staging systems

In the 1970s, the first ‘classification’ system for endometriosis originated from a study attempting to describe the results of conservative surgical treatment of endometriosis and hereby classify the extent of the disease and its relationship with the pregnancy rate (Acosta ). Later, this classification system was further expanded and submitted for consideration to the American Fertility Society (AFS) (Buttram, 1978). Similarly, a system published by Kistner et al., (1977) was submitted for endorsement by AFS and the International Federation of Fertility Societies (IFFS). In 1979, AFS published a classification system on behalf of a group of experts including the leading authors of the previous systems (American Fertility Society, 1979). The AFS classification for endometriosis, and later published revised AFS (rAFS) and revised American Society for Reproductive Medicine (rASRM) classification, have been the main standard for classifying endometriosis ever since (American Fertility Society, 1979, 1985; American Society for Reproductive Medicine, 1997). The different versions of the AFS/ASRM classification system reflect the progress made in the knowledge on endometriosis. Later attempts of surgical disease description or staging have focused on disease location—such as urinary tract endometriosis (Knabben )—or subtypes of the disease—such as DE (Chapron ; Tuttlies ; Coccia and Rizzello, 2011): the latter group includes the ENZIAN-Score for classifying DE (Tuttlies ). The recently updated #ENZIAN classification extends the previous ENZIAN score to incorporate all types of endometriosis (Keckstein ). The EPHect standard recommended (SSF) and minimum required (MSF) were developed for recording of surgical phenotypic information on endometriosis (Becker ). While these classification systems mainly focused on describing the extent of disease during surgery, some attempted to link these observations to outcomes, such as pregnancy rates, after surgery (American Fertility Society, 1979, 1985; Kurata ; American Society for Reproductive Medicine, 1997), or indicators for disease management (Chapron ). Another group of classification systems focused on pre-operative assessment of the extent of the disease (van der Wat ; Lafay Pillet ; Knabben ; Menakaya ; Riiskjær ; Chattot ; Ichikawa ), based on either patient-reported symptoms or pre-operative imaging, or a combination of both. The ultrasound-based endometriosis staging system (UBESS) additionally aimed at predicting the complexity of endometriosis surgery (Menakaya ), as does the adhesion scoring system in case of pelvic adhesions (Ichikawa ). Two systems aimed specifically at outcome prediction for endometriosis: the ‘disease extent, complaints, objectives (ECO)-system’, aiming to select the most appropriate management based on reported symptoms (Lasmar , 2015); and the endometriosis fertility index (EFI), aiming to predict the probability of natural conception after surgery (Adamson and Pasta, 2010). Finally, a recently published study ‘Endogram’ sets out to ‘profile’ endometriosis heterogeneity, based on the assessment of several disease markers in a biopsy sample, with the ultimate aim of guiding therapeutic options (Bouquet de Joliniere ).

Replication, validation and clinical value of published systems

We retrieved 46 studies, mostly observational, reporting an evaluation of the different classification, staging or reporting systems (Table II). The aims and outcomes of the different studies varied significantly. Of the included studies, eight reported on the practical aspects of the classification systems, being either the feasibility, or the inter-observer and intra-observer variability. Of these, seven studies focused on the rASRM classification system (Candiani ; Canis ; Hornstein ; Rock, 1995; Lin ; Schliep , 2017), while the most recent one evaluated the reproducibility of the EFI (Tomassetti ). Early studies (1990s) reported significant variability in rAFS classification by five independent experts reviewing surgery recordings, specifically with regards to endometriosis of the ovary and cul-de-sac obliteration (Hornstein ), although another study from the same period reported good to fair agreement in scoring endometriosis between two experts using photographs or recordings (Rock, 1995). In more recent studies, the rASRM classification system was found to have acceptable inter-observer agreement and inter-rater reliability among surgeons and experts reviewing surgical photographs and/or recordings (Schliep , 2017). Studies have also focused on the feasibility of specific aspects of the AFS/rAFS/rASRM classification, specifically classifying bilateral adnexal disease (Canis ), measuring cyst diameter (Candiani ), or the reliability of laparoscopic versus laparotomic scoring (Lin ). For the EFI, a near perfect clinical agreement rate between two independent experts (1.000, 95% CI 0.956–1.000) and high agreement between two assessments by the same expert (0.988, 95% CI 0.934–1.000) has been reported (Tomassetti ). The remaining studies (n = 37) applied the classification or staging systems to a cohort of patients, evaluating whether the system was reliable with regards to its proposed aim, or evaluating whether the classification could be used for other purposes. The latter was mainly the case for the AFS/rAFS/rASRM classification system, which was developed for surgical staging, but has been evaluated for predicting symptom relief and recurrence after surgery (Milingos ; Vercellini ), complications after surgery (Nicolaus ), ovarian reserve (Posadzka ), time to non-ART pregnancy (Yun ), pregnancy outcomes (Rock ; Guzick ) and the outcomes of ART treatment (Pal ; Barbosa ; Pop-Trajkovic ). Furthermore, correlation of the AFS/rAFS/rASRM classification system with symptoms before surgery was evaluated (Marana ; Szendei ; Vercellini ). To our knowledge, there are no studies specifically evaluating the feasibility or reliability of the AFS/rAFS/rASRM classification system for its proposed aim, being a descriptive system of surgical documentation of disease. The EFI, a 10-point scoring system grouped into five categories of risk, has been assessed in 12 studies and one review. It has been mainly assessed for its intended purpose, being prediction of the probability of natural conception after surgery (Tomassetti ; Wang ; Zeng ; Boujenah ; Garavaglia ; Boujenah ; Li ; Maheux-Lacroix ; Zhang ; Kim ; Negi ; Zhou ). Interestingly, in some of these studies, an evaluation of the prognostic value of the different factors included in the EFI score was also performed. A meta-analysis summarized these validation studies and evaluated the performance of the EFI score for predicting non-ART pregnancy after endometriosis surgery, observing good predictive value with a pooled estimate for AUC of 0.71 (95%CI 0.65-0.80) (Vesali ). Some authors have (additionally) evaluated whether its purpose can be extended to guide patient management, by using it to select patients that would benefit from ART treatments (Boujenah ; Li ), and/or predicting the chances of pregnancy from ART treatments (Wang ; Garavaglia ). The ECO system has been validated for prediction of management (surgery or medical treatment) in a single study, by the same authors that developed the tool (Lasmar ). The UBESS system, developed for pre-operative staging and prediction of the complexity of surgery, was evaluated in three studies reporting on the latter purpose, i.e. difficulty of surgery (Chaabane ; Espada ) and prediction of surgical skill levels (Tompsett ). Finally, the ENZIAN classification system, developed as a descriptive system for surgical staging of DE, was evaluated for its purpose in two studies (Haas ; Morgan-Ortiz ). Another evaluation reported on the correlation between the ENZIAN classification and complications after surgery, classified according to the Clavien-Dindo complication grading (Nicolaus ). The use of the ENZIAN classification system was further extrapolated for its use in pre-operative assessment with imaging. Two studies evaluated this MRI-based ENZIAN system (Di Paola ; Burla ), and a third study reported on a model to predict operation time based on the MRI-based ENZIAN classification (Haas ). In general, published classification or staging systems have been developed with various intended purposes, ranging from diagnosis (including symptoms) and pre-operative assessment, surgical description or staging, to prediction of surgical difficulty and treatment outcomes (both for pain and infertility). The studies summarized above confirm the surgical value of the ENZIAN system for description and pre-operative assessment of DE, and of UBESS for predicting laparoscopic difficulty. However, most classification/staging systems show no or very little correlation with patient outcomes. The exception is the EFI, which has been consistently shown to provide good predictive value for natural conception after endometriosis surgery. It is notable that the development of the EFI was data driven, whereas the development of most other classification/staging systems was based on expert opinion.

Discussion

The current paper provides an overview of currently available and published classification, staging and/or reporting systems for endometriosis. We include 22 systems published between 1973 and 2021. Each of the systems was developed for a specific and different purpose. The first systems tried to classify the various forms of endometriosis that were encountered (at the time), and this remains the purpose of more recent systems as there still is no international agreement on how to describe the disease. Next, we summarize published studies evaluating the different classification, staging or reporting systems. From this, we show a discrepancy between the intended and the evaluated purpose, and a general lack of validation data confirming correlation with pain symptoms or quality of life for any of the current endometriosis classification systems. With regards to infertility, the EFI has been confirmed valid for its intended purpose of predicting the probability of natural conception after surgery. Classification and staging systems are widely used in medicine and have been shown to be valuable in guiding clinical management. Examples include the American Joint Committee on Cancer (AJCC) tumor-node-metastasis (TNM) staging systems for cancer, the Gleason score for prostate cancer, the Braak Staging for Parkinson’s disease, and the ACR/EULAR Classification Criteria for Rheumatoid Arthritis. The ACR/EULAR Classification Criteria for Rheumatoid Arthritis were developed based on data analysis of 3115 patients followed by a consensus process in which determinants for risk of rheumatoid arthritis were selected and grouped into a classification system, which was further refined, and the feasibility was optimized (Aletaha ). A review published 2 years afterwards identified 17 articles (total 6816 patients) and 17 meeting abstracts (total 4004 patients) investigating the classification criteria. Only a minority of the articles aimed to validate the system in the intended population, while the other studies extended the target population, used different reference standards or adapted the criteria in the system (Radner ). The review findings are similar to the findings of the current review, although in a different field of medicine. The TNM staging system for cancer was developed in the early 1950s, aiming to guide clinical classification of cancer cases by anatomical extent. The philosophy and technique of TNM staging were developed by Professor Denoix and later adopted by international societies (Denoix, 1952; Sellers, 1971). The system is currently at its eighth edition (Edge ). The system is revised in a 6- or 8-year cycle and changes are implemented based on high-level evidence collected through large datasets. Specifications are available for different types of cancer, and the system has been complemented with a summary staging or classification linked to prognosis and used for treatment planning. In the TNM system for lung cancer, as an example, TNM staging adaptations included the removal of rare findings from the system, and corrections in stage grouping based on survival outcomes (Lim ). In addition, the TNM system has been increasingly complemented by molecular marker data that more accurately stratify risk in patients and guide appropriate treatment options. The longevity and update systems applied for the TNM staging, and the value of additional molecular subtype identification, are likely to be important guides for the design of future endometriosis classification and staging systems that correlate with relevant patient outcomes. Specifically, for endometriosis, previous reviews have summarized and commented on existing classification systems, mainly rASRM, ENZIAN and EFI. It has previously been concluded that the rASRM system has poor correlation with pain, fertility outcomes or prognosis, and that the ENZIAN system has poor correlation with symptoms and infertility (Haas ; Johnson ; Andres ). The EFI system needs further evaluation with regards to the importance of the different parameters and whether to include the completeness of surgical treatment (Maheux-Lacroix ). The conclusion of previous reviews of classification systems and our overview is consistently phrased as a need for a generally accepted classification with a clear goal/purpose (Adamson, 2011; Haas ; Johnson ; Andres ; Rolla, 2019). Yet, as presented in this paper, the goal and purpose of published classification, staging or reporting systems for endometriosis is often ignored when evaluating classification or staging systems, limiting the value of the evaluation studies and of the systems in general. To our knowledge, this is the first report comparing the outcomes assessed in the studies with the intended purposes of the classification systems. Indeed, we show that the rASRM system has been widely evaluated, often with negative conclusions, but we found no studies evaluating the system for its intended goal, which is descriptive surgical staging. ENZIAN and EFI have been evaluated for their intended purpose, but studies have also evaluated whether they can be applied more widely and for other outcomes. Apart from these three systems, only two other classification systems (UBESS and ECO) have been evaluated for their intended purpose, with no evaluations of the remaining 17 classification systems, preventing them from further dissemination and uptake. The current review provides an overview of published classification systems and studies evaluating them, but no detailed assessment of all positive and negative aspects of the classification systems, so as not to repeat previous reviews (Johnson ). In addition, we have restricted our overview to classification systems published in peer-reviewed papers and available through PUBMED/MEDLINE. Although locally used and/or unpublished systems are available and can be valuable, the relevance of including them in the current review was considered low, as they would not be widely applied, nor evaluated by (independent) researchers. For universal use of a classification system, it is pivotal that the system is accessible, validated, reliable and reproducible. Our report includes a summary of evaluation studies assessing these aspects in the different classification systems. Even though we retrieved 46 studies, the value of these evaluations is limited. Apart from the EFI score, the current classification systems have not been thoroughly assessed for validity, feasibility and reproducibility. Moreover, a significant proportion of the evaluation studies have examined the classification systems for purposes other than the one for which they were designed and initially evaluated. Endometriosis is a challenging disease to classify, as it is known to have different phenotypes and presentations (both with regards to the type of lesions and their location), and various symptoms without a clear link to phenotype or presentation. Moreover, the natural progression of the disease is unknown. There is a perceived need for a validated classification or descriptive system for endometriosis that could support further progress in defining subgroups and more importantly guiding the therapeutic options for women with pain and/or infertility. Such a system would certainly also progress endometriosis research by unifying patient subgroups and facilitating the development of prognostic and predictive tools. From this overview, it can be concluded that several classification, staging and reporting systems have been developed for endometriosis. A universally accepted categorization of the disease using the experience from the already existing proposals seems to be needed for clinical and research purposes.

Data availability

All data are incorporated into the article.
  81 in total

1.  Validation of a score to guide endometriosis therapy for the non-specialized gynecologist.

Authors:  Ricardo B Lasmar; Bernardo P Lasmar; Roger Keller Celeste; Angelika Larbig; Rudy L De Wilde
Journal:  Int J Gynaecol Obstet       Date:  2015-07-06       Impact factor: 3.561

2.  Incidence of bilateral adnexal disease in severe endometriosis (revised American Fertility Society [AFS], stage IV): should a stage V be included in the AFS classification?

Authors:  M Canis; J L Pouly; A Wattiez; H Manhes; G Mage; M A Bruhat
Journal:  Fertil Steril       Date:  1992-03       Impact factor: 7.329

3.  Suggested classification for endometriosis: relationship to infertility.

Authors:  R W Kistner; A M Siegler; S J Behrman
Journal:  Fertil Steril       Date:  1977-09       Impact factor: 7.329

4.  World Endometriosis Society consensus on the classification of endometriosis.

Authors:  Neil P Johnson; Lone Hummelshoj; G David Adamson; Jörg Keckstein; Hugh S Taylor; Mauricio S Abrao; Deborah Bush; Ludwig Kiesel; Rulla Tamimi; Kathy L Sharpe-Timms; Luk Rombauts; Linda C Giudice
Journal:  Hum Reprod       Date:  2016-12-05       Impact factor: 6.918

5.  Classification of endometriosis. The American Fertility Society.

Authors: 
Journal:  Fertil Steril       Date:  1979-12       Impact factor: 7.329

6.  Use of the endometriosis fertility index in daily practice: A prospective evaluation.

Authors:  J Boujenah; I Cedrin-Durnerin; C Herbemont; A Bricou; C Sifer; C Poncelet
Journal:  Eur J Obstet Gynecol Reprod Biol       Date:  2017-10-10       Impact factor: 2.435

7.  Comparison of revised American Fertility Society and ENZIAN staging: a critical evaluation of classifications of endometriosis on the basis of our patient population.

Authors:  Dietmar Haas; Radek Chvatal; Alwin Habelsberger; Peter Wurm; Wolfgang Schimetta; Peter Oppelt
Journal:  Fertil Steril       Date:  2011-04       Impact factor: 7.329

8.  Endometriosis fertility index predicts live births following surgical resection of moderate and severe endometriosis.

Authors:  S Maheux-Lacroix; E Nesbitt-Hawes; R Deans; H Won; A Budden; D Adamson; J A Abbott
Journal:  Hum Reprod       Date:  2017-11-01       Impact factor: 6.918

9.  Recommendations for the surgical treatment of endometriosis-part 1: ovarian endometrioma.

Authors:  Ertan Saridogan; Christian M Becker; Anis Feki; Grigoris F Grimbizis; Lone Hummelshoj; Joerg Keckstein; Michelle Nisolle; Vasilios Tanos; Uwe A Ulrich; Nathalie Vermeulen; Rudy Leon De Wilde
Journal:  Gynecol Surg       Date:  2017-12-19

10.  Recommendations for the Surgical Treatment of Endometriosis. Part 1: Ovarian Endometrioma.

Authors:  Ertan Saridogan; Christian M Becker; Anis Feki; Grigoris F Grimbizis; Lone Hummelshoj; Joerg Keckstein; Michelle Nisolle; Vasilios Tanos; Uwe A Ulrich; Nathalie Vermeulen; Rudy Leon De Wilde
Journal:  Hum Reprod Open       Date:  2017-12-19
View more
  3 in total

1.  Endometriosis and Impaired Placentation: A Prospective Cohort Study Comparing Uterine Arteries Doppler Pulsatility Index in Pregnancies of Patients with and without Moderate-Severe Disease.

Authors:  Noemi Salmeri; Antonio Farina; Massimo Candiani; Carolina Dolci; Giulia Bonavina; Caterina Poziello; Paola Viganò; Paolo Ivo Cavoretto
Journal:  Diagnostics (Basel)       Date:  2022-04-19

2.  Endometriosis classification systems: an international survey to map current knowledge and uptake.

Authors:  Krina T Zondervan; Stacey Missmer; Mauricio S Abrao; Jon I Einarsson; Andrew W Horne; Neil P Johnson; Ted T M Lee; John Petrozza; Carla Tomassetti; Nathalie Vermeulen; Grigoris Grimbizis; Rudy Leon De Wilde
Journal:  Hum Reprod Open       Date:  2022-03-01

3.  A 10-Year Journey to Diagnosis With Endometriosis: An Autobiographical Case Report.

Authors:  Lexi R Frankel
Journal:  Cureus       Date:  2022-01-17
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.