Literature DB >> 35586869

Cytologic diagnosis of parotid gland Warthin tumor: Systematic review and meta-analysis.

Roie Fisher1, Ohad Ronen1,2.   

Abstract

It is important to define the accuracy of fine-needle aspiration cytology (FNAC) in the diagnosis of Warthin tumor (WT). This systematic review and meta-analysis evaluated the accuracy of FNAC in the diagnosis of WT in the parotid gland and WT growth rate. For determination of FNAC accuracy, 17 studies, encompassing 1710 cases, were included. Pulled random model estimates of sensitivity, specificity, PPV, and NPV were 93.7% (95%CI: 92.1, 95.3), 97.9% (95%CI: 97, 98.9), 93.3% (95%CI: 91.5, 95.1), and 97.4% (95%CI: 96.4, 98.4), respectively. FNAC is highly reliable for the diagnosis of WT of the parotid. The high PPV value suggests that patients with a cytological diagnosis of WT of the parotid may be assigned to active surveillance.
© 2022 The Authors. Head & Neck published by Wiley Periodicals LLC.

Entities:  

Keywords:  Warthin tumor; diagnosis; fine-needle aspiration; parotid gland

Mesh:

Year:  2022        PMID: 35586869      PMCID: PMC9545504          DOI: 10.1002/hed.27099

Source DB:  PubMed          Journal:  Head Neck        ISSN: 1043-3074            Impact factor:   3.821


false negative fine‐needle aspiration cytology false positive negative predictive value positive predictive value true negative true positives Warthin tumor

INTRODUCTION

Warthin tumor (WT), also known as papillary cystadenoma lymphomatosum or adenolymphoma, is a benign neoplasm that arises almost exclusively in the parotid gland, which is the origin of most salivary gland tumors. It comprises 15% of all parotid tumors and is the second most frequent neoplasm in the parotid gland, after pleomorphic adenoma. , , WT is more common in Caucasians in the 6th and 7th decades of life, smokers, and males, although a narrowing of the gender gap has recently been observed, likely due to increased smoking among women. Several etiological factors have been suggested, including Epstein Barr virus (EBV) infection, autoimmune diseases, radiation, chronic inflammation, and most importantly, cigarette smoking. , In the last few years, there has been an increased trend in the diagnosis of WT in comparison to other parotid tumors; in some studies, WT was found to be more common than pleomorphic adenoma. One study showed that this trend cannot be explained by changes in smoking patterns. , , The same study suggested metabolic syndrome and obesity as two central risk factors. The typical clinical manifestation of WT is a painless firm swelling in the upper neck, but some cases will be asymptomatic, and others will show symptoms of facial nerve branch irritation, ear pain, tinnitus, and hearing impairment. In general, WT grows slowly, and malignant transformations are rare, occurring at a rate of less than 0.1%. The malignant transformation can arise from the epithelial or lymphoid cells of WT. , Synchronous or metachronous tumors, some of which are malignant, in proximity to WT of the parotid, occur rarely. Preoperative assessment of WT with fine‐needle aspiration (FNA) cytological analysis (FNAC) typically identifies a combination of necrotic debris, lymphocytes, and oncocytic epithelial clusters. , , Despite the reports of slow growth, some studies reported on cases in which WT doubled in size within 1 year. , , It is therefore of prime importance to identify the patients at risk of rapid WT growth. However, the literature is scarce and does not include important demographics and clinical data. , , Several studies evaluated the performance of FNA for the diagnosis of WT but showed considerably conflicting results. So et al. reported on 95.8% sensitivity and 97.2% positive predictive value (PPV), and Viguer et al. reported on 90.4% and 98.1%, respectively, whereas other studies demonstrated a rate of false diagnosis of about 25%–40%. , , , This systematic review and meta‐analysis considered publications that include a comparison between preoperative FNAC (index test) and postsurgical histopathological diagnosis (reference test) of WT in the parotid gland. In addition, the mean growth rate of the tumor as measured by imaging modalities was calculated.

METHODS

Search strategy

This systemic review and meta‐analysis followed the referred Reporting Items for Systematic Reviews and Meta‐Analyses (PRISMA) extension for diagnostic test accuracy guidelines. A comprehensive search of PubMed and Scopus was conducted on July 25, 2021 to identify relevant publications. The search terms used were “fine needle aspiration” OR “fine needle sampling” OR “FNA,” AND “Warthin tumor” OR “adenolymphoma” OR “parotid” AND “tumor” OR “neoplasm” OR “mass” as well as “growth rate.” No date restriction was applied. To expand the search, “similar articles” function in PubMed and “related articles” function in Google Scholar were used. In addition, the reference list of selected articles was screened.

Eligibility criteria

Articles that used FNAC for the diagnosis of a parotid gland lesion and histopathological assessment for a final postoperative diagnosis were included. Cases with a nondiagnostic FNAC result were not included for the assessment of FNAC accuracy. Case reports, letters, or comments to the authors, article not in English, and cases or articles that did not address WT were excluded. To calculate WT growth rate, we included articles that diagnosed WT using histopathology or cytology. WT size evaluations were based on articles reporting on CT‐ or MRI‐based tumor size evaluation at least twice with a minimum time interval of 3 months, a measured dimension, and which mention follow‐up duration. If more than two size evaluations were available, only the first and the last were included. In cases where several articles used the same database in overlapping years, only the most recent published article was included to avoid data duplication.

Data extraction, processing, and synthesis

The following data were extracted for each case: method of needle guidance, needle size, patient characteristics, the time (years) range of the data, FNAC diagnosis, and final histopathological diagnosis. Data regarding follow‐up duration for each lesion, imaging modality used, and initial and final size of WT were also extracted. The following parameters were calculated: true positives (TP): FNAC and histopathological diagnosis is WT; false positive (FP): FNAC result is WT, but histopathological diagnosis is not; false negative (FN): FNAC result is a different lesion (not WT), but histopathological result is WT; true negative (TN): both FNAC and histopathology results are not WT. Our goal was to conduct meta‐analysis of individual FNAC estimates in the diagnosis of WT; for this reason, all the included studies contain cases of FP, FN, and TP and some studies also contain TN cases. Studies that lack case of FP, FN, or TP were excluded. All the false positive and false negative results were classified as malignant, benign, or normal. Cytodiagnostic or histopathologic results that were classified as “probably malignant,” “suspected malignant,” “suspicious for malignancy,” “cannot exclude malignancy” were included in the malignant group.

Quality assessment

The included studies were assessed for quality of methodology based on the Diagnostic Accuracy Research Quality Assessment‐2 (QUADAS‐2) tool. The risk of bias was rated as “low,” “high,” or “unclear,” corresponding to a score of “2,” “1,” and “0,” respectively. A study awarded a cumulative score ≥6 was considered of high quality.

Statistical analysis

IBM SPSS Statistics for Windows, Version 27.0 was used for data analysis. R software and related packages were used for the meta‐analysis. The pooled sensitivity, specificity, PPV, and NPV were calculated to assess the diagnostic value of FNAC. Mean percent diameter change for the entire population and percent diameter change for subgroups were calculated to assess WT growth rate. Dependent variables were assessed for normality using the Kolmogorov–Smirnov test and by graphically comparing frequencies distribution to bell shape. The T test was used to compare the means of subgroups; p‐value <0.05 was considered statistically significant. Between‐study heterogeneity was evaluated using the Cochran Q test and the Higgins I square test, where I 2 > 50% indicates statistically significant heterogeneity. Random and fixed models were used for pooled estimates. Publication bias was evaluated by visual inspection of the symmetry of the funnel plot.

RESULTS

The literature search yielded 451 records, of which 17 articles met the inclusion criteria (Figure 1). Table 1 presents key study design elements of all the included articles. The reports were published between 1996 and 2019, conducted in 12 countries, and encompassed a total of 1710 cases, with study samples sizes ranging between 5 and 223 cases. Needle type and needle guidance technique were mentioned occasionally, patient sex was reported in 9 studies, and patient age was reported in 10 studies. The years range of presented data was unavailable for two articles. All the studies were retrospective and involved a medical records database search.
FIGURE 1

Flow diagram for identification of studies for assessment of fine‐needle aspiration cytology accuracy in diagnosing Warthin tumor in the parotid gland [Color figure can be viewed at wileyonlinelibrary.com]

TABLE 1

Summary of studies reviewed to determine fine‐needle aspiration cytology accuracy

Article, year publishedYears rangeLocationStudy designNeedle typeNeedle guidanceAge (range)Males
Al‐Khafaji et al. 27 1986–1996USARetrospectiveN/AN/AMean: 56 (5–90)50.6%
Altin et al. 16 2008–2017TurkeyRetrospective23 GN/AMean: 47.5 (7–82)54.6%
Atula et al. 28 1984–1991FinlandRetrospective23 GN/AUnknownUnknown
Edizer et al. 29 2005–2013TurkeyRetrospective23 or 25 GUSUnknownUnknown
Huang et al. 30 N/ATaiwanRetrospectiveN/AUSUnknownUnknown
Jafari et al. 31 2000–2006FranceRetrospective27 GUS or palpationMean: 50.5 (17–87)60%
Jayaram et al. 32 N/AMalaysiaRetrospective22 GN/AUnknownUnknown
Jechova et al. 33 2006–2016CzechiaRetrospectiveN/AUSMedian: 57 (12–96)42.6%
Raymond et al. 34 1992–2000CanadaRetrospectiveN/AN/AMean: 60.2 (14–88)1.4:1
So et al. 18 2006–2017CanadaRetrospectiveN/AN/AMean: 63.2 (SD 10.4)Unknown
Suzuki et al. 35 1999–2017JapanRetrospective21 or 22 GUS or free hand techniqueUnknownUnknown
Zabren et al. 19 1990–1998SwitzerlandRetrospective22 GN/AMean: 55 (16–97)46%
Akbas et al. 36 1994–2000TurkeyRetrospective25 GUSUnknownUnknown
Behzatoglu et al. 37 1997–2002TurkeyRetrospective22 GN/AMean: 44 (12–80)54.6%
Ali et al. 38 2002–2010PakistanRetrospective22 GFree hand techniqueMean: 44 (15–78)56.5%
Riley et al. 39 1996–2000New ZealandRetrospective24 GN/AUnknownUnknown
Weinberger et al. 40 1985–1989USARetrospective22 GFree hand techniqueMean: 57 (SD 12.9)77.7%

Abbreviations: N/A, not available; US, ultrasound.

Flow diagram for identification of studies for assessment of fine‐needle aspiration cytology accuracy in diagnosing Warthin tumor in the parotid gland [Color figure can be viewed at wileyonlinelibrary.com] Summary of studies reviewed to determine fine‐needle aspiration cytology accuracy Abbreviations: N/A, not available; US, ultrasound. In seven studies, the TN rate was not reported, as the publication focused on evaluation of the concordance between FNA and histopathology in the diagnosis of WT. Study data, individual diagnostic estimates and pooled estimates are summarized in Table 2. Individual and pooled estimates are also presented in a forest plot in Figures 2 and 3. The pooled sensitivity and PPV were calculated based on cases from 17 studies. The random effects model of the 17 studies showed a pooled sensitivity of 93.7% (95%CI: 92.1, 95.3) and pooled PPV of 93.3% (95%CI: 91.5, 95.2). Pooled specificity and NPV were calculated based on the 10 studies in which TN data were reported. The random effects model of these 10 studies showed a pooled specificity of 97.9% (95%CI: 97, 98.9) and a pooled NPV of 97.4% (95%CI: 96.4, 98.4). Heterogeneity assessments showed that PPV and specificity estimates were homogenous (Q = 20.2, p = 0.210, I 2 = 20.8 for PPV, and Q = 10.9, p = 0.282, I 2 = 17.4% for specificity). NPV and sensitivity estimates were found to be heterogenous (Q = 74.3, p < 0.0001, I 2 = 78.5% for sensitivity, and Q = 32.3, p = 0.0002, I 2 = 72.1% for NPV). Visual assessment of the funnel plots for each of the four FNAC estimates showed no asymmetrical distribution (Figure 4). All included studies were of high quality (Figure 5).
TABLE 2

Summary of the fine‐needle aspiration accuracy analysis

ArticleNPVPPVSpecificitySensitivityTNFNFPTP
Al‐Khafaji et al. 27 97.783.397.783.31293315
Altin et al. 16 93.378.693.871.713713933
Atula et al. 28 68.387.513428
Edizer et al. 29 93.095.23240
Huang et al. 30 78.374.290.688.5298323
Jafari et al. 31 10091.796.7100590222
Jayaram et al. 32 100.080.0104
Jechova et al. 33 93.196.6715201
Raymond et al. 34 89.289.24433
So et al. 18 97.295.83269
Suzuki et al. 35 94.592.6118137
Zbaren et al. 19 91.284.49762.816715527
Akbas et al. 36 10094.798.4100.0620118
Behzatoglu et al. 37 98.410010075.063103
Ali et al. 38 98.290.099.181.8111219
Riley et al. 39 95.766.797.850.090424
Weinberger et al. 40 90.460.09542.938423
Total (1710 cases)8859363669
Pooled value [95% CI]
Random effects model97.4 [96.4, 98.4]93.3 [91.5, 95.2]97.9 [97.0, 98.9]93.7 [92.1, 95.3]
Fixed effect model94.0 [92.1, 95.8]86.6 [81.8, 91.4]96.5 [95.0, 98.0]79.5 [74.4, 84.6]

Abbreviations: FN, false negative; FP, false positive; NPV, negative predictive value; PPV, positive predictive value; TN, true negative, TP, true positive.

FIGURE 2

Forest plot of (A) positive predictive value (PPV) and (B) negative predictive value (NPV). Dashed line depicts value of random model estimate. FE, fixed model estimate; RE, random model estimate

FIGURE 3

Forest plot of (A) sensitivity and (B) specificity. Dashed line depicts value of random model estimate. FE, fixed model estimate; RE, random model estimate

FIGURE 4

Funnel plots for assessment of publication bias. Each point represents a separate study. (A) PPV, (B) sensitivity, (C) specificity, (D) NPV. Horizontal axis represents the effect of the studies, vertical axis represents study size, vertical dashed line indicates effect summary, white triangle shape depicts the values extending 1.96 standard errors around the effect summary, this area should include 95% of studies. Studies with a larger sample size and hence, higher precision, are located at the top, studies with higher estimates are located at the right. When publication bias occurs, one expects asymmetry in the scatter around the effect summary, with more studies showing a positive as opposed to a negative result. NPV, negative predictive value

FIGURE 5

Risk‐of‐bias and applicability concerns summary for each domain of the Diagnostic Accuracy Research Quality Assessment (QUADAS‐2) for each included study. (A) Risk‐of‐bias table. (B) Risk‐of‐bias graph [Color figure can be viewed at wileyonlinelibrary.com]

Summary of the fine‐needle aspiration accuracy analysis Abbreviations: FN, false negative; FP, false positive; NPV, negative predictive value; PPV, positive predictive value; TN, true negative, TP, true positive. Forest plot of (A) positive predictive value (PPV) and (B) negative predictive value (NPV). Dashed line depicts value of random model estimate. FE, fixed model estimate; RE, random model estimate Forest plot of (A) sensitivity and (B) specificity. Dashed line depicts value of random model estimate. FE, fixed model estimate; RE, random model estimate Funnel plots for assessment of publication bias. Each point represents a separate study. (A) PPV, (B) sensitivity, (C) specificity, (D) NPV. Horizontal axis represents the effect of the studies, vertical axis represents study size, vertical dashed line indicates effect summary, white triangle shape depicts the values extending 1.96 standard errors around the effect summary, this area should include 95% of studies. Studies with a larger sample size and hence, higher precision, are located at the top, studies with higher estimates are located at the right. When publication bias occurs, one expects asymmetry in the scatter around the effect summary, with more studies showing a positive as opposed to a negative result. NPV, negative predictive value Risk‐of‐bias and applicability concerns summary for each domain of the Diagnostic Accuracy Research Quality Assessment (QUADAS‐2) for each included study. (A) Risk‐of‐bias table. (B) Risk‐of‐bias graph [Color figure can be viewed at wileyonlinelibrary.com] A summary of all cases falsely diagnosed by FNAC is presented in Figure 6. The total FP rate was 3.6% (63 out of 1710 patients), and the FP rate of malignant tumors was 2% (35 out of 1710 patients). When considering all positive FNAC results (n = 732), the rate of malignant FP was 4.7% (35 out of 732 patients). Most of the cases in the FP category were classified as malignant (55.5%, n = 35), and the leading FP diagnosis was adenoid cystic carcinoma (n = 12), followed by mucoepidermoid carcinoma (n = 11). The total FN rate was 5.4% (93 out of 1710 patients); 21 (22.5%) were classified as malignant.
FIGURE 6

False negative (A) and false positive (B) results of fine‐needle aspiration cytology diagnosis of Warthin tumor of the parotid gland in 17 studies. Blue and orange colors represent benign + normal and malignant cases, respectively. ACC, acinic cell carcinoma; adenoCA, adenocarcinoma; CCA, cribriform cystadenocarcinoma; DBCL, diffuse B cell lymphoma; MEC, mucoepidermoid carcinoma; Neg. MAL, negative for malignancy; PA, pleomorphic adenoma; SCC, squamous cell carcinoma; Sus. CA, suspicion of carcinoma; Sus. MAL, suspicion of malignancy; Sus. SCC, suspicion of squamous cell carcinoma [Color figure can be viewed at wileyonlinelibrary.com]

False negative (A) and false positive (B) results of fine‐needle aspiration cytology diagnosis of Warthin tumor of the parotid gland in 17 studies. Blue and orange colors represent benign + normal and malignant cases, respectively. ACC, acinic cell carcinoma; adenoCA, adenocarcinoma; CCA, cribriform cystadenocarcinoma; DBCL, diffuse B cell lymphoma; MEC, mucoepidermoid carcinoma; Neg. MAL, negative for malignancy; PA, pleomorphic adenoma; SCC, squamous cell carcinoma; Sus. CA, suspicion of carcinoma; Sus. MAL, suspicion of malignancy; Sus. SCC, suspicion of squamous cell carcinoma [Color figure can be viewed at wileyonlinelibrary.com]

DISCUSSION

This systematic review and meta‐analysis evaluated the accuracy of FNAC in the diagnosis of WT and investigated WT growth rate. The study found FNAC to have a high specificity (97.9% [95%CI: 97, 98.9]), and PPV (93.3% [95%CI: 91.5, 95.1]), yet a variable sensitivity (93.7% [95%CI: 92.1, 95.3]) and NPV (97.4% [95%CI: 96.4, 98.4]). Although FNAC is highly specific in the diagnosis of WT, the review found that 35/732 (4.7%) positive results proved malignant is postoperative histopathology, hence, patients choosing an observational approach based on preoperative FNAC WT diagnosis should be followed up with caution. False FNAC results involving WT is a well‐known phenomenon. The falsely diagnosed FNAC cases may be the result of sampling error, when WT cysts with acellular fluid are sampled. In addition, WT oncocytes tend to undergo necrosis and to change to squamous or mucinous epithelium which may lead to diagnosis of a malignant tumor. WT necrosis can also cause cyst spillage and subsequent inflammation and reactive changes, thus challenging cytodiagnosis. Other sources of sampling errors are mixed tumors of WT and synchronous benign and malignant lesions. Both FN and FP cases may harbor malignant tumors and should raise concern of progression of malignant disease. Yet, characteristics of these cases were not available for review; thus, future research is warranted to identify the features of falsely diagnosed cases. When considering the overall high PPV, a positive diagnosis of WT by FNAC can be a reasonable option in selected cases with close follow‐up. The cited malignant transformation rate of WT tumors diagnosed by histopathology is 0.1%. Yet, in the case of FNAC diagnosis, false results are mostly due to a sampling error. Although malignant transformation can contribute to lead to a false diagnosis, we think it has a small effect overall. Additionally, cases of malignant transformation were not included in the review, and this subject is beyond the scope of the current study. We chose not to present data regarding WT growth rate in this review. The data obtained in appraised publications were very heterogeneous. Moreover, factors influencing growth are many and unknown, all of which may lead to imprecise results. We suggest that futures studies conduct more comprehensive, three‐dimensional size assessments of WT.

LIMITATION

This review had several limitations. Articles reviewed to determine FNA accuracy were retrospective in nature, and some had a small sample size. Another limitation was that sensitivity and NPV showed a high degree of heterogeneity. Cytodiagnosis terminology used when assessing the salivary glands has some variability between medical centers, which might have been even more pronounced in articles published before 2015, before the Milan system was developed. Given the variability in reported FN and FP malignancy rates and established by this review, it is important that physicians be familiar with the institutional rate for malignancy when FNAC fails to diagnose correctly. This is key for advising the patient and informing them adequately for decision making. FNAC results are impacted by many factors, including collection method, physician FNA training, freehand versus ultrasound‐guided technique, and pathologist versus physician performed FNA. Most of the included studies failed to adequately report on these factors. , ,

CONCLUSIONS

To the best of our knowledge, this is the first systematic review and meta‐analysis of diagnostic accuracy of WT of the parotid gland. The study found that FNAC has high performance in the diagnosis of WT at this site. Although FP results were not common, most turned out to be malignant. The overall high PPV value suggests that selected patients with a cytological diagnosis of WT of the parotid can be assigned to active surveillance.

CONFLICT OF INTEREST

The authors declare that there is no conflict of interest that could be perceived as prejudicing the impartiality of the research reported.
  41 in total

1.  Diagnostic accuracy of fine-needle aspiration biopsy is determined by physician training in sampling technique.

Authors:  B M Ljung; A Drejet; N Chiampi; J Jeffrey; W H Goodson; K Chew; D H Moore; T R Miller
Journal:  Cancer       Date:  2001-08-25       Impact factor: 6.860

2.  Fine-needle aspiration cytology in parotid masses: our experience in Canterbury, New Zealand.

Authors:  Neil Riley; Robert Allison; Scott Stevenson
Journal:  ANZ J Surg       Date:  2005-03       Impact factor: 1.872

3.  Value of fine-needle aspiration cytology of parotid gland masses.

Authors:  P Zbären; C Schär; M A Hotz; H Loosli
Journal:  Laryngoscope       Date:  2001-11       Impact factor: 3.325

4.  Change in Warthin's tumor incidence: a 20-year joinpoint trend analysis.

Authors:  Orhan Tunç; Burhanettin Gönüldaş; Yusuf Arslanhan; Muzaffer Kanlıkama
Journal:  Eur Arch Otorhinolaryngol       Date:  2020-05-29       Impact factor: 2.503

5.  The growth rate and the positive prediction of needle biopsy of clinically diagnosed Warthin's tumor.

Authors:  Jungirl Seok; Woo-Jin Jeong; Soon-Hyun Ahn; Young Ho Jung
Journal:  Eur Arch Otorhinolaryngol       Date:  2019-06-05       Impact factor: 2.503

6.  Ultrasonography guided fine needle aspiration biopsy of parotid gland masses.

Authors:  Yücel Akbaş; Evrim Unsal Tuna; Alp Demireller; Hasan Ozcan; Cemil Ekinci
Journal:  Kulak Burun Bogaz Ihtis Derg       Date:  2004

7.  Clinical features of cystadenolymphoma (Warthin's tumor) of the parotid gland: a retrospective comparative study of 96 cases.

Authors:  A Teymoortash; Y Krasnewicz; J A Werner
Journal:  Oral Oncol       Date:  2006-02-15       Impact factor: 5.337

8.  Value of the cytological diagnosis in the treatment of parotid tumors.

Authors:  Alice Jafari; Benedicte Royer; Marine Lefevre; Pascal Corlieu; Sophie Périé; Jean Lacau St Guily
Journal:  Otolaryngol Head Neck Surg       Date:  2009-03       Impact factor: 3.497

9.  QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies.

Authors:  Penny F Whiting; Anne W S Rutjes; Marie E Westwood; Susan Mallett; Jonathan J Deeks; Johannes B Reitsma; Mariska M G Leeflang; Jonathan A C Sterne; Patrick M M Bossuyt
Journal:  Ann Intern Med       Date:  2011-10-18       Impact factor: 25.391

10.  Utility of clinical features with fine needle aspiration biopsy for diagnosis of Warthin tumor.

Authors:  Thomas So; Axel Sahovaler; Anthony Nichols; Kevin Fung; John Yoo; Michele M Weir; S Danielle MacNeil
Journal:  J Otolaryngol Head Neck Surg       Date:  2019-08-29
View more
  1 in total

Review 1.  Cytologic diagnosis of parotid gland Warthin tumor: Systematic review and meta-analysis.

Authors:  Roie Fisher; Ohad Ronen
Journal:  Head Neck       Date:  2022-05-18       Impact factor: 3.821

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.