| Literature DB >> 35878374 |
Serenella Papparella1, Maria Ines Crescio2, Valeria Baldassarre1, Barbara Brunetti3, Giovanni P Burrai4,5, Cristiano Cocumelli6, Valeria Grieco7, Selina Iussich8, Lorella Maniscalco8, Francesca Mariotti9, Francesca Millanta10, Orlando Paciello1, Roberta Rasotto11, Mariarita Romanucci12, Alessandra Sfacteria13, Valentina Zappulli14.
Abstract
Histological diagnosis of Canine Mammary Tumours (CMTs) provides the basis for proper treatment and follow-up. Nowadays, its accuracy is poorly understood and variable interpretation of histological criteria leads to a lack of standardisation and impossibility to compare studies. This study aimed to quantify the reproducibility of histological diagnosis and grading in CMTs. A blinded ring test on 36 CMTs was performed by 15 veterinary pathologists with different levels of education, after discussion of critical points on the Davis-Thompson Foundation Classification and providing consensus guidelines. Kappa statistics were used to compare the interobserver variability. The overall concordance rate of diagnostic interpretations of WP on identification of hyperplasia-dysplasia/benign/malignant lesions showed a substantial agreement (average k ranging from 0.66 to 0.82, with a k-combined of 0.76). Instead, outcomes on ICD-O-3.2 morphological code /diagnosis of histotype had only a moderate agreement (average k ranging from 0.44 and 0.64, with a k-combined of 0.54). The results demonstrated that standardised classification and consensus guidelines can produce moderate to substantial agreement; however, further efforts are needed to increase this agreement in distinguishing benign versus malignant lesions and in histological grading.Entities:
Keywords: canine mammary tumours; classification; diagnostic agreement; guidelines; interobserver variability; standardisation
Year: 2022 PMID: 35878374 PMCID: PMC9325225 DOI: 10.3390/vetsci9070357
Source DB: PubMed Journal: Vet Sci ISSN: 2306-7381
Scientific studies dealing with inter- and intraobserver variability in the pathological diagnosis (references are listed in alphabetical order for the topic, separately for humans and animals).
| Study | Topic | Species | No. Pat | No. Cases | RD | Lesion Classification | Classification System | Grading System | Outcome | InterO Agreement | IntraO Agreement |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Gilles et al., 2008 [ | brain tumours | H | 5 | 229 | NA | histological criteria | NA | NA | weighted k | 0.65 | NA |
| Eefting et al., 2009 [ | cartilagino-us tumours | H | 18 | 16 | NA | histological grade | NA | Evans (1977) [ | weighted k | 0.58–0.78 | NA |
| Corazza et al., 2007 [ | celiac disease | H | 6 | 60 | NA | histological criteria | Oberhuber (1999) [ | NA | weighted k | 0.35–0.55 | NA |
| Rugge et al., 2021 [ | gastric metaplasia | H | 3 | 74 | NA | histological criteria & IHC | Nagtegaal (2020) [ | NA | weighted k | 0.7–0.9 | 0.7 k |
| Barbosa et al., 2017 [ | gastric polyps | H | 3 | 128 | majority diagnosis | standard diagnosis | Park (2008) [ | NA | unweighted k | 0.40–0.79 | NA |
| Meyer et al., 2005 [ | HBC | H | 7 | 9000 | NA | histological grade & IHC | NA | Elston and Ellis (1991) [ | unweighted k | 0.5–0.59 | NA |
| Longacre et al., 2006 [ | HBC | H | 13 | 35 | NA | standard diagnosis & grade | structured report | Elston and Ellis (1991) [ | unweighted k | 0.3–1 | NA |
| Adams et al., 2009 [ | HBC | H | 5 | 38 | NA | histological grade | NA | Elston and Ellis (1991) [ | unweighted k | 0.32 vs. 0.47 | NA |
| Allison et al., 2014 [ | HBC | H | 3 | 201 | NA | standard diagnosis | NR | NA | % agreement | 62.70% | NA |
| Gomes et al., 2014 [ | HBC | H | 1 | 610 | original report | histological subtypes | Lakhani (2012) [ | NA | unweighted k | 0.22–0.68 | NA |
| Elmore et al., 2015 [ | HBC | H | 115 | 240 | 3 panel members | 4 categories | NR | NA | % agreement | 75.30% | NA |
| Mäkelä et al., 2018 [ | lung fibrosis | H | 4 | 60 | NA | 4 categories | Raghu (2011) [ | NA | unweighted k | 0.4–0.77 | NA |
| Hashisako et al., 2016 [ | lung (interstitial pneumonia) | H | 11 | 20 | NA | histological criteria | Raghu (2011) [ | NA | unweighted k | 0.23 | NA |
| Grilley-Olson et al., 2012 [ | lung tumours | H | 24 | 96 | majority diagnosis | standard diagnosis & categories | Travis (2004) [ | NA | weighted k, bootstrap for IC | 0.25–0.48 | NA |
| Thunnissen et al., 2012 [ | lung tumours | H | 26 (28) | 115 (64) | NA | standard diagnosis (invasion) | Yoshizawa (2011) [ | NA | unweighted k | 0.38–0.77 (0.08–0.55) | NA |
| Nicholson et al., 2018 [ | lung tumours | H | 16 | 126 | NA | histological criteria | Girard (2009) [ | NA | unweighted k | 0.6 | NA |
| Shi et al., 2021 [ | melanocytic neoplasms | H | 3 | 136 | NA | 3 categories | NR | NR | unweighted k | 0.496 | NA |
| Furness et al., 2003 [ | renal allografts | H | 21 | 85 | NA | histological criteria | Racusen (1999) [ | NA | unweighted k | 0.2–0.4 | NA |
| Ganti et al., 2021 [ | rhinosinusitis (chronic) | H | 2 | 92 | NA | histological criteria | Structured report | NA | unweighted k | 0.22–0.64 | NA |
| Hasegawa et al., 2002 [ | soft tissue sarcomas | H | 4 | 130 | expert panel | standard diagnosis, grade & IHC | NR | Hasegawa (2000) [ | % agreement, unweighted k | 75-100%, 0.34–0.86 | NA |
| Denkert et al., 2016 [ | TILs in HBC | H | 32 & 28 | 120 | NA | semiquantitative percentage | web-based & software system | NA | ICC, unweighted k | 0.7 & 0.89, 0.45 & 0.63 | NA |
| Tramm et al., 2018 [ | TILs in HBC | H | 9 | 124 | NA | cutoff categories | Salgado (2015) [ | NA | ICC, unweighted k | 0.71, 0.38–0.46 | NA |
| Kilmartin et al., 2021 [ | TILs in HBC | H | 23 | 49 | NA | absolute n. & cutoff categories | scoring digital tool (https://www.tilsinbreastcancer.org) | NA | ICC | 0.63 & 0.57 | NA |
| Phytian et al., 2016 [ | foot lesions | O | 8 | 1158 | test standard observer | macroscopic criteria | Hodginkson (2010) [ | NA | unweighted k | 0.47–0.72 | NA |
| Lidbury et al., 2017 [ | liver lesions | C | 6 | 50 | NA | scoring system | van den Ingh (2016) [ | NA | unweighted k | 0.16–0.35 | NA |
| Chu et al., 2011 [ | mammary tumours | C | 10 | 15 | NA | benign vs. malignant | NR | NA | unweighted k | 0.43 | NA |
| Santos et al., 2015 [ | mammary tumours | C | 3 | 46 | 2 panel vet members | histological grade | Goldschmidt (2011) [ | Karayannopoulou (2005) [ | weighted & unweighted k | 0.5–0.7 | NA |
| Northrup et al., 2005 [ | mast cell tumours | C | 10 | 60 | previous report | histological grade | NA | Patnaik (1984) [ | % agreement, weighted k | 62.1%, 0.62 | NA |
| Belluco et al., 2019 [ | nervous system tumours | C & F | 4 | 46 | neuropathologist | standard diagnosis & IHC | Higgins (2017) [ | NA | unweighted k | 0.66–0.76 | NA |
| Yap et al., 2016 [ | soft tissue sarcomas | C | 3 | 70 | NA | histological criteria & grade | Dennis (2011) [ | Dennis (2011) [ | ICC, unweighted k | 0.6 & 0.43k | 0.78–1 ICC |
RD, reference diagnosis; Pat, pathologists; H, human; O, ovine; C, canine; F, feline; HBC, human breast cancer, TILs, tumor infiltrating lymphocytes; NA, not applicable; NR, not reported; k, Fleiss’s/Cohen’s kappa; ICC, intraclass correlation coefficient; IntraO/interO, intraobserver/interobserver.
Davis Thomson Foundation classification of canine mammary tumours [7], associated ICD-O-3.2 codes [64], and applied category of lesion. Main categories are indicated in bold and subcategories in italics.
| Lesions | ICD-O-3.2 Codes | Category |
|---|---|---|
|
| ||
| NA | H | |
|
| NA | H |
|
| NA | H |
|
| NA | H |
|
| NA | H |
| NA | H | |
| 8060/0 | H | |
|
| ||
|
| ||
|
| 8211/0 | B |
|
| 8982/0 | B |
|
| ||
|
| 8983/0 | B |
|
| 8940/0 | B |
|
| 9010/0 | B |
|
| ||
|
| 8147/0 * | B |
|
| 8503/0 | B |
|
| ||
|
| not applied | |
|
| ||
|
| 8211/3 | M |
|
| 8263/3 | M |
|
| 8230/3 | M |
|
| 8507/3 | M |
|
| 8501/3 | M |
|
| 8021/3 | M |
|
| ||
|
| 8941/3 * | M |
|
| 8983/3 | M |
|
| 8562/3 | M |
|
| 8940/3 | M |
|
| ||
|
| 8147/3 * | M |
|
| 8503/3 | M |
|
| ||
| 8070/3 * | M | |
| 8560/3 * | M | |
| 8480/3 | M | |
| 8314/3 | M | |
| 8572/3 * | M | |
| 8982/3 * | M | |
|
| ||
| 9180/3 * | M | |
| 9220/3 * | M | |
| 8810/3 * | M | |
| 9120/3 * | M | |
| 8800/3 * | M | |
| 8980/3 * | M | |
|
| ||
| ND | H | |
| ND | H | |
|
| ||
|
| ||
|
| 8147/0 * | B |
|
| 8503/0 | B |
|
| ||
|
| 8147/3 * | M |
|
| 8503/3 | M |
| 8540/3 | M |
NA, not available; H, hyperplasia/dysplasia; B, Benign tumour; M, malignant tumour. * Code assigned also when name of histotype was different but histological description identical between human and canine lesions.
Histological grading for canine mammary tumours [2]. Main categories used for grading are indicated in bold.
| Feature | Points |
|---|---|
|
| |
| Tubules comprise >75% of the tumour | 1 |
| Tubules comprise 10–75% of the tumour (moderate formation of tubules admixed with non-tubular areas) | 2 |
| Tubules comprise <10% (minimal or no tubule formation) | 3 |
|
| |
| Uniform, regular, small nuclei with occasional small nucleoli | 1 |
| Moderate degree of variation in nuclear size and shape, hyperchromatic nucleus, presence of nucleoli (some of which can be prominent) | 2 |
| Marked variation in nuclear size, hyperchromatic nucleus, often with more than 1 prominent nucleoli | 3 |
|
| |
| 0–9/10 hpf | 1 |
| 10–19/10 hpf | 2 |
| 20 or more/10 hpf | 3 |
|
|
|
| I (low, well differentiated) | 3–5 |
| II (intermediate, moderately differentiated) | 6–7 |
| III (high, poorly differentiated) | 8–9 |
a In complex and mixed tumours, the percentage of tubular formation is scored considering only epithelial areas. In malignant myoepithelioma, tubular formation is 2. In heterogeneous canine mammary carcinomas, tubular scoring should be assessed in the most representative malignant area. b In complex and mixed tumours, nuclear pleomorphism is evaluated in all the malignant components. c HPF, high-power field. The fields are selected at the periphery or the most mitotically active parts of the sample (not only epithelial cells). Diameter of the field of view = 0.55 mm.
Characteristics of working panellists at the time of the study.
| Age | Sex | Affiliation | Years of Experience * | Position, Titles (In Addition to DVM) | CMTs Biopsies (per Week) | Self-Assessment of Level of Confidence in CMTs (High/Medium/Low) | Considered an Expert by Colleagues on CMTs | Published Papers on MTs° |
|---|---|---|---|---|---|---|---|---|
| 34 | F | Private and University | 6 | Histopathology Consultant, PhD, ECVP | 15 | medium | NO | 0 |
| 46 | F | University | 15 | AP, PhD, ECVP | 5 | high | YES | 14 |
| 40 | M | University | 7 | AsP, PhD, ECVP | 2 | high | YES | 8 |
| 40 | M | IZS° | 10 | Senior Scientist | 10 | medium | NO | 0 |
| 57 | F | University | 25 | PhD | 3 | medium | NO | 2 |
| 48 | F | University | 20 | AP, MSc^ | 4 | medium | NO | 6 |
| 38 | F | IZS° | 10 | Senior Scientist, PhD | 7 | medium | NO | 6 |
| 54 | F | University | 10 | AsP | 4 | medium | NO | 4 |
| 49 | F | University | 20 | AP | 20 | high | YES | 16 |
| 45 | M | University | 14 | FP, PhD | 5 | medium | NO | 0 |
| 62 | F | University | 25 | FP, PhD | 2 | medium | NO | 0 |
| 38 | F | Private | 14 | Senior Consultant, PhD ECVP | 8 | high | YES | 12 |
| 42 | F | University | 15 | AsP | 2 | medium | NO | 1 |
| 48 | F | University | 17 | AP, PhD | 1 | medium | NO | 2 |
| 46 | F | University | 15 | FP, MSc, PhD, ECVP | 6 | high | YES | 21 |
* Years interpreting mammary gland pathology cases (not including residency/fellowship training) at 1st June 2021. ^: Master of Science.
Figure 1Frequency of results in terms of hyperplasia-dysplasia (H), benign (B), or malignant (M) communicated by individual readers, by sample identification number.
Figure 2Performances of the panellists for outcomes expressed in terms of Hyperplasia-Dysplasia/Benign/Malignant (H, B, M). (a) mean k of each panellist versus all (k_ava); (b) mean k of each panellist versus GM (k_vGM). The points (green) represent the mean of Cohen’s k for each panellist; the vertical bars (orange) represent the 95% confidence interval. The horizontal line (red) represents the value above which the agreement is considered substantial.
Classification of histological subtypes by the 15 panellists (P) for the 36 canine mammary tumour samples included in the study. In bold red the diagnoses that differed from the majority opinion (GM), in grey boxes diagnoses repeated by the same panellist.
| S-ID | P01 | P02 | P03 | P04 | P05 | P06 | P07 | P08 | P09 | P10 | P11 | P12 | P13 | P14 | P15 | GM |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | IDPA | IDPA | DC | DC | DC | IDPC | DC | IDPC | IDPC | DC | IDPC | IDPA | IDPC | DC | IDPC | |
| 2 | IMPC | IMPC |
| IMPC | IMPC | IMPC | IMPC | IMPC | IMPC | IMPC | IMPC |
| IMPC | IMPC | IMPC | IMPC |
| 3 | CAD | CAD |
|
|
| CAD | CAD | CAD | CAD | CAD | CAD | CAD |
| CAD | CAD | CAD |
| 4 | MC |
| MC | MC | MC | MC | MC |
| MC | MC |
|
| MC |
| MC | MC |
| 5 |
| STPC | STPC | STPC | STPC |
|
| STPC |
|
| STPC | STPC | STPC |
|
| STPC |
| 6 |
| STC | STC | STC | STC |
|
|
| STC |
| STC |
| STC |
| STC | STC |
| 7 |
| IDPA |
| IDPA | IDPA |
|
|
| IDPA | IDPA | IDPA |
| IDPA |
| IDPA | IDPA |
| 8 | Skin M | Skin M | Skin M | Skin M | Skin M | Skin M | Skin M | Skin M | Skin M | Skin M | Skin M | Skin M | Skin M | Skin M | Skin M | Skin M |
| 9 | DE | DE | DE | DE | DE | DE | DE | DE | DE | DE | DE | DE | DE | DE | DE | DE |
| 10 |
|
| SAD |
|
|
|
| SAD |
| SAD | SAD | SAD |
|
|
| SAD |
| 11 | CoC | CoC | CoC | CoC | CoC | CoC | CoC | CoC | CoC | CoC | CoC | CoC | CoC | CoC | CoC | CoC |
| 12 | EP | EP |
| EP | EP | EP | EP | EP |
| EP | EP | EP | EP | EP |
| EP |
| 13 |
| TH |
| TH | TH | TH | TH |
| TH | TH | TH | TH |
| TH | TH | TH |
| 14 | LRC |
| LRC | LRC | LRC | LRC | LRC | LRC | LRC | LRC | LRC | LRC | LRC | LRC | LRC | LRC |
| 15 | DAD | DAD | DAD | DAD | DAD | DAD | DAD |
| DAD | DAD | DAD | DAD |
|
| DAD | DAD |
| 16 |
|
| SCC | SCC | SCC | SCC | SCC | SCC | SCC | SCC | SCC | SCC | SCC | SCC | SCC | SCC |
| 17 | MuC | MuC | MuC | MuC | MuC | MuC | MuC | MuC | MuC | MuC | MuC | MuC | MuC | MuC | MuC | MuC |
| 18 |
| LH-F |
| LH-F | LH-F |
| V |
|
| LH-F | LH-F | LH-F | LH-F |
|
| LH-F |
| 19 | MEP | MEP | MEP | MEP | MEP | MEP | MEP | MEP | MEP | MEP | MEP | MEP | MEP | MEP | MEP | MEP |
| 20 |
|
| Other S |
|
| Other S | Other S | Other S |
| Other S | Other S | Other S |
|
|
| Other S |
| 21 | AC | AC |
| AC | AC | AC | AC | AC | AC |
| AC | AC | AC |
| AC | AC |
| 22 | LH-R | LH-R | LH-R | LH-R | LH-R | LH-R |
|
|
| LH-R | LH-R | LH-R | LH-R | LH-R |
| LH-R |
| 23 | PAP | PAP | PAP | PAP | PAP | PAP | PAP | PAP | PAP | PAP | PAP | PAP |
| PAP | PAP | PAP |
| 24 |
| MC | MC |
|
|
|
| MC | MC |
| MC | MC |
| MC | MC | MC |
| 25 |
| OS | OS | OS | OS |
| OS | OS | OS | OS | OS | OS |
| OS | OS | OS |
| 26 |
| FAD | FAD | FAD | FAD |
| FAD | FAD |
|
| FAD | FAD |
|
|
| FAD |
| 27 |
| LRC | LRC | LRC | LRC | LRC | LRC |
| LRC |
| LRC | LRC |
| LRC | LRC | LRC |
| 28 |
| DC |
|
|
|
| DC |
| DC |
| DC | DC | DC | DC | DC | DC |
| 29 |
|
|
|
|
|
| LH-A | LH-A | LH-A |
| LH-A | LH-A |
|
| LH-A | LH-A |
| 30 |
|
|
|
|
| C&MM | C&MM | C&MM |
|
| C&MM | C&MM |
|
|
| C&MM |
| 31 |
| C&MM | C&MM | C&MM | C&MM |
|
|
|
| C&MM |
|
|
| C&MM |
| C&MM |
| 32 | BMT | BMT |
| BMT | BMT | BMT | BMT | BMT |
| BMT | BMT | BMT | BMT | BMT |
| BMT |
| 33 | ASC |
| ASC | ASC | ASC | ASC |
| ASC | ASC |
| ASC | ASC | ASC |
| ASC | ASC |
| 34 | CS | CS | CS | CS | CS | CS | CS | CS | CS | CS | CS | CS |
| CS | CS | CS |
| 35 | LH-S | LH-S | LH-S | LH-S | LH-S | LH-S |
| LH-S |
| LH-S | LH-S | LH-S |
| LH-S |
| LH-S |
| 36 | SCC | SoC | AC | HS | HS | HS | SoC | SoC | AC | HS | SoC | SoC | HS | AC | AC |
S-ID, sample identification; see Table 2 for other abbreviations.
Figure 3Performances of the panellists for outcomes expressed in terms of ICD-O morphological code/diagnosis. (a) mean k of each panellist versus all (k_ava); (b) mean k of each panellist versus GM (k_vGM). The points (green) represent the mean of Cohen’s k for each panellist; the vertical bars (orange) represent the 95% confidence interval. The horizontal line (red) represents the value above which the agreement is considered substantial.
Figure 4Left, dendrogram resulting after a cluster analysis on the variables synthesising the panellists’ experience; right, dendrogram resulting after a cluster analysis on the classification of lesions as hyperplasia, benign, malignant.