Literature DB >> 24843220

The Pooling-score (P-score): inter- and intra-rater reliability in endoscopic assessment of the severity of dysphagia.

D Farneti1, B Fattori2, A Nacci2, V Mancini2, M Simonelli3, G Ruoppolo4, E Genovese5.   

Abstract

This study evaluated the intra- and inter-rater reliability of the Pooling score (P-score) in clinical endoscopic evaluation of severity of swallowing disorder, considering excess residue in the pharynx and larynx. The score (minimum 4 - maximum 11) is obtained by the sum of the scores given to the site of the bolus, the amount and ability to control residue/bolus pooling, the latter assessed on the basis of cough, raclage, number of dry voluntary or reflex swallowing acts (< 2, 2-5, > 5). Four judges evaluated 30 short films of pharyngeal transit of 10 solid (1/4 of a cracker), 11 creamy (1 tablespoon of jam) and 9 liquid (1 tablespoon of 5 cc of water coloured with methlyene blue, 1 ml in 100 ml) boluses in 23 subjects (10 M/13 F, age from 31 to 76 yrs, mean age 58.56±11.76 years) with different pathologies. The films were randomly distributed on two CDs, which differed in terms of the sequence of the films, and were given to judges (after an explanatory session) at time 0, 24 hours later (time 1) and after 7 days (time 2). The inter- and intra-rater reliability of the P-score was calculated using the intra-class correlation coefficient (ICC; 3,k). The possibility that consistency of boluses could affect the scoring of the films was considered. The ICC for site, amount, management and the P-score total was found to be, respectively, 0.999, 0.997, 1.00 and 0.999. Clinical evaluation of a criterion of severity of a swallowing disorder remains a crucial point in the management of patients with pathologies that predispose to complications. The P-score, derived from static and dynamic parameters, yielded a very high correlation among the scores attributed by the four judges during observations carried out at different times. Bolus consistencies did not affect the outcome of the test: the analysis of variance, performed to verify if the scores attributed by the four judges to the parameters selected, might be influenced by the different consistencies of the boluses, was not significant. These initial data validate the clinical use of the P-score in the management of patients with deglutition disorders by a multidisciplinary team.

Entities:  

Keywords:  Aspiration; Deglutition disorders; Diagnosis; Dysphagia; FEES; Pooling; Residue; Severity

Mesh:

Year:  2014        PMID: 24843220      PMCID: PMC4025184     

Source DB:  PubMed          Journal:  Acta Otorhinolaryngol Ital        ISSN: 0392-100X            Impact factor:   2.124


Introduction

In patients with dysphagia due to different pathologies, predisposing to complications, the clinical severity of a swallowing disorder needs to be established . Respiratory complications, related to false paths, taken by bolus or pooling of secretions, can arise in the short or long term . Clinical non-instrumental assessment (clinical swallowing evaluation – CSE) alone can fail to identify silent aspirations or micro-aspirations. In at-risk conditions for the patient, and to define physiopathology of the swallowing disorder, instrumental assessment is essential in order to plan treatment. Currently, there is no instrumental technique that clearly defines the risk of false pathways: video- fluoroscopic study of swallowing (VFSS) and fiberoptic endoscopic evaluation of swallowing (FEES) both yield false positives and false negatives . Choosing the instrumental method that best responds to clinical questions and the expertise of the clinician carrying out the procedure become crucial factors in clinical management of patients with deglutition disorders. Endoscopic assessment provides an exhaustive definition of anatomical details of pharyngeal and laryngeal surfaces, including secretions or bolus residues that, in usual or unusual conditions, moisten the walls (coating) or occupy containment cavities (pooling, residue). Such a evaluation is considered a clinical criterion for severity that can be linked to the risk of respiratory complications . Although to date a standardised grading of pharyngeal residue is not available, this criterion is taken into account when planning treatment and management activities by a multidisciplinary team . In previous publications -, pooling of materials is considered in the broadest sense as any material that is present in the containment cavities of the hypopharynx and larynx, before and/or after the act of swallowing. The severity criterion proposed (Pooling score, P-score) (Table I) takes into account different parameters: 1) site: identified by anatomical landmarks; 2) amount: determined in a semi-quantitative fashion by the amount of pooling materials (coating, more or less than 50% of cavity containment capacity); 3) management: the ability of the patient to clear the residue. The score refers to a specific type of consistency and volume of the bolus, changing according to these parameters. The score considers the most severe condition for each parameter, reached after a sequence of boluses of the same consistency (in our department a sequence of at least 3 boluses for each consistency).
Table I.

P-score and P-SCA score.

PoolingEndoscopic landmarksBedside parameters
SensationCollaborationAge (years)
SiteVallecula Marginal zone Pyriform sinus Vestibule/vocal cords Lower vocal cords1 1 2 3 4Presence = -1 Absence = +1Presence = -1 Absence = +1+1 (< 65) +2 (65-75) +3 (> 75)
AmountCoating Minimum Maximum1 2 3
Management< 2 2-5 > 52 3 4
ScoreP 4-11P-SCA 3-16
P-score and P-SCA score. In clinical practice, the P-score may be integrated with other parameters of clinical assessment (CSE) that are more easily determined: sensation of the pharynx, patient collaboration and age (P-SCA score). Both the scores express, as a numerical value, a continuum of severity that may be used in different ways, with correlations that still have to be verified. Therefore, a minimum score (P-score 4-5) may indicate the absence of endoscopic signs of dysphagia. A low score (P-score 6-7) may identify mild dysphagia, a medium score (P-score 8-9) moderate dysphagia and a high score (P-score 10-11) severe dysphagia. This investigation considers the intra- and inter-rater reliability of the P-score among 4 judges with long-standing experience in the use of endoscopy. CSE parameters and the P-SCA score were not considered.

Materials and methods

Transits of 23 consecutive outpatients aged between 31 and 76 years (average age 58.56 ± 11.76), and referred to our department complaining of difficulties in swallowing, were enrolled. Sample characteristics and diseases are reported in Table II. Four judges with expertise in the FEES procedure were elected. Three judges have at least 10 years of experience, and one has 4 years of experience: they all routinely carry out at least 5 endoscopic assessments of swallowing per week.
Table II.

Patient characteristics and diseases.

SubjectGenderAgePathology
1 PSF47Globus
2 AAF68Cortical ictus sequelae
3 GAM65GERD
4 BAF62COPD
5 CLF56Dermatomyositis
6 RRM67Cortical ictus sequelae
7 MM1F64Laryngeal paralysis
8 XLF38GERD
9 TSF64Neurological degenerative
10 MM2M63Cortical ictus sequelae
11 QGM42Corea major
12 DAF71Myasthenia
13 MEM52H-N operated
14 DMF48Cortical ictus sequelae
15 MEF46Laryngeal paralysis
16 BGM62Sjögren's syndrome
17 CCF76Wallemberg sequaele
18 BFF73Laryngeal paralysis
19 CRM51Laryngeal paralysis
20 MPM71COPD
21 RFF67H-N operated
22 CLM31Neurologial degenerative
23 SRM63Cortical ictus sequelae
Patient characteristics and diseases. For some of the 23 subjects, the pharyngeal transit films of more than one bolus of a different consistency was recorded to obtain a total number of 30 films: 10 solid boluses (1/4 of a cracker), 11 creamy boluses (1 spoonful of jam) and 9 liquid boluses (1 tablespoon with 5 cc of water dyed with methylene blue, 1 cc in 100 ml, according to the procedure used in our department). The films are short sequences of acts of swallowing that are prolonged until the bolus has been completely swallowed, or at least 5 swallows. Film length varies from a few dozen seconds to no more than 180 seconds, so the complete session lasted no more than an hour, avoiding fatigue and a decrease in concentration among judges . All the films were collected on two CD copies, respectively CD0 and CD1, that differed with respect to the sequence of the films. Before the session, the rationale and application of the score were explained to the judges. Each judge was given a CD-test containing 4 pharyngeal transits of boluses with different consistencies, in subjects who differed by the severity of dysphagia. The application of the P-score was explained personally by the first judge at an explanatory session lasting 30 min. Judges were asked to watch CD0 immediately thereafter (time 0), CD1 at 24 hours later (time 1) and CD0 after 7 days (time 2) in a different order than at time 0. Data were collected on a predefined sheet, reporting separate findings for each parameter: pooling site, amount, management and total P-score. The different bolus consistencies used in our study between the pharyngeal transit films tested is a bias -. For this reason, we calculated the variance of scores attributed by judges to the parameters selected. Finally, the score can also be applied at the beginning of the observation . In the films considered, material pooling at the beginning of the observation was present in only a few cases, so it was decided not to apply the score at that time, but only after the transit of the bolus through the pharynx. The analysis of data obtained was carried out using SAS statistical software and the inter- and intra-rater reliability was calculated with the intra-class correlation coefficient (ICC; 3,k) .

Results

The scores attributed by judges to the site, amount and management of the pooled material are shown in Tables IIIa, IIIb and IIIc. The site, apparently simpler, was modified by both the first and fourth judges over time, reaching agreement at the last observation [ICC (3, k) 0.999]. The parameter amount was changed many times during observations, with differences maintained among judges [ICC (3, k) 0.997]. The parameter management was replicated by all judges in all three observations [ICC (3, k) 1.000]. The total P-score attributed by the four judges confirmed that the criterion of severity established by the first judge was essentially shared by the other three judges [ICC (3, k) 0.999] (Table IV).
Table IIIa.

Site (anatomical landmarks): descriptive analysis.

Time JUDGE
1234
NNNN
0116151515
24555
35555
45555
1115151515
25555
35556
45554
2115151515
25555
35555
45555

P-score: site

ICC(3,k) 0.999

Time: time of observation

N: number of observation

Table IIIb.

Amount: descriptive analysis.

Time JUDGE
1234
NNNN
0118191918
28778
34444
1118191918
27778
35444
2119191919
26777
35444

P-score: amount

ICC(3,k) 0.997

Time: time of observation

N: number of observation

Table IIIc.

Management: descriptive analysis.

Time JUDGE
1234
NNNN
028888
35555
417171717
128888
35555
417171717
228888
35555
417171717

P-score: management

ICC(3,k) 1.000

Time: time of observation

N: number of observation

Table IV.

Severity criteria: descriptive analysis.

Time JUDGE
1234
NNNN
048888
53333
77776
84445
95555
102222
111111
148888
53333
76777
84444
96555
102222
111111
248888
53333
76777
85444
95555
102222
111111

P-score: total

ICC(3,k) 0.999

Time: time of observation

N: number of observations

Site (anatomical landmarks): descriptive analysis. P-score: site ICC(3,k) 0.999 Time: time of observation N: number of observation Amount: descriptive analysis. P-score: amount ICC(3,k) 0.997 Time: time of observation N: number of observation Management: descriptive analysis. P-score: management ICC(3,k) 1.000 Time: time of observation N: number of observation Severity criteria: descriptive analysis. P-score: total ICC(3,k) 0.999 Time: time of observation N: number of observations The analysis of variance, performed to verify if the scores attributed by the four judges to the parameters selected, might have been influenced by the different consistency of the boluses, was not significant (interaction consistency*judge = 1.000 at times 0, 1, 2, respectively, for the three parameters of the score and P-score total). Thus, the different types of consistencies did not influence the scores attributed by judges to the 30 films.

Discussion

The evaluation of secretions or bolus pooling in the pharynx end/or larynx represents an important step in the endoscopic examination of swallowing, as it is closely correlated with respiratory complications . As previously mentioned, to date, standardised grading for pharyngeal material pooling is not available: the P-score could be used in this clinical context. From a physiopathological point of view, a swallowing disorder is the result of an imbalance between events that occur in the domain of space and time , domains in which vector forces guarantee the efficiency of defensive strategies, which protect the airways, or clear the containment cavities of the bolus passed through them . These events and forces can interact in different ways. In the domain of space, where forces are acting, the Pscore identifies the pathway and flow of the bolus: the pathway is identified by the direction along the digestive or respiratory tracts, as well as false route (penetration or aspiration); the flow is indicated by the amount of bolus that does not cross the pharynx while swallowing. This amount (expressed in a semi-quantitative manner by the score) represents the difference between the total amount of the bolus taken and the amount of bolus swallowed. In the domain of time, the score identifies events that occur before or after swallowing, considering that material pooling after a previous swallow becomes a bolus for the next swallow, with a volume that is either increased or decreased. Thus, the subsequent swallowing act can clear the residue or push it below into the airway . The P-score considers the sequence of swallowing acts in the "management" parameter, evaluating the fate and final amount of a bolus that persists in the pharynx/larynx after 5 empty swallows, and also gives information about the reaction of the patient to material pooling or to airway invasion. The occurrence, or absence, of dry swallowing, cough or throat clearing, in response to the residue or penetration/ aspiration before, during or after swallowing, express the interaction between vectors and volumes. In this way, the number of dry swallows or clearing activities, related to the final amount of material pooling, can be assumed to be a parameter of efficiency of the entire sequence, closely linked to fatigue of muscular swallowing effectors. These preliminary considerations can help us in the interpretation of the score attributed by the judges. The anatomical parameter (Table IIIa: site) was well identified by the 4 judges: disagreement may have been influenced by the amount and site of the residue, considering that the most severe condition has to be selected for scoring (the residue in a lower anatomical site may have been chosen with respect to a larger volume, both parameters indicating greater severity, or vice versa). In this case, site and amount are closely linked: in fact, "amount" (Table IIIb) created greater difficulties for the judges. It is worth remembering that the measurement of this parameter is semi-quantitative, and the four judges attributed a diversity of scores, which in some cases were modified during the three observations, but in other cases remained unchanged. A greater variability was seen when making a distinction between scores 2 and 3, while "coating" created fewer difficulties in score attribution. The third parameter (management) considers the effectiveness of the manoeuvre carried out to clear the residue, regardless of whether it was spontaneous or performed upon request, and in many cases it solves the dilemma between site and volume mentioned above. This parameter (Table IIIc) was easily evaluated by all judges and in all observations, with no discrepancies. If we consider the total score (from 4 to 11), which marks the continuum of clinical severity (Table IV), it can be observed that extreme scores were replicated by the four judges in all observations, with good agreement. This was easy to identify, and maintain judgement, for subjects without pooling or patients with bulky residues. Differences emerged in the medium classes of severity where the attribution of a different partial score may affect the attribution of a different class of clinical severity. In our work, these differences, sometimes replicated at the third observation, never reached statistical significance and can be assumed to be casual.

Conclusions

One of the most difficult tasks for clinicians and management team is to determine the severity of the swallowing disorder. This criterion very often refers to the risk for the lower airways to be invaded by the bolus or by material pooling into pharyngeal or laryngeal cavities. The instrumental criterion of severity (endoscopic or radiological) needs to be contextualised according to a more general clinical criterion of severity, which should make reference to the patient, considering that the non-instrumental assessment (CSE) tends to underestimate the risk of aspiration, whereas instrumental assessment tends to overestimate it . "Severity", in this case, becomes a relative criterion, which is quantifiable by the parameters that define it. In clinical practice, aspiration is the most significant event that marks a swallowing disorder, although it is not the only one. Endoscopic and radiological evaluations are complementary techniques , even if the former offers an optimal view of pharyngeal and laryngeal cavities, and particularly of material pooling . Parameters related to instrumental severity, devised for radiological evaluation, might not be the best for application in the clinical endoscopic field, though they maintain their effectiveness in terms of interand intra-rater reliability . Nonetheless, scores that can be easily applied in clinical practice are needed. There are several endoscopic scores reported in the literature, with severity criteria divided into 3, 4 or 5 levels. This division does not seem to interfere with the inter- and intra-rater reliability of a score . The P-score, which considers anatomical and functional parameters, evaluates the interaction between volumetric, vectorial and temporal events, disengaging the criterion of severity from the quantitative parameter alone. The high inter- and intra-rater reliability of the P-score was verified for anatomical, semi-quantitative and temporal parameters. Variability among judges, which was not statistically significant, was expressed with respect to the "site" and "amount", whereas "management" seems more easily determinable. Despite the high reliability of the P-score shown herein, one critical point could be represented by the unequal subdivision of clinical severity in the sample, taken from consecutive outpatients seen in our department. The patients with the most severe clinical conditions were not able to manage all consistencies or volumes tested by those less severely affected. In the former patients, for example, it was possible to administer only a few small amounts of the bolus with a creamy consistency. Nevertheless, the statistical analysis verifies the reliability of the judges in attributing a score for each parameter of the P-score, regardless of severity, which derives from the sum of the three parameters that determine it. This should not affect the clinical use of the score, which, in contrast, seems to be able to identify patients with small differences in severity. Further research is being undertaken to check the effectiveness of the P-score in clinical management of patients with swallowing disorders that have a different aetiology, as well as its usefulness in indicating variations after specific treatment.
  22 in total

1.  Interjudge and intrajudge reliabilities in fiberoptic endoscopic evaluation of swallowing (fees) using the penetration-aspiration scale: a replication study.

Authors:  Nancy Colodny
Journal:  Dysphagia       Date:  2002       Impact factor: 3.438

2.  Aspiration: the predictive value of some clinical and endoscopy signs. Evaluation of our case series.

Authors:  D Farneti; P Consolmagno
Journal:  Acta Otorhinolaryngol Ital       Date:  2005-02       Impact factor: 2.124

3.  Videofluoroscopic assessment of patients with dysphagia: pharyngeal retention is a predictive factor for aspiration.

Authors:  Edith Eisenhuber; Wolfgang Schima; Ewald Schober; Peter Pokieser; Alfred Stadler; Martina Scharitzer; Elisabeth Oschatz
Journal:  AJR Am J Roentgenol       Date:  2002-02       Impact factor: 3.959

4.  The significance of accumulated oropharyngeal secretions and swallowing frequency in predicting aspiration.

Authors:  J Murray; S E Langmore; S Ginsberg; A Dostie
Journal:  Dysphagia       Date:  1996       Impact factor: 3.438

5.  Fiberoptic endoscopic examination of swallowing safety: a new procedure.

Authors:  S E Langmore; K Schatz; N Olsen
Journal:  Dysphagia       Date:  1988       Impact factor: 3.438

Review 6.  New concepts in dysphagia management.

Authors:  M Mendelsohn
Journal:  J Otolaryngol       Date:  1993-02

7.  Predictors of aspiration pneumonia: how important is dysphagia?

Authors:  S E Langmore; M S Terpenning; A Schork; Y Chen; J T Murray; D Lopatin; W J Loesche
Journal:  Dysphagia       Date:  1998       Impact factor: 3.438

8.  Aspiration risk after acute stroke: comparison of clinical examination and fiberoptic endoscopic evaluation of swallowing.

Authors:  Steven B Leder; Julian F Espinosa
Journal:  Dysphagia       Date:  2002       Impact factor: 3.438

9.  Effect of swallowed bolus variables on oral and pharyngeal phases of swallowing.

Authors:  R O Dantas; M K Kern; B T Massey; W J Dodds; P J Kahrilas; J G Brasseur; I J Cook; I M Lang
Journal:  Am J Physiol       Date:  1990-05

10.  Endoscopic and videofluoroscopic evaluations of swallowing and aspiration.

Authors:  S E Langmore; K Schatz; N Olson
Journal:  Ann Otol Rhinol Laryngol       Date:  1991-08       Impact factor: 1.547

View more
  18 in total

Review 1.  History of Fiberoptic Endoscopic Evaluation of Swallowing for Evaluation and Management of Pharyngeal Dysphagia: Changes over the Years.

Authors:  Susan E Langmore
Journal:  Dysphagia       Date:  2017-01-18       Impact factor: 3.438

2.  Psychometric Properties of Visuoperceptual Measures of Videofluoroscopic and Fibre-Endoscopic Evaluations of Swallowing: A Systematic Review.

Authors:  Katina Swan; Reinie Cordier; Ted Brown; Renée Speyer
Journal:  Dysphagia       Date:  2018-07-17       Impact factor: 3.438

3.  Relationship between swallowing-related quality of life and fiberoptic endoscopic evaluation of swallowing in patients who underwent open partial horizontal laryngectomy.

Authors:  Nicole Pizzorni; Antonio Schindler; Marco Fantini; Andy Bertolin; Giuseppe Rizzotto; Federico Ambrogi; Giovanni Succo; Erika Crosetti
Journal:  Eur Arch Otorhinolaryngol       Date:  2018-02-08       Impact factor: 2.503

4.  Reliability of Untrained and Experienced Raters on FEES: Rating Overall Residue is a Simple Task.

Authors:  Jessica M Pisegna; James C Borders; Asako Kaneoka; Wendy J Coster; Rebecca Leonard; Susan E Langmore
Journal:  Dysphagia       Date:  2018-03-07       Impact factor: 3.438

Review 5.  Evaluating the safety of oral methylene blue during swallowing assessment: a systematic review.

Authors:  Bina Tariq; Sorina R Simon; Walmari Pilz; Andra Maxim; Bernd Kremer; Laura W J Baijens
Journal:  Eur Arch Otorhinolaryngol       Date:  2021-01-02       Impact factor: 2.503

6.  Pattern of dysphagia after swallowing-sparing intensity-modulated radiotherapy (IMRT) of head and neck cancers: results of a mono-institutional prospective study.

Authors:  Stefano Ursino; Paola Cocuzza; Veronica Seccia; Durim Delishaj; Agostino Cristaudo; Francesco Pasqualetti; Patrizia Giusti; Stefania Santopadre; Riccardo Morganti; Francesco Fiorica; Fabiola Paiar; Bruno Fattori
Journal:  Strahlenther Onkol       Date:  2018-07-09       Impact factor: 3.621

7.  Home-based intensive treatment of chronic radiation-associated dysphagia in head and neck cancer survivors (HIT-CRAD trial).

Authors:  Hanne Massonet; Ann Goeleven; Leen Van den Steen; Alice Vergauwen; Margot Baudelet; Gilles Van Haesendonck; Olivier Vanderveken; Heleen Bollen; Lisette van der Molen; Fréderic Duprez; Peter Tomassen; Sandra Nuyts; Gwen Van Nuffelen
Journal:  Trials       Date:  2022-10-22       Impact factor: 2.728

8.  Study protocol for a randomized controlled trial: tongue strengthening exercises in head and neck cancer patients, does exercise load matter?

Authors:  Gwen Van Nuffelen; Leen Van den Steen; Olivier Vanderveken; Pol Specenier; Carl Van Laer; Diane Van Rompaey; Cindy Guns; Steven Mariën; Marc Peeters; Paul Van de Heyning; Jan Vanderwegen; Marc De Bodt
Journal:  Trials       Date:  2015-09-04       Impact factor: 2.279

9.  Observers' Agreement on Measurements in Fiberoptic Endoscopic Evaluation of Swallowing.

Authors:  Walmari Pilz; Sophie Vanbelle; Bernd Kremer; Michel R van Hooren; Tine van Becelaere; Nel Roodenburg; Laura W J Baijens
Journal:  Dysphagia       Date:  2016-01-23       Impact factor: 3.438

10.  Laryngeal Sensitivity in Patients with Amyotrophic Lateral Sclerosis.

Authors:  Giovanni Ruoppolo; Emanuela Onesti; Maria Cristina Gori; Ilenia Schettino; Vittorio Frasca; Antonella Biasiotta; Carla Giordano; Marco Ceccanti; Chiara Cambieri; Antonio Greco; Costantino Eugenio Buonopane; Giorgio Cruccu; Marco De Vincentiis; Maurizio Inghilleri
Journal:  Front Neurol       Date:  2016-11-28       Impact factor: 4.003

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.