Literature DB >> 34816131

Use of technology for the objective evaluation of scratching behavior: A systematic review.

Albert F Yang1,2, Morgan Nguyen3, Alvin W Li2, Brad Lee3, Keum San Chun4, Ellen Wu3, Anna B Fishbein5, Amy S Paller2,6,7, Shuai Xu2,7,8,9.   

Abstract

INTRODUCTION: Pruritus is a common symptom across various dermatologic conditions, with a negative impact on quality of life. Devices to quantify itch objectively primarily use scratch as a proxy. This review compares and evaluates the performance of technologies aimed at objectively measuring scratch behavior.
METHODS: Articles identified from literature searches performed in October 2020 were reviewed and those that did not report a primary statistical performance measure (eg, sensitivity, specificity) were excluded. The articles were independently reviewed by 2 authors.
RESULTS: The literature search resulted in 6231 articles, of which 24 met eligibility criteria. Studies were categorized by technology, with actigraphy being the most studied (n = 21). Wrist actigraphy's performance is poorer in pruritic patients and inherently limited in finger-dominant scratch detection. It has moderate correlations with objective measures (Eczema and Area Severity Index/Investigator's Global Assessment: rs(ρ) = 0.70-0.76), but correlations with subjective measures are poor (r2 = 0.06, rs(ρ) = 0.18-0.40 for itch measured using a visual analog scale). This may be due to varied subjective perception of itch or actigraphy's underestimation of scratch.
CONCLUSION: Actigraphy's large variability in performance and limited understanding of its specificity for scratch merits larger studies looking at validation of data analysis algorithms and device performance, particularly within target patient populations.
© 2021 by the American Academy of Dermatology, Inc. Published by Elsevier Inc.

Entities:  

Keywords:  AD, Atopic dermatitis; PPV, Positive predictive value; RMSE, Root mean square error; TST%, total scratching time percentage; VAS, Visual analog scale; algorithm; atopic dermatitis; disease management; drug development; eczema; general dermatology; itch; machine learning; pediatric dermatology; pruritus; technology

Year:  2021        PMID: 34816131      PMCID: PMC8593746          DOI: 10.1016/j.jdin.2021.06.005

Source DB:  PubMed          Journal:  JAAD Int        ISSN: 2666-3287


We assessed ways to quantify itch by measuring scratching behavior via various technological modalities (eg, actigraphy, smartwatch applications, acoustic sensors). The overall performance of current objective tools for quantifying itch suffers from low accuracy and variable performance. Further development will allow for more-objective evaluation of disease management and treatment.

Introduction

Pruritus is a common symptom of systemic and dermatologic disorders, and scratching is the innate reflex. The itch-scratch cycle is a hallmark symptom of atopic dermatitis (AD) and perpetuates skin barrier dysfunction. Notably more severe during sleep, itch in AD has been shown to impact sleep quality.2, 3, 4, 5 Historically, itch has been assessed subjectively through visual analog scales (VAS) and numeric rating scales. However, these measures often do not correlate to visually observed scratch, especially in children.7, 8, 9 More recently, studies have explored device-driven methods to objectively measure scratch as a proxy for itch. Actigraphy is the most commonly tested method and entails the use of accelerometers to monitor wrist movements, a proxy for scratching. Other technologies include acoustic devices,, strain gauges,, pressure sensors,, and vibratory sensors.,,15, 16, 17, 18 The commonly accepted gold standard is video recording of scratching with manual coding by an observer, which is time-consuming and impractical in clinical settings.19, 20, 21 The purpose of this systematic review is to assess the performance and algorithms of technological methods currently available to evaluate scratching behavior objectively.

Methods

Search strategy

We queried PubMed, MEDLINE, Embase (Elsevier), Cochrane Library and Cochrane Central Register of Controlled Trials (CENTRAL), Scopus (Elsevier), Web of Science (Clarivate Analytics), and IEEE Xplore Digital Library in October 2020 without limits on publication date. The search strategy is fully detailed in the Supplemental Materials under “Search Strategy” (available via Mendeley at https://data.mendeley.com/datasets/ryg97c26t6/2).

Study selection

Eligibility assessment was performed independently by 2 authors. Included articles must feature critical assessment of a technology designed to measure itch objectively and report at least 1 of the primary outcomes described below. Exclusion criteria included studies of nonhuman subjects, articles without original data, and studies describing technology without assessing its performance.

Quality assessment

Study quality was assessed using a rating scheme (1-5), which was modified from the Oxford Centre for Evidence-Based Medicine for rating levels of evidence. The individual studies assessed are described in Tables I and II and assessment was performed by at least 2 authors.
Table I

Summary table for studies exploring wrist actigraphs and smartwatch applications

Device typesStudySample size and populationStudy focusVideo recording? (Yes/No)SensitivitySpecificityCorrelationAccuracyStudy quality (1-5)
Actigraphy Feuerstein23Healthy adults (n = 12)Testing k-means cluster algorithmNo0.90 ± 0.100.98 ± 0.05 (walking)0.88 ± 0.06 (restlessness)0.92 (scratch)0.92 (walking)0.97 (restless sleep)3
 Petersen29Healthy adults (n = 12)Testing logistic regression algorithmYes0.96 (all data)0.96 (cross-validation, mean)0.92 (all data)0.92 (cross-validation, mean)3
 Almazan28Healthy adults (n = 3), AD adults (n = 9)Testing BRNN algorithmYesrs (ρ) = 0.96 (actigraphy and video scoring)rs (ρ) = 0.90(number of scratching events at home and polysomnography)3
 Moreau24Healthy adults (n = 6), AD adults (n = 18)Testing BRNN algorithm compared to logistic regressionYesAD:0.45-0.91 (BRNN)0.00-0.10 (logistic regression)Healthy:0.00-0.75 (BRNN)0.00-0.50 (logistic regression)Total:0.66 (BRNN)0.06 (logistic regression)r2 = 0.98rs (ρ) = 0.95(BRNN and video recording)F1 scores:AD:0.27-0.90 (BRNN)0.00-0.14 (logistic regression)Healthy:0.00-0.29 (BRNN)0.00-0.08 (logistic regression)Total:0.68 (BRNN)0.09 (logistic regression)
 Kurihara12Healthy adults (n = 10)Actigraphy vs video recording and other devices for TST% calculationYesRMSE = 5.32%-8.12%2
 Murray8Study 1: healthy subjects (n = 24; 12 adults, 12 children), pruritic subjects (n = 118; 68 adults, 50 children)Study 2: AD adults (n = 20)Actigraphy vs VAS itchNoStudy 1: r2 = 0.06Study 2: r2 = 0.083
 Shino37Healthy adults (n = 1)Actigraphy vs video recording and other devices for TST% extraction via novel algorithmYesRMSE = 0.83s (0.64s)TST% error = +5.02% (+4.33%)(parentheses are from visually scoring outputs)3
 Wootton33AD children (n = 336)Actigraphy vs AD severity (SASSAD, POEM)Nors (ρ):SASSAD = 0.15 (P = .02)POEM = .10 (P = .13)3
 Hon30AD children (n = 24 for subjective surveys, n = 20 chemokines)Actigraphy vs SCORAD scores and AD-associated chemokinesNors (ρ):Total SCORAD = 0.52Objective SCORAD = 0.52SCORAD pruritus = 0.23SCORAD sleep loss = 0.36CTACK = 0.56§MDC = 0.63TARC = 0.543
 Hon31AD children (n = 28)Actigraphy vs BDNF and substance PNors (ρ):BDNF = 0.83-0.91Substance P = .83-.873
 Fujita27AD adults (n = 15)Actigraphy vs SCORAD, VAS itch, serum cytokinesNors (ρ):VAS daytime itch = 0.58SCORAD = 0.54TARC = 0.51LDH = 0.653
 Bender43Healthy adults (n = 14), AD adults (n = 14)Actigraphic sleep measures vs VAS itchNors (ρ):WASO = 0.35Sleep efficiency = 0.38Average sleep = 0.463
 Benjamin21Healthy children (n = 7), AD children (n = 14)Video recording (sleep time, scratch time, restlessness) vs actigraphy and VAS itchYesrs (ρ):Actigraphy, all > 0.92VAS itch = 0.16-0.30 (P > .05)3
 Bringhurst26Pruritic subjects (n = 33 adults, n = 25 children), healthy subjects (n = 30 adults, n = 17 children)Actigraphy vs subjective scores (VAS sleep, VAS itch, VAS skin disease), and SCORADNors (ρ):Children:VAS sleep = 0.48VAS itch = 0.40VAS skin disease = 0.49SCORAD = 0.62Adults:VAS sleep = −0.44VAS itch = 0.18VAS skin disease = 0.15SCORAD = 0.533
 Ebata25Healthy adults (n = 5), AD adults (n = 29)Actigraphy vs video recording in TST% calculationYesrs (ρ) = 0.913
 Sandoval32AD adults (n = 10)Actigraphic WASO vs IGA and EASI at baseline and after 5-day fluocinonide 0.1% creamNors (ρ):baseline EASI = 0.75baseline IGA = 0.76end treatment EASI = 0.70end treatment IGA = 0.733
 Kaburagi16Healthy adults (n = 12)TST% estimation algorithm for various devicesYesRMSE = 4.29% (4.85%)(parentheses are from visual scoring of outputs for TST%)4
Smartwatch applications Ikoma36AD adults (n = 5)“ItchTracker” (now “DermaTrack”) testing for scratch detectionYes0.85 ± 0.10R = 0.85-0.904
 Lee34Healthy adults (n = 3)“Itchtector” prototype testingYesdominant hand = 0.98-1.00nondominant hand = 0.63-0.82dominant hand = 0.98-1.00nondominant hand = 0.99dominant hand = 0.985-0.99nondominant hand = 0.933-0.9763
 Lee35Pruritic subjects (n = 13)“Itchtector” testing in pruritic subjectsYes0.750.903

AD, Atopic dermatitis; BDNF, brain-derived neurotrophic factor; BRNN, bidirectional recurrent neural network; CTACK, cutaneous T-cell-attracting chemokine; EASI, Eczema Area and Severity Index; IGA, Investigator's Global Assessment; r (ρ), Spearman's rank correlation coefficient; LDH, lactate dehydrogenase; MDC, macrophage-derived chemokine; r, coefficient of determination; RMSE, root mean square error; POEM, Patient-Oriented Eczema Measure; SASSAD, Six Area, Six Sign Atopic Dermatitis; SCORAD, SCORing Atopic Dermatitis; TARC, thymus and activation-regulated chemokine; TST%, total sleep time percentage; VAS, visual analog scale; WASO, wake after sleep onset.

Study quality was assessed using a rating scheme modified from the Oxford Centre for Evidence-Based Medicine for ratings of individual studies: (1) properly powered and conducted randomized clinical trial or systematic review with meta-analysis; (2) well-designed controlled trial without randomization or prospective comparative cohort trial; (3) case-control study or retrospective cohort study; (4) case series with or without intervention or cross-sectional study; and (5) opinion of respected authorities or case reports.

P < .05.

P < .01.

P < .005.

P < .001.

Table II

Summary table for studies exploring acoustic, vibratory, pressure, and strain gauge devices. Note that no specificity values are reported for any of the studies listed

Device typeStudySample size and populationStudy focusVideo recording? (Yes/No)SensitivityCorrelationAccuracyStudy quality (1-5)
Acoustic Kurihara12Healthy adults (n = 10)Finger-mounted microphone vs video recording and other devices for TST% calculationYesRMSE = 1.09%2
 Noro10Healthy adults (n = 8), AD adults (n = 4)Wristwatch-type piezoelectric device for scratching rate compared to video recordingYesr2 = 0.98 (nocturnal scratching rate by acoustic device vs video recording)3
Vibratory Kurihara18Healthy adults (n = 12)Validation of piezoceramic disk devices placed under bed legs vs video recording for scratch and nonscratchYesRMSE (staying calmly) = 0.35-0.72sRMSE (moving hand, turning over, moving foot) = 0.94-1.26sRMSE (scratching) = 0.56-1.29s3
 Kurihara12Healthy adults (n = 10)Piezoceramic disk bed devices placed under bed legs vs video recording and other devices for TST% calculationYesRMSE = 0.87 = 6.31%3
 Shino37Healthy adults (n = 1)Piezoceramic bed devices vs video recording and other devices for TST% extraction via novel algorithmYesRMSE = 0.68-0.79s (0.40-0.94s)TST% error = 2.13-4.11% (−6.51-0.82%)(parentheses are from visually scoring outputs)3
 Kaburagi16Healthy adults (n = 12)TST% estimation algorithm for various devicesYesRMSE (left bed head) = 1.51% (1.84%)RMSE (right bed head) = 0.92% (1.86%)RMSE (left bed foot) = 6.58% (6.27%)RMSE (right foot bed) = 3.97% (6.83%)(parentheses are from visual scoring of outputs for TST%)4
 Kogure17AD subjects (n = 20)Evaluation of sheet-shaped body vibrometer vs wrist actigraphy for measurement of scratching, activity count, and sleep efficiencyNors (ρ):activity count per minute = 0.63-0.82 sleep efficiency = 0.82-0.913
Pressure Sensor Endo14Healthy adults (n = 10), AD adults (n = 20 total; 10 male, 10 female)Evaluation of “Scratch Monitor” device on dorsal handNo0.74 (overall)0.65 (male)0.83 (female)3
 Kurihara12Healthy adults (n = 10)Ceramic sheet placed on dorsal hand vs video recording and other devices for TST% calculationYesRMSE = 0.73%3
Strain Gauge Kurihara12Healthy adults (n = 10)Strain gauge on index finger vs video recording and other devices for TST% calculationYesRMSE = 2.41%3
 Shino37Healthy adults (n = 1)Strain gauge on index finger vs video recording and other devices for TST% extraction via novel algorithmYesRMSE = 0.53s (0.37s)TST% error = +1.38% (−1.54%)(parentheses are from visually scoring outputs)3
 Kaburagi16Healthy adults (n = 12)TST% estimation algorithm for various devicesYesRMSE = 1.29% (1.63%)(parentheses are from visual scoring of outputs for TST%)4

AD, Atopic dermatitis; r, coefficient of determination; RMSE, root mean square error; r(ρ), Spearman's rank correlation coefficient; TST%, total sleep time percentage.

Study quality was assessed using a rating scheme modified from the Oxford Centre for Evidence-Based Medicine for ratings of individual studies: (1) properly powered and conducted randomized clinical trial or systematic review with meta-analysis; (2) well-designed controlled trial without randomization or prospective comparative cohort trial; (3) case-control study or retrospective cohort study; (4) case series with or without intervention or cross-sectional study; and (5) opinion of respected authorities or case reports.

P < .005.

P < .001.

Summary table for studies exploring wrist actigraphs and smartwatch applications AD, Atopic dermatitis; BDNF, brain-derived neurotrophic factor; BRNN, bidirectional recurrent neural network; CTACK, cutaneous T-cell-attracting chemokine; EASI, Eczema Area and Severity Index; IGA, Investigator's Global Assessment; r (ρ), Spearman's rank correlation coefficient; LDH, lactate dehydrogenase; MDC, macrophage-derived chemokine; r, coefficient of determination; RMSE, root mean square error; POEM, Patient-Oriented Eczema Measure; SASSAD, Six Area, Six Sign Atopic Dermatitis; SCORAD, SCORing Atopic Dermatitis; TARC, thymus and activation-regulated chemokine; TST%, total sleep time percentage; VAS, visual analog scale; WASO, wake after sleep onset. Study quality was assessed using a rating scheme modified from the Oxford Centre for Evidence-Based Medicine for ratings of individual studies: (1) properly powered and conducted randomized clinical trial or systematic review with meta-analysis; (2) well-designed controlled trial without randomization or prospective comparative cohort trial; (3) case-control study or retrospective cohort study; (4) case series with or without intervention or cross-sectional study; and (5) opinion of respected authorities or case reports. P < .05. P < .01. P < .005. P < .001. Summary table for studies exploring acoustic, vibratory, pressure, and strain gauge devices. Note that no specificity values are reported for any of the studies listed AD, Atopic dermatitis; r, coefficient of determination; RMSE, root mean square error; r(ρ), Spearman's rank correlation coefficient; TST%, total sleep time percentage. Study quality was assessed using a rating scheme modified from the Oxford Centre for Evidence-Based Medicine for ratings of individual studies: (1) properly powered and conducted randomized clinical trial or systematic review with meta-analysis; (2) well-designed controlled trial without randomization or prospective comparative cohort trial; (3) case-control study or retrospective cohort study; (4) case series with or without intervention or cross-sectional study; and (5) opinion of respected authorities or case reports. P < .005. P < .001.

Data extraction and outcomes

Performance values were extracted using a standardized survey. Primary outcomes included sensitivity, specificity, and positive and negative predictive values of scratch detection methods. Secondary outcomes included correlations of detection methods to other technologies and subjective assessments.

Performance metrics

Sensitivity is defined as the ability to detect the number of true positives (eg, true scratching) and specificity is the ability to detect the number of true negatives (eg, nonscratching movements). Positive predictive value (PPV, precision) is the proportion of positives that are true positives (eg, movements labeled as scratch that are true scratches). The F1 score encompasses both sensitivity and precision. Root mean square error (RMSE) is the standard deviation of residuals and is effectively an estimation of how well an algorithm predicts the observed data (ie, accuracy).

Algorithms

To efficiently extract and analyze device data, algorithms capable of distinguishing scratch from nonscratch movements are essential. Linear regression modeling is generated from the number of activity counts above a frequency threshold and total scratch time; however, this model is limited by confounding movements (eg, walking, restlessness). Logistic regression modeling is a simple approach to binary classification (eg, scratch vs nonscratch) and analogous to linear regression. Bidirectional recurrent neural networks are a form of machine learning whereby the network can detect patterns directly (eg, scratch waveforms) from raw input data, thereby eliminating precursory extraction of patterns required for other models. The k-means clustering analysis is another approach that involves clustering a set number of subgroups within a data set. The algorithm then allocates device signals into their respective subgroups based on frequency, waveform, or other qualities.

Results

Of the 6231 articles identified, 72 were assessed based on exclusion criteria and 24 fully met eligibility criteria. Most articles looked at AD, although other conditions were also examined (eg, urticaria). Articles reporting performance and correlation measures are summarized in Tables I and II. Sensitivity and specificity ranges of technologies compared to video recording are summarized in Table III. An overview of benefits and limitations is seen in Table IV.
Table III

Reported sensitivity of algorithms for scratch detection in studies focused on subjects with atopic dermatitis, which used video recording as comparison

Performance metricActigraphySmartwatch applications
Sensitivity (range)0.45-0.91 (BRNN)24 0.00-0.10 (logistic regression)240.75-0.8534,36

BRNN, Bidirectional recurrent neural network.

Table IV

Comparison of various technologies used to detect scratching

Device typeBenefits/prosLimitations/consAlgorithms for scratch detection
Actigraphy

Most studied, has a large literature base

Validated against video recording

High sensitivity for wrist-dominant scratching movements in healthy subjects

Statistically significant moderate correlation with other objective measures (eg, SCORAD, IGA, EASI)

Very poor correlation with subjective assessment tools for itch

Varied performance regarding scratch detection

Poor sensitivity for finger-dominant scratching movements

Deterioration of performance in pruritic subjects

Poor specificity given difficulty distinguishing wrist movements from scratching

False positives with similar waveforms (eg, walking)

Larger studies in target populations (eg, AD subjects) needed for algorithm development

The k-means cluster analysis algorithm has good performance, but impractical in clinical setting given required determination of all movements a priori23

The BRNN model has good performance in pruritic subjects (albeit poorer than healthy subjects) and moderate F1-scores24

Logistic regression model in the study by Petersen et al29 has comparable performance to k-means cluster analysis, but poorer performance in separate data set by Moreau et al24

Note that all of the aforementioned algorithms are for determination of TST

Smartwatch applications

Similar to actigraphy in that it utilizes the smartwatch's built-in accelerometer, more convenient for current smartwatch owners

Bluetooth and cloud capabilities make accessing data easy for both patients and health care providers

Few applications available

Some applications (eg, “DermaTrack”, formerly called “ItchTracker”) do not show raw data output

Smartwatches may be cumbersome for pediatric subjects, with no currently reported pediatric data

Algorithm proposed by Lee et al34,35 reveals good accuracy in pruritic subjects, but authors report false negatives (eg, nonperiodic scratching) and false positives (nonscratching periodic movements such as arm shaking)

Acoustic

Greater specificity with detection of scratch-generated sounds; will have different pattern than restlessness or turning over

Able to detect both finger and wrist scratching

Limited research

Privacy concerns/risk

Unable to use in patients who do not sleep alone or have OSA

Able to estimate TST% in healthy subjects with high accuracy (low RMSE compared to video recording) in healthy subjects

Vibratory

Noninvasive

Able to localize scratching based on different waveforms

Subject must use specific bed and unable to be used in patients who do not sleep alone

Able to estimate TST% with variable accuracy depending on distance between the sensor and scratch site in healthy subjects

Pressure sensor

Able to detect finger scratching if placed on dorsal hand along metatarsal bone (ceramic sheet)

Performance dependent on technology

Eg, false positives from any hand movement that causes changes in pressure

Limited research

Able to estimate TST% with high accuracy (low RMSE) due to distinct waveforms in healthy subjects

Strain gauge

Higher sensitivity for finger-dominant scratching when placed on index finger compared to actigraphy

False positives with nonscratch finger bending movements

Limited research

Able to estimate TST% with good accuracy in healthy subjects

AD, Atopic dermatitis; BRNN, bidirectional recurrent neural network; EASI, Eczema Area and Severity Index; IGA, Investigator's Global Assessment; N/A, not available; OSA, obstructive sleep apnea; RMSE, root mean square error; SCORAD, SCORing Atopic Dermatitis; TST, total scratch time; TST%, total sleep time percentage.

Reported sensitivity of algorithms for scratch detection in studies focused on subjects with atopic dermatitis, which used video recording as comparison BRNN, Bidirectional recurrent neural network. Comparison of various technologies used to detect scratching Most studied, has a large literature base Validated against video recording High sensitivity for wrist-dominant scratching movements in healthy subjects Statistically significant moderate correlation with other objective measures (eg, SCORAD, IGA, EASI) Very poor correlation with subjective assessment tools for itch Varied performance regarding scratch detection Poor sensitivity for finger-dominant scratching movements Deterioration of performance in pruritic subjects Poor specificity given difficulty distinguishing wrist movements from scratching False positives with similar waveforms (eg, walking) Larger studies in target populations (eg, AD subjects) needed for algorithm development The k-means cluster analysis algorithm has good performance, but impractical in clinical setting given required determination of all movements a priori The BRNN model has good performance in pruritic subjects (albeit poorer than healthy subjects) and moderate F1-scores Logistic regression model in the study by Petersen et al has comparable performance to k-means cluster analysis, but poorer performance in separate data set by Moreau et al Note that all of the aforementioned algorithms are for determination of TST Similar to actigraphy in that it utilizes the smartwatch's built-in accelerometer, more convenient for current smartwatch owners Bluetooth and cloud capabilities make accessing data easy for both patients and health care providers Few applications available Some applications (eg, “DermaTrack”, formerly called “ItchTracker”) do not show raw data output Smartwatches may be cumbersome for pediatric subjects, with no currently reported pediatric data Algorithm proposed by Lee et al, reveals good accuracy in pruritic subjects, but authors report false negatives (eg, nonperiodic scratching) and false positives (nonscratching periodic movements such as arm shaking) Greater specificity with detection of scratch-generated sounds; will have different pattern than restlessness or turning over Able to detect both finger and wrist scratching Limited research Privacy concerns/risk Unable to use in patients who do not sleep alone or have OSA Able to estimate TST% in healthy subjects with high accuracy (low RMSE compared to video recording) in healthy subjects Noninvasive Able to localize scratching based on different waveforms Subject must use specific bed and unable to be used in patients who do not sleep alone Able to estimate TST% with variable accuracy depending on distance between the sensor and scratch site in healthy subjects Able to detect finger scratching if placed on dorsal hand along metatarsal bone (ceramic sheet) Performance dependent on technology Eg, false positives from any hand movement that causes changes in pressure Limited research Able to estimate TST% with high accuracy (low RMSE) due to distinct waveforms in healthy subjects Higher sensitivity for finger-dominant scratching when placed on index finger compared to actigraphy False positives with nonscratch finger bending movements Limited research Able to estimate TST% with good accuracy in healthy subjects AD, Atopic dermatitis; BRNN, bidirectional recurrent neural network; EASI, Eczema Area and Severity Index; IGA, Investigator's Global Assessment; N/A, not available; OSA, obstructive sleep apnea; RMSE, root mean square error; SCORAD, SCORing Atopic Dermatitis; TST, total scratch time; TST%, total sleep time percentage.

Actigraphy

Performance

Actigraphy is the most studied technology.,,25, 26, 27 Twenty-one articles investigated actigraphy devices and data extraction algorithms, with 7 compared to video recording.,,,,,, While all 7 articles looked at healthy subjects, only 2 reported sensitivity values (0.00-0.96; zero values indicate no true positives)., Specificity was reported by 1 article (0.92). Four articles explored actigraphy in AD subjects, with 1 reporting sensitivity values (0.00-0.89) and PPV values (0.00-0.57). Specificity was not reported in this population. The large ranges likely stem from the various extraction algorithms and actigraphs (eg, PAM-RL, , Actiwatch Plus,, DigiTrac,). Each algorithm has its limitations. The k-means clustering analysis algorithm of Feuerstein et al yielded high performance values, but required all anticipated movements to be determined a priori. While logistic regression approach from Petersen et al for detecting total nocturnal scratch time yielded comparable performance to the algorithm from Feuerstein et al, the model had significantly decreased performance when tested with a separate data set. The bidirectional recurrent neural networks algorithm proposed by Moreau et al yielded higher sensitivity, PPV, and F1-scores than the logistic regression model; however, it has not been tested in further datasets. Correlation between actigraphy data and video recording was evaluated by Moreau et al, reporting Spearman rank correlation coefficients (rs(ρ)) of 0.95-0.96. Other studies report correlations between actigraphy and video recording for total scratching time percentage (TST%) calculation (rs(ρ) = 0.91), and correlation values between actigraphy and video recording of sleep efficiency were all reported to be greater than 0.92 by Benjamin et al.

Correlations with other objective and subjective measures

Several articles explored correlations between actigraphy and subjective sleep measures, disease severity, AD-associated serum markers, and subjective itch measures. Ten articles compared actigraphy to subjective sleep measures, with 4 reporting correlations. VAS sleep, a patient-reported measure of sleep quality, was examined in 1 article, reporting correlation coefficients of −0.44 in adults and 0.48 in children when compared to average hourly activity scores. The total scoring AD index, which includes both subjective (eg, itch and sleep) and objective (eg, disease severity) measures, had moderate correlations with various activity measures ranging from 0.53-0.64 in adults (P < .05) and 0.42-0.62 in children (P < .05).,, While total and objective total scoring AD indexes both resulted in rs(ρ) = 0.52 (P < .001) in children (n = 24) compared to wrist activity, correlations with pruritus and sleep subscores were not significant. Two articles evaluated other disease severity indices in children and adults, with moderate correlation for objective measures (Eczema and Area Severity Index and Investigator's Global Assessment) compared to actigraphic wake after sleep onset, ranging from 0.70-0.76 (P < .02, n = 10). Six area six sign AD was found to have a weak correlation with average nocturnal movement (rs(ρ) = 0.15, P = .02, n = 235). Four articles investigated -serum markers associated with AD. Statistically significant correlations with actigraphy measurements ranged from 0.51-0.93.,,, These studies were not compared to video recording, however, and thus conclusions specifically related to scratch are difficult to make. While there seems to be a moderate correlation between actigraphy and objective measures, this is not the case with subjective measures. Fourteen articles compared actigraphy to subjective itch, with 2 articles reporting correlation coefficients. Comparison between VAS itch and mean actigraphy scores yielded coefficients of determination (r2) of 0.06 in children and adults with various pruritic conditions (n = 118) and 0.08 in adult AD subjects (n = 20). VAS itch and hourly activity scores yielded rs(ρ) = 0.40 (P = .049) in children and 0.18 (P = .9) in adults. Actigraphy-based scratch measurements correlate poorly to VAS itch scores, sleep quality, and other subjective patient-reported outcomes.,,,, The reasons for this are likely multifactorial. In pediatric populations, proxy measures may be under or overestimated by caregivers. More likely, there are inherent differences between a subject's perception of itch and the objective actions of scratching. An individual may report a high level of subjective itch but exhibit an equally high level of scratching restraint. In contrast, some individuals with chronic itch are habituated to it and report low scores despite frequent scratching. Ultimately, scratch measurements with objective tools and subject-reported outcomes are interrelated outputs that provide complementary information.

Smartwatch applications

Applications leveraging smartwatches and their accelerometers show comparable performance in detecting scratch when compared to actigraphs. Three articles examined smartwatch applications compared to video recording. In preliminary testing of their “Itchtector” app, Lee et al reported sensitivity (0.63-1.00), specificity (0.98-1.00), PPV (0.83-0.98), negative predictive value (0.93-1.00), and accuracy (93.3%-99.0%) in healthy adults (n = 3). When cross-validated in pruritic subjects (n = 13), the app yielded lower sensitivity (0.75), PPV (0.74), and accuracy (90%), which may be due to the small initial sample size, different subject populations, and different smartwatches. Ikoma et al also tested the “ItchTracker” app in adult AD subjects (n = 5) and reported a sensitivity of 0.85 and PPV of 0.90. They reported a correlation between the app and video recording for an hourly scratch duration of rs(ρ) = 0.851-0.901 (P < .001). The authors further compared scratching duration percentage to current and 7-day itch in healthy and AD adults, reporting rs(ρ) = 0.36-0.43 (P < .001). Similar findings were reported regarding self-reported sleep disturbance (rs(ρ) = 0.45) and daytime disturbance (rs(ρ) = 0.42). Disease severity measured by the Eczema and Area Severity Index was significantly correlated to scratching duration percentage (rs(ρ) = 0.60). However, they excluded finger-only scratching movements. Additionally, the small sample size should be taken into consideration. Although smartwatch applications show good sensitivity, there are no reported specificity ranges for pruritic subjects, making it difficult to assess their ability to distinguish between scratch and nonscratch movements.

Acoustic

Acoustic devices detect sound waves generated from scratching. Two articles studied healthy subjects and compared the performance of their respective devices to that of video recording. No sensitivity or specificity values were reported. The finger-mounted microphone presented by Kurihara et al yielded an RMSE of 1.09% for TST% calculation when compared to video recording. Noro et al reported r2 = 0.98 when comparing scratching rate captured by their acoustic sensor and scratching rate obtained from video observation. While the devices show strong accuracy in detecting fine finger movements, the technology is not widely available and follow-up studies have not been conducted since first reported in 2014.

Vibratory

Vibratory devices allow for nonintrusive monitoring of body movements and mitigate lesion exacerbation by devices that require skin contact. Four articles studied bed vibratory sensors compared to video recording.,,, Accuracy was measured by RMSE, ranging 0.56-1.29s for scratching time and 0.87%-6.31% for TST% calculation. Shino et al reported comparable RMSE values for their TST% algorithm (0.68-0.79s) when compared to visually scored device outputs (0.40-0.94s) (n = 1). For both studies, the vibratory RMSE values were among the lowest when compared to other technologies. While vibratory devices have comparable accuracy to actigraphy and are largely burden-free once installed, their cost and setup may be deterrents.

Pressure sensors

Pressure sensors placed on the dorsal hand detect pressure changes with hand movements. Only 1 of 2 articles was compared to video recording. Kurihara et al compared a ceramic sheet to other devices in healthy subjects, and reported a RMSE of 0.72% for TST% calculation when compared to video recording, the lowest among the devices tested. Although not compared to video recording, the Scratch Monitor pressure sensor presented by Endo et al was tested in healthy adults and yielded sensitivity ranging from 0.65-0.83.

Strain gauge

Strain gauges placed on the index finger to measure finger bending were evaluated in 2 studies, both of which were compared to video recording and tested in healthy subjects. The devices yielded an RMSE of 2.41% for TST% calculation, half that of wrist actigraphy. The devices also yielded a TST% error of 1.38% when automatically extracted via an algorithm proposed by Shino et al, which was compared to a TST% error of −1.54% when the data were visually scored. No sensitivity or specificity values were reported. It should be noted that strain gauges may be more susceptible to false positives (eg, nonscratching finger movements).

Discussion

While the development of existing and novel devices has progressed tremendously, their performances reveal large areas in need of improvement. Actigraphy-based algorithms appear to have good sensitivity and specificity in healthy subjects; however, their performance deteriorates considerably when applied to pruritic subjects. This may be due to a lack of algorithm generalizability and failure to capture finger scratching. Additionally, most data used for establishing scratch parameters were obtained from small healthy samples. While there have been cross-validation studies with data from small AD samples, testing in larger samples of pruritic patients has not been performed. The same principle applies to newer scratch technologies, whereby further testing in both populations is needed for robust algorithms. While certain devices have demonstrated greater sensitivity for detecting finger scratching, the studies do not explicitly mention their abilities to detect rubbing or use of other scratching tools (eg, back scratchers). Rubbing, like scratching, is a natural reaction to itch; if devices are unable to distinguish rubbing or use of scratching tools from other motions, they may be underestimating itch. Further development of these technologies may help provide a more comprehensive picture of itch.

Performance metrics and algorithms

With advances in machine learning, data-driven approaches for objective scratch monitoring have gained significant interest. Various metrics have been employed to evaluate performance. While specificity and accuracy are useful, they need to be used with caution as they can be prone to class imbalances. Under typical situations, scratching arises sporadically, each over a brief period, ranging from a few seconds to several minutes depending on symptom severity. Thus, the majority of data collected features nonscratching behaviors; only a small amount of data feature scratching, resulting in a significant class imbalance. For example, a poor classification algorithm that predicts nonscratch all the time will, most likely, produce excellent accuracy and specificity. Given this problem, other metrics, such as sensitivity, precision, and F1-score are deemed more appropriate to quantify performance.

Future considerations

While patient history and examination remain important tools in assessing itch, there remains an ongoing need for adjunctive objective and precise, tools to quantify itch, such as in the case of subconscious habitual scratching. Many technologies and algorithmic strategies have been studied, though their performances are highly variable, with validation studies rarely extending beyond small samples. In addition, most studies focus on nocturnal scratching. Given that the perception of itch varies during the day, daytime scratching remains an important behavior that is largely unstudied. In this review, very few studies reported specificity values. While this is understandable in nocturnal scratching, during which the targeted behavior scratching is rare overall, daytime wear introduces other confounders, such as texting or walking. Thus, specificity may hold greater relevance in daytime wear, during which the wristwatch-based systems may struggle to differentiate scratching from other movements. Our group has introduced a novel mechano-acoustic skin device that incorporates actigraphy and acoustic detection of scratching by conforming to the dorsal hand and sampling at higher frequencies (~1600 Hz) compared to actigraphy (20-100 Hz). Scratch algorithm development performed in healthy subjects yielded high sensitivity and specificity with comparable performance among AD datasets using an IR camera gold standard, even with confounders., A comparison of data outputs for scratch from actigraphy, smartwatch application, and mechano-acoustic device is shown in Supplemental Fig 1.

Conclusion

While actigraphy remains the most frequently studied modality in clinical studies, performance is variable with no assessment of daytime performance. Further testing of these technologies will be needed before used in the clinical setting. A reliable technological modality would allow for objective support of drug development outcomes,40, 41, 42 guide disease management, and assess treatment response.

Conflicts of interest

Drs Yang, Nguyen, Li, Lee, Chun, Wu, Fishbein, and Paller have no conflicts of interest to declare. Dr Xu has equity in a private company with a commercial interest in scratch sensors and inventorship interest in patents related to a scratch sensor.
  31 in total

1.  Wrist actigraphy for scratch detection in the presence of confounding activities.

Authors:  Johanna Feuerstein; Daniel Austin; Robert Sack; Tamara L Hayes
Journal:  Conf Proc IEEE Eng Med Biol Soc       Date:  2011

2.  Actigraphy-based scratch detection using logistic regression.

Authors:  Johanna Petersen; Daniel Austin; Robert Sack; Tamara L Hayes
Journal:  IEEE J Biomed Health Inform       Date:  2012-06-13       Impact factor: 5.772

3.  Novel acoustic evaluation system for scratching behavior in itching dermatitis: rapid and accurate analysis for nocturnal scratching of atopic dermatitis patients.

Authors:  Yuichi Noro; Youichi Omoto; Koji Umeda; Futa Tanaka; Yousuke Shiratsuka; Tomomi Yamada; Kenichi Isoda; Kimiko Matsubara; Keiichi Yamanaka; Esteban C Gabazza; Masakatsu Nishikawa; Hitoshi Mizutani
Journal:  J Dermatol       Date:  2014-02-10       Impact factor: 4.005

4.  Correlation between nocturnal scratch behavior assessed by actigraphy and subjective/objective parameters in patients with atopic dermatitis.

Authors:  Hiroyuki Fujita; Mayumi Nagashima; Yoshihiro Takeshita; Michiko Aihara
Journal:  Eur J Dermatol       Date:  2014 Jan-Feb       Impact factor: 3.328

5.  Measurement of itch using actigraphy in pediatric and adult populations.

Authors:  Carol Bringhurst; Karen Waterston; Olivia Schofield; Kenneth Benjamin; Jonathan L Rees
Journal:  J Am Acad Dermatol       Date:  2004-12       Impact factor: 11.527

6.  Evaluation of scratch movements by a new scratch-monitor to analyze nocturnal itching in atopic dermatitis.

Authors:  K Endo; H Sumitsuji; T Fukuzumi; J Adachi; T Aoki
Journal:  Acta Derm Venereol       Date:  1997-11       Impact factor: 4.437

Review 7.  Itch.

Authors:  Gil Yosipovitch; Malcolm W Greaves; Martin Schmelz
Journal:  Lancet       Date:  2003-02-22       Impact factor: 79.321

8.  Treatment of atopic dermatitis with tralokinumab, an anti-IL-13 mAb.

Authors:  Andreas Wollenberg; Michael D Howell; Emma Guttman-Yassky; Jonathan I Silverberg; Christopher Kell; Koustubh Ranade; Rachel Moate; René van der Merwe
Journal:  J Allergy Clin Immunol       Date:  2018-06-12       Impact factor: 10.793

Review 9.  Nocturnal eczema: Review of sleep and circadian rhythms in children with atopic dermatitis and future research directions.

Authors:  Anna B Fishbein; Olivia Vitaterna; Isabel M Haugh; Aakash A Bavishi; Phyllis C Zee; Fred W Turek; Stephen H Sheldon; Jonathan I Silverberg; Amy S Paller
Journal:  J Allergy Clin Immunol       Date:  2015-11       Impact factor: 10.793

10.  Measure of atopic dermatitis disease severity using actigraphy.

Authors:  Laura F Sandoval; Karen Huang; Jenna L O'Neill; Cheryl J Gustafson; Emily Hix; Jessica Harrison; Adele Clark; Orfeu M Buxton; Steven R Feldman
Journal:  J Cutan Med Surg       Date:  2014 Jan-Feb       Impact factor: 2.092

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.