| Literature DB >> 32215257 |
Raphael Schuster1, Manuela Larissa Schreyer2, Tim Kaiser1, Thomas Berger3, Jan Philipp Klein4, Steffen Moritz5, Anton-Rupert Laireiter1,6, Wolfgang Trutschnig2.
Abstract
Smartphone-based devices are increasingly recognized to assess disease symptoms in daily life (e.g. ecological momentary assessment, EMA). Despite this development in digital psychiatry, clinical trials are mainly based on point assessments of psychopathology. This study investigated expectable increases in statistical power by intense assessment in randomized controlled trials (RCTs). A simulation study, based on three scenarios and several empirical data sets, estimated power gains of two- or fivefold pre-post-assessment. For each condition, data sets of various effect sizes were generated, and AN(C)OVAs were applied to the sample of interest (N = 50-N = 200). Power increases ranged from 6% to 92%, with higher gains in more underpowered scenarios and with higher number of repeated assessments. ANCOVA profited from a more precise estimation of the baseline covariate, resulting in additional gains in statistical power. Fivefold pre-post EMA resulted in highest absolute statistical power and clearly outperformed traditional questionnaire assessments. For example, ANCOVA of automatized PHQ-9 questionnaire data resulted in absolute power of 55 (for N = 200 and d = 0.3). Fivefold EMA, however, resulted in power of 88.9. Non-parametric and multi-level analyses resulted in comparable outcomes. Besides providing psychological treatment, digital mental health can help optimizing sensitivity in RCT-based research. Intense assessment appears advisable whenever psychopathology needs to be assessed with high precision at pre- and post-assessment (e.g. small sample sizes, small treatment effects, or when applying optimization problems like machine learning). First empiric studies are promising, but more evidence is needed. Simulations for various effects and a short guide for popular power software are provided for study planning.Entities:
Year: 2020 PMID: 32215257 PMCID: PMC7090342 DOI: 10.1016/j.invent.2020.100313
Source DB: PubMed Journal: Internet Interv ISSN: 2214-7829
Fig. 1Different slopes of improvement as a function of measurement day introduce measurement error.
Note. Point assessments of psychopathology (by standard questionnaires) introduce measurement error as symptoms fluctuate over time. For example, for a questionnaire with 16 items and a standard deviation of SD = 5, a fluctuation of 1 point on 2 items of a given Likert scale would result in 40% fluctuation of SD. This imprecision increases if both, pre- and post-assessment, are affected equally. Green lines represent three slopes of single point assessments. Red lines represent averaged slopes over a moving window of three measurement occasions.
Scenarios to test the impact of intense assessment.
| Scenario 1 (standard scenario) | Scenario 2 (emp. trial data) | Scenario 3 (emp. EMA data) | |
|---|---|---|---|
| Assessment method | Average questionnaire | Automatized PHQ-9 | Automatized EMA |
| Reliability of repeated pre-assessments ( | 0.7 | ≈0.4–0.65 | 0.4 |
| Reliability of repeated post-assessments ( | 0.7 | ≈0.4–0.65 | 0.4 |
| Quantity of pre-assessments | 2 or 5 | 2 or 5 | 2 or 5 |
| Quantity of post-assessments | 2 or 5 | 2 or 5 | 2 or 5 |
Abbreviations: EMA = ecological momentary assessment; PHQ-9 = Patient Health Questionnaire (depression); r = auto-correlation.
Fig. 2Power curves mapping effect size (x-axis) and achieved power (y-axis) for Scenario 1 (standard scenario).
Note. Green line = power gain; red line = 80% power level; dashed line: standard AN(C)OVA; solid line: intense assessment.
Fig. 3Power curves mapping effect size (x-axis) and achieved power (y-axis) for Scenario 2 (empiric data based on automatized PHQ-9 assessments).
Note. Green line = power gain; red line = 80% power level; dashed line: standard AN(C)OVA; solid line: intense assessment.
Fig. 4Power curves mapping effect size (x-axis) and achieved power (y-axis) for Scenario 3 (empiric EMA data).
Note. Green line = power gain; red line = 80% power level; dashed line: standard AN(C)OVA; solid line: intense assessment.
Achieved power through intense pre-post-assessment by automatized short questionnaires or sEMA.
| ANOVA | ANCOVA | |||||
|---|---|---|---|---|---|---|
| Standard pre-post | Twofold pre-post | Fivefold pre-post | Standard pre-post | Twofold pre-post | Fivefold pre-post | |
| Simulation | Power ( | Power ( | Power ( | Power ( | Power ( | Power ( |
| Scenario 2 (automatized PHQ-9) | ||||||
| | 72.1 (106) | 76.7 (113) | 83.8 (105) | 90.4 (113) | ||
| | 61.7 (106) | 66.7 (114) | 76,7 (109) | 83.8 (120) | ||
| | 47.9 (109) | 52.9 (121) | 60.4 (110) | 71,3 (130) | ||
| Scenario 3 (automatized EMA) | ||||||
| | 58.2 (100) | 71.9 (123) | 75.2 (100) | 89.5 (119) | ||
| | 51.3 (100) | 64.4 (125) | 66.9 (100) | 82.3 (123) | ||
| | 38.4 (100) | 50.0 (130) | 51.4 (100) | 67.5 (131) | ||
Note. Fivefold sEMA (columns 3 and 6 of Scenario 3) clearly outperforms questionnaire-based point assessments of psychopathology (columns 1 and 4 of Scenario 2) in terms of absolute statistical power.
Abbreviations: sEMA = intense pre-post-Ecological Momentary Assessment; %* = increase in percent relative to reference; N = number of participants.
Bold numbers indicate expectable power of questionnaire- and EMA-based assessment.
Cohen's d.
Advantages and disadvantages of intense assessment.
| Advantage | Disadvantage |
|---|---|
| Fits recent trends in clinical research | More feasibility research needed |
| Increases measurement precision | May act as intervention |
| Reduces impact of missing assessments | Increases burden for participants |
| Provides additional information on disease dynamics | Applicability decreases with number of items |
| Improves triangulation of data sources (e.g. neuroscience) | |
| Increases statistical power/reduces required sample size |