| Literature DB >> 30252908 |
Elizabeth A Handorf1, Carolyn J Heckman2, Susan Darlow3, Michael Slifker1, Lee Ritterband4.
Abstract
INTRODUCTION: Online surveys are a valuable tool for social science research, but the perceived anonymity provided by online administration may lead to problematic behaviors from study participants. Particularly, if a study offers incentives, some participants may attempt to enroll multiple times. We propose a method to identify clusters of non-independent enrollments in a web-based study, motivated by an analysis of survey data which tests the effectiveness of an online skin-cancer risk reduction program.Entities:
Mesh:
Year: 2018 PMID: 30252908 PMCID: PMC6155511 DOI: 10.1371/journal.pone.0204394
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Dendrogram representing clustering of screener responses in participants who went on to complete the baseline questionnaire.
Fig 2(A) shows the distribution of the observed cluster sizes and sil widths (black) compared to the distribution of size and sil widths from one simulated sample. (B) shows the distribution of the maximum SSWs from each of 1,000 simulations, compared to the top three observed SSWs in the true data. The two outliers of interest are represented by an X through a circle in both plots.
Cluster membership vs. latent class membership.
| Cluster Membership | Latent Class Membership | |||
|---|---|---|---|---|
| Class 1 | Class 2 | Class 3 | Total | |
| Other | 610 | 406 | 104 | 1120 |
| Cluster #1 | 7 | 55 | 3 | 65 |
| Cluster #2 | 2 | 4 | 43 | 49 |
| Total | 619 | 465 | 150 | 1234 |
*“Other” denotes membership in one of the clusters not identified as usually large/similar
Quality variables by cluster membership and predicted latent class.
| Quality variables | Full sample | Cluster Membership | Latent Class Membership | ||||
|---|---|---|---|---|---|---|---|
| Other | 1 | 2 | 1 | 2 | 3 | ||
| 1. Minutes to complete questionnaire (median) | 21 | 21 | 14 | 935 | 20 | 15 | 935 |
| 2. Correlation of synonyms (mean) | 0.4 | 0.39 | 0.58 | 0.33 | 0.28 | 0.59 | 0.33 |
| 3. Even-odd item correlation (mean) | 0.62 | 0.61 | 0.74 | 0.66 | 0.60 | 0.64 | 0.62 |
| 4. Distance from average (mean) | 15.64 | 15.82 | 12.85 | 15.33 | 17.40 | 13.45 | 15.18 |
| 5. Runs of identical responses (mean) | 0.43 | 0.44 | 0.47 | 0.27 | 0.43 | 0.49 | 0.27 |
| 6. Inconsistent state/climate selection | 15.0% | 12.9% | 16.9% | 61.2% | 6.8% | 13.8% | 52.7% |
| 7. Non-US phone | 16.0% | 15.3% | 13.9% | 36.7% | 13.1% | 8.4% | 52.0% |
| 8. Wrong phone number | 13.0% | 13.8% | 9.2% | 2.0% | 8.9% | 22.4% | 1.3% |
| 9. Obviously fake name | 6.8% | 6.8% | 12.3% | 0.0% | 0.5% | 16.3% | 3.3% |
| 10. Nonsensical feedback | 2.8% | 2.0% | 20.0% | 0.0% | 0.3% | 7.1% | 0.0% |
| 11. Discrepancies within questionnaire | 4.3% | 3.8% | 16.9% | 0.0% | 2.3% | 7.7% | 2.0% |
| 12. Other | 9.3% | 9.7% | 6.2% | 4.1% | 7.4% | 12.5% | 7.3% |
Note: Shading emphasizes unusual behavior patterns as measured by the quality indicators
*“Other” denotes membership in one of the clusters not identified as usually large/similar
Intervention effects on primary outcomes at 12 weeks, by cluster and latent class membership.
| UV exposure outcome | Effect | SE | 95% CI | P-val | |
|---|---|---|---|---|---|
| All participants | -0.19 | 0.054 | -0.30 | -0.09 | 0.0003 |
| Only non-clustered participants | -0.24 | 0.058 | -0.36 | -0.13 | <0.0001 |
| Only members of latent class 1 | -0.31 | 0.081 | -0.47 | -0.15 | 0.0001 |
| All participants | 0.31 | 0.081 | 0.15 | 0.47 | 0.0001 |
| Only non-clustered participants | 0.32 | 0.085 | 0.16 | 0.49 | 0.0001 |
| Only members of latent class 1 | 0.58 | 0.116 | 0.35 | 0.81 | <0.0001 |