| Literature DB >> 30563473 |
Yiwen Zhang1, Xianghua Luo2,3, Chap T Le4,5, Jasjit S Ahluwalia6, Janet L Thomas7.
Abstract
BACKGROUND: Missing data are common in tobacco studies. It is well known that from the observed data alone, it is impossible to distinguish between missing mechanisms such as missing at random (MAR) and missing not at random (MNAR). In this paper, we propose a sensitivity analysis method to accommodate different missing mechanisms in cessation outcomes determined by self-report and urine validation results.Entities:
Keywords: Abstinence outcome; Imputation; Missing data; Sensitivity analysis
Mesh:
Year: 2018 PMID: 30563473 PMCID: PMC6299502 DOI: 10.1186/s12874-018-0635-2
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Two-by-two table of tobacco use status by missing for self-report data
| Self-report tobacco use status | |||
|---|---|---|---|
| Missing status of self-report data | Abstinence | Failure | Total |
| Observed |
|
|
|
| Missing |
|
|
|
| Total |
|
|
|
Bolded entries indicate values that are not observable
Fig. 1Data structure and notation for a single treatment group. Note n is the total sample size, n1. is the number of survey respondents, and n2. is the number of survey non-respondents. Then among survey respondents, denote n11 as the number of observed self-report abstinence and n21 as the number of imputed self-report abstinence. Similarly, n12 and n22 represent the number of observed failures and imputed failures based on the self-report data, respectively. For the urine samples, u( and u( represent the number of observed and estimated (based on the imputed survey data) urine samples being provided; similar notations, v( and v( are used for the number of unavailable urine samples. For the urine data, analogous notations are defined as for the survey data except for using f, instead of n, to denote the numbers of subjects under different conditions (with the superscript 11, 12, 21, and 22 having the same meaning). In addition, we used f11( to denote the abstinence and f12( to denote the failure obtained from the estimated available urine samples u().Then we combined the f11( and f11( to obtain the number of urine-verified abstinence f11. among the urine samples what were actually provided or could have been provided if all surveys were completed, whereas combined f12( and f12( to obtain the urine-verified failure f12..Denote OR1 as the assumed odds ratio between missing and smoking for self-report data and OR2 for urine data. Dashed lines indicate where missing data are reallocated based on certain assumptions or estimations. Bolded notation denotes values that are not observed
Fig. 2Missing data in 6-month abstinence outcomes of the Enhanced Quit & Win study (subjects with missing abstinence data are shaded)
Summary of 6-month self-reported and urine verified abstinence and missing data by treatment arms and by type of intervention
| Self-report abstinence and survey missing at 6 months | Urine-verified abstinence and urine missing at 6 months | ||||||
|---|---|---|---|---|---|---|---|
| Treatment group | Total | Self-report abstinence | Self-report failure | Survey question missing | Urine-verified abstinence | Urine-verified failure | Urine missing |
| By treatment arm | |||||||
| Tx1 | 306 | 65 (21.2%) | 194 (63.4%) | 47 (15.4%) | 38 (58.5%) | 6 (9.2%) | 21 (32.3%) |
| Tx2 | 296 | 59 (19.9%) | 170 (57.4%) | 67 (22.6%) | 34 (57.6%) | 6 (10.2%) | 19 (32.2%) |
| Tx3 | 309 | 61 (19.7%) | 197 (63.8%) | 51 (16.5%) | 35 (57.4%) | 6 (9.8%) | 20 (32.8%) |
| Tx4 | 306 | 79 (25.8%) | 156 (51.0%) | 71 (23.2%) | 50 (63.3%) | 7 (8.9%) | 22 (27.8%) |
| By intervention | |||||||
| No counseling (Tx1 + Tx3) | 615 | 126 (20.5%) | 391 (63.6%) | 98 (15.9%) | 73 (57.9%) | 12 (9.5%) | 41 (32.5%) |
| Counseling (Tx2 + Tx4) | 602 | 138 (22.9%) | 326 (54.2%) | 138 (22.9%) | 84 (60.9%) | 13 (9.4%) | 41 (29.7%) |
| Single contest (Tx1 + Tx2) | 602 | 124 (20.6%) | 364 (60.5%) | 114 (18.9%) | 72 (58.1%) | 12 (9.7%) | 40 (32.2%) |
| Multiple contests (Tx3 + Tx4) | 615 | 140 (22.8%) | 353 (57.4%) | 122 (19.8%) | 85 (60.7%) | 13 (9.3%) | 42 (30.0%) |
| Overall | 1217 | 264 (21.7%) | 717 (58.9%) | 236 (19.4%) | 157 (59.5%)b | 25 (9.5%)b | 82 (31.1%) |
Tx1: single contest + no counseling; Tx2: single contest + counseling; Tx3: multiple contests + no counseling; Tx4: multiple contest + counseling
aPercentage out of those who self-reported abstinence
bFive subjects whose urine samples were not of adequate amount for testing. These 5 missing urine test results were assumed to have the same distribution as the rest 177 urine samples (86% verified abstinence and 14% verified failure) and added to the two columns accordingly
Summary of imputation results for self-report abstinence assuming different levels of association between the survey missing status and self-report abstinence
| Counseling vs. no counseling | Multiple vs. single contests | |||||||
|---|---|---|---|---|---|---|---|---|
|
| Counseling Tx2 + Tx4 | No counseling Tx1 + Tx3 | Estimated treatment effect (odds ratio for abstinence) | Multiple contests | Single contest | Estimated treatment effect (odds ratio for abstinence) | ||
| Complete case only | 29.7% | 24.4% | 1.31 | .058 | 28.4% | 25.4% | 1.16 | .291 |
| 1 | 29.8% | 24.4% | 1.31 | .034 | 28.6% | 25.4% | 1.18 | .212 |
| 2 | 27.0% | 22.7% | 1.26 | .086 | 26.2% | 23.4% | 1.16 | .251 |
| 3 | 25.8% | 22.0% | 1.23 | .125 | 25.2% | 22.5% | 1.16 | .275 |
| 4 | 25.1% | 21.7% | 1.21 | .154 | 24.7% | 22.1% | 1.15 | .290 |
| 5 | 24.7% | 21.5% | 1.20 | .175 | 24.3% | 21.8% | 1.15 | .301 |
| ⁞ | ||||||||
| +∞ | 22.9% | 20.5% | 1.15 | .303 | 22.8% | 20.6% | 1.14 | .359 |
OR1: odds ratio between missing and tobacco use status for self-report data, where OR1 = 1 corresponds to the situation when missing is independent of tobacco use and OR1 = positive infinity (+∞) corresponds to the situation when missing = smoking; Tx1: single contest + no counseling; Tx2: single contest + counseling; Tx3: multiple contests + no counseling; Tx4: multiple contest + counseling. P-values are based on the Chi-square test
Summary of imputation results for urine-verified abstinence assuming different levels of association between missing and abstinence
| Counseling vs. no counseling | Multiple vs. single contests | ||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
| Counseling | No counseling | Estimated treatment effect (odds ratio for abstinence) | Multiple contests | Single contest | Estimated treatment effect (odds ratio for abstinence) | ||
| 1 | 1 | 25.8% | 20.9% | 1.31 | .046 | 24.8% | 21.8% | 1.18 | .212 |
| 2 | 24.8% | 20.1% | 1.32 | .046 | 23.9% | 20.9% | 1.19 | .204 | |
| 3 | 24.1% | 19.4% | 1.32 | .047 | 23.3% | 20.2% | 1.20 | .200 | |
| 4 | 23.6% | 18.9% | 1.32 | .047 | 22.7% | 19.7% | 1.20 | .197 | |
| 5 | 23.1% | 18.5% | 1.32 | .047 | 22.2% | 19.2% | 1.20 | .195 | |
| +∞ | 18.1% | 14.1% | 1.35 | .058 | 17.4% | 14.8% | 1.22 | .209 | |
| 2 | 1 | 23.3% | 19.5% | 1.26 | .102 | 22.7% | 20.0% | 1.18 | .249 |
| 2 | 22.5% | 18.7% | 1.26 | .101 | 21.9% | 19.2% | 1.18 | .241 | |
| 3 | 21.9% | 18.1% | 1.27 | .100 | 21.3% | 18.6% | 1.19 | .236 | |
| 4 | 21.3% | 17.6% | 1.27 | .100 | 20.8% | 18.1% | 1.19 | .233 | |
| 5 | 20.9% | 17.2% | 1.27 | .100 | 20.4% | 17.7% | 1.19 | .231 | |
| +∞ | 16.4% | 13.1% | 1.30 | .109 | 15.9% | 13.6% | 1.21 | .244 | |
| 3 | 1 | 22.3% | 18.9% | 1.23 | .143 | 21.9% | 19.3% | 1.17 | .272 |
| 2 | 21.5% | 18.2% | 1.24 | .140 | 21.1% | 18.5% | 1.17 | .264 | |
| 3 | 20.9% | 17.6% | 1.24 | .139 | 20.5% | 17.9% | 1.18 | .258 | |
| 4 | 20.4% | 17.1% | 1.24 | .137 | 20.0% | 17.4% | 1.18 | .255 | |
| 5 | 20.0% | 16.7% | 1.25 | .137 | 19.6% | 17.0% | 1.19 | .253 | |
| +∞ | 15.7% | 12.8% | 1.27 | .143 | 15.3% | 13.1% | 1.20 | .264 | |
| 4 | 1 | 21.8% | 18.6% | 1.22 | .171 | 21.4% | 18.9% | 1.16 | .287 |
| 2 | 21.0% | 17.9% | 1.22 | .168 | 20.6% | 18.2% | 1.17 | .278 | |
| 3 | 20.4% | 17.3% | 1.23 | .165 | 20.0% | 17.6% | 1.17 | .272 | |
| 4 | 19.9% | 16.8% | 1.23 | .164 | 19.6% | 17.1% | 1.18 | .269 | |
| 5 | 19.5% | 16.4% | 1.23 | .163 | 19.2% | 16.7% | 1.18 | .266 | |
| +∞ | 15.3% | 12.6% | 1.26 | .166 | 15.0% | 12.8% | 1.20 | .277 | |
| 5 | 1 | 21.4% | 18.4% | 1.21 | .192 | 21.1% | 18.7% | 1.16 | .297 |
| 2 | 20.7% | 17.7% | 1.21 | .188 | 20.3% | 17.9% | 1.17 | .288 | |
| 3 | 20.1% | 17.1% | 1.22 | .185 | 19.8% | 17.4% | 1.17 | .282 | |
| 4 | 19.6% | 16.6% | 1.22 | .183 | 19.3% | 16.9% | 1.18 | .279 | |
| 5 | 19.2% | 16.3% | 1.22 | .181 | 18.9% | 16.5% | 1.18 | .276 | |
| +∞ | 15.1% | 12.4% | 1.25 | .183 | 14.8% | 12.7% | 1.20 | .285 | |
| +∞ | 1 | 19.8% | 17.6% | 1.16 | .315 | 19.7% | 17.7% | 1.15 | .352 |
| 2 | 19.1% | 16.9% | 1.17 | .306 | 19.0% | 16.9% | 1.15 | .342 | |
| 3 | 18.6% | 16.3% | 1.17 | .300 | 18.5% | 16.4% | 1.16 | .335 | |
| 4 | 18.1% | 15.9% | 1.17 | .295 | 18.0% | 15.9% | 1.16 | .331 | |
| 5 | 17.8% | 15.5% | 1.18 | .292 | 17.7% | 15.6% | 1.16 | .328 | |
| +∞ | 14.0% | 11.9% | 1.20 | .278 | 13.8% | 12.0% | 1.18 | .333 | |
OR1: odds ratio between missing and tobacco use status for self-report data; OR2: odds ratio between urine missing and urine-verified failure among those who self-reported abstinence; Tx1: single contest + no counseling; Tx2: single contest + counseling; Tx3: multiple contests + no counseling; Tx4: multiple contests + counseling. P-values are based on the Chi-square test