| Literature DB >> 27892664 |
Julia Becaria Coquet1, Natalia Tumas, Alberto Ruben Osella, Matteo Tanzi, Isabella Franco, Maria Del Pilar Diaz.
Abstract
A number of studies have evidenced the effect of modifiable lifestyle factors such as diet, breastfeeding and nutritional status on breast cancer risk. However, none have addressed the missing data problem in nutritional epidemiologic research in South America. Missing data is a frequent problem in breast cancer studies and epidemiological settings in general. Estimates of effect obtained from these studies may be biased, if no appropriate method for handling missing data is applied. We performed Multiple Imputation for missing values on covariates in a breast cancer case-control study of Córdoba (Argentina) to optimize risk estimates. Data was obtained from a breast cancer case control study from 2008 to 2015 (318 cases, 526 controls). Complete case analysis and multiple imputation using chained equations were the methods applied to estimate the effects of a Traditional dietary pattern and other recognized factors associated with breast cancer. Physical activity and socioeconomic status were imputed. Logistic regression models were performed. When complete case analysis was performed only 31% of women were considered. Although a positive association of Traditional dietary pattern and breast cancer was observed from both approaches (complete case analysis OR=1.3, 95%CI=1.0-1.7; multiple imputation OR=1.4, 95%CI=1.2-1.7), effects of other covariates, like BMI and breastfeeding, were only identified when multiple imputation was considered. A Traditional dietary pattern, BMI and breastfeeding are associated with the occurrence of breast cancer in this Argentinean population when multiple imputation is appropriately performed. Multiple Imputation is suggested in Latin America’s epidemiologic studies to optimize effect estimates in the future. Creative Commons Attribution LicenseEntities:
Keywords: Body mass index; breastfeeding; cancer epidemiology; dietary pattern; multiple imputation
Year: 2016 PMID: 27892664 PMCID: PMC5454599 DOI: 10.22034/apjcp.2016.17.10.4567
Source DB: PubMed Journal: Asian Pac J Cancer Prev ISSN: 1513-7368
Subjects and Missing Data: Absolute and Relative Distributions of Outcome, Exposure and Other Covariates, Breast Cancer Case-Control Study Córdoba, Argentina 2008-2015
| Total | n 844 | % 100 | % of | missing | values | ||||
|---|---|---|---|---|---|---|---|---|---|
| Breast Cancer | Physical Activity | Health Care | N° of providers | Computer | Internet | Debit Card | Cars | ||
| No | 526 | 62.3 | 29.7 | 26.0 | 11.2 | 11.2 | 11.2 | 11.2 | 11.2 |
| Yes | 318 | 37.7 | 28.3 | 21.4 | 8.8 | 8.8 | 8.8 | 8.8 | 9.4 |
| Traditional dietary Pattern | |||||||||
| Tertil 1 | 245 | 29 | 31 | 24.1 | 6.9 | 6.9 | 6.9 | 6.9 | 7.3 |
| Tertil 2 | 277 | 32.8 | 28.5 | 24.2 | 10.5 | 10.1 | 10.1 | 10.1 | 10.1 |
| Tertil 3 | 322 | 38.2 | 28.4 | 24.5 | 12.7 | 13.0 | 13.0 | 13.0 | 13.3 |
| Age | |||||||||
| <45 years | 137 | 16.2 | 26.3 | 21.9 | 9.5 | 9.5 | 9.5 | 9.5 | 10.2 |
| 45-60 years | 307 | 36.4 | 24.4 | 26.4 | 11.4 | 11.4 | 11.4 | 11.4 | 11.4 |
| >60 years | 400 | 47.4 | 33.7 | 23.5 | 9.7 | 9.7 | 9.7 | 9.75 | 10 |
| BMI | |||||||||
| <25kg/mt2 | 400 | 47.4 | 29 | 27.2 | 8.5 | 8.7 | 8.7 | 8.75 | 8.75 |
| 25-30kg/mt2 | 268 | 31.7 | 27.9 | 22 | 9.3 | 9.3 | 9.3 | 9.33 | 9.33 |
| >30kg/mt2 | 165 | 19.6 | 31.5 | 22.4 | 15.8 | 15.1 | 15.1 | 15.1 | 16.4 |
| Unknown | 11 | 1.3 | 27.3 | 0.0 | 18.2 | 18.2 | 18.2 | 18.2 | 18.2 |
| Physical Activity | |||||||||
| Sedentary | 182 | 21.6 | _ | 52.7 | 4.9 | 4.4 | 4.4 | 4.4 | 4.9 |
| Moderate | 228 | 27 | _ | 28.1 | 8.8 | 8.8 | 8.8 | 8.8 | 8.8 |
| Vigorous | 188 | 22.3 | _ | 10.6 | 23.9 | 24.5 | 24.5 | 24.5 | 25 |
| Unknown | 246 | 29.1 | _ | 10.2 | 5.3 | 5.3 | 5.3 | 5.3 | 5.3 |
| Education | |||||||||
| No Studies | 5 | 0.6 | 80.0 | 0.0 | 20.0 | 20.0 | 20 | 20 | 20 |
| Incomplete primary | 63 | 7.5 | 33.3 | 0.0 | 17.6 | 15.9 | 15.9 | 15.9 | 17.9 |
| Complete primary | 281 | 33.3 | 22.1 | 46.6 | 9.2 | 9.2 | 9.2 | 9.2 | 9.2 |
| Incomplete high school | 79 | 9.4 | 34.2 | 0.0 | 8.9 | 10.1 | 10.1 | 10.1 | 10.1 |
| Complete high school | 145 | 17.2 | 21.4 | 19.3 | 10.3 | 10.3 | 10.3 | 10.3 | 10.3 |
| Higher education | 249 | 29.5 | 36.9 | 13.6 | 6.0 | 6.0 | 6.0 | 6.0 | 6.4 |
| Unknown | 22 | 2.6 | 40.9 | 54.5 | 54.5 | 54.5 | 54.5 | 54.5 | 54.5 |
Statistically Significant Differences (p <0.05).
Subjects and Missing Data: Absolute and Relative Distributions of Gynecologic and Socioeconomic Variables, Breast Cancer Case-Control Study Córdoba, Argentina 2008-2015
| Total | n 844 | % 100 | % of Missing | Values | |||||
|---|---|---|---|---|---|---|---|---|---|
| Having Children | Physical Activity | Health Care | N° of providers | Computer | Internet | Debit Card | Cars | ||
| No | 120 | 14.2 | 32.5 | 24.2 | 10.8 | 10.8 | 10.83 | 10.8 | 11.7 |
| Yes | 666 | 78.9 | 29.9 | 26.3 | 10.4 | 10.4 | 10.36 | 10.4 | 10.5 |
| Unknwon | 58 | 6.9 | 13.8 | 1.7 | 8.6 | 8.6 | 8.62 | 8.6 | 8.6 |
| Breastfeeding | |||||||||
| No | 266 | 31.5 | 31.2 | 21.8 | 7.5 | 7.9 | 7.89 | 7.9 | 8.3 |
| Yes | 521 | 61.7 | 29.4 | 28.2 | 11.7 | 11.5 | 11.52 | 11.5 | 11.7 |
| Unknown | 57 | 6.8 | 17.5 | 0.0 | 10.5 | 10.5 | 10.53 | 10.5 | 10.5 |
| Menopause | |||||||||
| No | 201 | 23.8 | 25.4 | 24.9 | 9.5 | 9.5 | 9.45 | 9.4 | 9.5 |
| Yes | 589 | 69.8 | 31.9 | 26.3 | 11.4 | 11.4 | 11.38 | 11.4 | 11.5 |
| Unknown | 54 | 6.4 | 12.9 | 0.0 | 1.8 | 1.8 | 1.85 | 1.8 | 3.7 |
| Menarche | |||||||||
| <12 years | 147 | 17.4 | 29.9 | 22.4 | 11.6 | 11.6 | 11.56 | 11.6 | 11.6 |
| >=12 years | 676 | 80.1 | 29.0 | 25.0 | 9.6 | 9.6 | 9.62 | 9.6 | 9.7 |
| Unknown | 21 | 2.5 | 28.6 | 14.3 | 23.8 | 23.8 | 23.81 | 23.81 | 28.6 |
| Health Care | |||||||||
| No | 68 | 8.1 | 33.8 | _ | 14.7 | 14.7 | 14.7 | 14.7 | 14.7 |
| Yes | 571 | 67.6 | 34.7 | _ | 11.4 | 11.4 | 11.4 | 11.4 | 11.4 |
| Unknwown | 205 | 24.3 | 12.2 | _ | 5.9 | 5.8 | 5.8 | 5.8 | 5.8 |
| N° of providers | |||||||||
| One | 364 | 43.1 | 34.1 | 23.1 | 0.0 | 0.0 | 0.0 | 0.0 | 0.5 |
| Two or three | 379 | 44.9 | 27.9 | 28.0 | 0.0 | 0.3 | 0.3 | 0.3 | 0.3 |
| More than three | 14 | 1.7 | 21.4 | 21.4 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| Unknown | 87 | 10.3 | 14.9 | 13.8 | 100.0 | 98.8 | 98.8 | 98.8 | 98.8 |
| Computer | |||||||||
| No | 293 | 34.7 | 28.0 | 32.8 | 0.0 | _ | 0.0 | 0.0 | 0.3 |
| Yes | 464 | 55.0 | 32.5 | 20.9 | 0.2 | _ | 0.0 | 0.0 | 0.2 |
| Unknown | 87 | 10.3 | 14.9 | 13.8 | 98.8 | _ | 100.0 | 100.0 | 100.0 |
| Internet | |||||||||
| No | 346 | 41 | 27.5 | 32.7 | 0.3 | 0.0 | _ | 0.0 | 0.6 |
| Yes | 411 | 48.7 | 33.6 | 19.5 | 0.0 | 0.0 | _ | 0.0 | 0.0 |
| Unknown | 87 | 10.3 | 14.9 | 13.8 | 98.8 | 100.0 | _ | 100.0 | 100.0 |
| Debit Card | |||||||||
| No | 354 | 41.9 | 28.8 | 26.0 | 0.3 | 0.0 | 0.0 | _ | 0.6 |
| Yes | 403 | 47.8 | 32.5 | 25.1 | 00 | 0.0 | 0.0 | _ | 0.0 |
| Unknown | 87 | 10.3 | 14.9 | 13.8 | 98.8 | 100.0 | 100.0 | _ | 100 |
| Cars | |||||||||
| None | 361 | 42.8 | 35.2 | 19.7 | 0.3 | 0.0 | 0.0 | 0.0 | _ |
| One | 332 | 39.3 | 26.5 | 30.4 | 0.0 | 0.0 | 0.0 | 0.0 | _ |
| Two | 58 | 6.9 | 29.3 | 36.2 | 0.0 | 0.0 | 0.0 | 0.0 | _ |
| Three or more | 4 | 0.5 | 25.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | _ |
| Unknown | 89 | 10.5 | 14.6 | 13.5 | 96.6 | 97.7 | 97.7 | 97.7 | _ |
statistically significant differences (p<0.05).
Association Measurements (Odds Ratio), Confidence Intervals and P-Values, Breast Cancer Case-Control Study Córdoba, Argentina 2008-2015
| Complete Case Analysis | Multiple Imputation Analysis | |||||
|---|---|---|---|---|---|---|
| (n=265) | (n=703) | |||||
| Odds Ratio | 95% CI | p value | Odds Ratio | 95% CI | p value | |
| Traditional dietary Pattern | 1.3 | 1.0-1.8 | 0.039 | 1.4 | 1.2-1.6 | 0.00 |
| BMI | 1.1 | 0.9-1.1 | 0.262 | 1.1 | 1.0-1.1 | 0.03 |
| Breastfeeding | 0.6 | 0.3-1.1 | 0.087 | 0.5 | 0.4-0.8 | 0.003 |
| SES | ||||||
| Low-low | 0.4 | 0.1-1.8 | 0.260 | 1.1 | 0.5-2.9 | 0.766 |
| Upper-low | 0.9 | 0.3-2.9 | 0.985 | 1.3 | 0.6-2.8 | 0.454 |
| Middle | 1.1 | 0.4-3.3 | 0.872 | 1.9 | 0.9-4.1 | 0.112 |
| Upper middle | 1.5 | 0.5-4.1 | 0.443 | 1.6 | 0.8-3.2 | 0.213 |
| Upper | 1.3 | 0.4-3.7 | 0.673 | 1.4 | 0.7-2.9 | 0.369 |
| Menopause | 1.2 | 0.6-2.6 | 0.633 | 1.5 | 0.9-2.5 | 0.102 |
| Physical Activity | 0.9 | 0.9-1.0 | 0.694 | 1.0 | 0.9-1.0 | 0.510 |
| Age | 0.9 | 0.9-1.0 | 0.961 | 0.9 | 0.9-1.0 | 0.571 |
| Menarche | 0.9 | 0.8-1.1 | 0.458 | 1.0 | 0.9-1.1 | 0.653 |
| Having Children | 1.5 | 0.6-3.7 | 0.328 | 1.6 | 0.9-2.7 | 0.102 |
Figure 1Diagnostic Plots for Physical Activity After Imputation for the Imputed Dataset 2 and 17, Breast Cancer Case-Control Study Córdoba, Argentina 2008-2015.