| Literature DB >> 35270242 |
Stefanie A Busgang1, Lance A Waller2, Elena Colicino1, Ralph D'Agostino3, Irva Hertz-Picciotto4, Chris Gennings1.
Abstract
Rare-disease registries can be useful for studying the associations between environmental exposures and disease severity, but often require the addition of a healthy comparison control group. Defining a surrogate control group, matched and balanced on potentially confounding variables, would allow for the comparison of exposure distributions with cases from a registry. In the present study, we assess whether controls selected externally, using stratification score (SS) matching, can serve as effective proxies for internal controls. In addition, we use methyl paraben (MEPB) to compare the estimated associations between an externally matched sample and case-control frequencies in a cohort with internally matched controls. We started with 225 eligible cases of autism spectrum disorder (ASD) from Childhood Autism Risks from Genetics and the Environment (CHARGE), 241 internal controls from CHARGE, and 265 external controls from the National Health and Nutrition Examination Survey (NHANES) cycles 2005-2016. We calculated the SSs using demographic covariates and matched 1:1 using a caliper method without a replacement. The distribution of the covariates and the mean squared error of the paired differences (MSEpaired) in the SSs between the internal and external group were similar (MSEpaired = 0.007 and 0.011, respectively). The association between MEPB and ASD compared to the controls was similar between the externally matched SS pairs and the original frequency matched cohort. Controls selected externally, via SS matching, can provide a comparable bias reduction to that provided by the internal controls, and therefore may be a useful strategy in situations when the internal controls are not available.Entities:
Keywords: external controls; registry-based; stratification score matching
Mesh:
Year: 2022 PMID: 35270242 PMCID: PMC8909853 DOI: 10.3390/ijerph19052549
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 4.614
Figure 1Distributions of SS between cases (purple) and controls (pink) (a) before matching using the internal controls, (b) before matching using external controls, (c) after matching using the internal sample, and (d) after matching using the external controls.
Demographic characteristics (mean (SD) or frequency (%)) of the cases and controls in the internal sample, before and after matching with a 0.30 caliper using internal controls.
| Before Matching | After Matching | |||||
|---|---|---|---|---|---|---|
| Cases (N = 225) | Controls (N = 241) | Cases (N = 216) | Controls (N = 216) | |||
| Age of child (years) | 4.07 (0.72) | 3.83 (0.75) | <0.001 | 4.05 (0.73) | 3.91 (0.71) | 0.043 |
| Year of birth | 2007.26 (3.22) | 2006.47 (2.91) | 0.006 | 2007.19 (3.24) | 2006.67 (2.91) | 0.081 |
| Maternal age (years) | 30.74 (5.76) | 30.37 (5.39) | 0.476 | 30.60 (5.69) | 30.63 (5.40) | 0.952 |
| Sex | ||||||
| Female (ref) | 43 (19%) | 46 (19%) | 0.995 | 39 (18%) | 43 (20%) | 0.624 |
| Male | 182 (81%) | 195 (81%) | 177 (82%) | 173 (81%) | ||
| Maternal education | ||||||
| Bachelor/graduate/professional (ref) | 109 (48%) | 129 (53%) | 0.310 | 108 (50%) | 113 (52%) | 0.312 |
| Some college | 73 (32%) | 89 (37%) | 70 (32%) | 80 (37%) | ||
| High school/GED or less | 43 (19%) | 23 (10%) | 38 (18%) | 23 (11%) | ||
| Child’s race | ||||||
| White (non-Hispanic) (ref) | 107 (48%) | 132 (55%) | 0.161 | 105 (49%) | 118 (55%) | 0.355 |
| Non-White (non-Hispanic) | 55 (24%) | 46 (19%) | 52 (24%) | 44 (20%) | ||
| Hispanic | 63 (28%) | 63 (26%) | 59 (27%) | 54 (25%) | ||
| Birth place of mom | ||||||
| U.S.A. (ref) | 166 (74%) | 205 (85%) | 0.081 | 163 (75%) | 181 (84%) | 0.208 |
| Mexico | 18 (8%) | 10 (4%) | 15 (7%) | 9 (4%) | ||
| Outside the U.S.A. or Mexico | 41 (18%) | 26 (11%) | 38 (18%) | 26 (12%) | ||
| Homeowner status | ||||||
| No (ref) | 73 (32%) | 58 (24%) | 0.044 | 68 (31%) | 54 (25%) | 0.135 |
| Yes | 152 (68%) | 183 (76%) | 148 (69%) | 162 (75%) | ||
| Stratification score | 0.53 (0.15) | 0.54 (0.09) | <0.001 | 0.52 (0.14) | 0.46 (0.12) | <0.001 |
ap-values provided for either a two-sample t-test for continuous variables or a chi-square for categorical variables when comparing the demographics between cases and controls. Abbreviations: U.S.A. = United States of America; GED = general education diploma; ref = reference.
Demographic characteristics (mean (SD) or frequency (%)) of the cases and controls in the external sample, before and after matching with a 0.30 caliper using external controls.
| Before Matching | After Matching | |||||
|---|---|---|---|---|---|---|
| Cases (N = 225) | Controls (N = 265) | Cases (N = 71) | Controls (N = 71) | |||
| Age of child (years) | 4.07 (0.72) | 5.52 (0.90) | <0.001 | 4.44 (0.52) | 4.69 (1.21) | 0.112 |
| Year of birth | 2007.26 (3.22) | 2006.87 (4.12) | <0.001 | 2008.15 (3.14) | 2008.21 (5.09) | 0.937 |
| Maternal age (years) | 30.74 (5.76) | 28.04 (7.56) | <0.001 | 29.92 (6.37) | 29.69 (6.73) | 0.838 |
| Sex | ||||||
| Female (ref) | 43 (19%) | 145 (55%) | <0.001 | 21 (30%) | 22 (31%) | 0.855 |
| Male | 182 (81%) | 120 (45%) | 50 (70%) | 49 (69%) | ||
| Maternal education | ||||||
| Bachelor/graduate/professional (ref) | 109 (48%) | 43 (16%) | 0.256 | 25 (35%) | 24 (34%) | 0.730 |
| Some college | 73 (32%) | 99 (37%) | 28 (39%) | 26 (37%) | ||
| High school/GED or less | 43 (19%) | 123 (47%) | 18 (25%) | 21 (29%) | ||
| Child’s race/ethnicity | ||||||
| White (non-Hispanic) (ref) | 107 (48%) | 63 (24%) | <0.001 | 23 (32%) | 26 (37%) | 1.000 |
| Non-White (non-Hispanic) | 55 (24%) | 112 (42%) | 23 (32%) | 23 (32%) | ||
| Hispanic | 63 (28%) | 90 (34%) | 25 (25%) | 22 (31%) | ||
| Birth place of mom | ||||||
| U.S.A. (ref) | 166 (74%) | 217 (82%) | <0.001 | 53 (75%) | 56 (79%) | 1.000 |
| Mexico | 18 (8%) | 10 (4%) | 5 (7%) | 5 (7%) | ||
| Outside the U.S.A. or Mexico | 41 (18%) | 38 (14%) | 13 (18%) | 10 (14%) | ||
| Homeowner status | ||||||
| No (ref) | 73 (32%) | 167 (63%) | 35 (49%) | 31 (44%) | ||
| Yes | 152 (68%) | 98 (37%) | <0.001 | 36 (51%) | 40 (56%) | 0.501 |
| Stratification score | 0.81 (0.24) | 0.16 (0.24) | <0.001 | 0.57 (0.27) | 0.49 (0.27) | 0.060 |
ap-values provided for either a two-sample t-test for continuous variables or a chi-square for categorical variables when comparing demographics between cases and controls. Abbreviations: U.S.A. = United States of America; GED = general education diploma; ref = reference.
Figure 2Pre- and post-matching standardized differences of individual covariates between case and control groups for (a) the internal sample and (b) the external sample. The dashed lines represent the suggested threshold standardized difference proposed by Normand et al. (2001) of less than 0.10 [22,24]. Abbreviations: SS = Stratification score; U.S.A. = United States of America; GED = general education diploma.
Figure 3The distribution of error in matched pairs for (a) the sample using internal controls (N = 216 pairs) and (b) the sample using external controls (N = 71 pairs) using the caliper method with a 0.30 caliper. Abbreviations: SS = stratification score; MSE = mean squared error.
Association of MEPB and ASD compared to the controls among three conditions.
| OR (95% CI) | ||
|---|---|---|
| Frequency matched (N = 466 total participants) | 1.20 (1.09, 1.33) | <0.001 |
| Internally matched (N = 216 pairs) | 1.15 (1.04, 1.27) | 0.005 |
| Externally matched (N = 71 pairs) | 1.16 (0.98, 1.37) | 0.080 |