| Literature DB >> 32709231 |
Janet Y Bang1,2,3, Megha Sharda4, Aparna S Nadig5,6.
Abstract
BACKGROUND: Matching is one commonly utilized method in quasi-experimental designs involving individuals with neurodevelopmental disorders (NDD). This method ensures two or more groups (e.g., individuals with an NDD versus neurotypical individuals) are balanced on pre-existing covariates (e.g., IQ), enabling researchers to interpret performance on outcome measures as being attributed to group membership. While much attention has been paid to the statistical criteria of how to assess whether groups are well-matched, relatively little attention has been given to a crucial prior step: the selection of the individuals that are included in matched groups. The selection of individuals is often an undocumented process, which can invite unintentional, arbitrary, and biased decision-making. Limited documentation can result in findings that have limited reproducibility and replicability and thereby have poor potential for generalization to the broader population. Especially given the heterogeneity of individuals with NDDs, interpretation of research findings depends on minimizing bias at all stages of data collection and analysis.Entities:
Keywords: Covariate; Group comparison; Matching; Propensity score; Reproducibility; Transparency
Mesh:
Year: 2020 PMID: 32709231 PMCID: PMC7382075 DOI: 10.1186/s11689-020-09321-6
Source DB: PubMed Journal: J Neurodev Disord ISSN: 1866-1947 Impact factor: 4.025
Visual analysis and statistics to assess group matching
| Graph or statistic | |
|---|---|
| Visual analysis | • Boxplots • Histograms • Density plots • Dot plots |
| Descriptive statistics | • Means • Standard deviation • Range • Cohen’s • Variance ratio (includes both groups) [ |
| Inferential statistics | • • Chi-square test [ |
Fig. 1Workflow to achieve matched groups. The four key steps of our proposed workflow to achieve matched groups
Fig. 2Correlation matrix of potential covariates. ELF-4-RS = CELF-4 Recalling Sentences (scaled scores); CELF-4-WC = CELF-4 Word Classes (scaled scores); and CELF-4-WA = CELF-4 Word Associations (raw scores, because no normative data is provided)
Full sample comparison between TD and ASD groups on age and nonverbal IQ
| ASD ( | TD ( | ||
|---|---|---|---|
| Age (years) | 8.93 (1.34) | 8.75 (1.14) | .570 |
| Nonverbal IQ (Leiter) | 107.16 (13.61) | 114.53 (14.18) | .039 |
Fig. 3Distribution of propensity scores when including age and nonverbal IQ with the full sample (ASD n = 25, TD n = 43). Matched treatment units = children with ASD; matched control units = selected matches of TD children; unmatched control units = remaining unmatched TD children. Propensity scores calculated using the nearest neighbor and optimal matching methods resulted in the same values. This plot indicates the distribution of propensity scores when including covariates of age and IQ for all 25 children with ASD and 43 TD children. We see a similar distribution of propensity scores for matched treatment units and matched control units, ranging from scores of 0.2 to above 0.6. Among the matched treatment units, there appears to be one outlier where a child with ASD was assigned a propensity score of approximately 0.8
Descriptive statistics for final matched groups (ASD n = 24; TD n = 24)
| ASD ( | TD ( | vr | |||
|---|---|---|---|---|---|
| English- and French-dominant speaking children (En to Fr) | 11:13 | 10:14 | 1 | ||
| Block order (order 1 to order 2) | 12:12 | 13:11 | 1 | ||
| Agea | 8.83 (1.26) | 8.70 (1.12) | .713 | .11 | 1.27 |
| Nonverbal IQa | 108.29 (12.65) | 109.50 (13.24) | .748 | − .09 | .91 |
| CELF-4 Word Associationsa | 29.92 (15.01) | 33.29 (11.17) | .382 | − .26 | 1.80 |
| Sex (M to F) | 21:3 | 18:6 | .461 | ||
| Parental education (below to above university)c | 12:12 | 6:18 | .136 | ||
| CELF-4, Word Classes Totala, b | 9.74 (3.74) | 12.08 (3.06) | .024* | − .69 | 1.49 |
| CELF-4, Recalling Sentencesa | 8.08 (4.16) | 11.17 (2.18) | .003** | − .93 | 3.64 |
| Vineland Socialization subscalea | 76.83 (11.64) | 110.00 (11.88) | < .001*** | − 2.82 | 0.96 |
| Social Communication Questionnairea | 20.88 (5.83) | 4.42 (2.62) | < .001*** | 3.64 | 4.95 |
Variables are sorted in descending order based on p values
Continuous and categorical variables were analyzed using paired sample t tests and Fisher’s exact tests, respectively
Negative values for Cohen’s d indicate higher values in the TD group
d Cohen’s d, vr variance ratio
*p < .05, **p < .01, ***p < .001
aThe values shown are the mean (SD)
bOne child with ASD did not complete this measure
cFor all children, this is based on the mother except for one TD child where the mother’s education was not provided; thus, the father’s education was used instead
Fig. 4Violin plots for continuous demographic variables in final matched groups. Points represent observations per participant. For age and nonverbal IQ, matching was achieved according to criteria of p > .5, Cohen’s d close to 0, and variance ratios close to 1. CELF-4 Word Associations did not meet the criteria of p > .5, but distributions on this variable appear similar between groups. Groups are significantly different on other language measures of CELF-4 Recalling Sentences and Word Classes, as well as the Social Communication Questionnaire (SCQ) and the Vineland (VABS-II) Socialization Domain
Fig. 5Distribution of propensity scores when including age, nonverbal IQ, and CELF-4 Recalling Sentences with the full sample (ASD = 25, TD = 43). Matched treatment units = children with ASD; matched control units = selected matches of TD children; unmatched control units = remaining unmatched TD children. This plot indicates the distribution of propensity scores when including covariates of age, IQ, and CELF-4 Recalling Sentences for all 25 children with ASD and 43 TD children. Propensity scores were calculated using the nearest neighbor method. There are 8 children with ASD who appear to be outliers relative to the propensity scores for TD children
Comparison between ASD and TD groups on age, nonverbal IQ, and CELF-4 Recalling Sentences (ASD = 25, TD = 25)
| Nearest neighbor | Optimal | ||||
|---|---|---|---|---|---|
| Cohen’s | vr | Cohen’s | vr | ||
| Propensity scores | .55 | 3.57 | .55 | 3.72 | |
| ASD ( | TD ( | TD ( | |||
| Age (years) | 8.93 (1.34) | 8.73 (1.24) | .587 | 8.69 (1.31) | .520 |
| Nonverbal IQ (Leiter) | 107.16 (13.61) | 113.00 (14.89) | .154 | 113.80 (15.10) | .109 |
| CELF-4 Recalling Sentences | 7.84 (4.25) | 10.00 (2.10) | .029 | 9.88 (1.94) | .036 |