| Literature DB >> 35417495 |
Maria A Gartstein1, D Erich Seamon2, Jennifer A Mattera1, Michelle Bosquet Enlow3, Rosalind J Wright4,5, Koraly Perez-Edgar6, Kristin A Buss6, Vanessa LoBue7, Martha Ann Bell8, Sherryl H Goodman9, Susan Spieker10, David J Bridgett11, Amy L Salisbury12, Megan R Gunnar13, Shanna B Mliner13, Maria Muzik14, Cynthia A Stifter6, Elizabeth M Planalp15, Samuel A Mehr16, Elizabeth S Spelke16, Angela F Lukowski17, Ashley M Groh18, Diane M Lickenbrock19, Rebecca Santelli20, Tina Du Rocher Schudlich21, Stephanie Anzman-Frasca22, Catherine Thrasher23, Anjolii Diaz24, Carolyn Dayton25, Kameron J Moding26, Evan M Jordan27.
Abstract
Age and gender differences are prominent in the temperament literature, with the former particularly salient in infancy and the latter noted as early as the first year of life. This study represents a meta-analysis utilizing Infant Behavior Questionnaire-Revised (IBQ-R) data collected across multiple laboratories (N = 4438) to overcome limitations of smaller samples in elucidating links among temperament, age, and gender in early childhood. Algorithmic modeling techniques were leveraged to discern the extent to which the 14 IBQ-R subscale scores accurately classified participating children as boys (n = 2,298) and girls (n = 2,093), and into three age groups: youngest (< 24 weeks; n = 1,102), mid-range (24 to 48 weeks; n = 2,557), and oldest (> 48 weeks; n = 779). Additionally, simultaneous classification into age and gender categories was performed, providing an opportunity to consider the extent to which gender differences in temperament are informed by infant age. Results indicated that overall age group classification was more accurate than child gender models, suggesting that age-related changes are more salient than gender differences in early childhood with respect to temperament attributes. However, gender-based classification was superior in the oldest age group, suggesting temperament differences between boys and girls are accentuated with development. Fear emerged as the subscale contributing to accurate classifications most notably overall. This study leads infancy research and meta-analytic investigations more broadly in a new direction as a methodological demonstration, and also provides most optimal comparative data for the IBQ-R based on the largest and most representative dataset to date.Entities:
Mesh:
Year: 2022 PMID: 35417495 PMCID: PMC9007342 DOI: 10.1371/journal.pone.0266026
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Sample descriptions.
| Researcher(s) | Sample Size ( | Infant Age (Weeks) | Gender | Race | Sample Description |
|---|---|---|---|---|---|
| Bosquet & Wright | 668 | 20.23–63.25 | 53.3 | 71.1 | Community sample of infants |
| Gartstein | Study 1: 387 | 15.00–52.00 | 50.1 | NA | Community sample of infants |
| Perez-Edgar, Buss, & LoBue | Study 1: 138 | 16.00–47.20 | 55.0 | 26.8 | Community sample of infants |
| Bell & Calkins | 353 | 20.57–57.00 | 49.3 | 23.8 | Community sample of healthy infants |
| Goodman | Study 1: 82 | 12.00–52.00 | 62.2 | 43.9 | Community sample of mothers with history of major depression |
| Spieker | 221 | 22.00–40.00 | 54.8 | 81.4 | Mothers received mental health treatment during pregnancy |
| Bridgett | 178 | 16.00–48.00 | 47.2 | 29.2 | Full term, healthy infants |
| Salisbury | 172 | 23.00–32.00 | 51.7 | 47.7 | Prenatal exposure to depression, antidepressants |
| Mliner & Gunnar | 158 | 48.53–89.20 | 50.6 | 60.8 | Full term, healthy infants |
| Muzik | 157 | 23.27–44.40 | 52.2 | 43.3 | Mothers oversampled for trauma |
| Stifter | 149 | 24.57–57.29 | 53.0 | 8.1 | Community sample of full-term infants |
| Planalp | 148 | 23.00–87.00 | 48.0 | 24.3 | Community sample of infants |
| Mehr & Spelke | 123 | 11.71–88.43 | 59.3 | 32.5 | Community sample of full-term infants |
| Lukowski | 108 | 39.71–46.14 | 53.7 | 38.0 | Full term, healthy infants |
| Groh | 91 | 25.81–42.93 | 52.2 | 21.1 | Full term, healthy infants |
| Lickenbrock | 80 | 12.00–35.00 | 60.0 | 15.0 | Low-risk community sample of infants |
| Santelli | 73 | 47.57–70.14 | 47.9 | 32.9 | Vaginally delivered infants exclusively breastfed until 1 month of age |
| Du Rocher Shudlich | 73 | 24.80–58.80 | 52.1 | 16.4 | Parents living together since birth of child |
| Anzman-Frasca | 59 | 51.00–57.00 | 54.2 | 11.9 | Full term, healthy infants (a portion of the entire sample was included in this study) |
| Thrasher | Study 1: 12 | 6.33–8.67 | 73.0 | NA | Full term, healthy infants |
| Diaz | 47 | 40.00 | 44.7 | 23.4 | Full term, healthy infants |
| Dayton | 47 | 16.00–31.00 | 42.9 | 35.7 | High risk sample of families (e.g., poverty, violence exposure, psychopathology) |
| Moding | 43 | 26.00–102.00 | 41.9 | 34.9 | No food allergies, feeding difficulties |
| Jordan | 42 | 20.00–45.00 | 31.0 | 19.0 | Full term, healthy infants |
Descriptive statistics for the temperament subscales by gender and age group.
| Models | Gender | Age Group | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Girls | Boys | Youngest < 24 weeks | Mid-Range 24 to 48 weeks | Oldest > 48 weeks | |||||||||||
| Mean | SD | Range | Mean | SD | Range | Mean | SD | Range | Mean | SD | Range | Mean | SD | Range | |
| Activity | 4.25 | 1.11 | 0.33–6.93 | 4.29 | 1.08 | 0.47–6.80 | 4.12 | 0.89 | 0.53–6.67 | 4.29 | 1.21 | 0.33–6.93 | 4.43 | 0.87 | 0.47–6.87 |
| Approach | 4.79 | 1.39 | 0.17–7.00 | 4.84 | 1.37 | 0.17–7.00 | 3.98 | 1.50 | 0.17–7.00 | 5.00 | 1.23 | 0.33–7.00 | 5.55 | 0.91 | 1.42–7.00 |
| Smiling/ Laughter | 4.61 | 1.36 | 0.10–7.00 | 4.63 | 1.34 | 0.10–7.00 | 4.37 | 1.15 | 0.20–7.00 | 4.63 | 1.49 | 0.10–7.00 | 5.01 | 0.91 | 0.70–7.00 |
| High Intensity Pleasure | 5.32 | 1.41 | 0.09–7.00 | 5.49 | 1.38 | 0.09–7.00 | 4.98 | 1.23 | 0.55–7.00 | 5.47 | 1.53 | 0.09–7.00 | 5.95 | 0.74 | 0.27–7.00 |
| Perceptual Sensitivity | 3.27 | 1.35 | 0.08–6.83 | 3.33 | 1.36 | 0.17–7.00 | 2.89 | 1.25 | 0.17–7.00 | 3.38 | 1.40 | 0.08–7.00 | 3.71 | 1.18 | 0.42–6.83 |
| Vocal Reactivity | 4.42 | 1.38 | 0.08–7.00 | 4.41 | 1.35 | 0.17–7.00 | 3.92 | 1.10 | 0.33–7.00 | 4.43 | 1.47 | 0.08–7.00 | 5.22 | 0.89 | 1.00–7.00 |
| Distress to Limitations | 3.46 | 0.90 | 0.69–6.31 | 3.56 | 0.92 | 0.19–6.38 | 3.27 | 0.83 | 0.19–6.25 | 3.55 | 0.91 | 0.56–6.31 | 3.71 | 0.95 | 0.25–6.38 |
| Fear | 2.51 | 1.07 | 0.19–6.44 | 2.28 | 0.95 | 0.06–6.69 | 2.05 | 0.90 | 0.31–6.25 | 2.43 | 1.02 | 0.19–6.44 | 2.74 | 1.02 | 0.06–6.69 |
| Falling Reactivity | 4.57 | 1.20 | 0.23–6.92 | 4.50 | 1.19 | 0.08–7.00 | 4.63 | 1.07 | 1.08–7.00 | 4.62 | 1.03 | 1.15–7.00 | 4.13 | 1.67 | 0.08–6.92 |
| Sadness | 2.97 | 0.98 | 0.14–6.29 | 3.03 | 0.96 | 0.14–5.79 | 2.91 | 0.99 | 0.36–6.29 | 3.01 | 0.98 | 0.14–6.21 | 3.10 | 0.89 | 0.14–5.79 |
| Cuddliness | 5.12 | 1.11 | 0.53–7.00 | 5.08 | 1.13 | 0.29–7.00 | 5.39 | 1.13 | 0.76–7.00 | 5.03 | 1.12 | 0.29–7.00 | 4.87 | 0.97 | 0.41–6.82 |
| Duration of Orienting | 3.69 | 1.16 | 0.17–7.00 | 3.69 | 1.13 | 0.25–7.00 | 3.62 | 1.19 | 0.08–7.00 | 3.73 | 1.16 | 0.17–7.00 | 3.63 | 1.01 | 0.92–6.83 |
| Low Intensity Pleasure | 4.79 | 1.07 | 0.69–7.00 | 4.72 | 1.06 | 1.23–7.00 | 4.74 | 1.12 | 0.69–7.00 | 4.82 | 1.05 | 1.23–7.00 | 4.52 | 0.98 | 1.77–7.00 |
| Soothability | 4.64 | 1.07 | 0.50–7.00 | 4.58 | 1.12 | 0.39–7.00 | 5.39 | 1.13 | 0.76–7.00 | 4.62 | 1.13 | 0.50–7.00 | 4.66 | 1.09 | 0.94–7.00 |
Classification effectiveness indicators across machine learning algorithms: Gender and age-based classification with temperament features.
| Gender Classification: boys vs. girls | Age Classification: youngest (age < 24 weeks) vs. mid-range (age 24 to 48 weeks) vs. oldest (age > 48 weeks) | |||||
|---|---|---|---|---|---|---|
| Models | Accuracy | Kappa | AUC | Accuracy | Kappa | AUC |
| Linear Discriminant Analysis | .558 | .162 | .422 | .641 | .284 | .517 |
| Generalized Linear Modeling | .569 | .153 | .485 | .630 | .295 | .526 |
| Support Vector Machines | .559 | .169 | .432 | .637 | .308 | .517 |
| K-Nearest Neighbor | .556 | .084 | .471 | .650 | .271 | .529 |
| Naïve Bayes | .577 | .094 | .451 | .634 | .272 | .512 |
| Classification and Regression Trees | .565 | .099 | .424 | .645 | .240 | .514 |
| C5.0 Classification | .575 | .099 | .422 | .625 | .272 | .538 |
| Bootstrapped Aggregated Trees | .580 | .099 | .422 | .640 | .274 | .535 |
| Ensembled Decision Trees (Random Forest) | .580 | .133 | .485 | .641 | .289 | .535 |
| Gradient Boosting | .556 | .157 | .432 | .631 | .306 | .522 |
| Multi-class Adaptive Boosting (AdaBoost) | .558 | .141 | .471 | .641 | .241 | .517 |
*AUC for Age Classification analysis represents a multiclass ROC indicator, based on 3 groups.
Fig 1Note: lda—Linear Discriminant Analysis; glm—Generalized Linear Modeling; svm—Support Vector Machines; knn—K-Nearest Neighbor; nb—Naïve Bayes; cart—Classification and Regression Trees; c50—C5.0 Classification; treebag—Bootstrapped Aggregated Trees; rf—Ensembled Decision Trees (Random Forest); gbm—Gradient Boosting Method; adabag—Multi-class Adaptive Boosting (AdaBoost).
Fig 3Note: lda—Linear Discriminant Analysis; glm—Generalized Linear Modeling; svm—Support Vector Machines; knn—K-Nearest Neighbor; nb—Naïve Bayes; cart—Classification and Regression Trees; c50—C5.0 Classification; treebag—Bootstrapped Aggregated Trees; rf—Ensembled Decision Trees (Random Forest); gbm—Gradient Boosting Method; adabag—Multi-class Adaptive Boosting (AdaBoost).
Classification effectiveness indicators across machine learning algorithms: Gender by age with temperament features.
| Age Group 1 (< 24 weeks; | Age Group 2 (24 to 48 weeks; | Age Group 3 (> 48 weeks; | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Models | Accuracy | Kappa | AUC | Accuracy | Kappa | AUC | Accuracy | Kappa | AUC |
| Linear Discriminant Analysis | .563 | .164 | .404 | .557 | .148 | .429 | .527 | .152 | .452 |
| Generalized Linear Modeling | .549 | .154 | .407 | .551 | .147 | .436 | .574 | .112 | .501 |
| Support Vector Machines | .530 | .185 | .439 | .559 | .130 | .463 | .608 | .093 | .525 |
| K-Nearest Neighbor | .569 | .066 | .427 | .558 | .098 | .450 | .589 | .138 | .570 |
| Naïve Bayes | .594 | .117 | .455 | .556 | .087 | .436 | .572 | .194 | .542 |
| Classification and Regression Trees | .536 | .075 | .437 | .548 | .075 | .471 | .546 | .133 | .536 |
| C5.0 Classification | .567 | .087 | .457 | .573 | .112 | .436 | .571 | .159 | .487 |
| Bootstrapped Aggregated Trees | .572 | .092 | .410 | .568 | .060 | .422 | .618 | .093 | .565 |
| Ensembled Decision Trees (Random Forest) | .577 | .105 | .386 | .559 | .109 | .451 | .584 | .138 | .552 |
| Gradient Boosting Method | .540 | .123 | .395 | .567 | .155 | .405 | .540 | .214 | .576 |
| Multi-class Adaptive Boosting (AdaBoost) | .563 | .119 | .404 | .557 | .131 | .429 | .527 | .100 | .452 |