Literature DB >> 33035264

Prediction of child and adolescent outcomes with broadband and narrowband dimensions of internalizing and externalizing behavior using the child and adolescent version of the Strengths and Difficulties Questionnaire.

Pawel R Kulawiak1,2, Jürgen Wilbert1, Robert Schlack3, Moritz Börnert-Ringleb4.   

Abstract

The Strengths and Difficulties Questionnaire (SDQ) is a frequently used screening instrument for behavioral problems in children and adolescents. There is an ongoing controversy-not only in educational research-regarding the factor structure of the SDQ. Research results speak for a 3-factor as well as a 5-factor structure. The narrowband scales (5-factor structure) can be combined into broadband scales (3-factor structure). The question remains: Which factors (narrowband vs. broadband) are better predictors? With the prediction of child and adolescent outcomes (academic grades, well-being, and self-belief), we evaluated whether the broadband scales of internalizing and externalizing behavior (3-factor structure) or narrowband scales of behavior (5-factor structure) are better suited for predictive purposes in a cross-sectional study setting. The sample includes students in grades 5 to 9 (N = 4642) from the representative German Health Interview and Examination Survey for Children and Adolescents (KiGGS study). The results of model comparisons (broadband scale vs. narrowband scales) did not support the superiority of the broadband scales with regard to the prediction of child and adolescent outcomes. There is no benefit from subsuming narrowband scales (5-factor structure) into broadband scales (3-factor structure). The application of narrowband scales, providing a more differentiated picture of students' academic and social situation, was more appropriate for predictive purposes. For the purpose of identifying students at risk of struggling in educational contexts, using the set of narrowband dimensions of behavior seems to be more suitable.

Entities:  

Mesh:

Year:  2020        PMID: 33035264      PMCID: PMC7546492          DOI: 10.1371/journal.pone.0240312

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Internalizing and externalizing behavior problems are considered a substantial risk factor for students’ social and academic well-being [1]. Both dimensions are consistently associated with difficulties on different levels in schools. Children and adolescents with externalizing and/or internalizing behavior problems are exposed to higher risks of academic failure [2-6]. In addition to academic difficulties, subtypes of externalizing as well as internalizing behavior problems are associated with a range of short- and long-term developmental risks, such as higher rates of social exclusion/rejection [7], school suspension [6], lack of bonding to school [7], or criminal arrest [8]. Therefore, the early and adequate identification of students at-risk for the development of severe externalizing and internalizing behavior problems in educational practice is of high relevance [9]. The gained insights in students’ behaviors might at the same time serve as a reference for designing appropriate behavioral interventions. Teachers and school psychologists consequently play an important role in supporting and identifying students who suffer from behavioral problems. Among others, the Strengths and Difficulties Questionnaire (SDQ) [10] is a frequently used screening instrument for behavioral and emotional problems in children and adolescents [11] and is commonly applied in educational research and practice [4, 12]. The SDQ comprises 25 items that can be grouped into subscales for emotional symptoms, peer problems, conduct problems, hyperactivity, and prosocial behavior, containing five items each [10]. The narrowband subscales for emotional symptoms and peer problems can be combined into the broadband scale internalizing behavior and the narrowband subscales of hyperactivity and conduct problems can be subsumed into the broadband scale externalizing behavior [13]. Correlations of behavioral and emotional problems, measured by means of sum scores on the SDQ, with academic outcome variables have been addressed in recent research. Previous studies have highlighted negative associations between the SDQ scales and academic achievement [4, 14–16]. Metsäpelto et al. [4, 15] found that higher externalizing behavior (broadband scale) was related to lower reading skills. Similarly, Palmu et al. [16] highlighted that externalizing behavior (broadband scale) was associated with lower academic grades. Similarly, DeVries et al. [17] concluded that peer problems (narrowband scale) are negatively associated with academic grades. Higher internalizing and externalizing behavior (broadband scales) were linked to lower scholastic performance [18]. Hyperactivity and conduct problems (narrowband scales) were negatively correlated with reading and mathematical skills [14]. Not only are the associations between the SDQ scales and academic performance of interest from a pedagogical standpoint, but also their relation to social outcomes, such as social integration within the classroom, the well-being of students in their families, school absenteeism, and feelings of self-worth and self-efficacy. Higher emotional symptoms, peer problems, conduct problems, and hyperactivity (narrowband scales) are associated with a lower social preference within the classroom and with more difficulties within the family [19] and lower feelings of self-worth [20]. Students rejected by peers are more likely to show internalizing behavior problems (broadband scale) and controversial students are more likely to exhibit externalizing behavior (broadband scale) [21]. Higher emotional symptoms (narrowband scale) are associated with lower feelings of self-efficacy [22]. Lenzen et al. [23] highlighted the association of greater conduct problems and emotional symptoms (narrowband scales) with increased school absenteeism. School absence was also linked to internalizing behavior (broadband scale) while school discipline referrals were related to externalizing behavior (broadband scale) [24]. Moreover, students with special educational needs, especially those with learning disabilities, have greater peer problems (narrowband scale) [12]. Although the originally proposed 5-factor structure of the SDQ (5 narrowband subscales) is often applied in research [12, 17, 19, 20, 22, 23], the narrowband subscales are sometimes subsumed into broadband behavior scales (internalizing and externalizing behavior) [4, 14, 15, 18, 21, 24], which has resulted in an ongoing controversy, not only in educational research, regarding the factor structure of the SDQ [25-27]. This controversy implies a discussion about the usefulness of broadband and narrowband scales of behavior.

Broadband and narrowband scales of behavior

Some studies on the psychometric properties of the SDQ [13, 25, 26, 28, 29] have confirmed the original 5-factor structure (emotional symptoms, peer problems, conduct problems, hyperactivity, and prosocial behavior) proposed by Goodman [10]. At the same time, several studies have highlighted potential model fit shortcomings of the 5-factor structure [30-33]. As a reaction to these concerns, several authors have proposed a 3-factor model as a possible alternative to the original 5-factor model [13, 33]. In the revised 3-factor model, the narrowband subscales of emotional symptoms and peer problems are combined into the broadband scale for internalizing behavior, and the narrowband subscales of hyperactivity and conduct problems are subsumed into the broadband scale for externalizing behavior (the narrowband subscale of prosocial behavior remains unchanged). Recent research has provided partial support for the appropriateness of the revised 3-factor model. In studies comparing the different factor structures (3 vs. 5) in terms of model fit, the superiority of either the 3-factor structure [34-36] or the 5-factor structure [13, 37–39] or also the adequateness of both factor structures [13, 25, 28, 33] have been documented. Goodman et al. ([7] p. 1189) conclude that “there may be no single best set of subscales to use in the SDQ; rather, the optimal choice may depend in part upon one’s study population and study aims.” Subsuming narrowband subscales into broadband scales takes place from a clinical perspective and represents the well-known hierarchical model of child and adolescent psychopathology [40, 41]. For example, the SDQ narrowband scales of hyperactivity and conduct problems describe distinctive psychopathological phenomena but are often co-occurring [42], which is why both scales form a broadband externalizing scale. The broadband perspective on child and adolescent behavior and emotions leads to a two-dimensional taxonomy of psychopathology distinguishing between internalizing and externalizing behavior. The question of whether child and adolescent psychopathology is best described by narrowband or broadband measures is the subject of ongoing debate [43-45]: “[…] the once plausible goal of identifying homogeneous populations for treatment and research resulted in narrow diagnostic categories that did not capture clinical reality, symptom heterogeneity within disorders, and significant sharing of symptoms across multiple disorders. The historical aspiration of achieving diagnostic homogeneity by progressive subtyping within disorder categories no longer is sensible […]” ([32] p. 12) However, it must also be considered that different narrowband dimensions of behavior (e.g., aggressive vs. non-aggressive rule-breaking behavior) are related to different etiological factors [46]. Vice versa, different narrowband dimensions of behavior might explain educational outcome variables to varying degrees, for example, conduct problems are associated with arithmetic skills (r = -.20), while the association is stronger for hyperactivity (r = -.38) [14]. Subsuming these narrowband dimensions of behavior into a broader category of behavior might therefore lead to a loss of information; that is, differentiated effects (between narrowband scales) in predicting educational outcomes are not described by a broadband scale. The usage of narrowband scales could provide a nuanced description of the association between dimensions of behavior and child and adolescent outcomes.

Aims

Empirical evidence concerning the superiority of the narrowband scales of behavior in the prediction of child and adolescent outcomes is lacking. Comparisons between the narrowband scales (5-factor structure) and broadband scales (3-factor structure) with regard to the prediction of outcomes is sparse. The question remains: Which factors (narrowband vs. broadband) are better predictors? In addition, the vast majority of previous research using the SDQ has focused on parent- or teacher-reported student behavior. There is a lack of studies that examine the associations between self-reports of students on the SDQ and relevant outcomes. However, children and adolescents can be considered experts of their own well-being [47] and consequently might depict a valid and important source of information on their own behavior. In the present study, therefore, we examined how behavioral and emotional problems, measured by means of the different self-report SDQ scores (narrowband and broadband scales), are associated with child and adolescent outcomes, such as measures of academic success (grades), well-being (school, friends, and family), and self-belief (self-esteem and self-efficacy). We thereby assume that narrowband scales of behavior are more informative predictors of outcomes than broadband scales of behavior. This comparison (narrowband vs. broadband dimensions of behavior) seems of particular importance for emphasizing the need to differentiate behavioral problems when examining associations with child and adolescent outcomes.

Method

Study design and participants

The analyzed cross-sectional sample was obtained from the baseline of the German Health Interview and Examination Survey for Children and Adolescents (KiGGS study) [48]. The KiGGS study is a nationally representative health survey comprising children and adolescents. The survey’s main objective is to obtain information on key physical and mental health indicators, risk factors, health service utilization, health behavior, and living conditions of children and adolescents in Germany. Study participants were not recruited in schools (non-nested data structure) but randomly selected from the official registers of local residents. The data were collected from 2003 to 2006. The KiGGS sample consists of 17641 children and adolescents aged 0 to 17 years. The children were given a physical examination and the parents as well as the children and adolescents themselves (from age 11 on) were interviewed via written questionnaires. The study was approved by the Charité/Universitätsmedizin Berlin ethics committee and the Federal Office for the Protection of Data. The present analysis represents a secondary data analysis with a focus on behavior/emotions and child and adolescent outcomes such as measures of well-being (school, friends, and family), academic success (grades), and self-belief (self-esteem and self-efficacy), which are factors of interest to professionals in educational contexts. We will focus on the child/adolescent-reported data and refer to children and adolescents of compulsory education age. Compulsory education in Germany usually ends with the completion of grade 9 (usually 15-year-old adolescents). The survey’s questionnaires were addressed to children and adolescents from the age of 11 years onwards (usually children in grade 5 and above). The present sample therefore includes children and adolescents with a minimum age of 11 years in grades 5 to 9 (N = 4642; 52% boys; age in years: M = 13.46, SD = 1.47, Min = 11.00, Q1 = 12.17, Md = 13.42, Q3 = 14.67, Max = 17.92). The distribution of the children and adolescents across grades 6 to 9 is nearly equal (approximately 21.5% in each grade, but 14% in grade 5). The proportion of children and adolescents in grade 5 (usually 10- and 11-year-olds) is small, as individuals under the age of 11 are not included in the present sample.

Measures

Child and adolescent behavior and emotions

The German self-report version of the Strengths and Difficulties Questionnaire (SDQ) was used to assess child and adolescent behavior and emotions [10, 49]. This questionnaire (25 items) quantifies emotional symptoms, peer problems, conduct problems, hyperactivity, and, as a dimension of strength, prosocial behavior (original narrowband scales). Emotional symptoms and peer problems can be combined into a broadband internalizing behavior subscale, while conduct problems and hyperactivity can be subsumed into a broadband externalizing behavior subscale [13]. The prosocial behavior subscale was not considered in the present study, because the focus was on a comparison of the two broadband subscales with the underlying narrowband subscales. SDQ items are rated on a three-point scale (0 for "not true," 1 for "somewhat true," and 2 for "certainly true"). High subscale scores indicate elevated behavioral problems. The children and adolescents with the highest subscale scores, which are the upper 10% of the normative sample, can be categorized as “abnormal” and are considered to be at risk for psychiatric disorders [10, 50]. We therefore refer to these individuals as at-risk children and adolescents. A conservative classification rule [51], which minimizes false positive cases by selecting a cutoff value below 10%, was used to identify at-risk children and adolescents (emotional symptoms ≥ 6, peer problems ≥ 5, conduct problems ≥ 5, hyperactivity ≥ 7, internalizing behavior ≥ 9, and externalizing behavior ≥ 10; calculations based on KiGGS baseline data, Table 1). With regard to the self-report version used in the KiGGS study (data at hand), the internal consistencies (Table 2) for the subscales of peer problems and conduct problems are insufficient (α and ω ≤ .50), but moderate for the subscales of emotional symptoms, hyperactivity, and internalizing and externalizing behavior (α and ω ≥ .60).
Table 1

At-risk children and adolescents (SDQ: “Abnormal”).

percent (frequency)score range
measureat-riskborderlinenormalat-riskborderlinenormal
emotional symptoms6.2% (286)6.3% (293)87.5% (4063)≥ 65≤ 4
peer problems6.5% (303)8.9% (415)84.5% (3924)≥ 54≤ 3
conduct problems5.3% (245)7.5% (348)87.2% (4049)≥ 54≤ 3
hyperactivity9.2% (428)9.2% (426)81.6% (3788)≥ 76≤ 5
internalizing behavior7.9% (369)4.5% (211)87.5% (4062)≥ 98≤ 7
externalizing behavior10.0% (467)6.0% (281)83.9% (3894)≥ 109≤ 8
Table 2

Correlations (Pearson’s correlation), means, standard deviations, and reliabilities of SDQ subscales and child and adolescent outcome variables.

Measure123456789101112MSDαbωb
1. emotional symptoms (SDQ)2.301.86.60.61
2. peer problems (SDQ).311.981.56.48.50
3. conduct problems (SDQ).26.201.951.39.43.46
4. hyperactivity (SDQ).25.11.403.742.04.64.69
5. internalizing behavior (SDQ).85.77.29.234.282.77.64.69
6. externalizing behavior (SDQ).30.17.76.90.305.692.88.67.62
7. school (KINDL-R)-.37-.19-.29-.33-.36-.3714.772.72.53.62
8. friends (KINDL-R)-.31-.49-.17-.08-.48-.14.2516.572.37.53.54
9. self-esteem (KINDL-R)-.20-.16-.14-.17-.23-.19.24.2713.092.96.67.69
10. family (KINDL-R)-.26-.14-.34-.23-.25-.33.35.24.2117.232.37.68.68
11. math grade-.11-.04-.14-.20-.10-.20.29.02.07.144.090.94--
12. German grade-.03-.09-.15-.19-.07-.21.26.01.05.10.474.120.85--
13. general self-efficacya-.27-.25-.13-.23-.32-.23.28.32.40.17.10.0829.474.39.83.84

Note. All parameters are reported with regard to the raw data. Significant correlations (p < .05) are in bold.

aThe scale was only used in adolescents aged 14 years and older (N = 1750).

bReliability coefficients are Cronbach’s α and McDonald’s ω.

Note. All parameters are reported with regard to the raw data. Significant correlations (p < .05) are in bold. aThe scale was only used in adolescents aged 14 years and older (N = 1750). bReliability coefficients are Cronbach’s α and McDonald’s ω.

Child and adolescent outcomes

Health-related quality of life. The self-report version of the KINDL-R is a brief questionnaire to measure the health-related quality of life of children and adolescents [52]. The subscales school (e.g., “doing my schoolwork was easy”), friends (e.g., “I played with friends”), self-esteem (e.g., “I was proud of myself”), and family (e.g., “I got on well with my parents”) describe the students’ well-being related to daily school life, friendship, family life, and feelings of self-worth. Each subscale consists of 4 items. Items are rated on a five-point scale (1 for “never,” 2 for “seldom,” 3 for “sometimes,” 4 for “often,” and 5 for “all the time”). High scores indicate a positive quality of life in the specific domain. With regard to the KiGGS study (data at hand), the internal consistencies (Table 2) of the mentioned subscales are mediocre (α and ω range from .53 to .69). The subscales physical and emotional well-being are not used in the present study. School grades. The school grades (math and German) received on the last report card (half-year term) were reported by the parents. Germany uses a 6-point grading scale. School grades vary from 1 (excellent) to 6 (insufficient), which were reversed so higher values indicate a better academic performance. General self-efficacy. The general self-efficacy scale is a 10-item questionnaire that was designed to assess optimistic self-beliefs in coping with a variety of difficult demands in life [53] (e.g., “I can always manage to solve difficult problems if I try hard enough”). In the KiGGS study, the scale was only used in adolescents aged 14 years and older (N = 1750). Items are rated on a four-point scale (1 for “not at all true,” 2 for “hardly true,” 3 for “moderately true,” and 4 for “exactly true”). High scores indicate stronger self-efficacy. With regard to the KiGGS study (data at hand), the internal consistency (Table 2) of the scale is good (α and ω > .80).

Statistical analysis strategy

Ordinary least square regression models will be formulated with regard to the prediction of child and adolescent outcomes (outcomes regressed on SDQ subscales). The term “prediction” and cognate terms are used here in a statistical sense and shall not be confused with the concept of predictive validity, which describes the ability of a measure to forecast outcomes in the future [54]. To judge the statistical predictive performance of the different subscales of the SDQ (broadband vs. narrowband), the regression model with the broadband subscale (model 1: outcome regressed on broadband subscale, e.g., internalizing behavior) will be compared to the regression model with both underlying narrowband subscales jointly as predictors in one regression model (e.g., model 2: outcome regressed on emotional symptoms and peer problems). This model comparison (narrowband vs. broadband) will be conducted with regard to each predicted outcome and separately for internalizing and externalizing behavior (internalizing behavior vs. emotional symptoms and peer problems; externalizing behavior vs. conduct problems and hyperactivity). To evaluate the predictive performance of the different models (predictive performance of the broadband and narrowband subscales), we report two goodness-of-fit indices for each regression model. The adjusted R is the proportion of variance in the outcome that is predictable from the predictors (SDQ subscales). The Akaike Information Criterion (AIC) takes into account both model complexity (total number of estimated model parameters) and goodness of model fit (maximized likelihood) and balances these two [55]. The individual AIC values are not interpretable. However, the smaller the AIC value, the better the model fit. Consequently, models with less complexity (fewer predictors) along with a high goodness of fit are deemed to be good models. If the difference in AIC values between the models is less than 3 (model with broadband scale vs. model with underlying narrowband scales), then the model with the higher AIC value is almost as good as the model with the smaller AIC value [55]. For the application of AIC model selection in the fields of psychology and psychometrics, see Vrieze [56]. The SDQ subscales are used as dummy variables. The reference is the at-risk category (“abnormal”). Therefore, the intercept (constant) of each regression model is interpretable as the expected average outcome for the at-risk children and adolescents. Since all the outcomes are standardized (M = 0, SD = 1), the intercept represents the average outcome for the at-risk group as a deviation from the overall sample mean in units of standard deviation. The regression parameters (B) for all the other SDQ subscale scores are interpretable as the average difference in the outcomes (in units of standard deviation) between the at-risk group and the children and adolescents with the particular SDQ subscale score. These types of analyses emphasize the clinical category “abnormal” (at-risk). In some additional regression analyses, we will use the SDQ subscales as continuous predictors. If the SDQ subscale is a continuous predictor, the intercept represents the average outcome for children without behavioral problems (SDQ subscale score equals zero) as a deviation from the overall sample mean in units of standard deviation, and the regression parameter (B) is the average change (slope) of the outcome (in units of standard deviation) when the SDQ score increases on average by one unit. All statistical analyses were conducted in R 3.6.0.

Results

Preliminary results

Model fit and measurement invariance of the 3- and 5-factor structures of the SDQ

Confirmatory factor analyses (weighted least square mean and variance adjusted estimation) reveal an appropriate model fit (RMSEA < 0.08, for details see [57]) for both the 3- and 5-factor structures of the SDQ, although the 5-factor structure shows a better model fit (RMSEA = .05, CFI = .89, TLI = .88, χ2 = 3250.87, df = 265, p = .00) than the 3-factor structure (RMSEA = .06, CFI = .85, TLI = .83, χ2 = 4540.10, df = 272, p = .00). However, CFI (< .90) and TLI (< .95) do not indicate good fit for both the 3- and 5-factor structures. Both the 3- and 5-factor structures meet the standards for metric invariance [58, 59] across gender and age groups (multi‑group confirmatory factor analysis; comparison of metric and configural model: difference in the models’ CFI ≤ .01), which can be interpreted to indicate that the measured dimensions of behavior (narrowband and broadband scales) manifest in the same way across boys and girls as well as different age groups (quartile age groups in years: [11,12.2], (12.2,13.4], (13.4,14.7], and (14.7,17.9]). As the goal of the present paper is to compare the statistical predictive performance of the broadband (3-factor structure) and narrowband (5-factor structure) scales of behavior and as it is not the goal to highlight differences between boys and girls or different age groups, sex and age are not considered as predictors of the child and adolescent outcomes.

Descriptive results

Based on the SDQ narrowband subscales, the proportion of at-risk children and adolescents ranges between 5.3% and 9.2% (conduct problems: 5.3%; emotional symptoms: 6.2%; peer problems: 6.5%; hyperactivity: 9.2%), while the SDQ broadband subscales reveal a proportion of 7.9% (internalizing behavior) and 10% (externalizing behavior) of at-risk children and adolescents (Table 1). The correlations, means, and standard deviations of the SDQ subscales and the child and adolescent outcomes are displayed in Table 2. All outcomes are positively associated. Increased positive correlations are observed between the grades in math and German (r = .47), as well as between the KINDL-R subscales of family and school (r = .35). The general self-efficacy scale is likewise considerably correlated with the KINDL-R subscales of self-esteem (r = .40) and friends (r = .32). The different SDQ subscales were negatively correlated with all outcomes, which means that increased behavioral problems measured by means of the different SDQ subscales are associated with lower values for the outcomes, indicating adverse outcomes. The KINDL-R friends subscale is highly correlated with the SDQ subscales of peer problems (r = -.49) and internalizing behavior (r = -.48). The grades (math and German) were only weakly correlated with the SDQ subscales of peer problems and internalizing behavior (r ranges from -.04 to -.10). Also, the correlation between the German grades and emotional symptoms is close to zero (r = -.03). Another small correlation is between the KINDL-R subscale for friends and hyperactivity (r = -.08).

Main results: Predictive performance of the SDQ subscales

Internalizing behavior vs. emotional symptoms and peer problems

Each child and adolescent outcome is regressed on the different SDQ subscales, which are the broadband subscale for internalizing behavior (model 1) and both underlying narrowband subscales, i.e., emotional symptoms and peer problems jointly as predictors in one regression model (model 2). The regression coefficients (B) and model fit parameters (R2 and AIC) are displayed in Table 3.
Table 3

Child and adolescent outcome variables regressed on SDQ subscales (internalizing behavior, emotional symptoms, peer problems).

school KINDL-Rfriends KINDL-Rself-esteem KINDL-Rfamily KINDL-Rmath gradeGerman gradegeneral self-efficacyb
BSEBSEBSEBSEBSEBSEBSE
Model 1 (internalizing behavior)
intercepta-0.720.05-1.150.05-0.480.05-0.560.05-0.200.05-0.110.05-0.690.08
score 80.210.080.520.080.180.080.110.080.030.09-0.100.090.270.13
score 70.350.070.800.070.230.080.340.070.100.080.120.080.460.11
score 60.510.070.870.060.400.070.450.070.160.070.060.070.490.11
score 50.550.061.070.060.450.070.490.070.210.070.120.070.690.10
score 40.740.061.210.060.460.060.580.060.180.070.050.070.640.10
score 30.810.061.350.060.540.060.670.060.240.070.140.070.810.10
score 21.090.061.530.060.700.060.870.060.220.070.100.070.940.10
score 11.140.061.650.060.760.070.760.070.320.070.260.071.130.11
score 01.400.081.940.080.870.090.970.080.410.090.310.091.560.14
R2/AIC.13/12373.22/11872.05/12718.07/12691.01/12611.01/12608.11/4777
Model 2 (emotional symptoms and peer problems)
intercepta-0.970.07-1.610.07-0.670.07-0.780.07-0.230.08-0.130.08-0.820.11
emotional symptoms
score 50.370.080.230.070.290.080.220.080.040.090.000.090.120.12
score 40.530.070.340.060.310.070.370.070.040.08-0.100.080.190.11
score 30.670.070.310.060.470.070.530.070.140.070.000.070.320.10
score 20.820.060.460.060.450.070.660.070.140.07-0.120.070.450.10
score 11.040.060.560.060.590.070.750.070.230.07-0.020.070.560.10
score 01.300.070.690.060.650.070.860.070.340.07-0.020.070.760.10
peer problems
score 40.040.070.620.070.020.070.060.070.030.080.020.080.230.12
score 30.070.060.980.060.100.070.170.070.100.070.090.070.260.11
score 20.150.061.220.060.220.060.180.060.090.070.170.070.370.10
score 10.210.061.410.060.280.060.250.060.070.070.230.070.480.10
score 00.320.071.660.060.400.070.260.070.060.070.300.070.780.11
R2/AIC.14/12296.25/11674.05/12708.07/12667.01/12600.01/12598.11/4783

Note. All outcome variables are standardized (M = 0, SD = 1). The SDQ subscales are dummy coded.

aReference: at-risk (“abnormal”), internalizing behavior ≥ 9, emotional symptoms ≥ 6, peer problems ≥ 5.

bThe scale was only used in adolescents aged 14 years and older (N = 1750).

Note. All outcome variables are standardized (M = 0, SD = 1). The SDQ subscales are dummy coded. aReference: at-risk (“abnormal”), internalizing behavior ≥ 9, emotional symptoms ≥ 6, peer problems ≥ 5. bThe scale was only used in adolescents aged 14 years and older (N = 1750). With regard to the results of models 1 and 2, it can be stated that the associations between the outcomes and the SDQ subscales are negative, which means that increased behavioral problems as indicated by high SDQ subscale scores are associated with lower values of the outcomes, indicating adverse outcomes. Taken as a whole, the at-risk children and adolescents have the lowest average outcome values (the intercept ranges from -1.61 to -0.11). With reference to the model fit parameters, the narrowband subscales of emotional symptoms and peer problems (model 2: jointly as predictors in one regression model) outperform the broadband subscale of internalizing behavior (model 1) in the prediction of all outcomes (comparable R2 and lower AIC values), except for the prediction of general self-efficacy (AIC value favors the predictive performance of the broadband subscale). However, the predictive performance of the narrowband and broadband scales is poor with regard to the prediction of the grades (math and German), i.e., the proportion of explained variance is close to zero (R2 ≤ .01). Therefore, it is hard to judge the predictive superiority of one of the SDQ subscales (with regard to grade prediction), although the AIC values favor the predictive performance of the narrowband subscales of emotional symptoms and peer problems (model 2). Besides this, model 2 offers a deeper insight into the magnitude of the effect sizes of the two narrowband subscales. For example, in the prediction of the KINDL-R school subscale, the regression coefficients for emotional symptoms are remarkably higher (B ranges from 0.37 to 1.30) than the coefficients for peer problems (B ranges from 0.04 to 0.32). As well, in the prediction of the KINDL-R family subscale and in the prediction of the math grade, the emotional symptoms subscale shows noticeably higher coefficients than the peer problems subscale (family: B ranges from 0.22 to 0.86 vs. 0.06 to 0.26, math: B ranges from 0.04 to 0.34 vs. 0.03 to 0.06). The situation is reversed in the case of predicting the KINDL-R friends subscale, i.e., higher coefficients are observable for peer problems (B ranges from 0.62 to 1.66), while lower coefficients are detected for emotional symptoms (B ranges from 0.23 to 0.69). Also in the prediction of the German grade, the peer problems subscale shows higher coefficients than the emotional symptoms subscale (B ranges from 0.02 to 0.30 vs. -0.10 to 0.00). This detailed information about the differences in effect sizes between the two narrowband subscales (model 2) is not depicted when the broadband subscale for internalizing behavior is used as a predictor (model 1). The results are almost the same when the SDQ subscales are considered as continuous predictors (Table 4). With regard to the model fit parameters, the narrowband subscales of emotional symptoms and peer problems (model 2: jointly as predictors in one regression model) outperform the broadband subscale for internalizing behavior (model 1) in the prediction of all outcomes (comparable or lower R2 and lower AIC values), except for the prediction of self-esteem and general self-efficacy (AIC values favor the predictive performance of the broadband subscale). Differences in effect sizes (B) between the two narrowband subscales (model 2) are apparent.
Table 4

Child and adolescent outcome variables regressed on SDQ subscales (continuous predictors: Internalizing behavior, emotional symptoms, peer problems).

school KINDL-Rfriends KINDL-Rself-esteem KINDL-Rfamily KINDL-Rmath gradeGerman gradegeneral self-efficacya
BSEBSEBSEBSEBSEBSEBSE
Model 1 (internalizing behavior)
intercept0.550.030.740.020.350.030.390.030.150.030.110.030.520.04
internalizing behavior-0.130.00-0.170.00-0.080.01-0.090.01-0.040.01-0.030.01-0.120.01
R2/AIC.13/12372.23/11804.05/12697.06/12687.01/12594.01/12605.11/4775
Model 2 (emotional symptoms and peer problems)
intercept0.540.030.760.020.350.030.380.030.140.030.120.030.520.04
emotional symptoms-0.180.01-0.090.01-0.090.01-0.130.01-0.060.010.000.01-0.110.01
peer problems-0.060.01-0.280.01-0.070.01-0.040.01-0.010.01-0.060.01-0.120.02
R2/AIC.14/12290.27/11594.05/12698.07/12654.01/12586.01/12590.10/4777

Note. All outcome variables are standardized (M = 0, SD = 1).

aThe scale was only used in adolescents aged 14 years and older (N = 1750).

Note. All outcome variables are standardized (M = 0, SD = 1). aThe scale was only used in adolescents aged 14 years and older (N = 1750).

Externalizing behavior vs. conduct problems and hyperactivity

Each child and adolescent outcome is regressed on the different SDQ subscales, which are the broadband subscale for externalizing behavior (model 1) and both underlying narrowband subscales, i.e., conduct problems and hyperactivity jointly as predictors in one regression model (model 2). The regression coefficients (B) and model fit parameters (R2 and AIC) are displayed in Table 5.
Table 5

Child and adolescent outcome variables regressed on SDQ subscales (externalizing behavior, conduct problems, hyperactivity).

school KINDL-Rfriends KINDL-Rself-esteem KINDL-Rfamily KINDL-Rmath gradeGerman gradegeneral self-efficacyb
BSEBSEBSEBSEBSEBSEBSE
Model 1 (externalizing behavior)
intercepta-0.650.04-0.180.05-0.270.05-0.690.04-0.360.05-0.340.05-0.240.07
score 90.130.07-0.040.080.100.080.370.070.110.080.110.08-0.080.12
score 80.380.060.090.070.140.070.470.060.200.070.180.070.020.10
score 70.460.060.070.060.120.060.550.060.300.060.230.060.060.10
score 60.620.060.200.060.220.060.680.060.340.060.310.060.260.09
score 50.680.060.200.060.280.060.760.060.360.060.380.060.220.09
score 40.850.060.270.060.360.060.940.060.440.060.460.060.310.10
score 30.980.060.250.060.490.060.980.060.630.070.570.070.440.10
score 21.160.070.440.070.540.071.100.070.590.070.630.070.650.11
score 11.270.080.340.080.620.081.090.080.770.080.650.080.730.14
score 01.610.110.730.110.890.111.450.110.800.110.850.111.350.20
R2/AIC.14/12324.02/12922.04/12778.11/12485.04/12457.04/12457.06/4871
Model 2 (conduct problems and hyperactivity)
intercepta-0.850.07-0.310.07-0.320.07-0.930.07-0.550.07-0.560.07-0.350.11
conduct problems
score 40.170.080.120.080.030.080.200.080.180.080.140.080.080.12
score 30.350.070.140.070.070.070.520.070.190.070.220.07-0.020.11
score 20.450.070.230.070.060.070.750.070.250.070.300.07-0.070.10
score 10.580.070.440.070.150.070.940.070.310.070.300.070.030.10
score 00.790.070.570.080.370.081.140.080.310.080.450.080.330.12
hyperactivity
score 60.080.060.000.070.030.070.100.060.170.070.200.070.020.11
score 50.210.060.000.060.120.060.110.060.200.060.160.060.220.10
score 40.340.060.020.060.240.060.190.060.280.060.270.060.300.10
score 30.420.06-0.010.060.170.060.240.060.310.060.290.060.380.10
score 20.610.060.010.060.310.060.270.060.440.060.400.060.420.10
score 10.720.07-0.050.070.400.070.290.070.560.070.530.070.640.11
score 01.000.080.200.080.550.080.520.080.720.080.610.080.970.14
R2/AIC.14/12322.03/12876.04/12765.13/12378.04/12456.04/12445.07/4860

Note. All outcome variables are standardized (M = 0, SD = 1). The SDQ subscales are dummy coded.

aReference: at-risk (“abnormal”), externalizing behavior ≥ 10, conduct problems ≥ 5, hyperactivity ≥ 7.

bThe scale was only used in adolescents aged 14 years and older (N = 1750).

Note. All outcome variables are standardized (M = 0, SD = 1). The SDQ subscales are dummy coded. aReference: at-risk (“abnormal”), externalizing behavior ≥ 10, conduct problems ≥ 5, hyperactivity ≥ 7. bThe scale was only used in adolescents aged 14 years and older (N = 1750). With regard to the results of models 1 and 2, it can be stated that the associations between the outcomes and the SDQ subscales are negative, which means that increased behavioral problems as indicated by high SDQ subscale scores are associated with lower values for the outcomes, indicating adverse outcomes. Taken as a whole, the at-risk children and adolescents have the lowest average outcome values (the intercept ranges from -0.93 to -0.18). With regard to the model fit parameters, the narrowband subscales for conduct problems and hyperactivity (model 2: jointly as predictors in one regression model) outperform the broadband subscale of externalizing behavior (model 1) in the prediction of all outcomes (comparable or higher R2 and lower AIC values). In addition, model 2 offers a deeper insight into the magnitude of the effect sizes of the two narrowband subscales. For example, in the prediction of the KINDL-R friends subscale, the regression coefficients for conduct problems are remarkably higher (B ranges from 0.12 to 0.57) than the coefficients for hyperactivity (B ranges from -0.05 to 0.20). Similarly, in the prediction of the KINDL-R family subscale, the conduct problems subscale shows considerably higher coefficients (B ranges from 0.20 to 1.14) than the hyperactivity subscale (B ranges from 0.10 to 0.52). The situation is reversed in the case of predicting the math grade, i.e., higher coefficients are observable for hyperactivity (B ranges from 0.17 to 0.72), while lower coefficients are detected for conduct problems (B ranges from 0.18 to 0.31). In addition, in the prediction of general self-efficacy, the hyperactivity subscale shows noticeably higher coefficients (B ranges from 0.02 to 0.97) than the emotional symptoms subscale (B ranges from -0.07 to 0.33). This detailed information about the differences in effect sizes between the narrowband subscales (model 2) is not depicted when the broadband subscale for externalizing behavior is used as a predictor (model 1). If the SDQ subscales are considered as continuous predictors (Table 6), the broadband subscale for externalizing behavior (model 1) outperforms the narrowband subscales (model 2) in the prediction of the KINDL-R subscales of school and self-esteem as well as in the prediction of the German grade (comparable R2 and lower AIC values). In these predictions (KINDL-R school and self-esteem subscales as well as the German grade), there are no differences in effect sizes (B) between the two narrowband subscales, but they are apparent in the predictions of the other outcomes (KINDL-R subscales for friends and family, as well as math grades and general self-efficacy).
Table 6

Child and adolescent outcome variables regressed on SDQ subscales (continuous predictors: Externalizing behavior, conduct problems, hyperactivity).

school KINDL-Rfriends KINDL-Rself-esteem KINDL-Rfamily KINDL-Rmath gradeGerman gradegeneral self-efficacya
BSEBSEBSEBSEBSEBSEBSE
Model 1 (externalizing behavior)
intercept0.730.030.270.030.360.030.650.030.400.030.410.030.460.05
externalizing behavior-0.130.00-0.050.01-0.060.01-0.110.00-0.070.01-0.070.01-0.080.01
R2/AIC.14/12318.02/12923.03/12777.11/12462.04/12445.04/12433.05/4878
Model 2 (conduct problems and hyperactivity)
intercept0.730.030.250.030.370.030.630.030.410.030.410.030.470.05
conduct problems-0.140.01-0.120.01-0.060.01-0.220.01-0.050.01-0.070.01-0.030.02
hyperactivity-0.120.01-0.010.01-0.070.01-0.060.01-0.080.01-0.070.01-0.110.01
R2/AIC.14/12319.03/12880.03/12779.13/12358.04/12444.04/12435.05/4872

Note. All outcome variables are standardized (M = 0, SD = 1).

aThe scale was only used in adolescents aged 14 years and older (N = 1750).

Note. All outcome variables are standardized (M = 0, SD = 1). aThe scale was only used in adolescents aged 14 years and older (N = 1750).

Discussion

For the first time, the SDQ broadband and narrowband scales were compared with regard to their criterion validity in predicting child and adolescent outcomes. The results of the study indicated the relevance of the different SDQ subscales for the description of students’ socio-emotional and academic situation. At the same time, the results could not support a superiority of the broadband subscales with regard to prediction of the outcomes. This interpretation can be described for the internalizing and externalizing behavior subscales (except for the prediction of general self-efficacy, where the internalizing behavior scale shows the best model fit). If the SDQ scales are considered as continuous predictors, the broadband scale for internalizing behavior outperforms the narrowband scales in the prediction of general self-efficacy and self-esteem. The same holds true for the prediction of the KINDL-R subscales of school and self-esteem, as well as for the prediction of the German grade, where the continuous externalizing subscale shows the best model fit. At any rate, in all cases where a continuous broadband scale outperforms the underlying narrowband scales, the difference in AIC values between the models is less than 3 [55], i.e., the models with narrowband scales are almost as good as the models with the broadband scales. The use of sum scores for the narrowband subscales of emotional symptoms and peer problems is more informative with respect to the range of predicted outcomes. The models using the narrowband scales indicate that there are differences between emotional symptoms and peer problems with regard to their effect on different outcome variables. This information is not depicted through the use of the broadband subscale of internalizing behavior, which might therefore lead to a loss of information. At the same time, it must be stated that children and adolescents might exhibit symptomatic behaviors related to emotional symptoms but not have major peer problems or vice versa [60]. Similar observations can be made for conduct problems and hyperactivity [61]. It can be assumed that different categories of behavioral problems (e.g. conduct problems and attention deficit/hyperactivity) also go in hand with the development of different outcomes over time [62, 63]. Similarly, different conditions and predictors might lead to either the one or the other narrowband behavior. Moreover, different developmental trajectories become clear when focusing different narrowband behaviors problems (e.g. emotional problems and peer problems) [60, 64, 65], Aggregating scores of narrowband into broadband scales might therefore run the risk of blending distinctive behaviors associated with different developmental outcomes. This evidence seems to strengthen the assumptions of Tandon et al. ([66] p. 593) who argue that: “[…] a major shift, and advance in this area, has been the study of more discrete differentiated disorders instead of lumping of all internalizing symptoms into one broad category of the two-dimensional internalizing versus externalizing taxonomy of childhood psychopathology.” Therefore, the differentiation between different narrow facets of internalizing behavior seems to be of particular importance in the description of child and adolescent psychopathology and the prediction of relevant outcomes. With respect to the broadband subscale of externalizing behavior, these assumptions can also be partially supported. In line with similar previous findings, differences between the narrowband subscales of conduct problems and hyperactivity with regard to their effect on the outcomes can also be described for most predictions.

Limitations

In this context, however, it must be noted that the chosen outcome variables do not fully describe the levels of educational development of children and adolescents. Further research is desirable that applies in-depth assessment of educational outcomes, such as domain-specific academic achievement, social integration, cognitive or self-regulation processes and uses a multi-informant approach (especially data reported by educators and teachers are of relevance). Compared to school grades, a domain-specific assessment (e.g., reading comprehension) would provide a more detailed picture of the academic performance. Besides this, the reporting of school grades from the last report card by the parents might be prone to recall bias. In addition, some of the chosen outcome variables (KINDL-R scales school and friends) show only low internal consistencies and might therefore not be reliable. Unsatisfactory internal consistency can also be described with regard to some of the SDQ scales used (conduct problems and peer problems). However, it is important to note that poor reliability does not necessarily affect the goodness of predictive analysis [67].

Conclusion

Despite the aforementioned limitations of the study at hand, the results shed light on the predictive abilities of different subscales of the SDQ. In addition to previous studies [13, 25, 28], these insights can be used for a further discussion of the advantages and disadvantages of different factor structures of the SDQ. In the sense of predicting educational outcomes, no advantage of the broadband scales (resulting from the 3-factor structure) become clear. The application of narrowband scales (resulting from the 5-factor structure), providing a more differentiated picture of the socio-emotional and academic situation of students, seems to be more appropriate for the prediction of child and adolescent outcomes. This interpretation is of course limited, as it refers to a selection of criteria and needs to be replicated with further educational outcome variables. Furthermore, future research should examine the validity of these results when using parent- or teacher-reported student behavior. In addition, the finding that differentiation of behavioral problems might be a benefit for the description of educational outcomes should be replicated using other behavior assessment tools than the SDQ. Nonetheless, for the purpose of identifying students at risk of struggling in educational contexts, using the set of narrowband dimensions of behavior seems to be more appropriate in educational research and practice.

Implications

The study at hand indicated the need of focusing narrowband behaviors in educational practice in order to gain the most differentiated insights into possible predictors of the emotional-social as well as academic development of children and adolescents. It becomes clear, that different narrowband behaviors are more or less associated with different outcome variables. Subsuming behaviors into broadband categories in educational practice might lead to the fact that students, who might have been identified as at-risk because of salient narrowband behavior, might not be identified as at-risk in broadband categories (classification accuracy). In extension to these results of our analysis, the question arises, whether in educational practice, focusing narrowband behaviors might also be the most appropriate approach with regard to educational planning. Consequently, Casale et al. [68] argue, that the early identification of at-risk students and the provision of individualized intervention might be a key advantage of applying universal screening procedures (e.g. the SDQ) in schools. Gaining insights in specific behaviors might offer the most detailed information on educational needs, which might be addressed in subsequent behavioral interventions. Subsuming behaviors might however lead to a loss of relevant information. 16 Jul 2020 PONE-D-20-15994 Prediction of School-Relevant Outcomes with Broadband and Narrowband Dimensions of Internalizing and Externalizing Behavior Using the Child and Adolescent Version of the Strengths and Difficulties Questionnaire PLOS ONE Dear Dr. Kulawiak, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. After thoroughly considering the reviews and reading the paper myself, I offer a number of points to consider in a potential revision, as well. 1. Please enter a Financial Disclosure statement: The author(s) received no specific funding for this work. 2. The use of the term “school relevant outcomes” for measures of subjective well-being, self-esteem and self-efficacy seems problematic. These constructs in themselves are the focus of important areas of research, therefore, for a multidisciplinary journal such as PLOS ONE, it would be preferable to use a wider term, for example “child outcomes” or “developmental outcomes”. I would also suggest to change the ambiguous term “social and personal factors” to more specific terms. 3. What are reliabilities and cut-offs for the externalizing and internalizing scales? 4. Please describe the KINDL-R in more detail: How many items are there? How many scales? What are the reliabilities? 5. Grades. What time period is covered by the last report card? 6. Please present the reliability of the general self-efficacy scale. 7. Please provide chi square statistics and CFI for the CFA. Reviewer 2 also addresses this issue. 8. Please provide the significance levels for the correlations in Table 2: Please submit your revised manuscript by Aug 30 2020 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript: A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols We look forward to receiving your revised manuscript. Kind regards, Helena R. Slobodskaya, M.D., Ph.D., D.Sc. Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf 2.Thank you for stating the following financial disclosure:  [NO]. At this time, please address the following queries: a) Please clarify the sources of funding (financial or material support) for your study. List the grants or organizations that supported your study, including funding received from your institution. b) State what role the funders took in the study. If the funders had no role in your study, please state: “The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.” c) If any authors received a salary from any of your funders, please state which authors and which funders. d) If you did not receive any funding for this study, please state: “The authors received no specific funding for this work.” Please include your amended statements within your cover letter; we will change the online submission form on your behalf. 3. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For more information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. In your revised cover letter, please address the following prompts: a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially sensitive information, data are owned by a third-party organization, etc.) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent. b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories. We will update your Data Availability statement on your behalf to reflect the information you provide. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: No Reviewer #2: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: No Reviewer #2: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: No Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: The authors present an interesting question: how do the different factor-structures of the SDQ predict school grades. It is a question of interest to many educational researchers. However, it is difficult to assess how this question is answered based upon the unclear analyses used. It is necessary to further describe the methods in detail. It may be advised to conduct additional analyses to account for the complex data structure and to provide analyses that correspond more clearly to the goals of the article. Below are my more detailed comments: Introduction: 1) The authors argue that the literature linking SDQ scales to academic achievement is sparse - there is research linking subscales of it to achievement, as well as SDQ to other related scales. Additional work describing some of these works may be useful for the reader. (e.g., DeVries, Rathmann, Gebhardt, 2018). Failing the specific comparisons of SDQ subscales to achievement, perhaps a discussion about the underlying constructs relationships to achievement could be expanded upon further. 2) In general the literature review is a bit brief, but to the point. It may be worth adding a section about predictive validity of the SDQ to other variables - who made what predictions and did they use narrow or broad scales? Methods/Results: 3) Alpha and omegas in the .6 range while technically considered acceptable are still quite low - it may be worth making a cautionary note somewhere about this. 4) Also, why not give the exact values, as you do in other places (lines 170, 171, 173). 5) Alpha of for the KiGGS study is also very low, do you have the available omega? (line 191). 6) It appears you dropped prosocial behavior from all results. It may be worth a sentence somewhere explaining this and that it was done because it does not relate to the goals of the study. 7) It is altogether unclear exactly what analyses were done. Judging from the text and output tables, the authors appear to have run separate regressions for each dependent and independent variables. A multiple linear regression technique would be more appropriate here. It may be advised to run multiple linear models where the predictors are combined into the model, instead of a separate model for each predictor. Regardless, more information is required as the precise models that were developed and tested. 8) Is there no attempt to asses possible clusters and group effects? For this, a multilevel multiple regression may be advised. Discussion 8) Without a clear picture to the exact analyses run, it is impossible to evaluate the validity of any conclusions made. The general argument appears to be that the more differentiated model(s?) has(have?) a better fit and moreover there is a variation in the betas for the subscores. This line of reasoning is possibly convincing, but it requires some additional details. can you provide some theoretical connection to this reasoning? The discussion itself is altogether brief, and would benefit from this as well as a deeper connection to previous work (which may be needed also in the Intro). 9) Some discussion of the predictive power of variables with low reliability may be relevant - either as a limitation or a caution. Essentially, the low reliability of the scales may have a major impact on predictive validity, which is a major focus of the article. other notes: line 84: provide to provided Reviewer #2: The study entitled “Prediction of School-Relevant Outcomes with Broadband and Narrowband Dimensions of Internalizing and Externalizing Behavior Using the Child and Adolescent Version of the Strengths and Difficulties Questionnaire” is of great interest in the field of Child and Adolescent Psychological Health. The questionnaire SDQ is one of the most frequently used in this field and the approach is very stimulating. It contains new scientific knowledge and provides comprehensive information for further development in this productive line of research. This paper is well-argued and clearly worthy of publication. It has several strengths, amongst others: The background is adequate and up to date. The sample used is adequate and cross-sectional self-report data was obtained. Regression models have been compared with regard to the prediction of school-relevant outcomes using narrowband vs. broadband scales of behavior. The data are presented in a clear and easy-to-understand fashion. The results are clear and related to main goals. The conclusions are supported by data. As minor comments I would like to say: Introduction The literature review could include some more recent research on the use of the SDQ questionnaire, as well as provide a stronger rational for the study of internalizing and externalizing behavior problems in this specific population. Method The age group is not clearly defined. It would be necessary to include frequencies and percentages of the age and sex distribution. Results In the result section, the authors informed that models for both 3 or 5 factors show good fit indices. Please include more than one indicator of model fit besides the RMSEA, such as Chi-square (χ2); degrees of freedom (df); p-value, CFI or TLI. In addition to this information, factor invariance of gender and age would be interesting to include in the analyses. In table 1, in addition to the percentages of "abnormal" scores, it would be interesting to include those that are within the normal range and those that are at the limit. Please, provide a better rational for choosing this type of regression and the AIC measure rather than, for example, other types of regression such as hierarchical regressions. Why did you test so many different regression models? Please clarify the purpose. Discussion In the discussion section, please describe in more detail the contribution of your study and its implications for practice in the educational context. ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: J M DeVries Reviewer #2: Yes: Inmaculada Montoya-Castilla [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. 14 Sep 2020 Points raised by the academic editor 1. Please enter a Financial Disclosure statement: The author(s) received no specific funding for this work. We added the statement to the current cover letter: “The author(s) received no specific funding for this work.” 2. The use of the term “school relevant outcomes” for measures of subjective well-being, self-esteem and self-efficacy seems problematic. These constructs in themselves are the focus of important areas of research, therefore, for a multidisciplinary journal such as PLOS ONE, it would be preferable to use a wider term, for example “child outcomes” or “developmental outcomes”. I would also suggest to change the ambiguous term “social and personal factors” to more specific terms. We changed the term “school relevant outcomes” to “child and adolescent outcomes”. We removed the term “social and personal factors”. Instead, we use the term “self-belief”, for example (l. 157): “measures of academic success (grades), well-being (school, friends, and family), and self-belief (self-esteem and self-efficacy).” 3. What are reliabilities and cut-offs for the externalizing and internalizing scales? Cut-offs (l. 206): “A conservative classification rule (37), which minimizes false positive cases by selecting a cutoff value below 10%, was used to identify at-risk children and adolescents (emotional symptoms ≥ 6, peer problems ≥ 5, conduct problems ≥ 5, hyperactivity ≥ 7, internalizing behavior ≥ 9, and externalizing behavior ≥ 10).” Reliability coefficients (Cronbach’s α and McDonald’s ω calculated with the data at hand) are reported in Table 2. 4. Please describe the KINDL-R in more detail: How many items are there? How many scales? What are the reliabilities? Items (l. 230): “Each subscale consists of 4 items.”. Additional information about the subscales (l. 234): “The subscales physical and emotional well-being are not used in the present study.” Reliability coefficients (Cronbach’s α and McDonald’s ω calculated with the data at hand) are reported in Table 2. 5. Grades. What time period is covered by the last report card? l. 235: “The school grades (math and German) received on the last report card (half-year term) were reported by the parents.” 6. Please present the reliability of the general self-efficacy scale. Reliability coefficients (Cronbach’s α and McDonald’s ω calculated with the data at hand) are reported in Table 2. 7. Please provide chi square statistics and CFI for the CFA. Reviewer 2 also addresses this issue. Additional model fit parameters have been included (l. 307). 8. Please provide the significance levels for the correlations in Table 2. Table note (Table 2): “Significant correlations (p < .05) are in bold.” Reviewer #1 1. The authors argue that the literature linking SDQ scales to academic achievement is sparse - there is research linking subscales of it to achievement, as well as SDQ to other related scales. Additional work describing some of these works may be useful for the reader. (e.g., DeVries, Rathmann, Gebhardt, 2018). Failing the specific comparisons of SDQ subscales to achievement, perhaps a discussion about the underlying constructs relationships to achievement could be expanded upon further. The point we want to address is not that the literature that links SDQ scales to academic achievement is sparse, but that the literature that compares the narrowband and broadband scales in the prediction of outcomes is sparse. We added more information for clarification. We hope this clarifies your query. We have updated the literature to describe the issue in more detail: (Aanondsen et al., 2020; DeVries et al., 2018; Essau et al., 2012; Jones et al., 2020; McAloney-Kocaman & McPherson, 2017; Niclasen & Dammeyer, 2016; Zarrella et al., 2018) 2. In general the literature review is a bit brief, but to the point. It may be worth adding a section about predictive validity of the SDQ to other variables - who made what predictions and did they use narrow or broad scales? We now make it clear (introduction) which studies used narrowband scales and which broadband scales. 3. Alpha and omegas in the .6 range while technically considered acceptable are still quite low - it may be worth making a cautionary note somewhere about this. We updated the limitations (l. 502): “In addition, some of the chosen outcome variables (KINDL-R scales school and friends) show only low internal consistencies and might therefore not be reliable. Unsatisfactory internal consistency can also be described with regard to some of the SDQ scales used (conduct problems and peer problems). However, it is important to note that poor reliability does not necessarily affect the goodness of predictive analysis (67).”. 67 = Smits, N., van der Ark, L. A., & Conijn, J. M. (2018). Measurement versus prediction in the construction of patient-reported outcome questionnaires: Can we have our cake and eat it? Quality of Life Research, 27(7), 1673–1682. https://doi.org/10.1007/s11136-017-1720-4 4. Also, why not give the exact values, as you do in other places (lines 170, 171, 173). Reliability coefficients (Cronbach’s α and McDonald’s ω calculated with the data at hand) are reported in Table 2. 5. Alpha of for the KiGGS study is also very low, do you have the available omega? (line 191). Reliability coefficients (Cronbach’s α and McDonald’s ω calculated with the data at hand) are reported in Table 2. 6. It appears you dropped prosocial behavior from all results. It may be worth a sentence somewhere explaining this and that it was done because it does not relate to the goals of the study. l. 199: “The prosocial behavior subscale was not considered in the present study, because the focus was on a comparison of the two broadband subscales with the underlying narrowband subscales.” 7. It is altogether unclear exactly what analyses were done. Judging from the text and output tables, the authors appear to have run separate regressions for each dependent and independent variables. A multiple linear regression technique would be more appropriate here. It may be advised to run multiple linear models where the predictors are combined into the model, instead of a separate model for each predictor. Regardless, more information is required as the precise models that were developed and tested. Each outcome is predicted by the SDQ scales: Outcome regressed on broadband scale (Model 1) and outcome regressed on both underlying narrowband scales (Model 2). Hence, model 2 is a multiple linear model (two predictors: two narrowband scales). Both models are compared (AIC comparison) to judge the statistical predictive performance of the different subscales of the SDQ (broadband vs. narrowband), whereby this type of model comparison is conducted separately for internalizing and externalizing behavior (internalizing behavior vs. emotional symptoms and peer problems; externalizing behavior vs. conduct problems and hyperactivity). We reorganized the section “Statistical Analysis Strategy” (starts at l. 248) and added more information for clarification. We hope this clarifies your query. 8. Is there no attempt to asses possible clusters and group effects? For this, a multilevel multiple regression may be advised. l. 171: “Study participants were not recruited in schools (non-nested data structure) but randomly selected from the official registers of local residents.” A multilevel regression is not necessary because the data structure is non-nested (random sample). 9. Without a clear picture to the exact analyses run, it is impossible to evaluate the validity of any conclusions made. The general argument appears to be that the more differentiated model(s?) has(have?) a better fit and moreover there is a variation in the betas for the subscores. This line of reasoning is possibly convincing, but it requires some additional details. can you provide some theoretical connection to this reasoning? The discussion itself is altogether brief, and would benefit from this as well as a deeper connection to previous work (which may be needed also in the Intro). Your summary of our research describes exactly the main point. We are glad that our main point is understandable. We added more information to highlight the research question and aims. In addition, we tried to address the need of a stronger theoretical connection as well as a stronger connection to previous work. Therefore, we extended the discussion with regard to previous findings on the independence of narrowband behaviors (ll.47 and 470) as well as the implications for educational planning (l. 520) 10. Some discussion of the predictive power of variables with low reliability may be relevant - either as a limitation or a caution. Essentially, the low reliability of the scales may have a major impact on predictive validity, which is a major focus of the article. We updated the limitations (l. 502): “In addition, some of the chosen outcome variables (KINDL-R scales school and friends) show only low internal consistencies and might therefore not be reliable. Unsatisfactory internal consistency can also be described with regard to some of the SDQ scales used (conduct problems and peer problems). However, it is important to note that poor reliability does not necessarily affect the goodness of predictive analysis (67).” 67 = Smits, N., van der Ark, L. A., & Conijn, J. M. (2018). Measurement versus prediction in the construction of patient-reported outcome questionnaires: Can we have our cake and eat it? Quality of Life Research, 27(7), 1673–1682. https://doi.org/10.1007/s11136-017-1720-4 11. line 84: provide to provided Done Reviewer #2 1. The literature review could include some more recent research on the use of the SDQ questionnaire, as well as provide a stronger rational for the study of internalizing and externalizing behavior problems in this specific population. We have updated the literature: (Aanondsen et al., 2020; DeVries et al., 2018; Essau et al., 2012; Jones et al., 2020; McAloney-Kocaman & McPherson, 2017; Niclasen & Dammeyer, 2016; Zarrella et al., 2018). We see the need of providing a stronger rational for the study of internalizing and externalizing behavior in this specific population. Therefore, we revised parts of the theoretical background (l. 47) and added some references emphasizing the need and rational for the study of internalizing and externalizing behavior problems in this specific population. 2. The age group is not clearly defined. It would be necessary to include frequencies and percentages of the age and sex distribution. Minimum, maximum and quartiles have been included to describe the distribution (age), l. 185: “The present sample therefore includes children and adolescents with a minimum age of 11 years in grades 5 to 9 (N = 4654; 52% boys; age in years: M = 13.46, SD = 1.47, Min = 11.00, Q1 = 12.17, Md = 13.42, Q3 = 14.67, Max = 17.92).” 3. In the result section, the authors informed that models for both 3 or 5 factors show good fit indices. Please include more than one indicator of model fit besides the RMSEA, such as Chi-square (χ2); degrees of freedom (df); p-value, CFI or TLI. In addition to this information, factor invariance of gender and age would be interesting to include in the analyses. Additional model fit parameters have been included (l. 307). Factor invariance (also referred to as measurement invariance): “Both the 3- and 5-factor structures meet the standards for metric invariance (47,48) across gender and age groups (multi‑group confirmatory factor analysis; comparison of metric and configural model: difference in the models’ CFI ≤ .01), which can be interpreted to indicate that the measured dimensions of behavior (narrowband and broadband scales) manifest in the same way across boys and girls as well as different age groups (quartile age groups in years: [11,12.2], (12.2,13.4], (13.4,14.7], and (14.7,17.9]).” (l. 309) 4. In table 1, in addition to the percentages of "abnormal" scores, it would be interesting to include those that are within the normal range and those that are at the limit. Done (see Table 1) 5. Please, provide a better rational for choosing this type of regression and the AIC measure rather than, for example, other types of regression such as hierarchical regressions. Why did you test so many different regression models? Please clarify the purpose. We have added an important source: “For the application of AIC model selection in the field of psychology and psychometrics, see Vrieze (53).” (l. 271). Vrieze, S. I. (2012). Model selection and psychological theory: A discussion of the differences between the Akaike information criterion (AIC) and the Bayesian information criterion (BIC). Each outcome is predicted by the SDQ scales: Outcome regressed on broadband scale (Model 1) and outcome regressed on both underlying narrowband scales (Model 2). Hence, model 2 is a multiple linear model (two predictors: two narrowband scales). Both models are compared (AIC comparison) to judge the statistical predictive performance of the different subscales of the SDQ (broadband vs. narrowband), whereby this type of model comparison is conducted separately for internalizing and externalizing behavior (internalizing behavior vs. emotional symptoms and peer problems; externalizing behavior vs. conduct problems and hyperactivity). We reorganized the section “Statistical Analysis Strategy” (starts at l. 248) and added more information for clarification. We hope this clarifies your query. 6. In the discussion section, please describe in more detail the contribution of your study and its implications for practice in the educational context. We have added a subsection on the “Implications” of our study (l. 520 discussing the advantages of applying broad- or narrowband categorizations of behavior in schools with regard to diagnostic practice and educational planning. Submitted filename: Response to Reviewers.docx Click here for additional data file. 24 Sep 2020 Prediction of Child and Adolescent Outcomes with Broadband and Narrowband Dimensions of Internalizing and Externalizing Behavior Using the Child and Adolescent Version of the Strengths and Difficulties Questionnaire PONE-D-20-15994R1 Dear Dr. Kulawiak, We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Helena R. Slobodskaya, M.D., Ph.D., D.Sc. Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: All comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: All recommended changes have been addressed. The exact analyses and procedures are now quite clear, as well as the conclusions and limitations. ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No 29 Sep 2020 PONE-D-20-15994R1 Prediction of child and adolescent outcomes with broadband and narrowband dimensions of internalizing and externalizing behavior using the child and adolescent version of the strengths and difficulties questionnaire Dear Dr. Kulawiak: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to PLOS ONE and supporting open access. Kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Helena R. Slobodskaya Academic Editor PLOS ONE
  37 in total

1.  Psychiatric comorbidity in children and adolescents with reading disability.

Authors:  E G Willcutt; B F Pennington
Journal:  J Child Psychol Psychiatry       Date:  2000-11       Impact factor: 8.982

Review 2.  Model selection and psychological theory: a discussion of the differences between the Akaike information criterion (AIC) and the Bayesian information criterion (BIC).

Authors:  Scott I Vrieze
Journal:  Psychol Methods       Date:  2012-02-06

Review 3.  Internalizing disorders in early childhood: a review of depressive and anxiety disorders.

Authors:  Mini Tandon; Emma Cardeli; Joan Luby
Journal:  Child Adolesc Psychiatr Clin N Am       Date:  2009-07

4.  On the distinction between attentional deficits/hyperactivity and conduct problems/aggression in child psychopathology.

Authors:  S P Hinshaw
Journal:  Psychol Bull       Date:  1987-05       Impact factor: 17.737

5.  Psychometric properties of the strengths and difficulties questionnaire.

Authors:  R Goodman
Journal:  J Am Acad Child Adolesc Psychiatry       Date:  2001-11       Impact factor: 8.829

6.  Psychometric properties of the Strength and Difficulties Questionnaire from five European countries.

Authors:  Cecilia A Essau; Beatriz Olaya; Xenia Anastassiou-Hadjicharalambous; Gina Pauli; Catherine Gilvarry; Diane Bray; Jean O'callaghan; Thomas H Ollendick
Journal:  Int J Methods Psychiatr Res       Date:  2012-08-14       Impact factor: 4.035

7.  Problems of behaviour, reading and arithmetic: assessments of comorbidity using the Strengths and Difficulties Questionnaire.

Authors:  J W Adams; M J Snowling; S M Hennessy; P Kind
Journal:  Br J Educ Psychol       Date:  1999-12

8.  Validation of the Strengths and Difficulties Self-Report in Norwegian Sign Language.

Authors:  Chris Margaret Aanondsen; Thomas Jozefiak; Kerstin Heiling; Tormod Rimehaug
Journal:  J Deaf Stud Deaf Educ       Date:  2020-01-03

9.  The Strengths and Difficulties Questionnaire: psychometric properties of the parent and teacher version in children aged 4-7.

Authors:  Lisanne L Stone; Jan M A M Janssens; Ad A Vermulst; Marloes Van Der Maten; Rutger C M E Engels; Roy Otten
Journal:  BMC Psychol       Date:  2015-02-20

10.  How Does Social Behavior Relate to Both Grades and Achievement Scores?

Authors:  Jeffrey M DeVries; Katharina Rathmann; Markus Gebhardt
Journal:  Front Psychol       Date:  2018-06-04
View more
  1 in total

1.  Short measures of youth psychopathology: psychometric properties of the brief problem monitor (BPM) and the behavior and feelings survey (BFS) in a Norwegian clinical sample.

Authors:  Kristian Rognstad; Siri Saugstad Helland; Simon-Peter Neumer; Silje Baardstu; John Kjøbli
Journal:  BMC Psychol       Date:  2022-07-24
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.