Literature DB >> 34412151

A Bayesian nonparametric approach to dynamic item-response modeling: An application to the GUSTO cohort study.

Andrea Cremaschi¹, Maria De Iorio^1,2,3,4, Yap Seng Chong^1,2, Birit Broekman⁵, Michael J Meaney^1,6, Michelle Z L Kee¹.

Abstract

Statistical analysis of questionnaire data is often performed employing techniques from item-response theory. In this framework, it is possible to differentiate respondent profiles and characterize the questions (items) included in the questionnaire via interpretable parameters. These models are often crosssectional and aim at evaluating the performance of the respondents. The motivating application of this work is the analysis of psychometric questionnaires taken by a group of mothers at different time points and by their children at one later time point. The data are available through the GUSTO cohort study. To this end, we propose a Bayesian semiparametric model and extend the current literature by: (i) introducing temporal dependence among questionnaires taken at different time points; (ii) jointly modeling the responses to questionnaires taken from different, but related, groups of subjects (in our case mothers and children), introducing a further dependency structure and therefore sharing of information; (iii) allowing clustering of subjects based on their latent response profile. The proposed model is able to identify three main groups of mother/child pairs characterized by their response profiles. Furthermore, we report an interesting maternal reporting bias effect strongly affecting the clustering structure of the mother/child dyads.

Entities: Chemical

Keywords: Dirichlet process; clustering; cohort study; item-response theory; questionnaire data

Mesh：

Year: 2021 PMID： 34412151 PMCID： PMC9546363 DOI： 10.1002/sim.9167

Source DB: PubMed Journal: Stat Med ISSN： 0277-6715 Impact factor: 2.497

INTRODUCTION

Research and clinical practice in child psychopathology continues to rely heavily on the reports of adults, most notably the parents and especially mothers. Assessments of child psychopathology commonly collect information from multiple informants, including parents, most often the mother and teachers. Studies of the convergence of these measures often reveal only moderate (correlation coefficient r = 0.3‐0.6) alignment. , , , Alternatively, the divergence could represent very real effects of context, whereby the function of the child varies as a function of the setting. , , The divergence might reflect reporter bias associated with the nature of the behavioral problems within a context. Teachers, for example, seem particularly sensitive to reporting of externalizing problems, likely associated with classroom disruptions. However, the major concern for parental reports is that of bias derived from the mental health of the parent. For example, depressed mothers have been found to overreport child emotional or behavioral problems. , , Cognitive theories of depression predict that depressed parents and children will globally report negatively on many aspects of their lives. , Nevertheless, a number of studies , report association between maternal symptoms of depression at one stage in development and child behavioral problems at a later stage, while controlling for depressive symptoms at the time of maternal reporting. This approach should minimize the impact of the reporter mental health at the time of the assessment. Moreover, maternal symptoms of depression do predict child outcomes on a wide range of objective measures assessed directly, with the child independent of reporter bias. , , , Thus, reporter bias seems unlikely to account entirely for the association between mental health of the mother and that of the child. Among studies with evidence for maternal bias, the bias appears to depend upon the nature of the psychopathological problem (eg, internalizing vs externalizing behaviors ) as well as the gender of the child. The nature of the maternal psychopathology is also a factor, with greater evidence for maternal bias in mothers with comorbid anxiety and depression. , The issue is clearly not as simple as a maternal bias linked to mental health status that is expressed universally. In addition to the variation associated with the target outcome and the gender of the child, there is also the possibility that a maternal bias may vary as a function of the mother and/or of the mother‐child dyad. A dyadic relation was reported where the analysis suggested that maternal psychopathology and the reported child behavior appear to interact to determine the degree of the bias. In this work, we analyze data on mothers reporting on their children's behavior, and on the children's self‐reporting, obtained from their answers to clinical screening psychometric questionnaires. The mothers answered the Child Behavior Checklist (CBCL), which is the most commonly used parent or teacher tool for child assessment. The children answered the Child Depression Inventory and the Multidimensional Anxiety Scale for Children as self‐reporting tools. The availability of both type of questionnaires allows for a deeper evaluation of the psychopathological assessment of the mother‐child dyads. Furthermore, the mothers responded to questionnaires at different time points, adding a longitudinal dimension to the study. From a statistical point of view, our work builds upon techniques from item‐response theory (IRT), which are commonly used for the analysis of questionnaire data. , , , However, the statistical approach used here best exploits the interesting features of the dataset, adding novel features to the existing literature. First, by setting the study in a Bayesian framework, we allow for joint modeling of both mothers' and children's answers to the respective questionnaires. By borrowing information between these two components of the dataset, the proposed approach exploits the dependencies within the dyads, which would be otherwise overlooked if analyzed in two independent marginal studies. Second, the longitudinal aspect of the maternal reports is taken into account by including an autoregressive structure in the model, capturing the temporal profile of the maternal evaluation of the children's behavior. Furthermore, in our setting there is not a complete overlap between questions answered by the mothers at different time points. We account for this feature of the data by specifying question‐specific parameters which are shared over different time points when the questions are repeated over time, therefore allowing for additional sharing of information. Finally, to introduce extra flexibility, dyad‐specific latent parameters capturing the respondents' profiles are modeled via Bayesian nonparametric processes, naturally inducing clustering of the mother‐child pairs. As shown later, this feature of the model allows for the identification of three main clusters of interest, presenting in‐depth differences. In particular, our analysis shows evidence of the existence of reporting biases, a crucial finding enabled by our novel statistical approach. The article is structured as follows: the dataset is described in Section 2, while the proposed Bayesian semiparametric IRT model is introduced in Section 3, followed by posterior inference results in Section 4. Section 5 concludes with a discussion.

DESCRIPTION OF THE DATASET

In this work, we develop a modeling approach for data collected from questionnaires answered by large heterogeneous cohorts over time. In particular, we focus on responses from a cohort of mothers to psychometric questions describing the behavior of their children at different time points (2, 3, 4, and 7 years of age of the children), available through the GUSTO cohort study. Furthermore, we have access to the answers to two questionnaires given to the corresponding children at 8.5 years of age, aimed at measuring the presence of depressive/anxious traits. Due to the parental relationship between the respondents, we expect specific dependency structures in the data, relating the behaviors described by the mothers with the well‐being of the children. This dependency structure is the main object of this study and is of clinical interest. To better understand the relationship between the mother and the child's perception of behavior, we develop a Bayesian semi‐parametric model able to account for covariates and time evolution. First, we describe the data relative to the mothers' questionnaires. At each time point (2, 3, 4, and 7 years), the mothers provide information about the behavioral well‐being of the children by answering the CBCL. The questions in the CBCL are grouped into psychometric subscales, as reported in the second column of Supplementary Table S1, as well as syndrome subscales as reported in Supplementary Table S2. Clearly, the children's personality, attitude and abilities evolve as they grow, and therefore different questions are asked throughout time in order to properly capture the evolution of their behavioral and developmental profile. For instance, for the last time point, subscales relative to social interaction (Social Problems) and thinking processes (Thought Problems) are included, different from earlier time points. The total number of questions asked increases from 99 at the first three time points, to 119 at the last one, with only partial overlap. The number of subjects answering the questionnaires varies throughout the years, indicating that some mothers participate in the study only at specific time points due, in part, to missed visits, thus representing missing information in the dataset. In this work, we consider mothers who answered to the CBCL questionnaires at, at least, three different time points. Answers to each question in the CBCL questionnaire vary between not true (0), somewhat true (1), and very true (2). For each subscale, the responses are summarized in Supplementary Figures S1 to S5, by reporting the proportions of each answer category, grouped according to the subscales provided in Supplementary Table S1. The missing information (ie, unanswered questions) in the mothers' questionnaire datasets is 28%, 7%, 3%, and 11% at the four time points, respectively. The self‐reported depressive and anxiety symptoms from the children are obtained using two different questionnaires: the Children Depression Inventory 2nd edition (CDI2 ), and the Multidimensional Anxiety Score for Children 2nd edition (MASC2 ). The corresponding subscales are reported in the bottom part of Supplementary Table S1. The children in the study answered a total of 28 questions in CDI2 and of 50 questions in MASC2. The answers to the questions are recorded on a scale from 0 to 2 for the CDI2 system (eg, 0 = I am sad once in a while, 1 = I am sad many times, 2 = I am sad all the time), and from 0 to 3 for the MASC2 system (eg, 0 = Never, 1 = Rarely, 2 = Sometimes, 3 = Often). The proportions of answer categories for the CDI2 and MASC2 systems are reported in Supplementary Figures S6 and S7. The missing information from the children's questionnaires is very low, with only 1% missing information in CDI2, and none in MASC2. Besides the answers to the above‐described questionnaires, additional information is available to be used as predictors in the model. This includes gender (Gender) of the child and three indices measuring mothers' depression and anxiety: the Beck's Depression Inventory (BDI ), the Edinburgh Postnatal Depression Scale (EPDS ) and the State‐Trait Anxiety Inventory (STAI ). The resulting scores are used as covariate in our model, indicated by BDI, EPDS, STAI. No missing information is present in the covariates for the mother/child pairs considered in the analysis.

BNP FOR ITEM‐RESPONSE MODELLING

The analysis of questionnaire data is usually performed using techniques from IRT. The aim of IRT is to describe the answering profiles of the respondents by quantifying their ability to answer consistently and to capture the discriminating power and difficulty of each question. All the subjects are usually evaluated on the same set of questions (items). The questions can be dichotomous (binary answers), or polytomous (answers in more than two categories). Dichotomous answers are often modeled via the Rasch model, which is based on a logistic random effects model including a latent trait parameter for each respondent and a set of discriminatory and difficulty parameters for each question. This model can be extended to its polytomous version, allowing for answers with multiple categories. In this context, different models are available such as the Rating Scale Model, , the Graded Response Model, and the Partial Credit Model. We opt for the latter, which provides the most general analysis framework. Specifically, the Partial Credit Model can be seen as an extension of the Rating Scale Model where the questions are allowed to have categories varying in number and structure, an appealing feature in our application. On the other hand, the Graded Response Model specifies the cumulative distribution functions of the responses directly, modeling regions of the latent space separated by boundaries representing the categories of each item. This approach does not allow the algebraic separation of latent traits and question‐specific parameters, preventing the identification of a sufficient statistic for either. The Partial Credit Model describes the probability that individual i answers the jth question with category h as follows: where is the number of categories available for the jth question. The latent variable is a univariate subject‐specific parameters, while and for are item‐specific parameters. Model (1) is able to estimate the main features involved in the answering process. In particular, the ability parameter describes the attitude of the respondent toward the features that the questionnaire intends to measure. For instance, in our application this can be the perception of the mothers toward their children's behavior, or how the children view themselves through the self‐reported behavioral questionnaires. For each question , the positive parameter indicates the power of each question in discriminating between subjects, acting as a question‐specific steepness parameter for the probability curve as a function of the ability parameter , while the parameter quantifies the difficulty (offset) of reaching the hth category, expressing how likely it is that a person would answer using category h rather than , for , specifying regions of the latent space for the ability parameter which correspond to each answer category. The interpretability of the parameters is an appealing feature of this model and explains its popularity in psychology and social sciences, which has led to the development of a variety of extensions, aimed at dealing with particular applications. , , For example, extensions of model (1) to include covariates have been proposed. In a longitudinal framework, where subjects answer questionnaires at different time points, random effects can be introduced in model (1), in order to account for time variation and be able to infer changes in the behavior/condition of the respondents as compared with baseline. In a Bayesian framework, flexible hierarchical models can be specified allowing for time‐varying parameters and sharing of information. Furthermore, still in a Bayesian framework, different covariance patterns for the latent ability parameters can be specified, depending on the modeling assumptions. An important scenario sees the inclusion of fixed effects to model the subjects' group membership (eg, when additional information about the respondents is available. ) However, clustering of the subjects can be one of the goals of the analysis. To this aim, we mention works from the Bayesian nonparametric literature, , where the latent trait parameters for are modeled as a sample from a Dirichlet process (DP) prior. The DP describes a prior over the space of probability distributions in such a way that its samples are almost surely discrete measures, that is if a random measure , then P is almost surely discrete. In this notation, represents the mass parameter controlling the departure from the base measure , around which the DP is centered. A useful and straightforward constructive definition of the DP is provided by its stick‐breaking representation: where is the Dirac measure equal to 1 at the location x and 0 otherwise. The infinite sequence of location parameters is i.i.d. from the continuous base measure , while the weights follow the stick‐breaking construction: Discreteness is a key feature of the DP exploited in our modeling choice, since it allows for ties in the sample of latent trait parameters, and therefore induces a clustering structure over the set of subject indices . In particular, in this semi‐parametric setting, the partition of the subject indices is itself a random variable and object of posterior inference. In our application, we face various challenges: (i) the subject‐specific ability parameters can be of difficult interpretation, since they describe complex psychological and behavioral traits, departing from the setting where the questions have correct/wrong answers; (ii) the sets of questions from the CBCL questionnaire vary over different time points in the maternal behavioral assessments, with only partial overlap between time points and (iii) we have responses from two related sides of the same cohort, mothers and children, answering different but related questionnaires. The latter is a problem that has been explored before, for instance, by employing a Graded Response Model and linking the questionnaires answered by different respondents by sharing the same latent ability parameters, therefore not differentiating between informants in the model but rather focusing on the assessment of the same topic, or by simply analyzing the questionnaires separately and comparing the results on a common scale. In our case, however, we are interested in incorporating additional information regarding the informant/respondent, therefore maintaining the latent ability traits separated, and rather introduce dependency via a common random effect. These considerations require a nontrivial generalization of model (1). First of all, we introduce some notation. Let and be the data arrays of responses corresponding to the mothers and the children, respectively. In this dataset, we have pairs. Each entry represents the response at time t of mother i to question j, for , , and , with indicating the set of questions asked at time t. Notice that the total number of questions is allowed to change over time and therefore the global set of questions answered by the mothers is indicated as . The number of questions answered by the mothers are for and , with and an overlap of questions, for a total number of unique questions equal to . The variable can take value , with possible answer categories to question j, for . Although the number of answer categories might vary with question and time, we have here that , for and . Similarly, the entry of the children's dataset contains the response of child i to question j of questionnaire q, for . In our application, for the children's reports we have different questionnaires (CDI2 and MASC2) composed of and questions for the CDI2 and MASC2 questionnaires, respectively, and such that , for and for . It is possible that some of the interviewed mothers/children do not respond to some of the questions, or only respond partially, leaving the corresponding entries of the data arrays unobserved (ie, missing). These variables will be treated as missing‐at‐random, and therefore imputed in a Bayesian framework under this assumption by sampling from their full‐conditional distributions (more details in Section 2 of Supplementary Material). As typical in longitudinal cohort studies, covariate information is also available. Let and be the covariates associated to the mothers and the children, respectively. We include the covariate Gender in modeling both mothers' and children's latent traits (see Equation (5) below). For the mothers, additional covariates indicating the time from the first visit (in years), that is, a variable taking values 0, 1, 2, 5, as well as the four depression/anxiety indicators introduced in Section 2, are available. Thus, , with and , with . We specify a joint model for : and introduce dependence between the two sets of data by jointly modeling the latent parameter vector . Conditionally on the children and mothers' responses are independent. Building on (1), we propose the following joint model: Likelihood Item parameters Subject parameters Hyperpriors Nonparametric component where is a d‐dimensional vector of zeros and is the d‐dimensional identity matrix. The vector of latent variables is such that and with and , for . This description emphasizes the inclusion of parameters for the mothers' responses that are shared between time points, and parameters for the children's responses that are questionnaire‐specific. Therefore, the parameters are indexed over the global set of questions for and do not change over time, allowing for sharing of information between time points. This modeling choice reflects the assumption that the questions in the CBCL questionnaires, some which are common through all time points, aim at measuring the same characteristics of the mothers' behavioral assessment. Due to the high number of free parameters in the model, in order to ensure identifiability, we impose a constraint on the difficulty parameters in (4), as discussed in previous work on the subject. This implies that the prior distribution the difficulty parameters is defined over a lower dimensional space, imposing a constraint on one of the and (in our case we choose ). Furthermore, the prior distribution of the parameters has unitary variance. The trait parameters for the mothers are modeled via a covariate‐dependent autoregressive model, which is able to incorporate information about the evolution of the respondent's profile through time, as well as covariate information through the vectors , for and . In particular, referring to the description of Section 2, the covariate vectors contain the variables Gender, BDI, EPDS, STAI, and a time‐varying component representing the time from the first questionnaire (in years). The latter is used to provide information on how the elapsed time influences the respondent's profile, besides the autoregressive term. For the children, since we only have one time point, the trait parameters are modeled via linear regression using the covariate vectors , for , containing only Gender. The regression coefficients are a priori normally distributed with mean zero and variance one, as the continuous covariates are standardized so that their effect is comparable. The interpretation of the latent trait parameters is very important as, in our application, they are proxy for complex psychological behavioral assessments. In the proposed model, increasing values of the parameters and correspond to respondents who are more prone to assessing the behavior described in the questions as happening more often or being more likely to be observed. Considering the nature of the specific questions posed in the CBCL, CDI2, and MASC2 questionnaires, higher values of the latent trait parameters translate into more problematic behavioral assessments. For example, for the jth question asked to the mothers characterized by the parameters , a positive value of the latent trait is associated with higher probability of answering category very true (ie, 2), which is associated with a more “negative/problematic” description of the child's behavior. A similar interpretation holds for the children's questionnaire, reflecting their perception of themselves. However, the latent trait parameters of the mothers and children are defined over different latent scales in our application. Questionnaire data from mothers and children are linked through the random effect vectors , for , which are modeled as time invariant. The vector contains parameters influencing the mothers' behavioral perception of their children , the children's self‐perception of behavior , as well as an autoregressive coefficient , which captures the time evolution of the maternal latent traits. These parameters can be seen as drivers of the respondents' profiles. Therefore, it is of interest to be able to clusters the mother and child pairs based on their response patterns through these parameters. To this end, we specify a joint distribution for these variables, by modeling the vectors with a DP prior. The discreteness of this measure allows us to cluster the parameter vectors based on ties in the sample and induces a partition of the observations a‐posteriori. Specifying the base measure is straightforward if independence between the components of the vector is assumed a priori, as in our case. In Supplementary Material Section 3.2, we compare the performance of the proposed model with competitor models. In particular, we consider a parametric version of our model, in which all the observations are assumed to come from the same cluster. Furthermore, we investigate the performance of the following extensions of our modeling strategy: (i) subject‐specific regression coefficients in Equation (5), that is, and , for and the DP prior is defined on the augmented vector ; (ii) a model in which the AR(1) dependency in the mothers' temporal profile is specified directly on the residuals of the latent process. In addition, the regression coefficients are time‐specific, that is, for and the covariate time from first questionnaire is removed; (iii) a model where instead of modeling the temporal dependence with an autoregressive component, hence linearly, we introduce time‐specific intercepts, that is, and we specify the DP prior directly on the augmented vector . In addition, the regression coefficients are time‐specific, that is, for and the covariate time from first questionnaire is removed. This latter model allows for a nonlinear temporal effect on the questionnaire temporal profiles. Our comparison shows that the proposed strategy is robust in terms of clustering results and goodness‐of‐fit, still maintaining interpretability. In Supplementary Material, we also highlight the properties of the alternative approaches. We note that model (iii) is the best competitor in terms of clustering properties, flexibility and regression results, although the extra flexibility deriving from not assuming a linear temporal dependence comes at the cost of increasing the dimension of the parameter space.

POSTERIOR INFERENCE ON MOTHER/CHILD PAIRS

We fit model (3) to the mother/child pairs introduced in Section 2. The prior distributions assumed for the item‐specific parameters, reported in (4), are multivariate Gaussian distribution with null mean vector and identity covariance matrix. The scalar variances and modeling the variances of the subject‐specific parameters are a priori distributed as inverse‐gamma random variables with unitary mean and variance. The mass parameter of the DP is chosen such that the a priori expected number of clusters is approximately three, yielding . In Supplementary Material Section 3.1, we perform a sensitivity analysis to the choice of the parameter , and show that posterior inference is sensitive (in terms of estimated number of clusters) to such choice. This is typical of nonparametric priors based on the DP and the problem could be overcome by setting a hyperprior on . Finally, we fix the hyperparameters of as reported in (7), using standard Gaussian distributions for the univariate parameters and , and a noninformative Uniform distribution defined over the interval for the AR(1) parameter . Inference for the proposed model is obtained by running a Markov Chain Monte Carlo (MCMC) algorithm with code written in Rcpp. Most of the steps in the MCMC algorithm involve updating parameters that are nonconjugate in the proposed model. These steps are performed using an adaptive Metropolis‐Hastings approach within the main Gibbs sampler. The cluster allocations are updated using an algorithm for nonconjugate DP mixture models. , After an initial 100 iterations used to initialize the adaptive updates, the model is run for 50 000 iterations, of which the first 40 000 are discarded as burn‐in and the last 10 000 are thinned every two iterations, yielding a final sample of size 5000 which is used for posterior inference.

Analysis of question‐specific parameters

The posterior distributions of the item‐specific parameters and are among the main objects of inference, since they allow identifying those questions that are able to differentiate between respondent profiles. Figure 1 shows the posterior mean and 95% credible intervals for the discriminatory parameters relative to the questions contained in the CBCL (), grouped into clinical subscales for the first three and last time points, as reported in Supplementary Table S1. We observe how, within each subscale group, some questions have more discriminatory power, especially in the first time points where we have more information. Some of the subscales are characterized by higher variability in the distributions of the discriminatory parameters, probably due to the different ways mothers answered the questions contained in these subscales. For instance, while most of the questions at year 7 are characterized by narrow posterior credible intervals, indicating little uncertainty around the estimation of the corresponding discriminatory parameters, we observe high variability in the subscale rule‐breaking behavior. An explanation of such result can be found in the nature of the questions contained in this subscale, often involving negative or extreme behaviors, which are not usually encountered by the majority of the population, therefore strongly identifying different subjects based on their answers to these questions, and requiring higher variability of such parameters. In order to better understand this behavior on a question‐specific level, item characteristic curves (ICCs) are usually shown in IRT analysis. The ICCs represent the probability of answering a question with a specific category as a function of the latent trait parameter . By studying ICCs, one is able to locate the values of the latent traits where the propensity for a category increases with respect to the previous one, as indicated by the intersection of the two corresponding curves. Figure 2 shows some examples of ICCs for those questions in for which the posterior medians of the corresponding discriminatory parameters are the highest, median and lowest (top, middle and bottom rows). The ICCs are plotted over the posterior range of values of spanned during the MCMC sample. The questions characterized by very high discriminatory parameters show ICCs with higher slopes, indicating clear areas of the latent space associated with a specific category. In the case of questions number 141 and 138, shown in the top row of Figure 2, larger credible intervals are also observed for category somewhat true, indicating an almost dichotomous characterization of the questions. On the other hand, notice the increasing flatness of the ICCs when the discriminatory power of the question decreases, allowing for wider ranges of values of where the middle category is preferred.

FIGURE 1

FIGURE 2

Posterior item characteristic curves and their 95% credible intervals (dashed lines) for questions presenting decreasing discriminatory power (top = highest, middle = median, bottom = lowest). The discriminatory power of each question is assessed by ranking the posterior median of the discriminatory parameters [Colour figure can be viewed at wileyonlinelibrary.com]

Posterior means and 95% credible intervals for the discrimination parameters for the Child Behavior Checklist questionnaire, grouped into its clinical subscales. The abbreviations are defined in Supplementary Table S2. Vertical dashed lines indicate the reference value 1. Questions asked at years 2, 3, and 4 (A) or 7 (B) [Colour figure can be viewed at wileyonlinelibrary.com] Posterior item characteristic curves and their 95% credible intervals (dashed lines) for questions presenting decreasing discriminatory power (top = highest, middle = median, bottom = lowest). The discriminatory power of each question is assessed by ranking the posterior median of the discriminatory parameters [Colour figure can be viewed at wileyonlinelibrary.com] We now discuss the children's questionnaires. Figure 3 shows the posterior mean and 95% credible intervals for the discriminatory parameters for the CDI2 and MASC2 questionnaires, respectively, each grouped according to the their subscales as reported in Supplementary Table S1. We observe how most of the posterior distributions of these parameters are centered around similar values for CDI2, indicating analogous discriminating power of such questions. However, this is not the case for the questions belonging to the MASC2 questionnaire, where many range below the reference unitary value (eg, Separation anxiety/Phobias, Generalized Anxiety Disorder, Obsessions/Compulsions, Tense/Restless, and Harm avoidance subscales). Following the same approach used for the investigation of the mothers' responses, we select the questions with highest/median/lowest posterior median of the discrimination parameters , for . Their ICCs are shown in Figure 4. Once again, we identify the questions with highest posterior median (top row) to have a more clear separation between categories in association with the value of the latent trait parameter , as characterized by higher slopes. By contrast, the bottom row shows how the middle category in question 15 of CDI2 and the categories rarely and sometimes in question 15 of MASC2 have overlapping credible intervals with the other categories for most values of , indicating lower discriminating power in terms of respondent propensity to pick a particular category. A more gradual shift between the curves for each categories is observed in the middle row.

FIGURE 3

FIGURE 4

Posterior item characteristic curves and their 95% credible intervals (dashed lines) for questions presenting decreasing discriminatory power (top = highest, middle = median, bottom = lowest). The discriminatory power of each question is assessed by computing the posterior median of the discriminatory parameters , for [Colour figure can be viewed at wileyonlinelibrary.com]

Posterior means and 95% credible intervals for the discrimination parameters for CDI2 (A) and MASC2 (B), grouped into their respective subscales. The abbreviations are defined in Supplementary Table S2. Vertical dashed lines indicate the reference value 1 [Colour figure can be viewed at wileyonlinelibrary.com] Posterior item characteristic curves and their 95% credible intervals (dashed lines) for questions presenting decreasing discriminatory power (top = highest, middle = median, bottom = lowest). The discriminatory power of each question is assessed by computing the posterior median of the discriminatory parameters , for [Colour figure can be viewed at wileyonlinelibrary.com]

Clustering analysis

The latent traits for the mothers and for the children incorporate features regarding the answering profiles of the respondents pairs . Due to the complex phenomenon under study, the interpretation of these parameters represents a challenge. As we observed in the ICCs curves presented in the previous section, the range of parameter values associated with a specific answer category is dictated by the posterior estimates of the coefficients and . In general, small values of the latent subject‐specific parameters are related to high probability of answering the lowest category (eg, not true, never), and vice‐versa large values of these parameters are associated with higher categories (eg, very true, often). In general, we can describe the behavior of the latent profiles by looking at the predictive densities of the latent variables responsible for their clustering structure, that is, of . See Figure 5. As it is easily observed from the multimodality of these predictive distributions, the driving variables for clustering are and , indicating that the mothers' latent profile contribute the most to the partition structure. This is an interesting result, suggesting that the mothers exhibit a more diverse response behavior than their children.

FIGURE 5

Contour plots for the bivariate predictive distribution of the latent variables [Colour figure can be viewed at wileyonlinelibrary.com]

Contour plots for the bivariate predictive distribution of the latent variables [Colour figure can be viewed at wileyonlinelibrary.com] Thanks to the specification of a Bayesian nonparametric prior, the posterior distribution of the vector of parameters allows the identification of clusters of mother/child dyads characterized by similar pairs of response profiles. First, we provide a posterior estimate of the partition of the data by minimizing the Binder's loss function with equal misclassification costs. This approach minimizes a loss function assigning equal costs to each type of misclassification error, that is, when two subjects are clustered together rather than separately, and vice‐versa. The Binder's loss is then obtained as the sum of such costs over all possible pairs of data points, and its expectation is minimized using the MCMC output of the algorithm. , The computation of this estimate is achieved via the R package salso and we refer to it as Binder partition. The matrix of posterior coclustering probabilities used to minimize the Binder's loss criterion, as well as the posterior distribution of the number of clusters, are reported in Supplementary Figure S12. The estimated Binder partition is composed of clusters of sizes , respectively, and labeled according to their cluster sizes in decreasing order. Due to label switching problems, estimates of the posterior distribution of the latent parameters are not trivially obtained from the MCMC output. Therefore, we study the posterior distribution of the mean of the parameters within each cluster. To this end, we first fix the partition of the subjects to the estimated Binder partition, and compute, for each MCMC iteration, the average of the values of over the subjects within each cluster and then we explore their posterior distribution. Recall that is the number of clusters for the Binder estimated partition and Figure 6 reports such posterior distributions (one for each cluster). The cluster‐specific posterior distributions of the autoregressive coefficient and the children's latent trait intercept (two bottom rows of Figure 6) are concentrated around negative values for most of the clusters, indicating latent traits oscillating more in time (), and that overall lower categories are preferred for the questions answered by children (), indicating respondents more prone to positive characterization of behavior (eg, 0 = I am sad once in a while is better than 1 = I am sad many times, or than 2 = I am sad all the time). Interestingly, Cluster 3 presents higher variability and spans a range of values including both negative and positive terms for the parameter , but it is centered around zero. On the other hand, the intercept parameter presents strong differences between clusters, although all distributions are concentrated on negative values, with highest values for the ones relative to Cluster 4. The three largest clusters also show higher precision in the posterior distributions of the averages of , while still exhibiting opposite behavior, indicating discordance in the assessments between mothers in these clusters. For instance, the assessment of Cluster 1 is better than Cluster 3, since the average values of are overall lower, but it is worse than that of Cluster 2.

FIGURE 6

Violin plots of the mean values of , , and within six clusters identified by the Binder partition. The numbers within the violin plots represent the cluster size [Colour figure can be viewed at wileyonlinelibrary.com] The relationship between the clustering structure and the way the subjects within each cluster respond to the questionnaires is presented in details in Supplementary Figures S13 and S14 (for the same sets of questions previously highlighted in Figures 2 and 4), showing the posterior expected value curves (EVCs) and their 95% credible interval. EVCs represent the mean answers to each question as a function of the latent traits, easily obtainable from the MCMC output. The EVCs are overlapped with the posterior means and 95% credible intervals of the average latent trait parameters and within each cluster (and, for the mothers, for the two extreme time points), as depicted by the horizontal lines in the figures. From the mothers' EVCs, the separation between the way subjects within clusters respond is very clear, as highlighted by the different locations of the estimates of for each cluster. In particular, in all questions presented here, Cluster 3 is associated with the most negative assessment of the children's behavior (ie, corresponds to the highest values of ). However, in all clusters apart from the single‐dyad Cluster 6, an improvement between year 2 and year 7 is observed (ie, the values of at year 7 are lower than at year 2). Another important aspect highlighted by these figures, is the relationship between the clusters and the item parameters. Specifically, questions with higher discriminatory power (ie, top rows of Figures 2 and 4 and of Supplementary Figures S13 and S14) are able to isolate the most negative Cluster 3 from the other clusters more neatly. By looking at Figure Supplementary Figure S14 for the children's assessment, we observe a similar distinction between clusters in the questions from the CDI2 and MASC2 questionnaires, again showing a strong relation between the discriminatory power of each question and the cluster separation. Interestingly, while in Supplementary Figures S13 Cluster 5 presents the lowest values for for the CBCL questions, Supplementary Figures S14 shows that this is true for Cluster 4, again capturing a discrepancy in assessment between mothers and children.

Analysis of covariate effects

The model for latent trait parameters contains a regression component, including an autoregressive term for the maternal questionnaires that are observed at several time points. In a Bayesian framework, the overall effect of covariates can be established by computing the posterior 95% credible intervals (95% CI) of the corresponding coefficients, here denoted as and for the two groups of subjects. Whenever the credible interval contains the value zero, we regard the corresponding covariate as not having a relevant effect in explaining the subjects' latent traits. The posterior means and credible intervals for the coefficients and are reported in Supplementary Figure S15. In this analysis, the time from the first questionnaire has a strong effect on the maternal latent trait evolution (95% CI ), inducing lower values of the mothers' latent traits as time passes, indicating an improvement in their behavioral assessment over time. This finding is not surprising as older children, with language development, become more responsive to parental instruction. By contrast, the maternal mental health index BDI shows a negative effect on the maternal rating (95% CI ). For the children's self‐reporting, the only covariate included (Gender) has high variability although posterior mean relatively far away from zero (95% CI ). To further characterize the clusters in the Binder partition, we look at the empirical distribution of the time‐homogeneous covariates within each cluster. To have an indication of which covariates differ the most among clusters, we perform a ‐test for the categorical covariate Gender and Kruskal‐Wallis test for the continuous mental health indicators (analyses not shown). None of the covariates is associated with a significant test result, pointing toward the existence of further (unmeasured) factors driving the clustering.

Scale‐specific analysis

In this section, we combine the information provided by the available subscales, reported exhaustively in Supplementary Table S2, and the posterior clustering estimate, with the aim of understanding the behavior of the respondents from a clinical point of view. In the analysis of questionnaire data via IRT models, a widely used summary statistics for the latent trait parameters is the raw score, obtained from each respondent as the sum of all the answers in the questionnaire or a specific subscale. Such statistic is sufficient for the latent trait parameters in the proposed model. For each subscale of interest, conditionally on the Binder partition, we can compute the raw score statistic by sampling answers from the predictive probabilities , for and for the mothers and , for and for the children. The predictive probabilities of answering a specific category within each cluster are estimated by averaging over the posterior MCMC samples, where at each iteration the subject‐specific parameters are replaced with their average within the clusters. The raw scores so obtained are representative of the subjects' behavior within each cluster, providing clinical information regarding the behavior and well‐being of the children. The distribution of such raw scores for a selection of composite subscales, for both mothers and children data in the three largest cluster, are shown in Supplementary Figures S16 to S18 for internalizing and emotional problems, anxiety and depression, and dysregulation, respectively. The latter is a composite scale of clinical interest, and is composed of scores resulting from aggregating the questions in the aggressive behavior, anxious/depressed, and attention problems CBCL subscales (also termed AAA). The differences in the predictive distributions of the raw scores within each cluster are more evident in the mothers' subscales than in the children's ones. Furthermore, Clusters 1 and 2 appear to be similar with respect to children's self‐assessment. This confirms the results discussed in Section 4.2, highlighting a discrepancy between mothers' and children's profiles. In addition, from the first two rows of Supplementary Figure S16 and the left column of Supplementary Figure S17, it is observable a general shift to smaller values in the predictive distributions of the raw scores within each cluster, indicating an improvement in the maternal assessment over time, and confirming the phenomenon already noticed in Supplementary Figure S13.

DISCUSSION

Our analysis points to the existence of three main mother‐child clusters based on a latent trait reflecting the nature of the maternal assessment of the child, jointly with an analogous latent trait reflecting the nature of the child self‐assessment (see Figures 5 and 6). In brief, all mothers in the three largest clusters show improved child ratings over time, with mothers in Clusters 1 and 2 overall more positive in their assessments. Mothers in Cluster 3 are generally more negative (see Supplementary Figure S13). However, the same is not completely matched by the children's self‐reports (see Supplementary Figure S14), who show very similar behavior in Clusters 1 and 2, supporting the evidence of bias in the mothers reporting process. These findings reflect the previously reported discrepancy between maternal reports and those of their children. Furthermore, the inclusion of mental health indicators as covariates in the model show that only the maternal mental health index BDI is a global factor in characterizing the mothers' latent traits, while none contributes to the characterization of the clusters.

Maternal mental health

Rater bias is a major concern in the assessment of child psychopathology (see Introduction). The greatest concern in the literature is that of a maternal bias in child reports linked to maternal mental health. Our analysis provides evidence for a global association between maternal response biases and maternal mental health (see Section 4.3). As such, our longitudinal analyses reveal evidence that the critical latent trait that reflects the response pattern of the mothers is associated with maternal mental health. In details, our findings suggest some evidence of an association between maternal symptoms of depression BDI, but not those of anxiety STAI. This finding is consistent with a detailed study comparing potential maternal reporting biases showing no evidence for symptoms of anxiety when compared directly with those of depression. Furthermore, as noted above, our clustering of mothers provides a unique ability for a more nuanced analysis exploring whether a mental health‐linked maternal bias might appear in a subset of mothers. In addition, there is evidence of reporting bias when clinical raw scores between mothers and children are compared (Supplementary Figures S16‐S18). This is consistent with studies reporting maternal bias associated with mental health status , which examine mothers with clinical levels of depressive symptoms. Studies reporting evidence of a maternal bias compare reports of mothers with and without clinical levels of depression. An extensive analysis provides compelling evidence for a maternal bias among clinically depressed mothers. Maternal reports using the CBCL of child behavior problems by mothers who are depressed and/or anxious at the time they provide the report are systematically different from those of unimpaired mothers when compared with child self‐reports (using the Youth Self‐Report ). It has been found that children of mothers with a history of major depression have more internalizing problems according to both mothers and teachers. When comorbidity is taken into account, mothers with symptoms of anxiety, a combination of depression and anxiety, or substance abuse appear to overstate their children's internalizing and externalizing behavioral problems, but no evidence of overreporting is found when mothers are depressed but not anxious. The majority of the depressed mothers in a previous study shows comorbid anxiety. Indeed there are other streams of evidence showing that mothers with comorbid anxiety and depression represent a significantly more affected population.

Gender

Interestingly in this analysis the gender of the child is not a major factor driving variation in the maternal latent traits across all dyads (Supplementary Figure S14, left panel). This finding is consistent with previous work that finds a gender bias in reports of teachers, but not parents. However, Gender is a relevant factor in the modeling of the latent traits for the children's answering profiles (Supplementary Figure S14, right panel). This finding is consistent with known gender differences in the self‐reports of symptoms of depression and anxiety, with greater evidence of problems in girls.

Child self‐reported symptoms

We find that the children's self‐report in the three main clusters are consistent with each other, with slightly higher values of the raw scores for Cluster 3, and lowest values reported in Clusters 1 and 2 (see Supplementary Figures S16‐S18). This is not completely consistent with the maternal report. For example, our analysis identifies maternal Clusters 1 and 2 to be associated with the lowest, but different, maternally reported Anxiety and Depression features, but the same is not observed among the children, where the predictive distributions of the raw scores are overlapping (Supplementary Figure S17). Recall that child self‐report measures are obtained at about 9 years of age, 2 years following the final maternal CBCL reports. The same pattern is observed for internalizing and externalizing problems. Thus, the relation between the maternal response latent trait and the child's self‐report is apparent across a range of distinct behavioral problems.

Conclusions

The association between parental mental health and that of the offspring has been established for some time. However, there is a long‐standing methodological concern that emphasizes the weakness of the parent as both the source of risk and the reporter of child outcomes. The concern, simply put, is that more depressed mothers tend to overrate the mental health problems of their children. Previous analyses of the issue reveal highly mixed results with reports of essential no effects, small effects or moderate effects. Detailed analyses provide a more nuanced understanding. First, there is evidence for a greater maternal reporting bias associated with depressive symptoms when rating boys compared with girls. Second, there is compelling evidence showing that the influence of a depression‐related maternal bias depends not only on gender, but on the reported outcome: internalizing problems reveal greater evidence for a maternal reporting bias than do externalizing problems, with evidence that certain emotional functions reveal greater bias than do others. A limitation of the existing literature is a reliance on crosssectional data. In this work, we analyzed maternal reports of their children's behavior in conjunction with symptom self‐reports from the children obtained from clinical psychometric questionnaires commonly used as screening tools in child clinical studies. The availability of both types of reporting systems allows for a deeper evaluation of the psycho‐pathological assessment of the mother‐child dyads. Furthermore, the mothers responded to questionnaires at multiple time points adding a valuable longitudinal dimension to the study. A significant and novel contribution of this analysis is the ability to identify specific mother—child dyadic clusters. The implication is that of variation across clusters such that the issue of potential reporting bias must consider the nature of the mother—child dyad. Among the measures of maternal symptoms of anxiety or depression, the BDI influences the mothers' profiles in the direction of greater negative assessment. However, our clusters did not associate with maternal mental health indicators nor with the gender of the child. Appendix S1 Click here for additional data file.

38 in total

1. Bayesian infinite mixture model based clustering of gene expression profiles.

Authors: Mario Medvedovic; Siva Sivaganesan
Journal: Bioinformatics Date: 2002-09 Impact factor: 6.937

2. Severity of the aggression/anxiety-depression/attention child behavior checklist profile discriminates between different levels of deficits in emotional regulation in youth with attention-deficit hyperactivity disorder.

Authors: Joseph Biederman; Carter R Petty; Helen Day; Rachel L Goldin; Thomas Spencer; Stephen V Faraone; Craig B H Surman; Janet Wozniak
Journal: J Dev Behav Pediatr Date: 2012-04 Impact factor: 2.225

A Bayesian nonparametric approach to dynamic item-response modeling: An application to the GUSTO cohort study.

INTRODUCTION

DESCRIPTION OF THE DATASET

BNP FOR ITEM‐RESPONSE MODELLING

POSTERIOR INFERENCE ON MOTHER/CHILD PAIRS

Analysis of question‐specific parameters

Clustering analysis

Analysis of covariate effects

Scale‐specific analysis

DISCUSSION

Maternal mental health

Gender

Child self‐reported symptoms

Conclusions

1. Bayesian infinite mixture model based clustering of gene expression profiles.

2. Severity of the aggression/anxiety-depression/attention child behavior checklist profile discriminates between different levels of deficits in emotional regulation in youth with attention-deficit hyperactivity disorder.

3. A multilevel item response theory model was investigated for longitudinal vision-related quality-of-life data.

4. The influence of distress on mothers' and fathers' reports of childhood emotional and behavioral problems.

5. Perinatal Maternal Depressive Symptoms as an Issue for Population Health.

6. Maternal depression and child psychopathology: a meta-analytic review.

Review 7. Children of depressed parents: an integrative review.

8. Diagnostic efficiency of the child and parent versions of the Multidimensional Anxiety Scale for Children.

9. Parental psychiatric disorder: effects on children.

10. A Bayesian nonparametric approach to dynamic item-response modeling: An application to the GUSTO cohort study.

1. A Bayesian nonparametric approach to dynamic item-response modeling: An application to the GUSTO cohort study.