Literature DB >> 30487888

A meta-analytic review of the MMPI validity scales and indexes to detect defensiveness in custody evaluations.

Francisca Fariña1, Laura Redondo1, Dolores Seijo2, Mercedes Novo2, Ramón Arce2.   

Abstract

Background/Objective: In child custody disputes, one of the remit of the forensic psychologist is to evaluate parental attributes while suspecting defensiveness. The instrument of choice for undertaking this double task is the MMPI. Method: As to establish the state of the art on this, a meta-analysis was undertaken with a total of 32 primary studies from which 256 effect sizes were assessed. A meta-analysis was undertaken, effect sizes were corrected for sampling error and criterion unreliability.
Results: The results revealed a positive, significant, large and generalizable mean true effect size for the L, K, S and MP scales, and the L + K and L + K-F indexes. The Wsd was positive, significant and large, but not generalizable. A negative and significant, but not generalizable mean true effect size was found for the F and generalizable for F-K index. The effect sizes for the L, K, S and MP scales, and the L + K and L + K-F indexes were equal. Both the gender of parents (father vs. mother) and the context of evaluation (parent child custody disputes vs. parenting capacity) were assessed as moderators. Conclusions: The results are discussed in relation to forensic practice.

Entities:  

Keywords:  Defensiveness; MMPI; Meta-analysis; Parent child custody disputes; Parenting capacity assessment

Year:  2017        PMID: 30487888      PMCID: PMC6220924          DOI: 10.1016/j.ijchp.2017.02.002

Source DB:  PubMed          Journal:  Int J Clin Health Psychol        ISSN: 1697-2600


Forensic psychological evaluation in child custody disputes is regulated by standards and guidelines established by an array of associations around the world such as the American Psychological Association (2010), the Association of Family and Conciliation Courts (Martindale, Martin, Austin, & Task Force Members, 2007), or the Spanish Psychological Association [Colegio Oficial de Psicólogos] (Chacón, García, García, Gómez, & Vázquez, 2009). Though these standards and guidelines may vary slightly, they all have common aims i.e., to determine the child's psychological best interests, to guide professionals in the evaluation of children, parents and the child-parent interaction in order to identify the child's psychological needs and parental attributes to find the best fit between child's needs and parental attributes. The primary aim of these standards and guidelines is to evaluate parenting attributes in terms of the knowledge, abilities and skills required to effectively cater for the child's needs, and to detect deficits and psychopathology that may put the child at risk. Both separation and divorce are psychosocial stressors closely linked to clinical symptomatology (Amato and Keith, 1991, Cheng et al., 2006, Weaver and Schofield, 2015). Moreover, defensive responding should be suspected (Arce et al., 2015a, Bagby and Marshall, 2004, Strong et al., 1999), affecting an estimated 30 to 40% of evaluations (Baer and Miller, 2002, Fariña et al., 2010, Strong et al., 1999). To evaluate parental attributes, psychologists employ psychological tests, clinical interviews, behavioural observation (e.g., parent-child interactions), home visits, and collateral contacts (e.g., extended family). The clinical interview, in particular the forensic-clinical interview (Vilariño, Arce, & Fariña, 2013), and psychological tests, primarily the MMPI-2, the psychometric instrument most extensively used worldwide for forensic psychological assessment which has been translated into over 40 languages (Archer et al., 2006, Fariña et al., 2014, Rogers et al., 2003), and used in over 90% of parental evaluations in child custody disputes (Ackerman and Pritzl, 2011, Arch et al., 2011, Fariña et al., 2010), serve to evaluate both parental attributes and defensiveness. When defensiveness or malingering is suspected in the assessment of psychological and personal attributes, the combination of clinical interview and psychometric evaluation is required (Arce et al., 2015b, Graham, 2011). The MMPI-2 includes the L, F and K original validity scales. The L scale was designed to detect the deliberate and overt acknowledgment of uncommon virtues. The F scale was initially designed to detect random responding, but empirical research has shown that F was also sensitive to intentional attempts to portray one's own negative image. The K scale was as a subtle indicator (F and L are more obvious) of attempts to exaggerate psychopathology and to appear in a very unfavourable way (low scores), or deny psychopathology and to present oneself in a favourable way (high scores). Due to the Restructured Form of the MMPI-2, the MMPI-2-RF, the original validity scales for defensiveness, L and K, were also reformulated as L-r and K-r. The L-r scale consisted of 14 items, sharing 11 with the original L scale and adding three additional items, while the K-r scale consisted of 14 items from the original K scale (16 were deleted and the scoring direction for one was reversed). No evidence or rationality was provided to support actions in both scales (Greene, 2011). Moreover, the MMPI-2 contains additional scales for measuring defensiveness: Positive Malingering Scale (MP); Wiggins's Social Desirability Scale (Wsd); Edward's Social Desirability (Esd); O-S Scale (Obvious-Subtle); Test Taking Defensiveness Scale (Tt); Other Deception (Od); Superlative Scale (S); and the Positive Mental Health Scale (PMH4). The S scale measures the denial of psychological problems and moral shortcomings, as well as the endorsement of unrealistically positive personal and interpersonal attributes; the Wsd, Od, Mp, Esd and Tt scales measure social desirability (Od is an upgrade version of MP and Wsd); the PMH-4, the denial of various forms of psychological maladjustment; and the O-S subscale reports underreporting when subtle items are endorsed more than obvious (negative scores). Finally, three indices, F-K, L + K and L + K-F, were related with defensiveness (Baer and Miller, 2002, Graham, 2011, Lanyon and Lutz, 1984, Posthuma and Harper, 1998). As for distortions related to defensiveness, two response patterns have been observed i.e., self-deception (SD) and positive impression management (IM) according to whether the individual is conscious or not of manipulating them (Paulhus, 1984). These response patterns have different legal implications since the IM entails a deliberate attempt (volitional component) to wilfully deceive in spite of being fully aware it is illegal (intent, cognitive component), whereas the SD implies a unreal (volitional component), but honest (cognitive component) responses (Fariña et al., 2010). In the context of forensic evaluation of custody disputes both types of response patterns can be expected. Thus, the SD would be a stable trait of a subject generalizable to all measurement contexts, whereas IM is characteristic to this measurement context, involving approximately 40% of the population under evaluation (Arce, Fariña, & Vilariño, 2015). The MMPI Wsd, L, Od and MP scales assessed IM, and the Esd, K, S and PMH4 scales assessed the SD (Bagby and Marshall, 2004, Greene, 2011, Strong et al., 1999, Strong et al., 2002). These standards and guidelines are enshrined in the professional practice of psychologists (Ackerman and Pritzl, 2011, Arch et al., 2011, Archer and Wygant, 2012, Bow and Quinnell, 2001). Moreover, judges and the courts classify, according to psychological reports, parental attributes as incapacitating characteristics for child custody (e.g., drug addiction, negligence), negative for custody (e.g., parental incompetence, mental disorders), and positive (e.g., parental abilities to cater for the child's needs) (Arce, Fariña, & Seijo, 2005). Research on defensiveness evaluation has focused mainly on two contexts, personnel selection (Strong et al., 2002) and child custody disputes (Strong et al., 1999), suggesting that these MMPI scales and indexes might perform in a different manner across different assessment contexts (Bagby & Marshall, 2004). Thus, Baer and Miller's (2002) meta-analysis has shown that the mean effect size of MMPI-2 traditional and supplementary indices of underreporting was higher for job applicants (d = 1.55) than for child custody litigants (d = 0.99). Nevertheless, these and other results of this meta-analysis published in the reference journal of psychological evaluation Psychological Assessment are not valid since the results were incorrectly computed, given that the mean effect sizes were not corrected for sampling error (nor corrected for criterion unreliability); notwithstanding, these are used worldwide in forensic setting as valid assessments. For example, the unweighted overall mean effect size (non-corrected for sampling error) reported for the K scale was d = 1.13, whereas the corrected sampling error was d = 1.47. The gap between corrected and uncorrected effect sizes, d = 0.34, implies that the K scale classified correctly 16.8% (r = .16) more defensiveness than Baer and Miller's results. Moreover, all the scales and indexes were inappropriately mixed in a global effect size. Additionally, the systematic conclusions of the literature, based mainly on the classification accuracy or incremental validity, concerning the superiority of certain scales and indexes over others (e.g., Baer and Miller, 2002, Bagby et al., 1999, Butcher, 1997, Carr et al., 2005), are not statistically supported. In fact, the application of statistical tools to the data provided by Baer and Miller about the classification accuracy (e.g., the computed 95% CIs, meaning there are no mean differences), does not corroborate the superiority. Bearing in mind these gaps, the time lapsed from the last review of the literature, 2002, a meta-analysis was undertaken to determine the mean true effect size for each of the MMPI scales and indexes of defensiveness and to assess their utility in forensic practice for evaluating parents involved child custody litigation.

Method

Search of studies

The search strategy was aimed at detecting studies evaluating parents in child custody disputes using any of the family instruments on the MMPI: MMPI (Hathaway & McKinley, 1940), MMPI-2 (Butcher et al., 1989, Butcher et al., 2001), or MMPI-2-RF (Ben-Porath & Tellegen, 2008/2011). The initial search was intended to locate previous systematic reviews and meta-analyses (i.e., Baer and Miller, 2002, Cooke, 2010, Roma et al., 2014) from which to draw a list of reviewed articles and descriptors for subsequent searches (i.e., MMPI, response styles, validity scales, child custody litigation, child custody dispute, underreporting, child custody evaluations, defensiveness, faking good, parental capacity assessment). These descriptors were used to design search algorithms applied to leading scientific databases: Web of Science, Scopus, PsycInfo and Proquest Dissertation & Theses. Finally, a search was performed in the metasearch-engine ‘google scholar’. The search was performed in July 2016. These systems yielded a total of 4,310 publications that were applied the following inclusion criteria: a) participants were parents involved in child custody litigation proceedings; b) empirical studies reporting effect size or sufficient data for its computation (when this contingency was not met but the others were, the authors were contacted to obtain the data); and c) parents were evaluated using the family instrument of the MMPI. Studies in which subjects had been instructed to respond (simulation research) like parents (most instructed students to behave as litigant parents) in custody disputes were excluded because the results of these simulation studies enjoy high face validity, while external validity remains untested (Konecni & Ebbesen, 1992), and real subjects and those in feigning conditions (simulation research) provide significant different results (Amado et al., 2015, Amado et al., 2016) and have been found to perform different tasks (Fariña, Arce, & Real, 1994). All of the studies published meeting these criteria were included. After screening, a total of 32 primary studies (21 articles in journals, 2 unpublished studies and 9 doctoral theses) were selected in which the effect sizes of one or more scales measuring defensiveness on the MMPI were obtained. Samples duplicity was controlled, 256 effect sizes were obtained: 67 for the L scale, 65 for the K, 51 for the F, 19 for the S, 15 for the Wsd, 9 for the MP, and 1 for the Esd and Od, 10 for the F-K index, 6 for L + K, and 12 for L + K-F.

Coding of primary studies

In order to proceed with the meta-analysis the following data from the studies was codified: a) article reference; b) article source (paper, unpublished data, doctoral thesis); c) sample characteristics (i.e., size, gender); d) design characteristics (evaluation of custody disputes or evaluation of parenting abilities, level of conflict, reports of sexual abuse, physical abuse, negligence or abandonment, family violence, alienation, descriptor favourable or unfavourable); e) the statistics required for computing the effect size. This task was carried out separately by two researchers, with total concordance (Cohen's k = 1) in the coding. The characteristics of the primary studies included in this review are shown in Appendix 1.

Data analysis

The effect size of the primary studies was obtained with Cohen's d since the means were systematically reported (no study was correlational) of groups in the custody dispute evaluation condition (i.e., the target population of this meta-analysis). Primary studies compared independent groups of cases-controls, multiples groups, repeated measures, and the experimental group with a test value. Moreover, some studies were found to report their results in raw scores, but others used T scores. Similarly, different versions of the MMPI i.e., MMPI-1, MMPI-2 and MMPI-2-RF were used. When the results were reported in T values, the effect size was obtained as for a single sample using the formula of Glass (Glass, 1976, Glass et al., 1981), where the measure and standard deviation of the ‘test value’ were 50 and 10, respectively. The use of the normative group was preferred to the particular study control group as the idiosyncrasies of this specific control group were controlled by taking the normative group that represents the general population (Hunter & Schmidt, 2015). When the results were reported in raw scores, these were transformed into T scores by using the means and standard deviations of the normative population in the MMPI manuals. For the scales and indexes not included in the MMPI manuals, the test value for computing d was the mean cutting scores for coached participants to be applied to test takers involved in legal proceedings (Baer & Miller, 2002), and the standard deviation for the experimental group. Having computed the effects sizes the meta-analysis was performed and corrected for sampling error and criterion unreliability (procedure of Hunter and Schmidt, 2015), for each of the scales and measurement indexes of defensiveness. Amado et al. (2015) have shown the utility of three statistics for forensic practice: U1, Binomial Effect Size Display (BESD), and Probability Superiority (PS). Thus, these were computed to derive the measures on the effectiveness of the scales and indexes for detecting defensiveness over the natural tendency for defensiveness i.e., responding defensively even with nothing to hide or to give a positive presentation (Osuna et al., 2015, Palmer et al., 2013).

Criterion reliability

Criterion reliability for the original validity scales (Table 1) assessing the MMPI and the MMPI-2 (the original scales remain in both versions with the exception of 4 items on the F scale that were eliminated from version 2 for being offensive) were taken from a meta-analytical review on the reliability of the L (70 studies), F (70 studies), and K scales (71 studies) of Hunsley, Hanson, and Parkeret (1988); and the MMPI-2-RF Manual for administration, scoring, and interpretation (Ben-Porath & Tellegen, 2008/2011). As for the additional defensiveness scales, the reliability of the S Scale (Superlative) was taken from its creators (Butcher & Han, 1995), the Wsd was taken from the only study reporting it (Paulhus, 1984), and for the MP, as no study was found reporting reliability, it was calculated on the basis of 892 normative subjects evaluated under standard response conditions (control group in studies) from the Forensic Psychology Institute of the University of Santiago de Compostela (Spain). No meta-analysis calculated Esd and Od scales as only one study was identified. Finally, the reliability of the composites (i.e., F-K, L + K, L + K-F) was calculated using the formula of Mosier (1943).
Table 1

Criterion reliability.

Scale/Indexα1α2
L.77.70
F.77.61
K.82.68
S.86---
Wsd.51---
MP.70---
F-K.85---
L + K.84---
L + K-F.86---

Note.

MMPI/MMPI-2.

MMPI-2-RF, --- Scale/index not available at this instrument.

Criterion reliability. Note. MMPI/MMPI-2. MMPI-2-RF, --- Scale/index not available at this instrument.

Results

Study of outliers

Initially outliers [±1.5*IQR] in each of the scales and indexes of defensiveness were eliminated. This tool found 2 (3.8%) outliers in 53 effect sizes in the F scale; 2 (22%) of 9 in MP; 1 of 15 (6.7%) in Wsd; 1 of 10 (10%) in F-K index; and 4 (33%) of 12 in L + K-F. As this technique eliminated many effect sizes of the MP Scale, the F-K index and the L + K-F index (≥10%, De Dreu and Weingart, 2003, Hunter and Schmidt, 2015, Tukey, 1960), it is likely they were moderators, not outliers. Moreover, the elimination should not account for an excessive percentage of evaluated subjects (N), which would substantially affect the MP Scale with the loss of 56.34% of participants. Thus, a second screening with the criterion M ± 2SD was performed, being the results generalizable to 96% of the future samples, with 1 outlier, the same as with criterion ±1.5*IQR, in F-K and in L + K-F, and none in MP. Hence, the meta-analysis calculated MP and L + K-F with the effect sizes within the region M ± 2SD. Nonetheless, given that the elimination of outliers reduces the variance, and in turn the effect size, for the L and Wsd scales the mean true effect sizes of the samples obtained with the interquartile range (IQR) criterion were computed. The results showed equivalent results for L + K-F (δ = 1.24 and 1.20 for the criterion ±1.5*IQR and M ± 2SD, respectively), and similar for MP (a positive, significant and generalizable mean true effect size), but different in size (medium, δ = 0.48, with the ±1.5*IQR criterion; and large, δ = 1.08, with the M ± 2SD criterion).

Defensiveness scales and indexes

The results of the effect size calculated for each scale and index, the total number of effect sizes obtained (k); sample size (N); the uncorrected effect size weighted by sample size (dw), and the standard deviation (SD); the effect size corrected for criterion unreliability (δ); the percentage of variance explained by the artifactual errors (%Var), 95% confidence intervals, and 80% credibility interval (when both intervals have no zero, it indicated the estimated effect size was significant and generalizable, respectively), are shown in Table 2. The results for the L, K, S, MP scales and L + K and L + K-F indexes reveal a significant (when the confidence interval has no zero, indicating the effect size was significant), positive (between child custody litigants and defensiveness), generalizable (the credibility interval had no zero, indicating the effect size was generalizable to 90% of other samples), and large (δ>0.80) mean true effect size (δ). Similar results i.e., a significant, positive and large mean true effect size, was found for Wsd, but it was not generalizable. As for the F scale and the F-K index, a significant and negative mean true effect size was found, small (0.20 > δ<0.50) and not generalizable (credibility interval had zero) for F scale, and medium (0.50 > δ<0.80) and generalizable for F-K index. As only one effect size was found for the Esd and Od scales, the mean true effect sizes could not be estimated, the uncorrected effect size were 1.24 and 1.38, respectively.
Table 2

Results of the meta-analyses between parents in child custody disputes and the normative population.

Scale/IndexkNdwSDdSDpreSDresδSDδ%Var95% CId80% CIδ
L +67106420.870.370.160.340.990.3819.470.83, 0.910.50, 1.49
L ++5895300.930.350.160.311.060.3521.790.89, 0.970.60, 1.52
K +65101540.820.270.160.220.910.2436.460.78, 0.860.60, 1.23
K ++5790740.800.280.160.220.890.2534.770.76, 0.840.57, 1.21
F +519212-0.230.300.150.26-0.270.3023.67-0.27, -0.19-0.66, 0.13
F ++438132-0.270.290.140.25-0.310.2924.94-0.31, -0.23-0.68, 0.06
S +++1932630.850.290.160.240.910.2629.850.78, 0.920.57, 1.25
Wsd + + +1412440.780.650.220.611.100.8611.510.66, 0.90-0.01, 2.20
MP + + +910880.910.450.190.411.080.4917.900.79, 1.030.45, 1.71
F-K + + +9673-0.600.300.230.19-0.650.2159.19-0.76, -0.44-0.92, -0.38
L + K+++61880.760.080.3700.8301000.47, 1.050.83
L + K-F+++113391.110.560.390.401.200.4348.640.87, 1.340.64, 1.75

Note. +studies from original validity scales of MMPI, MMPI-2 and reformulated scales of MMPI-2-RF; ++studies from original validity scales of MMPI-2; ++ + studies from the additional validity scales of MMPI-2; k = number of studies; N = total sample size; dw = effect size weighted for sample size; SD = observed standard deviation of d; SDpre = standard deviation of observed correlations predicted from all artifacts; SDres = standard deviation of observed correlations after removal of variance due to all artifacts; δ = effect size corrected for criterion unreliability; SDδ = standard deviation of δ; %Var = variance accounted for by artifactual errors; 95% CI = 95% confidence interval for d; 80% CIδ = 80% credibility interval for δ.

Results of the meta-analyses between parents in child custody disputes and the normative population. Note. +studies from original validity scales of MMPI, MMPI-2 and reformulated scales of MMPI-2-RF; ++studies from original validity scales of MMPI-2; ++ + studies from the additional validity scales of MMPI-2; k = number of studies; N = total sample size; dw = effect size weighted for sample size; SD = observed standard deviation of d; SDpre = standard deviation of observed correlations predicted from all artifacts; SDres = standard deviation of observed correlations after removal of variance due to all artifacts; δ = effect size corrected for criterion unreliability; SDδ = standard deviation of δ; %Var = variance accounted for by artifactual errors; 95% CI = 95% confidence interval for d; 80% CIδ = 80% credibility interval for δ. Comparatively, the mean true effect sizes in the scale and indexes with a positive and generalizable relationship with defensiveness for L, δ = 0.99, 95%CI [0.95, 1.03], K, δ = 0.91, 95% CI [0.87, 0.95], S, δ = 0.91, 95%CI [0.84, 0.98], and MP, δ = 1.08, 95%CI [0.95, 1.21], scales and for L + K index, δ = 0.83, 95%CI [0.53, 1.13] and L + K-F, δ = 1.20, 95%CI [0.97, 1.43], indexes were equal (if the 95% CIs for δ overlap, it indicates no mean differences). In terms of utility for forensic practice (Table 3), the results revealed that the L scale classified as defensiveness 44.4% more (BESD) protocols in the population of custody disputes than in the normative group; 55.0% (U1 = .55) of the area covering both populations (normative and custody disputes) did not overlap i.e., they were totally independent; and a probability of .75 (PS) that subjects in custody disputes score higher on the L scale than the population normative. In K, S, MP, L + K and L + K-F, the defensiveness classification rate in the custody disputes population was, respectively, 41.4, 41.4, 47.6, 38.4, and 51.6%, more than in the normative population; the distributions for the normative population and custody disputes were totally independent in 51.9, 51.9, 58.2, 48.7 and 62.2%; and the probability of superiority was .74, .74, .77, .72 and .80, that is, these would be the probabilities for the population under custody disputes scoring higher on these scales than the normative population.
Table 3

Practical utility indicators.

Scale/IndexU1rPS
L.55.44.75
K.51.41.74
S.51.41.74
MP.58.47.77
L + K.48.38.72
L + K-F.62.51.80

Note. Only for scales and indexes with generalizable effects sizes; U1 = Cohen's U1 statistic; r = correlation for BESD compute; PS = probability of superiority.

Practical utility indicators. Note. Only for scales and indexes with generalizable effects sizes; U1 = Cohen's U1 statistic; r = correlation for BESD compute; PS = probability of superiority. The 75% rule (Hunter & Schmidt, 2015) warrants the study of moderators, except for the L + K index (%Var = 100, indicating the primary studies were not entirely randomly distributed, and N [<400] was insufficient for the study of moderators). The literature suggests the parent's gender could play a relevant role in defensiveness (Roma et al., 2014), as well as the situational factor (parent child custody disputes [PCCDs] vs. parenting capacity assessment in child protection cases [PCA-CPCs]) (Carr et al., 2005). Other moderators could not be analysed due to insufficient effect sizes or Ns. A last moderator, the version of the MMPI i.e., the original MMPI, the MMPI-2 and the MMPI-2-RF, could not be examined as the studies with the original MMPI and the MMPI-2-RF, are only available for the original validity scales, and were insufficient (N < 400 and/or k≤3). Thus, results were computed for all versions and only for the MMPI-2 (see Table 2).

Gender as a moderator

The meta-analysis on the gender of the litigator as a moderator (Table 4), in line with the general meta-analysis, showed for the L, K and S scales a significant, positive, generalizable and large (or nearly large) mean true effect size for both fathers and mothers. The mean true effect sizes for fathers and mothers were equal (Table 4) in the three scales (the 95%CIs for δ overlapped).
Table 4

Results of the meta-analyses for the gender of the litigator as moderator.

Scale/SubsamplekNTdwSDdSDpreSDresδSDδ%Var95% CId80% CIδ95% CIδ
L Scale
 Fathers2427830.670.350.190.300.760.3428.730.59, 0.750.32, 1.210.68, 0.84
 Mothers2428570.810.410.190.370.920.4221.210.73, 0.890.38, 1.460.84, 1.00



K Scale
 Fathers2327230.670.220.190.110.740.1372.180.59, 0.750.57, 0.910.66, 0.82
 Mothers2328010.750.230.180.140.840.1564.550.67, 0.830.64, 1.040.76, 0.92



F Scale
 Fathers182514-0.340.250.170.18-0.390.2145.95-0.42, -0.26-0.66, -0.11-0.47, -0.31
 Mothers172499-0.170.270.160.22-0.200.2536.19-0.25, -0.09-0.52, 0.12-0.28, -0.12



S Scale
 Fathers813060.810.220.160.160.870.1751.250.69, 0.930.65, 1.090.75, 0.99
 Mothers814180.900.220.150.150.970.1651.050.80, 0.990.76, 1.190.85, 1.09

Note. Studies only from MMPI-2; 95% CIδ = 95% confidence interval for δ.

Results of the meta-analyses for the gender of the litigator as moderator. Note. Studies only from MMPI-2; 95% CIδ = 95% confidence interval for δ. In the F scale, as in the general meta-analysis, a significant and negative mean true effect size for both fathers and mothers was observed. Nevertheless, this negative mean true effect size may be generalised to other samples for fathers, but not for populations of mothers. The meta-analysis for the Wsd and MP scales, and F-K and L + K-F indexes are not shown as k (≤3) and/or N (<400) were too low to guarantee stability in sampling estimates (Hunter & Schmidt, 2015), which were in line with the general meta-analysis and equal to gender.

The context of disputes as moderator

The context of evaluation (parent child custody disputes [PCCDs] vs. parenting capacity assessment in child protection cases [PCA-CPCs]) appears in primary studies as a potential moderator of differences in the evaluation of parents/caregivers in custody disputes. To this effect, the L, K, F and S scales were evaluated. The results (Table 5) reveal a positive, significant, generalizable and large mean true effect size for the L scale for both parents in custody disputes and for parents in PCA-CPCs. Notwithstanding, the effect size was significantly larger in PCA-CPCs, δ = 1.41, 95%CI [1.22, 1.60], than in PCCDs, δ = 0.97, 95%CI [0.93, 1.01]. As for the K scale, the results of the meta-analysis showed a positive, significant, generalizable and large mean true effect size for PCCs and of a small size for PCA-CPCs. In contrast to the L, the effect size for the K scale was significantly larger in PCCDs, δ = 0.95, 95%CI [0.91, 0.99], than in PCA-CPCs, δ = 0.28, 95%CI [0.11, 0.45]. In the F scale the results show an inverse relationship: a negative, significant, generalizable and small mean true effect size for PCCDs, and positive, significant, generalizable and large mean true effect size in PCA-CPCs. Finally, the results for the S scale showed a positive, significant, generalizable and large mean true effect size in PCCDs, and a non-significant mean true effect size in PCA-CPCs.
Table 5

Results of the meta-analyses for the evaluation context as moderator.

Scale/IndexkNTdwSDdSDpreSDresδSDδ%Var95% CId80% CIδ
L Scale
 PCCDs60100990.850.370.160.340.970.3818.430.81, 0.890.47, 1.47
 PCA-CPCs75431.240.140.2401.4101001.06, 1.421.41



K Scale
 PCCDs5896110.850.240.160.180.950.2043.220.81, 0.890.68, 1.21
 PCA-CPCs75430.260.100.2200.2801000.08, 0.440.28



F Scale
 PCCDs478785-0.280.220.140.16-0.320.1943.27-0.32, -0.24-0.57, -0.07
 PCA-CPCs54460.710.290.210.190.810.2255.970.53, 0.890.53, 1.10



S Scale
 PCCDs1630430.890.240.150.180.960.2040.580.81, 0.970.71, 1.22
 PCA-CPCs32200.200.150.2300.210100-0.07, 0.470.21

Note. Studies only from MMPI-2; Meta-analysis only for generalized scales and indexes.

Results of the meta-analyses for the evaluation context as moderator. Note. Studies only from MMPI-2; Meta-analysis only for generalized scales and indexes.

Discussion

The following conclusions may be derived from the results of this study. First, none of the scales or indexes detected totally defensiveness. Thus, no indicator of defensiveness was a fully efficacious detector on its own and had to be used in combination or accumulatively to enhance efficacy. Second, in line with the original models, the L, K, S, Mp, Wsd, Od and Esd scales and the L + K and L + K-F indexes were positively related to defensiveness, whereas the F scale and the F-K index were negatively related. Third, the results undermine the findings of studies claiming the superiority of scale over the other on the basis of simply observing the means and classification accuracy (e.g., Bagby et al., 1999, Butcher, 1997, Carr et al., 2005), MMPI reference manuals (Graham, 2011, Greene, 2011), and other meta-analysis (Baer & Miller, 2002), which should be revised. However, the results for Wsd, F, and F-K were not generalizable i.e., they did not consistently detect inter-study defensiveness. Likewise, the findings of studies reporting the validity of these scales and indexes as detectors of defensiveness should also be reviewed (e.g., Baer and Miller, 2002, Baer et al., 1992, Baer et al., 1995a; Baer et al., 1995b, Bagby et al., 1997). Forth, the L, K, S and MP scales, and L + K and L + K-F indexes, whose efficacy in detecting defensiveness was similar, were found to be the best detectors. Fifth, the scales and indexes with generalizable results (i.e., L, K, S, MP, L + K, L + K-F) add to the classification baseline of defensiveness (normative group), with approximately 40 to 50% more cases; the discrimination rate (independence distributions) between protocols of populations in custody disputes and the normative population (honest response) ranged from 50 to 60%; and the probability that parents in custody disputes obtained higher scores on the scales and indexes with generalizable results ranged approximately from .75 to .80. Sixth, the defensiveness attitudes of men and women in the evaluation of child custody disputes were similar, which disagrees with the findings of studies claiming different attitudes towards the evaluation (defensiveness) in men and women in child custody disputes (Roma et al., 2014). Seventh, L was a significantly better detector of defensiveness in the PCA-CPC than in PCCD evaluation context, and both K and S were in PCCDs. Surprisingly, the F scale was related, in line with the model (high scores suspect potential feigning), negatively (between parents in child custody disputes and defensiveness) in PCCDs, but positively related in PCA-CPCs (contrary to the model). In short, attitudes towards the evaluation (defensiveness) were measured according the evaluation context i.e., PCCDs vs. PCA-CPCs. This meta-analysis has several limitations that should be borne in mind such as: a) the results were obtained from studies on parent child custody disputes or parenting capacity assessment in child protection cases, and caution should be exercised in generalizing the findings to other contexts; b) the results of the meta-analysis in certain conditions may be subject to a degree of variability given that Ns <400 o k ≤3 is no guarantee of the stability of sampling estimates (Hunter & Schmidt, 2015); c) due to insufficient primary studies in the Esd and Od Scales, the effect sizes could not be corrected; and d) the results of the self-deception (SD) and positive impression management (IM) scales cannot be directly generalized to forensic practice since they are mediated by conscious or not manipulation that have different legal implications. Further research is required to assess the defensiveness detection capacity of the Esd and Od scales given the lack of studies in the literature and the insufficient Ns; to evaluate the effects of the evaluation context, and to assess the revised MMPI-2-RF scales that could not be used as a moderator in this study owing to the lack of studies. Thus, more studies with the MMPI-2-RF validity scales are necessary. Nevertheless, as for the substitution of the L and K, the original defensiveness scales of the MMPI-2, as well as the F scale as it was used to compute indexes, for the reformulated ones in the MMPI-2-RF i.e., the L-r, K-r, and F-r, and the subsequent indexes derived from these, a great number of studies with a significantly higher mean true effect size will be required. Hence, a File Drawer Analysis showed that for L, K and F scales would be necessary 615, 498 and 143 studies, respectively, to reverse the results from MMPI-2 to a trivial effect or to attribute them to a sampling bias. Additionally, there is no evidence about the performance of the indexes with the MMPI-2-RF. Moreover, the S and MP additional validity scales (results from Wsd scale are not generalizable) were not reformulated for MMPI-2-RF. As a combination of all the measures of defensiveness is necessary to classify defensiveness in forensic practice (the wrong classification of a protocol as defensive is not permitted in forensic practice as it supposes a false allegation against assessed person) (Arce, Fariña, & Vilariño, 2015; Fariña et al., 2010), while awaiting further evidence for MMPI-2-RF and for the reformulation of additional validity scales, the MMPI-2 must be preferred.

Funding

This research has been sponsored by a grant of the Spanish Ministry of Economy and Competitiveness (PSI2014-53085-R).
SourceInstrumentNSubsampleEvaluation context
Agüero and Álvarez-Icaza (2014)PaperMMPI-2345FathersPCCD
342MothersPCCD
Arce, Fariña, and Vilariño (2015)PaperMMPI-2488AllPCCD
Archer, Hagan, Mason, Handle, and Archer (2012)PaperMMPI-2-RF172FathersPCCD
172MothersPCCD
Bagby et al. (1999)PaperMMPI-257FathersPCCD
58MothersPCCD
Bathurst, Gottfried, and Gottfried (1997)PaperMMPI-2258FathersPCCD
250MothersPCCD
Butcher (1997)PaperMMPI-2868FathersPCCD
911MothersPCCD
Caldwell (2004)UnpublishedMMPI-21867AllPCCD
Carr et al. (2005)PaperMMPI-273FathersPCA-CPC
91MothersPCA-CPC
Cooke (2010)PaperMMPI-250FathersPCCD
50MothersPCCD
Daskalakis (2004)Doctoral thesisMMPI-249AllPCCD
Ezzo, Pinsoneault, and Evans (2007)PaperMMPI-270AllPCCD
205AllPCCD
Fariña et al. (2010)PaperMMPI-2126AllPCCD
Gordon, Stoffey, and Bottinelli (2008)PaperMMPI-279FathersPCCD
79MothersPCCD
Gordon, Stoffey, and Bottinelli (2008)PaperMMPI-241FathersPCCD
41MothersPCCD
7FathersPCCD
7MothersPCCD
31FathersPCCD
31MothersPCCD
Gready (2006)Doctoral thesisMMPI-231FathersPCA-CPC
66MothersPCA-CPC
116FathersPCCD
124MothersPCCD
Hopkins (1999)Doctoral thesisMMPI/MMPI-2207FathersPCCD
219MothersPCCD
Kauffman, Stolberg, and Madero (2015)PaperMMPI-251AllPCCD
Leib (2006)Doctoral thesisMMPI-26FathersPCCD
18MothersPCCD
7FathersPCCD
18MothersPCCD
Mandappa (2004)Doctoral thesisMMPI-2420AllPCCD
Moreland and Greenberg (1993)UnpublishedMMPI201AllPCCD
MMPI-233FathersPCCD
32MothersPCCD
Normington (2006)Doctoral thesisMMPI-219AllPCA-CPC
19AllPCCD
Ollendick and Collings (1984)PaperMMPI38FathersPCCD
38MothersPCCD
Peters (2012)Doctoral thesisMMPI-268AllPCCD
57AllPCCD
Posthuma and Harper (1998)PaperMMPI-240FathersPCCD
40MothersPCCD
27FathersPCCD
27MothersPCCD
27FathersPCCD
27MothersPCCD
Rehil (2011)Doctoral thesisMMPI-261AllPCCD
Resendes and Lecci (2012)PaperMMPI-2136AllPCA-CPC
Roma et al. (2014)PaperMMPI-2194FathersPCCD
197MothersPCCD
Schenk (1996)PaperMMPI-260FathersPCCD
56MothersPCCD
46FathersPCCD
34MothersPCCD
Stredny and Archer (2006)PaperMMPI-2127AllPCA-CPC
Strong et al. (1999)PaperMMPI-2206FathersPCCD
206MothersPCCD
Wakefield and Underwager (1990)PaperMMPI-232FathersPCCD
27MothersPCCD
Wisneski (2006)Doctoral thesisMMPI-2626AllPCCD

Note. PCCD = parent child custody disputes; PCA-CPC = parenting capacity assessment in child protection cases.

  20 in total

Review 1.  Underreporting of psychopathology on the MMPI-2: a meta-analytic review.

Authors:  Ruth A Baer; Joshua Miller
Journal:  Psychol Assess       Date:  2002-03

2.  Effects of information about validity scales on underreporting of symptoms on the personality assessment inventory.

Authors:  R A Baer; M W Wetter
Journal:  J Pers Assess       Date:  1997-04

3.  A survey of psychological test use patterns among forensic psychologists.

Authors:  Robert P Archer; Jacqueline K Buffington-Vollum; Rebecca Vauter Stredny; Richard W Handel
Journal:  J Pers Assess       Date:  2006-08

4.  Task versus relationship conflict, team performance, and team member satisfaction: a meta-analysis.

Authors:  Carsten K W De Dreu; Laurie R Weingart
Journal:  J Appl Psychol       Date:  2003-08

5.  Comparing the MMPI-2 scale scores of parents involved in parental competency and child custody assessments.

Authors:  John Resendes; Len Lecci
Journal:  Psychol Assess       Date:  2012-05-21

6.  MMPI discrimination of defensive and nondefensive felony sex offenders.

Authors:  R I Lanyon; R W Lutz
Journal:  J Consult Clin Psychol       Date:  1984-10

7.  Assessing Impression Management With the MMPI-2 in Child Custody Litigation.

Authors:  Ramón Arce; Francisca Fariña; Dolores Seijo; Mercedes Novo
Journal:  Assessment       Date:  2014-11-19

8.  Assessing underreporting response bias on the MMPI-2.

Authors:  R Michael Bagby; Margarita B Marshall
Journal:  Assessment       Date:  2004-06

9.  Assessment of the standard forensic procedure for the evaluation of psychological injury in intimate-partner violence.

Authors:  Francisca Fariña; Ramón Arce; Manuel Vilariño; Mercedes Novo
Journal:  Span J Psychol       Date:  2014       Impact factor: 1.264

10.  Factors moderating children's adjustment to parental separation: findings from a community study in England.

Authors:  Helen Cheng; Judy Dunn; Thomas G O'connor; Jean Golding
Journal:  J Abnorm Child Psychol       Date:  2006-02-23
View more
  4 in total

1.  MMPI-2-RF Profiles in Child Custody Litigants.

Authors:  Cristina Mazza; Franco Burla; Maria Cristina Verrocchio; Daniela Marchetti; Alberto Di Domenico; Stefano Ferracuti; Paolo Roma
Journal:  Front Psychiatry       Date:  2019-10-16       Impact factor: 4.157

2.  What Goes on in This House Do Not Stay in This House: Family Variables Related to Adolescent-to-Parent Offenses.

Authors:  Antonia Hernández; Ana M Martín; Stephany Hess-Medler; Juan García-García
Journal:  Front Psychol       Date:  2020-12-07

3.  Psychosocial Profile of Juvenile and Adult Offenders Who Acknowledge Having Committed Child-to-Parent Violence.

Authors:  Ana M Martín; Leticia De la Fuente; Antonia Hernández; Flor Zaldívar; Elena Ortega-Campos; Juan García-García
Journal:  Int J Environ Res Public Health       Date:  2022-01-05       Impact factor: 3.390

4.  The Reasons for Doing Physical Exercise Mediate the Effect of Self-Esteem on Uncontrolled Eating Amongst Nursing Personnel.

Authors:  María Del Carmen Pérez-Fuentes; María Del Mar Molero Jurado; María Del Mar Simón Márquez; José Jesús Gázquez Linares
Journal:  Nutrients       Date:  2019-01-31       Impact factor: 5.717

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.