| Literature DB >> 22851128 |
M Johnson1, V Schmeid, S J Lupton, M-P Austin, S M Matthey, L Kemp, T Meade, A E Yeo.
Abstract
The purpose of this review was to critically analyse existing tools to measure perinatal mental health risk and report on the psychometric properties of the various approaches using defined criteria. An initial literature search revealed 379 papers, from which 21 papers relating to ten instruments were included in the final review. A further four papers were identified from experts (one excluded) in the field. The psychometric properties of six multidimensional tools and/or criteria were assessed. None of the instruments met all of the requirements of the psychometric properties defined. Some had used large sample sizes but reported low positive predictive values (Antenatal Risk Questionnaire (ANRQ)) or insufficient information regarding their clinical performance (Antenatal Routine Psychosocial Assessment (ARPA)), while others had insufficient sample sizes (Antenatal Psychosocial Health Assessment Tool, Camberwell Assessment of Need-Mothers and Contextual Assessment of Maternity Experience). The ANRQ has fulfilled the requirements of this analysis more comprehensively than any other instrument examined based on the defined rating criteria. While it is desirable to recommend a tool for clinical practice, it is important that clinicians are made aware of their limitations. The ANRQ and ARPA represent multidimensional instruments commonly used within Australia, developed within large samples with either cutoff scores or numbers of risk factors related to service outcomes. Clinicians can use these tools, within the limitations presented here, to determine the need for further intervention or to refer women to mental health services. However, the effectiveness of routine perinatal psychosocial assessment continues to be debated, with further research required.Entities:
Mesh:
Year: 2012 PMID: 22851128 PMCID: PMC3443336 DOI: 10.1007/s00737-012-0297-8
Source DB: PubMed Journal: Arch Womens Ment Health ISSN: 1434-1816 Impact factor: 3.633
Fig. 1Systematic literature review inclusion and exclusion criteria
Reported validity and reliability for multidimensional perinatal mental health screening tools
| Scale; reference | Total no. of items; no. of domains; scoring | Domains/criteria | Reliability and validity | Sensitivity (95 % CI) (Sn), Specificity (95 % CI) (Sp), negative predictive value (NPV), positive predictive value (PPV) | Sample size; normative data | Overall ratinga |
|---|---|---|---|---|---|---|
| Multidimensional | ||||||
| Antenatal Psychosocial Health Assessment Tool (ALPHA), Carroll et al. | 35; 4; 1–3 | 1. Family factors | OR (identifying a concern), 1.005 (95 % CI 0.6–1.7, | Sn, not provided | 227; yes | Not recommended with the present level of evidence due to: small sample size, sensitivity, specificity, positive and negative |
| 2. Maternal factors | Sp, not provided | |||||
| 3. Substance use | NPV, not provided | |||||
| 4. Family violence | PPV, not provided | |||||
| Antenatal Risk Questionnaire (ANRQ), Austin et al. | 12; 7; 1–5/6 or Y/N | 1 Emotional support from subject's own mother in childhood | ROC AUCb, 0.69; 95 % CI = 0.61–0.77 (acceptable). Developed by a panel of experts, based on past reviews of postnatal depression risk factors, and on face and construct validity of these factors. The cutoff was based on ‘known groups’ using a diagnostic interview on women with high depression scores or items identifying distress. | At the most clinically relevant cutoff of 23 (out of a possible 62): | 1,196; no | Not recommended with the present level of evidence. Due to: normative scores are not provided |
| 2 Past history of depressed mood or mental illness and treatment received | Sn, 0.62 | |||||
| 3 Perceived level of support available following the birth of the baby | Sp, 0.64 | |||||
| 4 Partner emotional support | PPV, 0.30 | |||||
| 5 Life stresses in previous 12 months | NPV, 0.87 | |||||
| 6 Personality style (anxious or perfectionistic traits) | ||||||
| 7 History of abuse (emotional, physical and sexual) | ||||||
| Australian Routine Psychosocial Assessment (ARPA), Matthey et al. | 12; 7; 1–5 | 1 Support | Face/content validity of items derived from existing known psychosocial risk factors. ‘Known groups’ validation was presented. No reliability/stability testing of items was reported. | Sn, not provided | 2,167; no | Not recommended with the present level of evidence. Due to: normative scores, sensitivity, specificity, negative and positive predictive values not provided |
| 2 Stressors | Sp, not provided | |||||
| 3 Personality | NPV, not provided | |||||
| 4 Mental health | PPV, not provided | |||||
| 5 Childhood abuse | ||||||
| 6 Family violence | ||||||
| 7 Current mood measured by the Edinburgh Depression Scale [EDS] | ||||||
| Camberwell Assessment of Need—Mothers (CAN-M), Howard et al. | 26; 26; 1–3 | 1 Accommodation | Calculated using the total number of unmet needs: Inter-rater reliability = 0.93 (service users), 0.83 (staff); test–retest reliability = 0.91 (service users), 0.85 (staff); content validity, expert reviewed; consensual validity, expert reviewed; concurrent validity = compared with the GAF-S (−0.36) and GAF-D (−0.52), using total summary scores | Sn, not provided | 72; yes | Not recommended with the present level of evidence. Due to: small sample size, sensitivity, specificity, negative and positive predictive values not provided. |
| 2 Food | Sp, not provided | |||||
| 3 Looking after the home | NPV, not provided | |||||
| 4 Self care | PPV, not provided | |||||
| 5 Daytime activities | ||||||
| 6 General physical health | ||||||
| 7 Pregnancy care | ||||||
| 8 Sleep | ||||||
| 9 Psychotic symptoms | ||||||
| 10 Psychological distress | ||||||
| 11 Information | ||||||
| 12 Safety to self | ||||||
| 13 Safety to child and others | ||||||
| 14 Substance misuse | ||||||
| 15 Company | ||||||
| 16 Intimate relationships | ||||||
| 17 Sexual health | ||||||
| 18 Violence and abuse | ||||||
| 19 Practical demands of childcare | ||||||
| 20 Emotional demands of childcare | ||||||
| 21 Basic education | ||||||
| 22 Telephone | ||||||
| 23 Transport | ||||||
| 24 Budgeting | ||||||
| 25 Benefits | ||||||
| 26 Language, culture and religion | ||||||
| Contextual Assessment of Maternity Experience (CAME), Bernazzani et al. ( | 4; 3; 1–4 | 1 Recent life adversity or stressors | Prospective study: | Sn, not provided | 85; yes | Not recommended with the present level of evidence. Due to: small sample size, sensitivity, specificity, negative and positive predictive values were not reported. |
| 2 Quality of social support and key relationships including partner relationship | Domain 1 | Sp, not provided | ||||
| 3 Maternal feelings towards pregnancy, motherhood and the baby | Predictive validity: RR, 1.57 (95 % CI 1.06–2.33) | NPV, not provided | ||||
| Domain 2 | PPV, not provided | |||||
| Internal consistency: | ||||||
| Concurrent validity: compared to measurement using EPDS | ||||||
| Domain 3 | ||||||
| Internal consistency (maternal feelings towards pregnancy): | ||||||
| Internal consistency (maternal feelings towards motherhood and the baby): | ||||||
| Concurrent validity: compared to measurement using EPDS | ||||||
| Retrospective study: | ||||||
| Domain 2 | ||||||
| Internal consistency: | ||||||
| Concurrent validity: compared to measurement using EPDS | ||||||
| Domain 3 | ||||||
| Internal consistency (maternal feelings towards pregnancy): | ||||||
| Internal consistency (maternal feelings towards motherhood and the baby): | ||||||
| Concurrent validity: compared to measurement using EPDS. For maternal feelings towards motherhood and the baby the concurrent validity was modest, and three overall indices were used in further analysis | ||||||
| Pregnancy Risk Questionnaire (PRQ), Austin et al. | 21; 12; 1–6 | 1 Mother's attitude to her pregnancy | Area under the curve (AUC)b, 0.788 (95 % CI, 0.727–0.848). Compared to EDS. The AUC between the PRQ and the EDS were significantly different (0.788 and 0.659, respectively, | Sn, 0.44 | 1,296; yes | Not recommended with the present level of evidence. Low PPV |
| 2 Mother's experience of parenting in childhood | Sp, 0.92 | |||||
| 3 History of physical or sexual abuse | NPV, 0.968 | |||||
| 4 History of depression | PPV, 0.235 (at maximum | |||||
| 5 Impact of depression on psychosocial function | ||||||
| 6 Whether treatment was sought or recommended | ||||||
| 7 Presence of emotional support from partner and mother | ||||||
| 8 Presence of emotional support from partner and mother | ||||||
| 9 Presence of stressors during pregnancy | ||||||
| 10 Trait anxiety | ||||||
| 11 Obsessional traits | ||||||
| 12 Self-esteem | ||||||
aIn order to assist readers in comparing the psychometric properties of the screening tools, an overall rating was ascribed to each tool based on the model used by Hammill et al. (1992). This rating is not intended to provide a definitive statement on a tool’s psychometric rigor, which can alter with additional validation studies but rather to offer a guide based upon presently available evidence. The Hammill et al. scoring system awards the rating of a tool's reliability, validity and normative data as either ‘good’—A, ‘acceptable’—B or ‘unacceptable’—F. Ratings are based on criteria such as internal consistency, stability and test–retest reliability values, the current volume and quality of accumulated evidence regarding tool validity and the size, diversity of demographic characteristics and recency of the normative sample. To obtain a final score, normative, reliability and validity ratings were combined to provide an overall rating of the scales as being either ‘highly recommended’ (two As), ‘recommended’ (As and/or Bs) or ‘not recommended’ (any Fs) {Hammill 1992, #381}
bReceiver operating characteristic (ROC) and area under the curve (AUC): the area under the ROC curve analysis provides an indication of a particular scale's diagnostic ability to discriminate between those with and without a particular diagnosis. The AUC values range from 0.5 to 1.0, where a value of 0.5 indicates that the scale is performing at a chance level, and 1.0 indicates perfect discrimination. There is no agreed standard for interpreting the significance of the AUC statistics. However, it has been suggested that values between 0.5 and 0.70 represent a scale with low accuracy, values between 0.70 and 0.90 are indicative of a useful screening scale and a value of 0.90 and above is indicative of a highly accurate screening scale with a perfect ability to identify those with the target diagnosis {Austin et al. 2005, #389}