| Literature DB >> 24093910 |
Joanne Allen1, Kerry J Inder, Terry J Lewin, John R Attia, Frances J Kay-Lambkin, Amanda L Baker, Trevor Hazell, Brian J Kelly.
Abstract
BACKGROUND: Epidemiologic studies often struggle to adequately represent populations and outcomes of interest. Differences in methodology, data analysis and research questions often mean that reviews and synthesis of the existing literature have significant limitations. The current paper details our experiences in combining individual participant data from two existing cohort studies to address questions about the influence of social factors on health outcomes within a representative sample of urban to remote areas of Australia. The eXtending Treatments, Education and Networks in Depression study involved pooling individual participant data from the Australian Rural Mental Health Study (T0 N = 2639) and the Hunter Community Study (T0 N = 3253) as well as conducting a common three-year follow-up phase (T1 N = 3513). Pooling these data extended the capacity of these studies by: enabling research questions of common interest to be addressed; facilitating the harmonization of baseline measures; permitting investigation of a range of psychosocial, physical and contextual factors over time; and contributing to the development and implementation of targeted interventions for persons experiencing depression and alcohol issues. DISCUSSION: The current paper describes the rationale, challenges encountered, and solutions devised by a project aiming to maximise the benefits derived from existing cohort studies. We also highlight opportunities for such individual participant data analyses to assess common assumptions in research synthesis, such as measurement invariance, and opportunities for extending ongoing cohorts by conducting a common follow-up phase.Entities:
Mesh:
Year: 2013 PMID: 24093910 PMCID: PMC3856520 DOI: 10.1186/1471-2288-13-122
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Figure 1Bar chart depicting increase in number of journal publications (articles or conference abstracts) including terms referring to pooling raw data, as found by a keyword search between dates 2003-2012 (N = 544). [Database: OvidSP titles listed 2003-2012].
Reasons for combining experimental or observational research data across cohorts
| Pooling results in an area of research | |
| Increasing generalizability | ✓ |
| Interest in known or potential sources of heterogeneity | ✓ |
| Replication of results | |
| Questions of interest centre on association/modelling | ✓ |
| Interest in a subset of data | |
| Identifying directions for future research | ✓ |
| | |
| Effects of interest are small but important | ✓ |
| More sophisticated models necessary | ✓ |
| Increasing observations of infrequent events | ✓ |
| Minimising effects of attrition over time | ✓ |
| Standardizing modelling used in predicting outcomes | |
| Aggregation of data from repeated experiments | |
| Maximising existing resources | ✓ |
| Time efficiency/producing information of current public interest | ✓ |
| Cost efficiency | ✓ |
| Preliminary exploration to support funding for more comprehensive research design or inform later phases of research | ✓ |
| Features appealing to funding bodies | ✓ |
Key benefits for the eXtending Treatment, Education and Networks in Depression (xTEND) study are denoted by ✓.
Potential threats to inference when examining data across cohorts
| Contextual | The specific contexts from which samples were derived and recruited may influence results. |
| Historical | Events occurring between observations may influence results. May also relate to factors impacting on one cohort but not another at baseline assessments. |
| Time synchronicity | Studies are not conducted at a similar point in time, allowing a potential for factors or events associated with the time of administration to influence results. The length of time between follow-up assessments may also differ. |
| Geographic region | Similar to contextual factors, but specifically associated with features of geographical region. |
| Sampling frame and methods | Sampling frame (who was recruited) and methods could influence results (e.g., survey |
| Measurement equivalence | Measurement methods or characteristics may differ across cohorts or change differentially (e.g., for assessments to be comparable across samples and timepoints, we may need to examine participant responses and demonstrate that the same latent factors were assessed). |
Figure 2Proportion of the pooled eXtending Treatment, Education and Networks in Depression (xTEND) sample at baseline (T) and follow-up (T) by remoteness category and Hunter Community Study (HCS)/Australian Rural Mental Health Study (ARMHS) membership, compared to New South Wales (NSW) population (2008).
Comparability of Australian Rural Mental Health Study (ARMHS) and Hunter Community Study (HCS) measures/samples at baseline and common follow-up
| Age | I | I |
| Gender | I | I |
| Education | LI | LI |
| Marital status | LI | LI |
| Retirement status | LI | LI |
| Employment status | LI | LI |
| | | |
| Personal & network support | S | I |
| Sense of place (Environmental distress scale) | M | I |
| Family support | M | I |
| | | |
| Kessler 10 (K-10) | I | I |
| Patient Health Questionnaire (PHQ-9) - Depression | M | I |
| Depressive symptomatology (CES-D) | M | I |
| Suicidal ideation | M | I |
| Solastalgia (Environmental distress scale) | I | I |
| Self-reported quality of life (AQoL-6D) | I^ | I |
| Personal hopefulness (HOPES-12) | M | I |
| Neuroticism (Brief Eysenck scale) | M | I |
| | | |
| Body Mass Index (BMI) | S | I |
| Chronic illness | LI | LI |
| Adverse life events | M | I |
| Alcohol use | M | I |
| Current smoking | LI | LI |
| Satisfaction with life | M | I |
| Physical and mental wellbeing (SF-36) | M | I |
| | | |
| Remoteness and SEIFA (postal code) | I | I |
| % rural employment, % land use for agriculture, and % population change (LGA) | I | I |
| Social capital and Health service accessibility (regional) | I | . |
Note: I, Ideal circumstances for data combination; LI, Less than ideal circumstances for data combination (e.g., data re-coding required); S, Statistical intervention required for data combination; M, Missing from one sample or measures not comparable; ^ One subscale was missing from AQoL-6D at ARMHS baseline and imputation was required. # Contextual measures were derived using postal code information, from which indices at the relevant level of aggregation could be geocoded; SEIFA, Socio-Economic Indexes for Areas; LGA, Local government area.