| Literature DB >> 24397587 |
Ikhlaaq Ahmed, Thomas P A Debray, Karel G M Moons, Richard D Riley1.
Abstract
BACKGROUND: Risk prediction models estimate the risk of developing future outcomes for individuals based on one or more underlying characteristics (predictors). We review how researchers develop and validate risk prediction models within an individual participant data (IPD) meta-analysis, in order to assess the feasibility and conduct of the approach.Entities:
Mesh:
Year: 2014 PMID: 24397587 PMCID: PMC3890557 DOI: 10.1186/1471-2288-14-3
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Summary of the 15 articles included in the review
| Italy | To identify predictors of short-term and sustained Alanine transaminase (ALT) normalization after interferon treatment in adult patients with hepatitis C | Adult patients with transfusion-related or community-acquired | Short term and sustained response (ALT normalization) | Collaborative group | 2 (NA) | |
| USA | To determine the predictive accuracy of pH for identifying patients with malignant pleural effusions who will fail pleurodesis | Patients with malignant pleural effusions | Failure of pleurodesis | Literature review | 6 (12) | |
| Canada | To determine the ability of intermediate plasma viral load (pVL) measurements to predict virologic outcome at 52 weeks of follow-up in clinical trials of antiretroviral therapy | Patients within a particular range of CD4 cell counts, naive to antiretroviral therapy and not been previously diagnosed with AIDS | Virologic outcome at 52 weeks of follow-up | Collaborative group | 3 (NA) | |
| Netherlands | To develop a prognostic tool for patients with unresectable pancreatic cancer to distinguish between low or high probabilities of survival 3 to 9 months after diagnosis. | Patients diagnosed with pancreatic cancer | Overall survival | Literature review | 8 (15) | |
| United Kingdom | To identify baseline patient- or tumour-related prognostic factors; and to assess whether pre-treatment quality of life predicts survival in patients with locally advanced or metastatic esophago-gastric cancer. | Patients with histologically confirmed inoperable adenocarcinoma, squamous cell carcinoma, or undifferentiated carcinoma of the oesophagus, esophago-gastric junction, or stomach | Overall survival | Collaborative group | 3 (NA) | |
| Netherlands | Investigate if transcranial doppler monitoring for micro embolic signals, directly after carotid endarterectomy (CEA) identifies patients at risk of developing ischaemic complications. | Carotid endarterectomy patients | Cerebral ischaemic complications, defined as new neurological deficits within 1st week after CEA | Literature review | 7 (10) | |
| Germany | Identifying the predictive value of biologically effective dose as function of the risk of myelopathy | Patients with spinal cord retreatment | Development of radiation myelopathy | Literature review | 8 (8) | |
| Australia | To investigate the generalisability of current definitions of the metabolic syndrome in Asia-Pacific populations, and assess the prognostic value of metabolic risk factors to discriminate fatal coronary heart disease (CHD) risk | Healthy patients aged 30–75 | Fatal CHD within 10 years | Collaborative group | 26 (NA) | |
| Belgium | To predict a superficial bladder cancer patient’s probability of recurrence and progression at one and five years | Stage Ta, T1, and Tis bladder cancer patients who have undergone transurethral resection | Time to first recurrence (disease-free interval) and time to progression to muscle invasive disease | Collaborative group | 7 (NA) | |
| USA | Early prediction of hypocalcaemia after thyroidectomy using parathyroid hormone | Patients undergoing thyroidectomy | Postoperative symptomatic hypocalcaemia | Literature review | 9 (15) | |
| Netherlands | To determine the predictors of a prolonged course for children with acute otitis media (AOM) to discriminate between children with and without poor outcomes | Children with AOM | A prolonged course of AOM (pain and/or fever at 3 to 7 days) | Literature review | 6 (10) | |
| Germany | To identify prognostic indicators in acute myeloid leukaemia (AML) to provide a new prognostic model for risk stratification of AML patients | AML patients | Overall survival and relapse-free survival | Collaborative group | 8 (NA) | |
| United Kingdom | To determine if the ankle brachial index provides information on the risk of cardiovascular events and mortality independently of the Framingham risk score and can improve risk prediction | Participants of any age and sex derived from a general population | Total and cardiovascular mortality | Literature review | 16 (20) | |
| Netherlands | To develop prediction model for predicting unfavourable outcome according to the glasgow outcome scale (GOS) at 6 months after traumatic brain injury (TBI) | Patients with moderate or severe TBI (with GOS > = 12) | 6-months mortality and unfavourable outcomes defined by 6 months GOS | Collaborative group | 11 (NA) | |
| United Kingdom | To design a prognostic indicator using demographic information to select patients at risk of dying after myocardial infarction (MI) | Patients at day 45 post-MI up to 2 years | All-cause, arrhythmic and non-arrhythmic cardiac mortality within 2 years | Not stated | 4 (unknown) |
NA not applicable.
Figure 1Number of studies for which IPD was requested and obtained in the seven articles using a literature review to identify relevant studies to seek IPD from.
Methodological challenges when developing and validating a risk prediction model using IPD from multiple studies as identified from those 15 articles in our review (written below in a framework similar to recommendations by Abo-Zaid et al.[37]for prognostic factors)
| • Unavailability of IPD in some studies | |
| • How to assess quality of studies available | |
| • Inability of IPD to overcome deficiencies of original studies, such as missing participant data or of being low methodological quality. | |
| • Dealing with different definitions of disease or outcome | |
| • Dealing with different (or out-dated) treatment strategies, especially when a mixture of older and newer studies are combined | |
| • Dealing with a mixture of IPD from retrospective and prospective studies | |
| • Missing data, including: missing predictor values and missing outcome data for some participants within a study, and completely unavailable predictors in some studies | |
| • Difficulty in using a continuous scale for continuous factors in meta-analysis when some IPD studies give values on a continuous scale and others do not | |
| • Dealing with IPD from trials where both control and treatment groups are available | |
| • How to assess the impact of excluded studies who did not provide IPD | |
| • Accounting for clustering of patients within different IPD studies | |
| • Allowing for heterogeneity in baseline risk (intercept term) across studies | |
| • Allowing for heterogeneity in predictor effects across studies | |
| • Lack of external validation if all studies used for model development | |
| • Sample size required to implement the internal-external approach (i.e. sample size of studies to be excluded, and also the total number of IPD studies needed) |
Recommendations for improved research when developing and validating risk prediction models from multiple studies
| • Produce a protocol for the project, detailing rationale, conduct and statistical analysis and reference this | |
| • Report how the primary study authors were approached for their IPD | |
| • Report strategy used to identify relevant studies, e.g. literature review/collaborative group | |
| • If literature review performed, then report search strategy, including keywords and databases used | |
| • Provide a flowchart showing the search strategy, classification of identified articles, and retrieval of IPD from relevant studies | |
| • Report any prior sample size considerations used, such as the number of IPD studies deemed necessary and the number of patients and events required. If no sample size requirements were considered, report this also | |
| • Report the number of patients and events for each study used in model development and/or validation | |
| • Report the missing data for each study (e.g. whether predictors were missing entirely, or how many patients had predictor values missing), and whether some patients or studies were entirely excluded for this reason | |
| • Detail the reasons why IPD was unavailable in some desired studies (if applicable), and report the number of patients and events from these studies | |
| • If any studies were excluded after IPD was obtained, provide the number of studies excluded and explain why they were removed (e.g. missing predictors, different outcome definition, different methods of measurement) | |
| • Compare and report the quality of studies for which IPD was obtained | |
| • Account for clustering of patients within studies, for example by allowing for a separate intercept per study | |
| • Report the selection criteria and procedure used to decide which predictors are included in the final model | |
| • Assess and report any between study heterogeneity in the effects of included predictors | |
| • If large heterogeneity does exist in particular predictors, then try to reduce it by including more predictors or simply focus on including homogenous or weakly heterogeneous factors | |
| • Where possible model continuous predictors on their continuous scale, unless it is important to categorise with good clinical or statistical reason | |
| • Report the final developed model in original format with alpha (baseline risk) and beta estimates, so that others can ascertain how apply the model in practice | |
| • Detail how missing patient-level data and missing study-level factors were dealt with in the analysis | |
| • Validate the model that has been developed using internal-external cross-validation; we tentatively suggest at least 4 studies are required for this approach however. | |
| • Explain the choice of intercept (baseline hazard) to be used when implementing the model in the excluded study | |
| • Report validation statistics for each study excluded in the internal-external cross validation method | |
| • Report clearly whether there is evidence the model performs consistently well during the internal-external validation | |
| • If it performs consistently well, clearly report the final overall prediction model to be used in practice, and emphasise again how the intercept should be chosen upon application | |
| - If it does not perform consistently well, clearly flag those populations for which the model cannot be applied and draw attention to the model’s lack of generalisability | |
| • If possible, compare the populations of those studies not providing IPD to those studies providing IPD, to be able to understand whether the developed model may need further generalisation in such populations in the future |