| Literature DB >> 23935460 |
Valerie C Henderson1, Jonathan Kimmelman, Dean Fergusson, Jeremy M Grimshaw, Dan G Hackam.
Abstract
BACKGROUND: The vast majority of medical interventions introduced into clinical development prove unsafe or ineffective. One prominent explanation for the dismal success rate is flawed preclinical research. We conducted a systematic review of preclinical research guidelines and organized recommendations according to the type of validity threat (internal, construct, or external) or programmatic research activity they primarily address. METHODS ANDEntities:
Mesh:
Year: 2013 PMID: 23935460 PMCID: PMC3720257 DOI: 10.1371/journal.pmed.1001489
Source DB: PubMed Journal: PLoS Med ISSN: 1549-1277 Impact factor: 11.069
Summary of preclinical guidelines for in vivo experiments identified through various database searches.
| Database Search or Source | Date of Search/Acquisition | Unique Guidelines Identified |
| MEDLINE 1 | April 2, 2013 | STAIR |
| Ludolph et al. | ||
| Rice et al. | ||
| Schwartz et al. | ||
| Verhagen et al. | ||
| García-Bonilla et al. | ||
| Kelloff et al. | ||
| Kamath et al. | ||
| MEDLINE 2 | April 2, 2013 | Bellomo et al. |
| MEDLINE 3 | April 2, 2013 | Moreno et al. |
| Google Scholar | January 19, 2012 | Scott et al. |
| Curtis et al. | ||
| Piper et al. | ||
| Liu et al. | ||
| Google Scholar | April 9, 2013 | Margulies and Hicks |
| Landis et al. | ||
| January 24, 2012 | Bolon et al. | |
| Macleod et al. | ||
| NINDS-NIH | ||
| Pullen et al. | ||
| Shineman et al. | ||
| Willmann et al. | ||
| Bolli et al. | ||
| Correspondence | April 5–31, 2013 | Grounds et al. |
| Savitz et al. | ||
| Katz et al. |
No unique guidelines that had not been previously identified through previous search strategies were found by searching the EQUATOR Network or through hand searching of references in identified guidelines.
The guidelines are listed under the search strategy by which they were first identified.
Guidelines that were grouped together during analysis (e.g., identical guidelines that were published in more than one journal).
NINDS-NIH, US National Institutes of Health National Institute of Neurological Disorders and Stroke.
Results of recommendation extraction from guidelines addressing validity threats in preclinical experiments.
| Recommendation Number | Validity Type | Application | Topic Addressed by the Recommendation | Number of Guidelines | General | Neurological and Cerebrovascular | Cardiac and Circulatory | Neuromuscular | Chemoprevention | Pain | Endometriosis | Arthritis | Sepsis | Renal Failure | Infectious Diseases | |||||||||||||||
| Landis et al. | Ludolph et al. | NINDS-NIH | Scott et al. | Shineman et al. | Moreno et al. | Katz et al. | STAIR | Macleod et al. | Liu et al. | García-Bonilla et al. | Savitz et al. | Margulies and Hicks | Curtis et al. | Schwartz et al. | Bolli et al. | Willmann et al. | Grounds et al. | Verhagen et al. | Kelloff et al. | Rice et al. | Pullen et al. | Bolon et al. | Piper et al. | Bellomo et al. | Kamath et al. | |||||
| 1 | IV | U | Matching or balancing treatment allocation of animals | 7 | X | X | X | X | X | X | Δ | |||||||||||||||||||
| 2 | IV | U | Standardized handling of animals | 8 | X | X | X | X | X | X | X | X | ||||||||||||||||||
| 3 | IV | U | Randomized allocation of animals to treatment | 20 | X | X | X | X | X | X | X | X | Δ | X | Δ | X | X | X | X | X | X | X | X | Δ | ||||||
| 4 | IV | U | Monitoring emergence of confounding characteristics in animals | 12 | X | Δ | X | Ŧ | X | X | X | X | X | X | X | Δ | ||||||||||||||
| 5 | IV | U | Specification of unit of analysis | 1 | X | |||||||||||||||||||||||||
| 6 | IV | T | Addressing confounds associated with anesthesia or analgesia | 5 | X | X | X | X | X | |||||||||||||||||||||
| 7 | IV | T | Selection of appropriate control groups | 15 | X | X | X | X | X | X | X | X | X | X | X | X | X | Δ | X | |||||||||||
| 8 | IV | T | Concealed allocation of treatment | 14 | X | X | X | X | X | X | X | X | Δ | X | Ŧ | X | X | Ŧ | ||||||||||||
| 9 | IV | T | Study of dose–response relationships | 15 | X | X | X | X | X | X | Ŧ | X | Δ | X | X | X | X | X | X | |||||||||||
| 10 | IV | O | Use of multiple time points for measuring outcomes | 5 | X | X | X | X | X | |||||||||||||||||||||
| 11 | IV | O | Consistency of outcome measurement | 8 | X | X | X | X | X | X | X | X | ||||||||||||||||||
| 12 | IV | O | Blinding of outcome assessment | 20 | X | X | X | X | X | X | X | X | X | Δ | X | Δ | X | X | X | X | X | X | X | Δ | ||||||
| 13 | IV | Total | Establishment of primary and secondary end points | 4 | X | X | X | X | ||||||||||||||||||||||
| 14 | IV | Total | Precision of effect size | 6 | X | X | X | X | X | Ŧ | ||||||||||||||||||||
| 15 | IV | Total | Management of interest conflicts | 8 | X | X | X | X | X | X | Δ | Ŧ | ||||||||||||||||||
| 16 | IV | Total | Choice of statistical methods for inferential analysis | 14 | X | X | X | X | X | X | X | X | X | X | X | X | X | X | ||||||||||||
| 17 | IV | Total | Flow of animals through an experiment | 16 | X | X | X | X | X | X | X | Δ | X | Ŧ | X | X | X | X | X | Ŧ | ||||||||||
| 18 | IV | Total | A priori statements of hypothesis | 3 | X | X | X | |||||||||||||||||||||||
| 19 | IV | Total | Choice of sample size | 23 | X | X | X | X | X | X | X | X | X | Δ | X | Ŧ | X | X | X | X | X | X | X | X | X | X | Δ | |||
| 20 | CV | U | Matching model to human manifestation of the disease | 19 | X | X | X | X | X | X | Δ | X | X | Ŧ | X | X | X | X | X | X | X | Δ | X | |||||||
| 21 | CV | U | Matching model to sex of patients in clinical setting | 9 | X | X | X | Ŧ | X | X | Δ | X | X | |||||||||||||||||
| 22 | CV | U | Matching model to co-interventions in clinical setting | 7 | X | Ŧ | X | Δ | X | X | Ŧ | |||||||||||||||||||
| 23 | CV | U | Matching model to co-morbidities in clinical setting | 10 | X | Ŧ | X | X | Ŧ | X | X | X | X | Δ | ||||||||||||||||
| 24 | CV | U | Matching model to age of patients in clinical setting | 11 | X | X | X | Ŧ | X | X | Ŧ | X | X | X | X | |||||||||||||||
| 25 | CV | U | Characterization of animal properties at baseline | 20 | X | X | X | X | X | X | X | X | Δ | X | Ŧ | X | X | X | X | X | X | X | X | Ŧ | ||||||
| 26 | CV | U | Comparability of control group characteristics to those of previous studies | 1 | X | |||||||||||||||||||||||||
| 27 | CV | T | Optimization of complex treatment parameters | 5 | X | X | X | X | X | |||||||||||||||||||||
| 28 | CV | T | Matching timing of treatment delivery to clinical setting | 10 | X | X | X | X | Ŧ | X | Δ | X | X | Ŧ | ||||||||||||||||
| 29 | CV | T | Matching route/method of treatment delivery to clinical setting | 8 | X | X | X | Ŧ | X | Ŧ | X | X | ||||||||||||||||||
| 30 | CV | T | Pharmacokinetics to support treatment decisions | 9 | X | X | X | X | Ŧ | X | Δ | X | X | |||||||||||||||||
| 31 | CV | T | Matching the duration/exposure of treatment to clinical setting | 10 | X | X | X | X | X | Ŧ | X | Δ | X | X | ||||||||||||||||
| 32 | CV | T | Definition of treatment | 2 | X | X | ||||||||||||||||||||||||
| 33 | CV | T | Faithful delivery of intended treatment | 6 | X | X | X | X | X | X | ||||||||||||||||||||
| 34 | CV | T | Addressing confounds associated with treatment | 9 | X | X | X | X | X | X | X | X | Ŧ | |||||||||||||||||
| 35 | CV | O | Matching outcome measure to clinical setting | 14 | X | X | X | Ŧ | X | Ŧ | X | X | X | X | X | X | X | Δ | ||||||||||||
| 36 | CV | O | Degree of characterization and validity of outcome measure chosen | 9 | X | X | X | X | X | X | X | X | X | |||||||||||||||||
| 37 | CV | O | Treatment response along mechanistic pathway | 15 | X | X | X | X | X | Ŧ | X | Δ | X | X | X | X | X | X | X | |||||||||||
| 38 | CV | O | Assessment of multiple manifestations of disease phenotype | 10 | X | X | X | X | Δ | X | Δ | X | X | X | ||||||||||||||||
| 39 | CV | O | Assessment of outcome at late/clinically relevant time points | 7 | X | X | Ŧ | X | Δ | X | X | |||||||||||||||||||
| 40 | CV | O | Addressing treatment interactions with clinically relevant co-morbidities | 1 | X | |||||||||||||||||||||||||
| 41 | CV | O | Use of validated assay for molecular pathways assessment | 1 | X | |||||||||||||||||||||||||
| 42 | CV | O | Definition of outcome measurement criteria | 7 | X | X | X | X | X | X | X | |||||||||||||||||||
| 43 | CV | O | Addressing confounds associated with experimental setting | 3 | X | X | X | |||||||||||||||||||||||
| 44 | CV | Total | Addressing confounds associated with setting | 8 | X | X | X | X | X | X | X | X | ||||||||||||||||||
| 45 | EV | U | Replication in different models of the same disease | 13 | X | X | X | X | Δ | X | Δ | X | X | X | X | X | Ŧ | |||||||||||||
| 46 | EV | U | Replication in different species | 8 | X | X | Ŧ | X | Δ | X | X | X | ||||||||||||||||||
| 47 | EV | U | Replication at different ages | 1 | X | |||||||||||||||||||||||||
| 48 | EV | U | Replication at different levels of disease severity | 1 | X | |||||||||||||||||||||||||
| 49 | EV | T | Replication using variations in treatment | 2 | X | X | ||||||||||||||||||||||||
| 50 | EV | Total | Independent replication | 12 | X | X | X | X | X | Ŧ | X | X | Δ | X | X | X | ||||||||||||||
| 51 | PROG | O | Inter-study standardization of end point choice | 3 | X | X | X | |||||||||||||||||||||||
| 52 | PROG | Total | Define programmatic purpose of research | 4 | X | X | X | X | ||||||||||||||||||||||
| 53 | PROG | Total | Inter-study standardization of experimental design | 14 | X | X | X | X | X | X | X | X | X | X | X | X | X | X | ||||||||||||
| 54 | PROG | Total | Research within multicenter consortia | 3 | X | X | X | |||||||||||||||||||||||
| 55 | PROG | Total | Critical appraisal of literature or systematic review during design phase | 2 | X | X | ||||||||||||||||||||||||
Explicit endorsement of STAIR [10],[12].
Explicit endorsement Piper et al. [53].
CV, threat to construct validity; EV, threat to external validity; IV, threat to internal validity; O, outcome; PROG, research program recommendations; T, treatment; Ŧ, recommendation imported from an endorsed guideline but not otherwise stated in the endorsing guideline; U, units (animals); Δ, recommendation imported from an endorsed guideline and also explicitly stated in the endorsing guideline; Total, all parts of the experiment; X, recommendation explicitly stated in the guideline.
NINDS-NIH, US National Institutes of Health National Institute of Neurological Disorders and Stroke.
Figure 1Flow of database searches and eligibility screening for guideline documents addressing preclinical efficacy experiments.
Sample sizes at the identification stage reflect the raw output of the search and do not reflect the removal of duplicate entries between search strategies.
To what extent individual guidelines address each type of validity threat and make recommendations regarding the overall research program.
| Category | Study | Number (Percent) of Recommendations Addressing Each Validity Type | Total ( | |||
| IV ( | CV ( | EV ( | PROG ( | |||
|
| Landis et al. | 10 (53) | 2 (8) | 1 (17) | 0 (0) | 13 (24) |
|
| Ludolph et al. | 5 (26) | 12 (48) | 3 (50) | 3 (60) | 23 (42) |
| NINDS-NIH | 9 (47) | 4 (16) | 1 (17) | 0 (0) | 14 (25) | |
| Scott et al. | 8 (42) | 2 (8) | 0 (0) | 1 (20) | 11 (20) | |
| Shineman et al. | 15 (79) | 12 (48) | 1 (17) | 1 (20) | 29 (53) | |
| Moreno et al. | 10 (53) | 10 (40) | 0 (0) | 1 (20) | 21 (38) | |
| Katz et al. | 10 (53) | 11 (44) | 2 (33) | 2 (40) | 25 (45) | |
| STAIR | 8 (42) | 14 (56) | 3 (50) | 0 (0) | 25 (45) | |
| Macleod et al. | 8 (42) | 1 (4) | 0 (0) | 0 (0) | 9 (16) | |
| Liu et al. | 12 (63) | 10 (40) | 3 (50) | 1 (20) | 26 (47) | |
| García-Bonilla et al. | 11 (58) | 8 (32) | 1 (17) | 1 (20) | 21 (38) | |
| Savitz et al. | 3 (16) | 16 (64) | 3 (50) | 1 (20) | 23 (42) | |
| Margulies and Hicks | 8 (42) | 10 (40) | 5 (83) | 2 (40) | 25 (45) | |
|
| Curtis et al. | 11 (58) | 11 (44) | 3 (50) | 2 (40) | 27 (49) |
| Schwartz et al. | 9 (47) | 10 (40) | 1 (17) | 0 (0) | 20 (36) | |
| Bolli et al. | 6 (32) | 6 (24) | 3 (50) | 2 (40) | 17 (31) | |
|
| Willmann et al. | 6 (32) | 6 (24) | 0 (0) | 3 (60) | 15 (27) |
| Grounds et al. | 6 (32) | 7 (28) | 0 (0) | 1 (20) | 14 (25) | |
|
| Verhagen et al. | 8 (42) | 10 (40) | 1 (17) | 0 (0) | 19 (35) |
| Kelloff et al. | 1 (5) | 0 (0) | 1 (17) | 0 (0) | 2 (4) | |
|
| Rice et al. | 9 (47) | 10 (40) | 0 (0) | 0 (0) | 19 (35) |
|
| Pullen et al. | 5 (26) | 4 (16) | 1 (17) | 1 (20) | 11 (20) |
|
| Bolon et al. | 6 (32) | 7 (28) | 0 (0) | 1 (20) | 14 (25) |
|
| Piper et al. | 9 (47) | 7 (28) | 1 (17) | 2 (40) | 19 (35) |
|
| Bellomo et al. | 10 (53) | 4 (16) | 2 (33) | 0 (0) | 16 (29) |
|
| Kamath et al. | 1 (5) | 1 (4) | 1 (17) | 1 (20) | 4 (7) |
CV, threat to construct validity; EV, threat to external validity; IV, threat to internal validity; NINDS-NIH, US National Institutes of Health National Institute of Neurological Disorders and Stroke; PROG, research program recommendations.
Most frequent recommendations appearing in preclinical research guidelines for in vivo animal experiments.
| Validity Type | Recommendation Category | Examples |
|
|
| Choice of sample size | Power calculation, larger sample sizes | 23 (89) |
| Randomized allocation of animals to treatment | Various methods of randomization | 20 (77) | |
| Blinding of outcome assessment | Blinded measurement or analysis | 20 (77) | |
| Flow of animals through an experiment | Recording animals excluded from treatment through to analysis | 16 (62) | |
| Selection of appropriate control groups | Using negative, positive, concurrent, or vehicle control groups | 15 (58) | |
| Study of dose–response relationships | Testing above and below optimal therapeutic dose | 15 (58) | |
|
| Characterization of animal properties at baseline | Characterizing inclusion/exclusion criteria, disease severity, age, or sex | 20 (77) |
| Matching model to human manifestation of the disease | Matching mechanism, chronicity, or symptoms | 19 (73) | |
| Treatment response along mechanistic pathway | Characterizing pathway in terms of molecular biology, histology, physiology, or behaviour | 15 (58) | |
| Matching outcome measure to clinical setting | Using functional or non-surrogate outcome measures | 14 (54) | |
| Matching model to age of patients in clinical setting | Using aged or juvenile animals | 11 (42) | |
|
| Replication in different models of the same disease | Different transgenics, strains, or lesion techniques | 13 (50) |
| Independent replication | Different investigators or research groups | 12 (46) | |
| Replication in different species | Rodents and nonhuman primates | 8 (31) | |
|
| Inter-study standardization of experimental design | Coordination between independent research groups | 14 (54) |
| Defining programmatic purpose of research | Study purpose is preclinical, proof of concept, or exploratory | 4 (15) |
Recommendations concerning the coordination of experimental design practices across a program of research.