Literature DB >> 31819300

Evaluation of Published Preclinical Experimental Studies in Medicine: Methodology Issues.

Slobodan M Jankovic1, Belma Kapo2, Aziz Sukalo2, Izet Masic3.   

Abstract

INTRODUCTION: Inappropriate design of experimental studies in medicine inevitably leads to inaccurate or false results, which serve as basis for erroneous and biased conclusions. AIM: The aim of our study was to investigate prevalence of implementing basic principles of experimental design (local control, replication and randomization) in preclinical experimental studies, performed either on animals in vivo, or animal/human material in vitro.
MATERIAL AND METHODS: Preclinical experimental studies were retrieved from the PubMed database, and the sample for analysis was randomly chosen from the retrieved publications. Implementation rate of basic experimental research principles (local control, randomization and replication) was established by careful reading of the sampled publications and their checking against predefined criteria.
RESULTS: Our study showed that only a minority of experimental preclinical studies had basic principles of design completely implemented (7%), while implementation rate of single aspects of appropriate experimental design varied from as low as 9% to maximum 86%. Average impact factor of the surveyed studies was high, and publication date relatively recent, suggesting generalizability of our results to highly ranked contemporary journals.
CONCLUSION: Prevalence of experimental preclinical studies that did not implement completely basic principles of research design is high, raising suspicion to validity of their results. If incorrect and biased, results of published studies may mislead authors of future studies and cause conduction of fruitless research that will waste precious resources.
© 2019 Slobodan M. Jankovic, Belma Kapo, Aziz Sukalo, Izet Masic.

Entities:  

Keywords:  control experiments; internal validity; randomization; replication

Mesh:

Year:  2019        PMID: 31819300      PMCID: PMC6885208          DOI: 10.5455/medarh.2019.73.298-302

Source DB:  PubMed          Journal:  Med Arch        ISSN: 0350-199X


INTRODUCTION

Inappropriate design of experimental studies in medicine inevitably leads to inaccurate or false results, which serve as basis for erroneous and biased conclusions (1). Although numerous attempts were made in the past to prevent errors in research design, like establishing guidelines for experimental studies (2) or teaching experimental desing at postgraduate studies (3), evidence shows that some of the basic principles of experimental research design are still not implemented in more than half of the studies published in medical journals (4). There are three basic principles of experimental design that guarantee reliability of the results: having appropriate negative and positive controls for treatment or a factor that is tested, replicating experiments on independent experimental units sufficient number of times and randomly assigning a treatment (or factor) that is tested and control treatment (or factor) to experimental units (5). Failure to acknowledge and implement these principles when planning a study usually causes production of false positive experimental results, which are rather consequence of uncontrolled factors, like concomitant conditions or maturation of ecxperimental units, than of the treatment (or a factor) that is actually tested (6).

AIM

The aim of our study was to investigate prevalence of implementing basic principles of experimental design (local control, replication and randomization) in preclinical experimental studies, performed either on animals in vivo, or animal/human material in vitro.

MATERIAL AND METHODS

The studies were retrieved for analysis from the PubMed database. The following inclusion criteria defined the pool of the studies from which the study sample was extracted: journal article, original experimental study, animal study, in vitro study and full text availability. The exclusion criteria were: review articles, clinical trials of phase I-IV, cohort studies, case control studies and cross-sectional studies. The following search strategy was used to implement inclusion and exclusion criteria and select the pool of the studies for futher analysis: ((“animals”[MeSH Terms:noexp] OR animal[All Fields]) AND study[All Fields]) OR ((“in vitro techniques”[MeSH Terms] OR (“vitro”[All Fields] AND “techniques”[All Fields]) OR “in vitro techniques”[All Fields] OR “vitro”[All Fields] OR “in vitro”[All Fields]) AND study[All Fields]) NOT (“review”[Publication Type] OR “review literature as topic”[MeSH Terms] OR “review”[All Fields]) AND (Journal Article[ptyp] AND “loattrfree full text”[sb]). Size of the study sample (n=43) was calculated on the basis of the following assumptions: rate of inappropriate research design 0.5 (4) and width of the 95% confidence interval ± 0.15. The formula n = (1.96)2 x 4*p*(1-p)/d2 was used for the calculation, where „n“ is the sample size, „p“ probability of inappropriate research design and „d“ width of the confidence interval (7). Since the studies retrieved by the abovementioned search strategy were numbered orderly in the PubMed database, the study sample od 43 studies was extracted by simple randomization technique, activating for 43 times random number generator in Excel, using formula RANDBETWEEN(1;666,342). The extracted studies were analyzed for internal methodological validity, checking whether basic principles of correct experimental design (replication, control and randomization) were implemented. For the purpose of this analysis, the checklist with 8 questions was prepared, as shown in the Table 1. The results of the analysis were tabulated and described by rates and percentages when categorical, and by means, srtandard deviations, medians and interquartile ranges, if continuous.
Table 1.

Results of the survey of the experimental studies (n = 43)

RequirementSatisfied n (%)Not satisfied n (%)Unclear n (%)Not applicable n (%)
Sample size reported for the experiment?26 (60%)17 (40%)--
Number of observations reported for the experiment33 (77%)10 (23%)--
Value of test statistics, exact p value and degrees of freedom reported6 (14%)37 (86%)--
Error bars correspond to the analysis (i.e. standard error is based on number of independent observations)19 (44%)6 (14%)12 (28%)6 (14%)
Only independent observations were taken into account for statistical tests19 (44%)4 (9%)20 (47%)-
Is there negative control?32 (74%)11 (26%)--
Was positive control necessary, and if so, was it used?20 (47%)19 (44%)-4 (9%)
Were treatments randomly allocated to experimental units?4 (9%)28 (65%)2 (5%)9 (21%)
Number of citations: mean, standard deviation, median, interquartile range28.6 ± 37.2; 12.0; 29.0
Time passed from the publication (years)12.4 ± 10.9; 9.0; 13

RESULTS

In total 43 journal articles were retrieved randomly from pool of 666,342 articles in the PubMed database defined by the inclusion and exclusion criteria, and then analyzed according to predefined criteria of research design quality. Average impact factor of the journals (for the years when the articles were published) was 3.9739 ± 1.9125, median impact factor was 3.7490, and interquartile range 2.240. Compliance of the articles with the criteria, average number of citations per article and average time elapsed from the publication of the articles are shown in the Table 1. Only three of the analyzed studies (7.0%) had all basic principles of experimental design completely implemented. Number of satisfied criteria per study was not correlated either with journal impact factor (Spearman’s rho = 0.058, p = 0.710) or with number of citations (Spearman’s rho = -0.254, p = 0.100). The time elapsed from the publication also was not correlated with the number of satisfied criteria per study (Spearman’s rho = -0.227, p = 0.144).

DISCUSSION

Our study showed that only a minority of experimental preclinical studies had basic principles of design completely implemented (7%), while implementation rate of single aspects of appropriate experimental design varied from as low as 9% to maximum 86%. Average impact factor of the surveyed studies was high, and publication date relatively recent, suggesting generalizability of our results to highly ranked contemporary journals. Prevalence of certain aspects of inappropriate design in our study was similar to values reported by other studies, especially in regard to lack of randomization, which was observed in 70% of studies from our sample and in 87% of studies surveyed by Kilkenny et al (8). A number of the authors of experimental studies on animals, human cells, or tissues are misleaded by superficial similarity between the experimental units, derived from their common origin (the same cell line, the same clone of animals, the same species from the same breeding line, etc.), and may wrongly assume that they are completely the same. However, even identical twins are not completely identical, as many external factors with shape them differently, so randomization is always necessary in experimental studies, regardless of the similarities between the experimental units (9). While necessity of having local control in their experiments was understood by authors of majority of analyzed studies, the replication issue remained obtunded, and difference between true replication (repeating experiments on independent experimental units) and pseudo replication (repeating experiments on the same experimental unit) was not appreciated by majority. Pseudo replication leads to inappropriate testing of hypothesis (because statistical tests for testing difference between groups assume independence of experimental units) and to false precision, as improved estimate after repeating measurements on the same experimental unit just gives more precise results for that experimental unit, and not for the population that is investigated (10-13). Pseudo replication can also undermine the conclusions of a statistical analysis, and it would be easier to detect if the sample size, degrees of freedom, the test statistic, and precise p-values are reported. This information should be a requirement for all publications. The articles we analyzed in this study were highly cited regardless of their methodological shortcomings and possibly wrong conclusions, that may lead to erroneous assumptions when designing future studies and unnecessary wasting of research resources (14). Seven threats to the internal validity of experiments were discussed by Donald T. Campbell in his classic 1957 article: history, maturation, testing, instrument decay, statistical regression, selection, and mortality. These concepts are said to be threats to the internal validity of experiments because they pose alternate explanations for the apparent causal relationship between the independent variable and dependent variable of an experiment if they are not adequately controlled. Unlike with observational studies, experimental design is based on elimination of confusing variables with inclusion/exclusion criteria and on control of extraneous factors by setup of the experiment which should include randomization and local control. It is critical that experimental setup excludes extraneous influences, because they are not taken into account during statistical analysis, and may bias the results. If basic principles of experimental design are not implemented, extraneous variables will not be controlled properly, and the observed effects on experimental model may not be consequence of the tested treatment or factor, but of the extraneous variables themselves (15). Widespread failure to comply with basic rules of experimental design also led to crisis in reproducibility of experimental results. Many so-called breakthroughs in experimental science turned out to be spurious and false when independent study groups tried to repeat experiments described in published papers. Some authors believe that majority of published experimental results will not stand the test of time (16), because numerous authors all over the world do not adhere to good experimental practice being under pressure to “publish or perish”. Some of measures that could improve the situation are: insisting on standards of data presentation, publication of negative results in scientific journals, and changes in principles of funding research that would prevent making profit on just having publications without any real impact on science and healthcare (17, 18).

Limitations of the study

The results of our study are limited to only one database (PubMed) having journals that are on average ranked highly than journals in some other databases with less strict inclusion criteria. Therefore, our results could underestimate the problem of inadequate experimental design, and should be interpreted with caution. Besides, not all published papers had enough data presented to allow for complete estimate of methodological issues.

CONCLUSION

Prevalence of experimental preclinical studies that did not implement completely basic principles of research design is high, raising suspicion to validity of their results. If incorrect and biased, results of published studies may mislead authors of future studies and cause conduction of fruitless research that will waste precious resources.
Table 2.

Check list of 43 published papers used for assessment of the frequency of un-adequated study design

StudySample size reported for the experiment?Number of observations reported for the experimentValue of test statistics, exact p value and degrees of freedom reportedError bars correspond to the analysis (i.e. standard error is based on number of independent observations)Only independent observations were taken into account for statistical testsIs there negative control?Was positive control necessary, and if so, was it used?Were treatments randomly allocated to experimental units?Number of citations
Study 1nononononononono71
Study 2yesnononoIt is not clearyesnono25
Stdey 3yesnononounclearnonono11
Study 4yesyesyesNot shownyesyesyesno64
Study 5yesyesnounclearunclearyesnono7
Study 6noyesnoyesunclearyesnono23
Study 7noyesnonounclearnoyesno1
Study 8yesyesnoyesyesyesnoYes, but nor explained how5
Study 9nonoyesNot clearNot clearyesnono36
Study 10noyesnonoyesyesyesMentioned, but not explained8
Study 11yesyesnoyesNot clearyesnono3
Study 12yesyesnoyesyesnoyesyes4
Study 13noyesnoyesyesnonoNot applicable33
Study 14noyesnoNot shownnononoNot applicable122
Study 15yesyesnoyesyesyesyesyes60
Study 16yesyesnoNot applicableNot clearnoyesNot applicable4
Study 17yesyesnoyesNot clearyesnoNot applicable12
Study 18yesyesyesyesyesyesyesno153
Study 19No yesNoNot clearNoYes Yes no8
Study 20yesyesyesyesyesyesyesNot applicable12
Study 21yesyesnoNot applicableyesnoyesno5
Study 22yesyesnoNot applicableyesyesnoNot applicable9
Study 23yesyesnoyesyesnoyesyes2
Study 24yesyesnoNot clearyesyesnono6
          
Study 25nononoNot applicableNot clearnonono19
Study 26.Noyesnoyesyesyesyesno11
Study 27yesyesnoyesyesyesnoNot applicable9
Study 28nononoNot applicableNot clearnonoNot applicable159
Study 29yesnonoyesNot clearyesyesno20
Study 30yesnonoyesyesyesnono27
Study 31yesNonoNot clearNot clearyesnono8
Study 32yesyesnoyesyesyesyesno3
Study 33yesyesyesyesyesyesNot applicableyes7
Study 34NoYes noyesNot clearyesyesno1
Study 35yesyesnoyesNot clearyesyesno30
Study 36noyesnononoyesyesno50
Study 37noyesnoyesyesyesyesno13
Study 38yesyesyesNot applicableyesyesNot applicableNot applicable31
Study 39noyesnoNot clearNor clearyesNono41
Study 40yesyesnoNot clearNot clearyesyesno7
Study 41yesyesnoNot clearNot clearyesNot applicableno49
Study 42noyesnoNot clearNot clearyesNot applicableno22
Study 43NoNonoNot clearNot clearyesyesno40
  15 in total

1.  Factors relevant to the validity of experiments in social settings.

Authors:  D T CAMPBELL
Journal:  Psychol Bull       Date:  1957-07       Impact factor: 17.737

Review 2.  Teaching experimental design.

Authors:  Derek J Fry
Journal:  ILAR J       Date:  2014

Review 3.  Reproducibility in science: improving the standard for basic and preclinical research.

Authors:  C Glenn Begley; John P A Ioannidis
Journal:  Circ Res       Date:  2015-01-02       Impact factor: 17.367

4.  Study Design Rigor in Animal-Experimental Research Published in Anesthesia Journals.

Authors:  Janine M Hoerauf; Angela F Moss; Ana Fernandez-Bustamante; Karsten Bartels
Journal:  Anesth Analg       Date:  2018-01       Impact factor: 5.108

5.  Why Is the One-Group Pretest-Posttest Design Still Used?

Authors:  Thomas R Knapp
Journal:  Clin Nurs Res       Date:  2016-08-24       Impact factor: 2.075

6.  Guidelines for experimental studies.

Authors:  J E Moorhead; P V Rao; K J Anusavice
Journal:  Dent Mater       Date:  1994-01       Impact factor: 5.304

7.  Systematic survey of the design, statistical analysis, and reporting of studies published in the 2008 volume of the Journal of Cerebral Blood Flow and Metabolism.

Authors:  Hanna M Vesterinen; Hanna V Vesterinen; Kieren Egan; Amelie Deister; Peter Schlattmann; Malcolm R Macleod; Ulrich Dirnagl
Journal:  J Cereb Blood Flow Metab       Date:  2010-12-15       Impact factor: 6.200

8.  The bias of experimental design, including strain background, in the determination of critical Streptococcus suis serotype 2 virulence factors.

Authors:  Jean-Philippe Auger; Sarah Chuzeville; David Roy; Annabelle Mathieu-Denoncourt; Jianguo Xu; Daniel Grenier; Marcelo Gottschalk
Journal:  PLoS One       Date:  2017-07-28       Impact factor: 3.240

9.  The Second Mediterranean Seminar on Science Writing, Editing and Publishing (SWEP - 2018), Sarajevo, December 8th, 2018.

Authors:  Izet Masic; Miro Jakovljevic; Osman Sinanovic; Srecko Gajovic; Mirko Spiroski; Rasim Jusufovic; Sekib Sokolovic; Besim Prnjavorac; Enver Zerem; Benjamin Djulbegovic; Selma Porovic; Slobodan Jankovic; Mirsad Hadzikadic; Lejla Zunic; Edin Begic; Edin Nislic; Nedim Begic; Emir Becirovic; Anis Cerovac; Venesa Skrijelj; Jasmina Nuhanovic
Journal:  Acta Inform Med       Date:  2018-12

10.  Survey of the quality of experimental design, statistical analysis and reporting of research using animals.

Authors:  Carol Kilkenny; Nick Parsons; Ed Kadyszewski; Michael F W Festing; Innes C Cuthill; Derek Fry; Jane Hutton; Douglas G Altman
Journal:  PLoS One       Date:  2009-11-30       Impact factor: 3.240

View more
  9 in total

1.  Unethical Behaviors of Authors Who Published Papers in the Biomedical Journals Became a Global Problem.

Authors:  Izet Masic
Journal:  Med Arch       Date:  2020-02

2.  Predatory Journals and Publishers - Dilemmas: How to Assess it and How to Avoid it?

Authors:  Izet Masic
Journal:  Med Arch       Date:  2021-10

3.  Weaknesses in Experimental Design and Reporting Decrease the Likelihood of Reproducibility and Generalization of Recent Cardiovascular Research.

Authors:  John L Williams; Hsini Cindy Chu; Marissa K Lown; Joseph Daniel; Renate D Meckl; Darshit Patel; Radwa Ibrahim
Journal:  Cureus       Date:  2022-01-10

Review 4.  Systematic review of preclinical studies on the neutrophil-mediated immune response to air pollutants, 1980-2020.

Authors:  Andrés Valderrama; Maria Isabel Zapata; Juan C Hernandez; Jaiberth A Cardona-Arias
Journal:  Heliyon       Date:  2022-01-25

5.  On the Occasion of the Symposium "Scientometry, Citation, Plagiarism and Predatory in Scientific Publishing", Sarajevo, 2021.

Authors:  Izet Masic
Journal:  Med Arch       Date:  2021-12

Review 6.  The Hitchhiker's Guide to Human Therapeutic Nanoparticle Development.

Authors:  Thelvia I Ramos; Carlos A Villacis-Aguirre; Katherine V López-Aguilar; Leandro Santiago Padilla; Claudia Altamirano; Jorge R Toledo; Nelson Santiago Vispo
Journal:  Pharmaceutics       Date:  2022-01-21       Impact factor: 6.321

7.  Evaluation of Preclinical and Clinical Studies Published in Medical Journals of Bosnia and Herzegovina: Methodology Issues.

Authors:  Slobodan M Jankovic; Izet Masic
Journal:  Acta Inform Med       Date:  2020-03

8.  Comparative Analysis of Web of Science and Pubmed Indexed Medical Journals Published in Former Yugoslav Countries.

Authors:  Izet Masic; Slobodan M Jankovic
Journal:  Med Arch       Date:  2020-08

9.  Guidelines for Editing Biomedical Journals: Recommended by Academy of Medical Sciences of Bosnia and Herzegovina.

Authors:  Izet Masic; Slobodan M Jankovic; Asim Kurjak; Doncho M Donev; Muharem Zildzic; Osman Sinanovic; Izet Hozo; Snjezana Milicevic; Sefik Hasukic; Emir Mujanovic; Kenan Arnautovic; Senaid Trnacevic; Enisa Mesic; Mirza Biscevic; Mustafa Sefic; Vjekoslav Gerc; Abdulah Kucukalic; Zlatko Hrgovic; Jacob Bergsland; Mirko Grujic
Journal:  Acta Inform Med       Date:  2020-12
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.