Literature DB >> 31819300

Evaluation of Published Preclinical Experimental Studies in Medicine: Methodology Issues.

Slobodan M Jankovic¹, Belma Kapo², Aziz Sukalo², Izet Masic³.

Abstract

INTRODUCTION: Inappropriate design of experimental studies in medicine inevitably leads to inaccurate or false results, which serve as basis for erroneous and biased conclusions. AIM: The aim of our study was to investigate prevalence of implementing basic principles of experimental design (local control, replication and randomization) in preclinical experimental studies, performed either on animals in vivo, or animal/human material in vitro.
MATERIAL AND METHODS: Preclinical experimental studies were retrieved from the PubMed database, and the sample for analysis was randomly chosen from the retrieved publications. Implementation rate of basic experimental research principles (local control, randomization and replication) was established by careful reading of the sampled publications and their checking against predefined criteria.
RESULTS: Our study showed that only a minority of experimental preclinical studies had basic principles of design completely implemented (7%), while implementation rate of single aspects of appropriate experimental design varied from as low as 9% to maximum 86%. Average impact factor of the surveyed studies was high, and publication date relatively recent, suggesting generalizability of our results to highly ranked contemporary journals.
CONCLUSION: Prevalence of experimental preclinical studies that did not implement completely basic principles of research design is high, raising suspicion to validity of their results. If incorrect and biased, results of published studies may mislead authors of future studies and cause conduction of fruitless research that will waste precious resources.

Entities: Chemical

Keywords: control experiments; internal validity; randomization; replication

Mesh：

Year: 2019 PMID： 31819300 PMCID： PMC6885208 DOI： 10.5455/medarh.2019.73.298-302

Source DB: PubMed Journal: Med Arch ISSN： 0350-199X

INTRODUCTION

Inappropriate design of experimental studies in medicine inevitably leads to inaccurate or false results, which serve as basis for erroneous and biased conclusions (1). Although numerous attempts were made in the past to prevent errors in research design, like establishing guidelines for experimental studies (2) or teaching experimental desing at postgraduate studies (3), evidence shows that some of the basic principles of experimental research design are still not implemented in more than half of the studies published in medical journals (4). There are three basic principles of experimental design that guarantee reliability of the results: having appropriate negative and positive controls for treatment or a factor that is tested, replicating experiments on independent experimental units sufficient number of times and randomly assigning a treatment (or factor) that is tested and control treatment (or factor) to experimental units (5). Failure to acknowledge and implement these principles when planning a study usually causes production of false positive experimental results, which are rather consequence of uncontrolled factors, like concomitant conditions or maturation of ecxperimental units, than of the treatment (or a factor) that is actually tested (6).

AIM

The aim of our study was to investigate prevalence of implementing basic principles of experimental design (local control, replication and randomization) in preclinical experimental studies, performed either on animals in vivo, or animal/human material in vitro.

MATERIAL AND METHODS

The studies were retrieved for analysis from the PubMed database. The following inclusion criteria defined the pool of the studies from which the study sample was extracted: journal article, original experimental study, animal study, in vitro study and full text availability. The exclusion criteria were: review articles, clinical trials of phase I-IV, cohort studies, case control studies and cross-sectional studies. The following search strategy was used to implement inclusion and exclusion criteria and select the pool of the studies for futher analysis: ((“animals”[MeSH Terms:noexp] OR animal[All Fields]) AND study[All Fields]) OR ((“in vitro techniques”[MeSH Terms] OR (“vitro”[All Fields] AND “techniques”[All Fields]) OR “in vitro techniques”[All Fields] OR “vitro”[All Fields] OR “in vitro”[All Fields]) AND study[All Fields]) NOT (“review”[Publication Type] OR “review literature as topic”[MeSH Terms] OR “review”[All Fields]) AND (Journal Article[ptyp] AND “loattrfree full text”[sb]). Size of the study sample (n=43) was calculated on the basis of the following assumptions: rate of inappropriate research design 0.5 (4) and width of the 95% confidence interval ± 0.15. The formula n = (1.96)2 x 4*p*(1-p)/d2 was used for the calculation, where „n“ is the sample size, „p“ probability of inappropriate research design and „d“ width of the confidence interval (7). Since the studies retrieved by the abovementioned search strategy were numbered orderly in the PubMed database, the study sample od 43 studies was extracted by simple randomization technique, activating for 43 times random number generator in Excel, using formula RANDBETWEEN(1;666,342). The extracted studies were analyzed for internal methodological validity, checking whether basic principles of correct experimental design (replication, control and randomization) were implemented. For the purpose of this analysis, the checklist with 8 questions was prepared, as shown in the Table 1. The results of the analysis were tabulated and described by rates and percentages when categorical, and by means, srtandard deviations, medians and interquartile ranges, if continuous.

Table 1.

Results of the survey of the experimental studies (n = 43)

Requirement	Satisfied n (%)	Not satisfied n (%)	Unclear n (%)	Not applicable n (%)
Sample size reported for the experiment?	26 (60%)	17 (40%)	-	-
Number of observations reported for the experiment	33 (77%)	10 (23%)	-	-
Value of test statistics, exact p value and degrees of freedom reported	6 (14%)	37 (86%)	-	-
Error bars correspond to the analysis (i.e. standard error is based on number of independent observations)	19 (44%)	6 (14%)	12 (28%)	6 (14%)
Only independent observations were taken into account for statistical tests	19 (44%)	4 (9%)	20 (47%)	-
Is there negative control?	32 (74%)	11 (26%)	-	-
Was positive control necessary, and if so, was it used?	20 (47%)	19 (44%)	-	4 (9%)
Were treatments randomly allocated to experimental units?	4 (9%)	28 (65%)	2 (5%)	9 (21%)
Number of citations: mean, standard deviation, median, interquartile range	28.6 ± 37.2; 12.0; 29.0
Time passed from the publication (years)	12.4 ± 10.9; 9.0; 13

RESULTS

In total 43 journal articles were retrieved randomly from pool of 666,342 articles in the PubMed database defined by the inclusion and exclusion criteria, and then analyzed according to predefined criteria of research design quality. Average impact factor of the journals (for the years when the articles were published) was 3.9739 ± 1.9125, median impact factor was 3.7490, and interquartile range 2.240. Compliance of the articles with the criteria, average number of citations per article and average time elapsed from the publication of the articles are shown in the Table 1. Only three of the analyzed studies (7.0%) had all basic principles of experimental design completely implemented. Number of satisfied criteria per study was not correlated either with journal impact factor (Spearman’s rho = 0.058, p = 0.710) or with number of citations (Spearman’s rho = -0.254, p = 0.100). The time elapsed from the publication also was not correlated with the number of satisfied criteria per study (Spearman’s rho = -0.227, p = 0.144).

DISCUSSION

Our study showed that only a minority of experimental preclinical studies had basic principles of design completely implemented (7%), while implementation rate of single aspects of appropriate experimental design varied from as low as 9% to maximum 86%. Average impact factor of the surveyed studies was high, and publication date relatively recent, suggesting generalizability of our results to highly ranked contemporary journals. Prevalence of certain aspects of inappropriate design in our study was similar to values reported by other studies, especially in regard to lack of randomization, which was observed in 70% of studies from our sample and in 87% of studies surveyed by Kilkenny et al (8). A number of the authors of experimental studies on animals, human cells, or tissues are misleaded by superficial similarity between the experimental units, derived from their common origin (the same cell line, the same clone of animals, the same species from the same breeding line, etc.), and may wrongly assume that they are completely the same. However, even identical twins are not completely identical, as many external factors with shape them differently, so randomization is always necessary in experimental studies, regardless of the similarities between the experimental units (9). While necessity of having local control in their experiments was understood by authors of majority of analyzed studies, the replication issue remained obtunded, and difference between true replication (repeating experiments on independent experimental units) and pseudo replication (repeating experiments on the same experimental unit) was not appreciated by majority. Pseudo replication leads to inappropriate testing of hypothesis (because statistical tests for testing difference between groups assume independence of experimental units) and to false precision, as improved estimate after repeating measurements on the same experimental unit just gives more precise results for that experimental unit, and not for the population that is investigated (10-13). Pseudo replication can also undermine the conclusions of a statistical analysis, and it would be easier to detect if the sample size, degrees of freedom, the test statistic, and precise p-values are reported. This information should be a requirement for all publications. The articles we analyzed in this study were highly cited regardless of their methodological shortcomings and possibly wrong conclusions, that may lead to erroneous assumptions when designing future studies and unnecessary wasting of research resources (14). Seven threats to the internal validity of experiments were discussed by Donald T. Campbell in his classic 1957 article: history, maturation, testing, instrument decay, statistical regression, selection, and mortality. These concepts are said to be threats to the internal validity of experiments because they pose alternate explanations for the apparent causal relationship between the independent variable and dependent variable of an experiment if they are not adequately controlled. Unlike with observational studies, experimental design is based on elimination of confusing variables with inclusion/exclusion criteria and on control of extraneous factors by setup of the experiment which should include randomization and local control. It is critical that experimental setup excludes extraneous influences, because they are not taken into account during statistical analysis, and may bias the results. If basic principles of experimental design are not implemented, extraneous variables will not be controlled properly, and the observed effects on experimental model may not be consequence of the tested treatment or factor, but of the extraneous variables themselves (15). Widespread failure to comply with basic rules of experimental design also led to crisis in reproducibility of experimental results. Many so-called breakthroughs in experimental science turned out to be spurious and false when independent study groups tried to repeat experiments described in published papers. Some authors believe that majority of published experimental results will not stand the test of time (16), because numerous authors all over the world do not adhere to good experimental practice being under pressure to “publish or perish”. Some of measures that could improve the situation are: insisting on standards of data presentation, publication of negative results in scientific journals, and changes in principles of funding research that would prevent making profit on just having publications without any real impact on science and healthcare (17, 18).

Limitations of the study

The results of our study are limited to only one database (PubMed) having journals that are on average ranked highly than journals in some other databases with less strict inclusion criteria. Therefore, our results could underestimate the problem of inadequate experimental design, and should be interpreted with caution. Besides, not all published papers had enough data presented to allow for complete estimate of methodological issues.

CONCLUSION

Prevalence of experimental preclinical studies that did not implement completely basic principles of research design is high, raising suspicion to validity of their results. If incorrect and biased, results of published studies may mislead authors of future studies and cause conduction of fruitless research that will waste precious resources.

Table 2.

Check list of 43 published papers used for assessment of the frequency of un-adequated study design

Study	Sample size reported for the experiment?	Number of observations reported for the experiment	Value of test statistics, exact p value and degrees of freedom reported	Error bars correspond to the analysis (i.e. standard error is based on number of independent observations)	Only independent observations were taken into account for statistical tests	Is there negative control?	Was positive control necessary, and if so, was it used?	Were treatments randomly allocated to experimental units?	Number of citations
Study 1	no	no	no	no	no	no	no	no	71
Study 2	yes	no	no	no	It is not clear	yes	no	no	25
Stdey 3	yes	no	no	no	unclear	no	no	no	11
Study 4	yes	yes	yes	Not shown	yes	yes	yes	no	64
Study 5	yes	yes	no	unclear	unclear	yes	no	no	7
Study 6	no	yes	no	yes	unclear	yes	no	no	23
Study 7	no	yes	no	no	unclear	no	yes	no	1
Study 8	yes	yes	no	yes	yes	yes	no	Yes, but nor explained how	5
Study 9	no	no	yes	Not clear	Not clear	yes	no	no	36
Study 10	no	yes	no	no	yes	yes	yes	Mentioned, but not explained	8
Study 11	yes	yes	no	yes	Not clear	yes	no	no	3
Study 12	yes	yes	no	yes	yes	no	yes	yes	4
Study 13	no	yes	no	yes	yes	no	no	Not applicable	33
Study 14	no	yes	no	Not shown	no	no	no	Not applicable	122
Study 15	yes	yes	no	yes	yes	yes	yes	yes	60
Study 16	yes	yes	no	Not applicable	Not clear	no	yes	Not applicable	4
Study 17	yes	yes	no	yes	Not clear	yes	no	Not applicable	12
Study 18	yes	yes	yes	yes	yes	yes	yes	no	153
Study 19	No	yes	No	Not clear	No	Yes	Yes	no	8
Study 20	yes	yes	yes	yes	yes	yes	yes	Not applicable	12
Study 21	yes	yes	no	Not applicable	yes	no	yes	no	5
Study 22	yes	yes	no	Not applicable	yes	yes	no	Not applicable	9
Study 23	yes	yes	no	yes	yes	no	yes	yes	2
Study 24	yes	yes	no	Not clear	yes	yes	no	no	6

Study 25	no	no	no	Not applicable	Not clear	no	no	no	19
Study 26.	No	yes	no	yes	yes	yes	yes	no	11
Study 27	yes	yes	no	yes	yes	yes	no	Not applicable	9
Study 28	no	no	no	Not applicable	Not clear	no	no	Not applicable	159
Study 29	yes	no	no	yes	Not clear	yes	yes	no	20
Study 30	yes	no	no	yes	yes	yes	no	no	27
Study 31	yes	No	no	Not clear	Not clear	yes	no	no	8
Study 32	yes	yes	no	yes	yes	yes	yes	no	3
Study 33	yes	yes	yes	yes	yes	yes	Not applicable	yes	7
Study 34	No	Yes	no	yes	Not clear	yes	yes	no	1
Study 35	yes	yes	no	yes	Not clear	yes	yes	no	30
Study 36	no	yes	no	no	no	yes	yes	no	50
Study 37	no	yes	no	yes	yes	yes	yes	no	13
Study 38	yes	yes	yes	Not applicable	yes	yes	Not applicable	Not applicable	31
Study 39	no	yes	no	Not clear	Nor clear	yes	No	no	41
Study 40	yes	yes	no	Not clear	Not clear	yes	yes	no	7
Study 41	yes	yes	no	Not clear	Not clear	yes	Not applicable	no	49
Study 42	no	yes	no	Not clear	Not clear	yes	Not applicable	no	22
Study 43	No	No	no	Not clear	Not clear	yes	yes	no	40

15 in total

1. Factors relevant to the validity of experiments in social settings.

Authors: D T CAMPBELL
Journal: Psychol Bull Date: 1957-07 Impact factor: 17.737

Review 2. Teaching experimental design.

Authors: Derek J Fry
Journal: ILAR J Date: 2014

Review 3. Reproducibility in science: improving the standard for basic and preclinical research.

Authors: C Glenn Begley; John P A Ioannidis
Journal: Circ Res Date: 2015-01-02 Impact factor: 17.367

4. Study Design Rigor in Animal-Experimental Research Published in Anesthesia Journals.

Authors: Janine M Hoerauf; Angela F Moss; Ana Fernandez-Bustamante; Karsten Bartels
Journal: Anesth Analg Date: 2018-01 Impact factor: 5.108

5. Why Is the One-Group Pretest-Posttest Design Still Used?

Authors: Thomas R Knapp
Journal: Clin Nurs Res Date: 2016-08-24 Impact factor: 2.075

6. Guidelines for experimental studies.

Authors: J E Moorhead; P V Rao; K J Anusavice
Journal: Dent Mater Date: 1994-01 Impact factor: 5.304

7. Systematic survey of the design, statistical analysis, and reporting of studies published in the 2008 volume of the Journal of Cerebral Blood Flow and Metabolism.

Authors: Hanna M Vesterinen; Hanna V Vesterinen; Kieren Egan; Amelie Deister; Peter Schlattmann; Malcolm R Macleod; Ulrich Dirnagl
Journal: J Cereb Blood Flow Metab Date: 2010-12-15 Impact factor: 6.200

8. The bias of experimental design, including strain background, in the determination of critical Streptococcus suis serotype 2 virulence factors.

Authors: Jean-Philippe Auger; Sarah Chuzeville; David Roy; Annabelle Mathieu-Denoncourt; Jianguo Xu; Daniel Grenier; Marcelo Gottschalk
Journal: PLoS One Date: 2017-07-28 Impact factor: 3.240

9. The Second Mediterranean Seminar on Science Writing, Editing and Publishing (SWEP - 2018), Sarajevo, December 8th, 2018.

Authors: Izet Masic; Miro Jakovljevic; Osman Sinanovic; Srecko Gajovic; Mirko Spiroski; Rasim Jusufovic; Sekib Sokolovic; Besim Prnjavorac; Enver Zerem; Benjamin Djulbegovic; Selma Porovic; Slobodan Jankovic; Mirsad Hadzikadic; Lejla Zunic; Edin Begic; Edin Nislic; Nedim Begic; Emir Becirovic; Anis Cerovac; Venesa Skrijelj; Jasmina Nuhanovic
Journal: Acta Inform Med Date: 2018-12

10. Survey of the quality of experimental design, statistical analysis and reporting of research using animals.

Authors: Carol Kilkenny; Nick Parsons; Ed Kadyszewski; Michael F W Festing; Innes C Cuthill; Derek Fry; Jane Hutton; Douglas G Altman
Journal: PLoS One Date: 2009-11-30 Impact factor: 3.240

9 in total

1. Unethical Behaviors of Authors Who Published Papers in the Biomedical Journals Became a Global Problem.

Authors: Izet Masic
Journal: Med Arch Date: 2020-02

2. Predatory Journals and Publishers - Dilemmas: How to Assess it and How to Avoid it?

Authors: Izet Masic
Journal: Med Arch Date: 2021-10

3. Weaknesses in Experimental Design and Reporting Decrease the Likelihood of Reproducibility and Generalization of Recent Cardiovascular Research.

Authors: John L Williams; Hsini Cindy Chu; Marissa K Lown; Joseph Daniel; Renate D Meckl; Darshit Patel; Radwa Ibrahim
Journal: Cureus Date: 2022-01-10

Review 4. Systematic review of preclinical studies on the neutrophil-mediated immune response to air pollutants, 1980-2020.

Authors: Andrés Valderrama; Maria Isabel Zapata; Juan C Hernandez; Jaiberth A Cardona-Arias
Journal: Heliyon Date: 2022-01-25

5. On the Occasion of the Symposium "Scientometry, Citation, Plagiarism and Predatory in Scientific Publishing", Sarajevo, 2021.

Authors: Izet Masic
Journal: Med Arch Date: 2021-12

Review 6. The Hitchhiker's Guide to Human Therapeutic Nanoparticle Development.

Authors: Thelvia I Ramos; Carlos A Villacis-Aguirre; Katherine V López-Aguilar; Leandro Santiago Padilla; Claudia Altamirano; Jorge R Toledo; Nelson Santiago Vispo
Journal: Pharmaceutics Date: 2022-01-21 Impact factor: 6.321

7. Evaluation of Preclinical and Clinical Studies Published in Medical Journals of Bosnia and Herzegovina: Methodology Issues.

Authors: Slobodan M Jankovic; Izet Masic
Journal: Acta Inform Med Date: 2020-03

8. Comparative Analysis of Web of Science and Pubmed Indexed Medical Journals Published in Former Yugoslav Countries.

Authors: Izet Masic; Slobodan M Jankovic
Journal: Med Arch Date: 2020-08

9. Guidelines for Editing Biomedical Journals: Recommended by Academy of Medical Sciences of Bosnia and Herzegovina.

Authors: Izet Masic; Slobodan M Jankovic; Asim Kurjak; Doncho M Donev; Muharem Zildzic; Osman Sinanovic; Izet Hozo; Snjezana Milicevic; Sefik Hasukic; Emir Mujanovic; Kenan Arnautovic; Senaid Trnacevic; Enisa Mesic; Mirza Biscevic; Mustafa Sefic; Vjekoslav Gerc; Abdulah Kucukalic; Zlatko Hrgovic; Jacob Bergsland; Mirko Grujic
Journal: Acta Inform Med Date: 2020-12

9 in total