| Literature DB >> 32960878 |
Nadine M Mansour1,2, E Andrew Balas1, Frances M Yang3, Marlo M Vernon4.
Abstract
BACKGROUND Studies have found that many published life sciences research results are irreproducible. Our goal was to provide comprehensive risk estimates of familiar reproducibility deficiencies to support quality improvement in research. MATERIAL AND METHODS Reports included were peer-reviewed, published between 1980 and 2016, and presented frequency data of basic biomedical research deficiencies. Manual and electronic literature searches were performed in seven bibliographic databases. For deficiency concepts with at least four frequency studies and with a sample size of at least 15 units in each, a meta-analysis was performed. RESULTS Overall, 68 publications met our inclusion criteria. The study identified several major groups of research quality defects: study design, cell lines, statistical analysis, and reporting. In the study design group of 3 deficiencies, missing power calculation was the most frequent (82.3% [95% Confidence Interval (CI): 69.9-94.6]). Among the 6 cell line deficiencies, mixed contamination was the most frequent (22.4% [95% CI: 10.4-34.3]). Among the 3 statistical analysis deficiencies, the use of chi-square test when expected cells frequency was <5 was the most prevalent (15.7% [95% CI: -3.2-34.7]). In the reporting group of 12 deficiencies, failure to state the number of tails was the most frequent (65% [95% CI: 39.3-90.8]). CONCLUSIONS The results of this study could serve as a general reference when consistently measurable sources of deficiencies need to be identified in research quality improvement.Entities:
Mesh:
Year: 2020 PMID: 32960878 PMCID: PMC7519945 DOI: 10.12659/MSM.922016
Source DB: PubMed Journal: Med Sci Monit ISSN: 1234-1010
Figure 1Search Strategy of the Meta-Analysis. (PRISMA, 2009).
Baseline characteristics of the included studies.
| Author | Year | Country | Deficiency Group | Sample type | Size | Author | Year | Country | Deficiency Group | Sample type | Size |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Armstrong [ | 2010 | USA | Mycoplasma contamination | Human & animal cell cultures | 38225 | Mariotti [ | 2008 | Italy | Mycoplasma contamination & misidentification | Human & animal cell lines | 37 |
| Avram [ | 1985 | USA | Design, statistics, & reporting | Anesthesia articles | 243 | McGarrity [ | 1986 | USA | Mycoplasma contamination | Cell cultures | 2589 |
| Azari [ | 2007 | Iran | Misidentification | Human cell lines | 100 | McGuigan [ | 1995 | UK | Design, statistics, & reporting | Psychiatry research articles | 164 |
| Berglind [ | 2008 | Sweden | Misidentification | Human cancer cell lines | 384 | McKinney [ | 1989 | USA | Design, statistics, & reporting | Medical research articles | 56 |
| Bölske [ | 1988 | Sweden | Mycoplasma contamination | Cell cultures | 1424 | Mirjalili [ | 2005 | Iran | Mixed contamination | Human and animal cell lines | 138 |
| Capes-Davis [ | 2010 | Australia | Misidentification | Human cell lines | 360 | Neville [ | 2006 | USA | Design, statistics, & reporting | Dermatology research articles | 155 |
| Capes-Davis [ | 2013 | Australia | Misidentification | Human cell lines | 1157 | Nour-Eldein [ | 2016 | Egypt | Design, statistics, & reporting | Medical Research articles | 60 |
| Cobo [ | 2007 | Spain | Mixed contamination | Stem cell cultures | 151 | Olarerin-George [ | 2015 | USA | Mycoplasma contamination | Human & animal cell cultures | 484 |
| Didion [ | 2014 | USA | Cross-, mixed contamination, & misidentification | Mouse cell lines | 99 | Oliver [ | 1989 | Australia | Design, statistics, & reporting | Surgery research articles | 240 |
| Drexler [ | 1999 | Germany | Cross-contamination | Human hematopoietic cell lines | 189 | Onwuegbuzie [ | 2002 | USA | Design, statistics, & reporting | Educational Research | 36 |
| Drexler [ | 2002 | Germany | Contamination, cross-contamination, & false | Human leukemia lymphoma cell lines | 1404 | Patel [ | 2014 | India | Design, statistics, & reporting | Basic Medical articles | 128 |
| Drexler [ | 2003 | Germany | Misidentification | Human leukemia lymphoma cell lines | 550 | Pienkowska [ | 1998 | Canada | Viral contamination | Human cell lines | 75 |
| Drexler [ | 2010 | Germany | Mycoplasma contamination & misidentification | Human leukemia lymphoma cell lines | 1331 | Pilčèk [ | 2003 | Czech | Design, statistics, & reporting | Biomedical articles | 171 |
| Drexler [ | 2017 | Germany | Mycoplasma contamination & misidentification | Human leukemia lymphoma cell lines | 330 | Roulland-Dussoix [ | 1994 | France | Mycoplasma contamination | Cell cultures | 372 |
| Ercan [ | 2012 | Turkey | Design, statistics, & reporting | Medical sciences articles | 181 | Schweppe [ | 2008 | USA | Misidentification | Human thyroid cancer cell lines | 40 |
| Ercan [ | 2015 | Turkey | Design, statistics, & reporting | Medical sciences articles | 217 | Šimundić [ | 2009 | Croatia | Design, statistics, & reporting | Medical research articles | 55 |
| Ercan [ | 2017 | Turkey | Design, statistics, & reporting | Veterinary sciences articles | 204 | Spierenburg [ | 1988 | Netherlands | Mycoplasma contamination | Animal cell lines | 115 |
| Felson [ | 1984 | USA | Design, statistics, & reporting | Rheumatology research articles | 74 | Störmer [ | 2009 | Germany | Mycoplasma contamination | Human cell lines | 176 |
| Hanif [ | 2011 | Pakistan | Design, statistics, & reporting | Medical sciences articles | 80 | Strasak [ | 2007 | Austria | Design, statistics, & reporting | Medical sciences articles | 15 |
| Hassan [ | 2015 | India | Design, statistics, & reporting | Medical research articles | 2012 | Strasak [ | 2007 | Austria | Design, statistics, & reporting | Medical sciences articles | 53 |
| Hopert [ | 1993 | Germany | Mycoplasma contamination | Continuous cell lines | 42 | Teyssou [ | 1993 | France | Mycoplasma contamination | Animal cell cultures | 82 |
| Huang [ | 2017 | China | Misidentification & cross-contamination | Tumor cell lines | 278 | Timenetsky [ | 2006 | Brazil | Mycoplasma contamination | Human cell cultures | 301 |
| Hué [ | 2010 | UK | Viral contamination | Human cell lines | 411 | Uchio-Yamada [ | 2017 | Japan | Misidentification | Mouse cell lines | 80 |
| Hukku [ | 1984 | USA | Mixed contamination | Cell cultures | 275 | Uphoff [ | 2002 | Germany | Mycoplasma contamination | Leukemia lymphoma cell lines | 451 |
| Ishikawa [ | 2006 | Japan | Mycoplasma contamination | Cell cultures | 337 | Uphoff [ | 2010 | Germany | Viral contamination | Animal cell lines | 465 |
| Jin [ | 2010 | China | Design, statistics, & reporting | Medical research articles | 2913 | Uphoff [ | 2015 | Germany | Viral contamination | Human cell lines | 577 |
| Jung [ | 2003 | USA | Mycoplasma contamination | Human & animal+ cell lines | 15 | Van Kuppeveld [ | 1994 | Netherlands | Mycoplasma contamination | Human & animal cell cultures | 95 |
| Kazemiha [ | 2009 | Iran | Mycoplasma contamination | Mammalian cell lines | 200 | Welch II [ | 2002 | USA | Design, statistics, & reporting | OB/GYN research articles | 195 |
| Kazemiha [ | 2014 | Iran | Mycoplasma contamination | Human and animal cell lines | 40 | Welch [ | 1996 | USA | Design, statistics, & reporting | OB/GYN research articles | 145 |
| Korch [ | 2012 | USA | Misidentification | Endometrial & ovarian cancer | 51 | Wu [ | 2011 | China | Design, statistics, & reporting | Medical research articles | 2145 |
| Kurichi [ | 2006 | USA | Design, statistics, & reporting | Surgery research articles | 187 | Ye [ | 2015 | China | Cross-contamination | Human cell lines | 380 |
| Lucena [ | 2011 | Spain | Design, statistics, & reporting | Dentistry research studies | 226 | Yim [ | 2010 | Korea | Design, statistics, & reporting | Medical research articles | 139 |
| MacArthur [ | 1984 | USA | Design, statistics, & reporting | Medical sciences articles | 64 | Yoshino [ | 2006 | Japan | Misidentification | Human cell lines | 400 |
| MacLeod [ | 1999 | Germany | Cross-contamination | Human tumor cell lines | 252 | Zhao [ | 2011 | USA | Cross-contamination | Human cell lines | 122 |
Figure 2Simplified illustration of the aggregation of information. (A) Description of study levels; (B) pyramid of aggregating information about research deficiencies; and (C) illustrative study statements at each level.
Figure 3Frequency estimates of 3 (A–C) study design deficiencies in original research articles.
Figure 4Frequency estimates of 6 (D–I) cell line defects.
Figure 5Frequency estimates of 3 (J–L) statistical analysis deficiencies in original research articles.
Figure 6Frequency estimates of 12 (M–X) reporting deficiencies in original research articles.
Subgroup analysis of the estimated variation of reproducibility deficiencies in high-income and middle-income countries.
| Deficiency concepts | Studies | Sample | Combined I2 % | High-income I2 % | Middle-income I2 % |
|---|---|---|---|---|---|
| Sample/power calculation deficiency | 16 | 1486 | 81.78% | 84.17% | 85.40% |
| Misidentified cell lines | 18 | 5610 | 94.45% | 93.14% | 97.96% |
| Mycoplasma contamination in cell lines | 30 | 57052 | 99.39% | 99.08% | 69.99% |
| Parametric test for non-parametric data vice versa | 9 | 753 | 78.73% | 88.09% | 83.71% |
| Related data independent test or vice versa | 10 | 1695 | 91.82% | 95.68% | 95.26% |
| Mean (SD) used for non-normal or ordinal data | 8 | 3331 | 73.99% | 38.17% | 10.16% |
| Failure to report the exact p-value | 12 | 4094 | 98.13% | 74.62% | 99.22% |
| P-value significance level not defined | 6 | 434 | 0.00% | 69.01% | 0.00% |
| Name of statistical software not mentioned | 8 | 758 | 82.15% | 70.89% | 92.63% |
| Number of tails not stated | 8 | 608 | 84.46% | 85.63% | 74.92% |
Figure 7Funnel plots. (A) Eligibility criteria not mentioned or inappropriate (B) Randomization deficiency (C) Sample/power calculation deficiency (D) Cell line bacterial contamination other than mycoplasma (E). Cell line cross-contamination (F). Misidentified cell lines (G). Mixed contamination of cell lines (H). Mycoplasma cell line contamination (I). Viral contamination of cell lines (J). Chi-square test used when expected cells frequency are <5 (K). Parametric test for non-parametric data and vice versa (L). Related data independent test and vice versa (M). Mean(SD) used for non-normal or ordinal data (N). variability description +/− notation undefined (O). Failure to report exact p-value (P). p-value significance level not defined (Q). p-value reported without statistical test (R). Significance stated without providing statistical test (S). Statistical software not mentioned (T). Statistical test name incorrect (U). Study population baseline characteristics not described (V). Number of tails not stated (W). Reporting of “Where appropriate statement” (X). Statistical test used for dataset not specified.