| Literature DB >> 30082302 |
Martijn J Schuemie1,2, Patrick B Ryan3,2,4, George Hripcsak3,4,5, David Madigan3,6, Marc A Suchard3,7,8,9.
Abstract
Concerns over reproducibility in science extend to research using existing healthcare data; many observational studies investigating the same topic produce conflicting results, even when using the same data. To address this problem, we propose a paradigm shift. The current paradigm centres on generating one estimate at a time using a unique study design with unknown reliability and publishing (or not) one estimate at a time. The new paradigm advocates for high-throughput observational studies using consistent and standardized methods, allowing evaluation, calibration and unbiased dissemination to generate a more reliable and complete evidence base. We demonstrate this new paradigm by comparing all depression treatments for a set of outcomes, producing 17 718 hazard ratios, each using methodology on par with current best practice. We furthermore include control hypotheses to evaluate and calibrate our evidence generation process. Results show good transitivity and consistency between databases, and agree with four out of the five findings from clinical trials. The distribution of effect size estimates reported in the literature reveals an absence of small or null effects, with a sharp cut-off at p = 0.05. No such phenomena were observed in our results, suggesting more complete and more reliable evidence.This article is part of a discussion meeting issue 'The growing ubiquity of algorithms in society: implications, impacts and innovations'.Entities:
Keywords: medicine; observational research; publication bias; reproducibility
Year: 2018 PMID: 30082302 PMCID: PMC6107542 DOI: 10.1098/rsta.2017.0356
Source DB: PubMed Journal: Philos Trans A Math Phys Eng Sci ISSN: 1364-503X Impact factor: 4.226
Figure 1.High-throughput observational study design with empirical calibration, applied to the comparison of depression treatments. We apply this design to four large insurance claims databases. (Online version in colour.)
Treatments and outcomes of interest.
| treatments of interest | outcomes of interest |
|---|---|
| amitriptyline | acute liver injury |
| bupropion | acute myocardial infarction |
| citalopram | alopecia |
| desvenlafaxine | constipation |
| duloxetine | decreased libido |
| electroconvulsive therapy | delirium |
| escitalopram | diarrhoea |
| fluoxetine | fracture |
| mirtazapine | gastrointestinal haemorrhage |
| paroxetine | hyperprolactinaemia |
| psychotherapy | hyponatraemia |
| sertraline | hypotension |
| trazodone | hypothyroidism |
| venlafaxine | insomnia |
| vilazodone | nausea |
| open-angle glaucoma | |
| seizure | |
| stroke | |
| suicide and suicidal ideation | |
| tinnitus | |
| vent. arr. and sudden cardiac death | |
| vertigo |
Negative control outcomes. Outcomes not believed to be caused by any of the exposures of interest.
| acariasis | ingrowing nail |
| amyloidosis | iridocyclitis |
| ankylosing spondylitis | irritable bowel syndrome |
| aseptic necrosis of bone | lesion of cervix |
| astigmatism | Lyme disease |
| Bell's palsy | malignant neoplasm of endocrine gland |
| benign epithelial neoplasm of skin | nononeuropathy |
| chalazion | onychomycosis |
| chondromalacia | osteochondropathy |
| Crohn's disease | paraplegia |
| croup | polyp of intestine |
| diabetic oculopathy | presbyopia |
| endocarditis | pulmonary tuberculosis |
| endometrial hyperplasia | rectal mass |
| enthesopathy | sarcoidosis |
| epicondylitis | scar |
| Epstein–Barr virus disease | seborrhoeic keratosis |
| fracture of upper limb | septic shock |
| gallstone | Sjogren's syndrome |
| genital herpes simplex | Tietze's disease |
| haemangioma | tonsillitis |
| Hodgkin's disease | toxic goitre |
| human papilloma virus infection | ulcerative colitis |
| hypoglycaemic coma | viral conjunctivitis |
| hypopituitarism | viral hepatitis |
| impetigo | visceroptosis |
Figure 2.Cohort comparability and balance for duloxetine versus sertraline new users from the CCAE database. (a) Propensity score distributions for each cohort. (b) Absolute values of the standardized difference of the mean (SDM) before and after stratification for the 59 038 covariates established at baseline.
Figure 3.Evaluation of effect estimation between duloxetine and sertraline new users after stratification on the propensity scores before (top) and after (bottom) calibration. Each dot represents the hazard ratio and corresponding standard error for one of the negative (true hazard ratio = 1) or positive control (true hazard ratio greater than 1) outcomes. (Online version in colour.)
Figure 5.Effect size estimates from the literature (a, b) and the study described in this paper (c). Each dot represents a single estimate, such as relative risk, odds ratio or hazard ratio, and corresponding standard error (linearly related to the width of the asymptotic CI). Estimates below the red dashed line have a CI that excludes 1, suggesting a non-null effect. Plot (a) shows estimates extracted from the abstracts of all observational research papers in MEDLINE, plot (b) shows only the subset of those that are related to depression treatments. Plot (c) shows estimated and calibrated hazard ratios for comparisons between depression treatments for various health outcomes of interest, generated from observational data in a single study using a systematic process. An online interactive visualization enables readers to explore these results in detail, including individual study artefacts for the estimates we generated (http://data.ohdsi.org/SystematicEvidence). (Online version in colour.)
Figure 4.I2 distribution for all 2570 TCO triplets for which there was enough data in all four databases. Blue shows the distribution before calibration, red shows the distribution after calibration. (Online version in colour.)