| Literature DB >> 31373357 |
Ben Van Calster1,2, Laure Wynants1, Dirk Timmerman1,3, Ewout W Steyerberg2, Gary S Collins4,5.
Abstract
There is increasing awareness that the methodology and findings of research should be transparent. This includes studies using artificial intelligence to develop predictive algorithms that make individualized diagnostic or prognostic risk predictions. We argue that it is paramount to make the algorithm behind any prediction publicly available. This allows independent external validation, assessment of performance heterogeneity across settings and over time, and algorithm refinement or updating. Online calculators and apps may aid uptake if accompanied with sufficient information. For algorithms based on "black box" machine learning methods, software for algorithm implementation is a must. Hiding algorithms for commercial exploitation is unethical, because there is no possibility to assess whether algorithms work as advertised or to monitor when and how algorithms are updated. Journals and funders should demand maximal transparency for publications on predictive algorithms, and clinical guidelines should only recommend publicly available algorithms.Entities:
Keywords: artificial intelligence; external validation; machine learning; model performance; predictive analytics
Mesh:
Year: 2019 PMID: 31373357 PMCID: PMC6857503 DOI: 10.1093/jamia/ocz130
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Summary of arguments in favor of making predictive algorithms fully available, hurdles for doing so, and reasons why developers choose to hide and sell algorithms
| Why should predictive algorithms be fully and publicly available? |
Facilitate external validation and assessment of heterogeneity in performance Facilitate uptake of algorithm by researchers and clinicians, avoid research waste Facilitate updating for specific settings For publicly funded research, this makes research results available to the community |
| Recommendations to maximize algorithm availability |
Report the full equation of a predictive algorithm, where possible (eg, regression-based models); this includes reporting of the intercept, or baseline hazard information for time-to-event regression models When making an algorithm available online or via a mobile app, provide relevant and complete background information For complex algorithms (eg, black-box machine learning), provide software to facilitate implementation and large-scale validation studies |
| Potential reasons why developers might choose to hide and sell algorithms |
Generate income for further research More control over how people use an algorithm Facilitate FDA approval or CE certification, because a commercial entity can be identified To install a profitable business model |