Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Approval policies for modifications to machine learning-based software as a medical device: A study of bio-creep.

Literature DB >> 32981103

Approval policies for modifications to machine learning-based software as a medical device: A study of bio-creep.

Jean Feng¹, Scott Emerson¹, Noah Simon¹.

Abstract

Successful deployment of machine learning algorithms in healthcare requires careful assessments of their performance and safety. To date, the FDA approves locked algorithms prior to marketing and requires future updates to undergo separate premarket reviews. However, this negates a key feature of machine learning-the ability to learn from a growing dataset and improve over time. This paper frames the design of an approval policy, which we refer to as an automatic algorithmic change protocol (aACP), as an online hypothesis testing problem. As this process has obvious analogy with noninferiority testing of new drugs, we investigate how repeated testing and adoption of modifications might lead to gradual deterioration in prediction accuracy, also known as "biocreep" in the drug development literature. We consider simple policies that one might consider but do not necessarily offer any error-rate guarantees, as well as policies that do provide error-rate control. For the latter, we define two online error-rates appropriate for this context: bad approval count (BAC) and bad approval and benchmark ratios (BABR). We control these rates in the simple setting of a constant population and data source using policies aACP-BAC and aACP-BABR, which combine alpha-investing, group-sequential, and gate-keeping methods. In simulation studies, bio-creep regularly occurred when using policies with no error-rate guarantees, whereas aACP-BAC and aACP-BABR controlled the rate of bio-creep without substantially impacting our ability to approve beneficial modifications.

Entities: Chemical Disease Gene Species

Keywords: AI/ML-based SaMD; alpha-investing; gate-keeping; group-sequential; online hypothesis testing

Year: 2020 PMID： 32981103 PMCID： PMC7946712 DOI： 10.1111/biom.13379

Source DB: PubMed Journal: Biometrics ISSN： 0006-341X Impact factor: 2.571

13 in total

1. One-sided tests in clinical trials with multiple endpoints.

Authors: D A Bloch; T L Lai; P Tubert-Bitter
Journal: Biometrics Date: 2001-12 Impact factor: 2.571

2. STATISTICS. The reusable holdout: Preserving validity in adaptive data analysis.

Authors: Cynthia Dwork; Vitaly Feldman; Moritz Hardt; Toniann Pitassi; Omer Reingold; Aaron Roth
Journal: Science Date: 2015-08-07 Impact factor: 47.728

3. Survival model predictive accuracy and ROC curves.

Authors: Patrick J Heagerty; Yingye Zheng
Journal: Biometrics Date: 2005-03 Impact factor: 2.571

4. A combined superiority and non-inferiority approach to multiple endpoints in clinical trials.

Authors: Daniel A Bloch; Tze Leung Lai; Zheng Su; Pascale Tubert-Bitter
Journal: Stat Med Date: 2007-03-15 Impact factor: 2.373

5. Gatekeeping procedures with clinical trial applications.

Authors: Alex Dmitrienko; Ajit C Tamhane
Journal: Pharm Stat Date: 2007 Jul-Sep Impact factor: 1.894

6. The hidden effect of time.

Authors: D G Altman; J P Royston
Journal: Stat Med Date: 1988-06 Impact factor: 2.373

Review 7. Making medicines evergreen.

Authors: Andrew W Hitchings; Emma H Baker; Teck K Khong
Journal: BMJ Date: 2012-11-29

8. Sample size determination in group-sequential clinical trials with two co-primary endpoints.

Authors: Koko Asakura; Toshimitsu Hamasaki; Tomoyuki Sugimoto; Kenichi Hayashi; Scott R Evans; Takashi Sozu
Journal: Stat Med Date: 2014-03-27 Impact factor: 2.373

9. Interim analysis: the alpha spending function approach.

Authors: D L DeMets; K K Lan
Journal: Stat Med Date: 1994 Jul 15-30 Impact factor: 2.373

10. Patent protection strategies.

Authors: Himanshu Gupta; Suresh Kumar; Saroj Kumar Roy; R S Gaud
Journal: J Pharm Bioallied Sci Date: 2010-01

6 in total

1. Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees.

Authors: Jean Feng; Alexej Gossmann; Berkman Sahiner; Romain Pirracchio
Journal: J Am Med Inform Assoc Date: 2022-04-13 Impact factor: 4.497

Review 2. Clinical artificial intelligence quality improvement: towards continual monitoring and updating of AI algorithms in healthcare.

Authors: Jean Feng; Rachael V Phillips; Ivana Malenica; Andrew Bishara; Alan E Hubbard; Leo A Celi; Romain Pirracchio
Journal: NPJ Digit Med Date: 2022-05-31

3. Discussion on "Approval policies for modifications to machine learning-based software as a medical device: A study of biocreep" by Jean Feng, Scott Emerson, and Noah Simon.

Authors: Sherri Rose
Journal: Biometrics Date: 2020-10-11 Impact factor: 2.571

Review 4. Lessons learned in transitioning to AI in the medical imaging of COVID-19.

Authors: Issam El Naqa; Hui Li; Jordan Fuhrman; Qiyuan Hu; Naveena Gorre; Weijie Chen; Maryellen L Giger
Journal: J Med Imaging (Bellingham) Date: 2021-10-01

5. A Pathologist-Annotated Dataset for Validating Artificial Intelligence: A Project Description and Pilot Study.

Authors: Sarah N Dudgeon; Si Wen; Matthew G Hanna; Rajarsi Gupta; Mohamed Amgad; Manasi Sheth; Hetal Marble; Richard Huang; Markus D Herrmann; Clifford H Szu; Darick Tong; Bruce Werness; Evan Szu; Denis Larsimont; Anant Madabhushi; Evangelos Hytopoulos; Weijie Chen; Rajendra Singh; Steven N Hart; Ashish Sharma; Joel Saltz; Roberto Salgado; Brandon D Gallas
Journal: J Pathol Inform Date: 2021-11-15

Review 6. Clinical deployment environments: Five pillars of translational machine learning for health.

Authors: Steve Harris; Tim Bonnici; Thomas Keen; Watjana Lilaonitkul; Mark J White; Nel Swanepoel
Journal: Front Digit Health Date: 2022-08-19

6 in total