Literature DB >> 32981103

Approval policies for modifications to machine learning-based software as a medical device: A study of bio-creep.

Jean Feng1, Scott Emerson1, Noah Simon1.   

Abstract

Successful deployment of machine learning algorithms in healthcare requires careful assessments of their performance and safety. To date, the FDA approves locked algorithms prior to marketing and requires future updates to undergo separate premarket reviews. However, this negates a key feature of machine learning-the ability to learn from a growing dataset and improve over time. This paper frames the design of an approval policy, which we refer to as an automatic algorithmic change protocol (aACP), as an online hypothesis testing problem. As this process has obvious analogy with noninferiority testing of new drugs, we investigate how repeated testing and adoption of modifications might lead to gradual deterioration in prediction accuracy, also known as "biocreep" in the drug development literature. We consider simple policies that one might consider but do not necessarily offer any error-rate guarantees, as well as policies that do provide error-rate control. For the latter, we define two online error-rates appropriate for this context: bad approval count (BAC) and bad approval and benchmark ratios (BABR). We control these rates in the simple setting of a constant population and data source using policies aACP-BAC and aACP-BABR, which combine alpha-investing, group-sequential, and gate-keeping methods. In simulation studies, bio-creep regularly occurred when using policies with no error-rate guarantees, whereas aACP-BAC and aACP-BABR controlled the rate of bio-creep without substantially impacting our ability to approve beneficial modifications.
© 2020 The International Biometric Society.

Entities:  

Keywords:  AI/ML-based SaMD; alpha-investing; gate-keeping; group-sequential; online hypothesis testing

Year:  2020        PMID: 32981103      PMCID: PMC7946712          DOI: 10.1111/biom.13379

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  13 in total

1.  One-sided tests in clinical trials with multiple endpoints.

Authors:  D A Bloch; T L Lai; P Tubert-Bitter
Journal:  Biometrics       Date:  2001-12       Impact factor: 2.571

2.  STATISTICS. The reusable holdout: Preserving validity in adaptive data analysis.

Authors:  Cynthia Dwork; Vitaly Feldman; Moritz Hardt; Toniann Pitassi; Omer Reingold; Aaron Roth
Journal:  Science       Date:  2015-08-07       Impact factor: 47.728

3.  Survival model predictive accuracy and ROC curves.

Authors:  Patrick J Heagerty; Yingye Zheng
Journal:  Biometrics       Date:  2005-03       Impact factor: 2.571

4.  A combined superiority and non-inferiority approach to multiple endpoints in clinical trials.

Authors:  Daniel A Bloch; Tze Leung Lai; Zheng Su; Pascale Tubert-Bitter
Journal:  Stat Med       Date:  2007-03-15       Impact factor: 2.373

5.  Gatekeeping procedures with clinical trial applications.

Authors:  Alex Dmitrienko; Ajit C Tamhane
Journal:  Pharm Stat       Date:  2007 Jul-Sep       Impact factor: 1.894

6.  The hidden effect of time.

Authors:  D G Altman; J P Royston
Journal:  Stat Med       Date:  1988-06       Impact factor: 2.373

Review 7.  Making medicines evergreen.

Authors:  Andrew W Hitchings; Emma H Baker; Teck K Khong
Journal:  BMJ       Date:  2012-11-29

8.  Sample size determination in group-sequential clinical trials with two co-primary endpoints.

Authors:  Koko Asakura; Toshimitsu Hamasaki; Tomoyuki Sugimoto; Kenichi Hayashi; Scott R Evans; Takashi Sozu
Journal:  Stat Med       Date:  2014-03-27       Impact factor: 2.373

9.  Interim analysis: the alpha spending function approach.

Authors:  D L DeMets; K K Lan
Journal:  Stat Med       Date:  1994 Jul 15-30       Impact factor: 2.373

10.  Patent protection strategies.

Authors:  Himanshu Gupta; Suresh Kumar; Saroj Kumar Roy; R S Gaud
Journal:  J Pharm Bioallied Sci       Date:  2010-01
View more
  6 in total

1.  Bayesian logistic regression for online recalibration and revision of risk prediction models with performance guarantees.

Authors:  Jean Feng; Alexej Gossmann; Berkman Sahiner; Romain Pirracchio
Journal:  J Am Med Inform Assoc       Date:  2022-04-13       Impact factor: 4.497

Review 2.  Clinical artificial intelligence quality improvement: towards continual monitoring and updating of AI algorithms in healthcare.

Authors:  Jean Feng; Rachael V Phillips; Ivana Malenica; Andrew Bishara; Alan E Hubbard; Leo A Celi; Romain Pirracchio
Journal:  NPJ Digit Med       Date:  2022-05-31

3.  Discussion on "Approval policies for modifications to machine learning-based software as a medical device: A study of biocreep" by Jean Feng, Scott Emerson, and Noah Simon.

Authors:  Sherri Rose
Journal:  Biometrics       Date:  2020-10-11       Impact factor: 2.571

Review 4.  Lessons learned in transitioning to AI in the medical imaging of COVID-19.

Authors:  Issam El Naqa; Hui Li; Jordan Fuhrman; Qiyuan Hu; Naveena Gorre; Weijie Chen; Maryellen L Giger
Journal:  J Med Imaging (Bellingham)       Date:  2021-10-01

5.  A Pathologist-Annotated Dataset for Validating Artificial Intelligence: A Project Description and Pilot Study.

Authors:  Sarah N Dudgeon; Si Wen; Matthew G Hanna; Rajarsi Gupta; Mohamed Amgad; Manasi Sheth; Hetal Marble; Richard Huang; Markus D Herrmann; Clifford H Szu; Darick Tong; Bruce Werness; Evan Szu; Denis Larsimont; Anant Madabhushi; Evangelos Hytopoulos; Weijie Chen; Rajendra Singh; Steven N Hart; Ashish Sharma; Joel Saltz; Roberto Salgado; Brandon D Gallas
Journal:  J Pathol Inform       Date:  2021-11-15

Review 6.  Clinical deployment environments: Five pillars of translational machine learning for health.

Authors:  Steve Harris; Tim Bonnici; Thomas Keen; Watjana Lilaonitkul; Mark J White; Nel Swanepoel
Journal:  Front Digit Health       Date:  2022-08-19
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.