| Literature DB >> 33734466 |
Olivier Q Groot1, Paul T Ogink2, Amanda Lans1, Peter K Twining1, Neal D Kapoor1, William DiGiovanni1, Bas J J Bindels2, Michiel E R Bongers1, Jacobien H F Oosterhoff1, Aditya V Karhade1, F C Oner2, Jorrit-Jan Verlaan2, Joseph H Schwab1.
Abstract
Machine learning (ML) studies are becoming increasingly popular in orthopedics but lack a critically appraisal of their adherence to peer-reviewed guidelines. The objective of this review was to (1) evaluate quality and transparent reporting of ML prediction models in orthopedic surgery based on the transparent reporting of multivariable prediction models for individual prognosis or diagnosis (TRIPOD), and (2) assess risk of bias with the Prediction model Risk Of Bias ASsessment Tool. A systematic review was performed to identify all ML prediction studies published in orthopedic surgery through June 18th, 2020. After screening 7138 studies, 59 studies met the study criteria and were included. Two reviewers independently extracted data and discrepancies were resolved by discussion with at least two additional reviewers present. Across all studies, the overall median completeness for the TRIPOD checklist was 53% (interquartile range 47%-60%). The overall risk of bias was low in 44% (n = 26), high in 41% (n = 24), and unclear in 15% (n = 9). High overall risk of bias was driven by incomplete reporting of performance measures, inadequate handling of missing data, and use of small datasets with inadequate outcome numbers. Although the number of ML studies in orthopedic surgery is increasing rapidly, over 40% of the existing models are at high risk of bias. Furthermore, over half incompletely reported their methods and/or performance measures. Until these issues are adequately addressed to give patients and providers trust in ML models, a considerable gap remains between the development of ML prediction models and their implementation in orthopedic practice.Entities:
Keywords: machine learning; orthopedics; prediction models
Mesh:
Year: 2021 PMID: 33734466 PMCID: PMC9290012 DOI: 10.1002/jor.25036
Source DB: PubMed Journal: J Orthop Res ISSN: 0736-0266 Impact factor: 3.102
Figure 1PRISMA flowchart of study inclusions and exclusions. ML, machine learning; PI, principal investigator [Color figure can be viewed at wileyonlinelibrary.com]
Characteristics of included studies (n = 59)
|
| |
|---|---|
| Variables | Median (IQR) |
| Sample size | 4782 (616–23.264) |
| Predictors included in final model | 10 (7–14) |
Abbreviations: IQR, interquartile range; ML, machine learning; PROM, patient reported outcome measure; TRIPOD, transparent reporting of a multivariable prediction model for individual prognosis or diagnosis.
The amount of predictors that were included in the final, best performing ML algorithm. In 14% (8/59) this could not be extracted from the study or was unclear.
This includes databases, such as Surveillance, Epidemiology, and End Results (SEER) or American College of Surgeons National Surgical Quality Improvement Program (ACS NSQIP).
Figure 2Overall adherence per TRIPOD item. *All items consisted of 59 datapoints, except for item 5c (58), item 11 (4), and item 14b (45) due to the “Not applicable” option. TRIPOD, transparent reporting of a multivariable prediction model for individual prognosis or diagnosis [Color figure can be viewed at wileyonlinelibrary.com]
Individual TRIPOD items sorted by completeness of reporting over 75% and under 25%
|
|
| ||||
|---|---|---|---|---|---|
|
|
|
|
|
|
|
| 4a | Describe the study design or source of data (e.g., randomized trial, cohort, or registry data). | 100 (59) | 10b | Specify type of model, all model‐building procedures (including any predictor selection), and method for internal validation. | 3 (2) |
| 19b | Give an overall interpretation of the results considering objectives, limitations, results from similar studies and other relevant evidence. | 98 (58) | 2 | Provide a summary of objectives, study design, setting, participants, sample size, predictors, outcome, statistical analysis, results, and conclusions. | 3 (2) |
| 18 | Discuss any limitations of the study (such as nonrepresentative sample, few events per predictor, missing data). | 97 (57) | 15a | Present the full prediction model to allow predictions for individuals (i.e., all regression coefficients, and model intercept or baseline survival at a given time point). | 8 (5) |
| 3b | Specify the objectives, including whether the study describes the development of the model. | 95 (56) | 13a | Describe the flow of participants through the study, including the number of participants with and without the outcome and, if applicable, a summary of the follow‐up time. A diagram may be helpful. | 19 (11) |
| 3a | Explain the medical context and rationale for developing the multivariable prediction model, including references to existing models. | 85 (50) | 14a | Specify the number of participants and outcome events in each analysis. | 20 (12) |
| 5b | Describe eligibility criteria for participants. | 83 (49) | 1 | Identify the study as developing a multivariable prediction model, the target population, and the outcome to be predicted. | 20 (12) |
| 5c | Give details of treatments received, if relevant. | 81 (48) | 14b | If done, report the unadjusted association between each candidate predictor and outcome. | 24 (11) |
| 8 | Explain how the study size was arrived at. | 76 (45) | |||
Abbreviation: TRIPOD, transparent reporting of a multivariable prediction model for individual prognosis or diagnosis.
All items consisted of 59 datapoints, except for 5c (58) and 14b (45) due to “Not applicable” option.
Figure 3PROBAST results for all included studies (n = 59). PROBAST, prediction model risk of bias assessment tool [Color figure can be viewed at wileyonlinelibrary.com]