| Literature DB >> 28783069 |
Behnaz Poursartip1,2, Marie-Eve LeBel3,4, Laura C McCracken5, Abelardo Escoto6, Rajni V Patel7,8,9, Michael D Naish10,11,12, Ana Luisa Trejos13,14.
Abstract
Minimally invasive skills assessment methods are essential in developing efficient surgical simulators and implementing consistent skills evaluation. Although numerous methods have been investigated in the literature, there is still a need to further improve the accuracy of surgical skills assessment. Energy expenditure can be an indication of motor skills proficiency. The goals of this study are to develop objective metrics based on energy expenditure, normalize these metrics, and investigate classifying trainees using these metrics. To this end, different forms of energy consisting of mechanical energy and work were considered and their values were divided by the related value of an ideal performance to develop normalized metrics. These metrics were used as inputs for various machine learning algorithms including support vector machines (SVM) and neural networks (NNs) for classification. The accuracy of the combination of the normalized energy-based metrics with these classifiers was evaluated through a leave-one-subject-out cross-validation. The proposed method was validated using 26 subjects at two experience levels (novices and experts) in three arthroscopic tasks. The results showed that there are statistically significant differences between novices and experts for almost all of the normalized energy-based metrics. The accuracy of classification using SVM and NN methods was between 70% and 95% for the various tasks. The results show that the normalized energy-based metrics and their combination with SVM and NN classifiers are capable of providing accurate classification of trainees. The assessment method proposed in this study can enhance surgical training by providing appropriate feedback to trainees about their level of expertise and can be used in the evaluation of proficiency.Entities:
Keywords: arthroscopy; energy-based metrics; machine learning algorithms; sensorized instruments; surgical skills assessment
Mesh:
Year: 2017 PMID: 28783069 PMCID: PMC5579843 DOI: 10.3390/s17081808
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Shoulder simulator and video tower (a), the sensorized arthroscopic probe (b), and the sensorized arthroscopic grasper (c).
Figure 2The arthroscopic tasks investigated in this study: (a) Task 1; (b) Task 2; and (c) Task 3.
Number of subjects with valid data from the instrument, the arthroscope, and both the instrument and the arthroscope for the three studied tasks.
| Task | Instrument | Arthroscope | Instrument and Athroscope | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Novices | Experts | Total | Novices | Experts | Total | Novices | Experts | Total | |
| 1 | 16 | 4 | 20 | 15 | 5 | 20 | 14 | 4 | 18 |
| 2 | 18 | 5 | 23 | 17 | 6 | 23 | 15 | 5 | 20 |
| 3 | 16 | 4 | 20 | 12 | 4 | 16 | 12 | 4 | 16 |
Figure 3The arthroscope’s tip displacement (a,b), and angle (c,d) for a random novice and expert subject over the same time duration.
Figure 4Energy-based metrics for the instrument (left) and arthroscope (right). (a) Task 1, (b) Task 2, and (c) Task 3. In this figure, ** indicates a statistically significant difference with p value less than 0.01 and * indicates a statistically significant difference with p value less than 0.05.
The mean and standard deviation of the normalized energy-based metrics for the novice and expert groups and the corresponding p values. The statistically significant p values are shown in bold. , , , and stand for the normalized potential energy, normalized translational kinetic energy, normalized rotational kinetic energy, and normalized work, respectively. All of the metrics, except of the instrument and the arthroscope for Task 2, which are indicated with an asterisk (*), demonstrated statistically significant differences between novices and experts. The metrics with a normal distribution are marked with † in the p value column.
| Task 1 | Task 2 | Task 3 | ||||||
|---|---|---|---|---|---|---|---|---|
| Metric | Level | Mean ± SD | Mean ± SD | Mean ± SD | ||||
| Instrument | Novice | 5.87 ± 4.47 | 5.15 ± 3.26 | 5.22 ± 5.55 | ||||
| Expert | 0.88 ± 0.44 | 1.18 ± 0.60 | 1.10 ± 0.17 | |||||
| Novice | 6.24 ± 4.65 | 5.92 ± 4.12 | 6.92 ± 6.98 | |||||
| Expert | 1.07 ± 0.51 | 1.37 ± 0.74 | 1.84 ± 1.29 | |||||
| Novice | 145.35 ± 127.06 | 8.05 ± 10.15 | 0.199 * | 5.10 ± 3.33 | ||||
| Expert | 50.56 ± 29.24 | 2.41 ± 2.65 | 1.95 ± 0.78 | |||||
| Novice | 29.43 ± 22.71 | 9.92 ± 10.16 | 7.64 ± 7.89 | |||||
| Expert | 3.49 ± 1.32 | 2.68 ± 2.52 | 0.53 ± 0.21 | |||||
| Arthroscope | Novice | 11.89 ± 9.35 | 7.07 ± 6.10 | 8.34 ± 8.02 | ||||
| Expert | 1.59 ± 0.79 | 2.34 ± 1.93 | 2.23 ± 1.56 | |||||
| Novice | 10.72 ± 8.74 | 6.44 ± 5.47 | 8.38 ± 8.56 | |||||
| Expert | 1.40 ± 0.77 | 1.58 ± 0.79 | 1.84 ± 1.88 | |||||
| Novice | 17.23 ± 23.34 | 10.95 ± 19.03 | 0.062 * | 10.46 ± 11.53 | ||||
| Expert | 1.56 ± 0.79 | 1.44 ± 1.36 | 1.09 ± 1.10 | |||||
Figure 5Accuracy using (a) only the instrument’s metrics; (b) using only the arthroscope’s metrics; and (c) using metrics of both the instrument and the arthroscope. SVM: support vector machine; KNN: K-nearest neighbors; NN: neural network; LDA: linear discriminant analysis.
Accuracy, precision, recall, and F1 score as percentages when the normalized energy-based metrics of both the instrument and the arthroscope are used as inputs of the classifiers.
| Task | Classifier | Accuracy | Precision | Recall | F1 score |
|---|---|---|---|---|---|
| 1 | SVM | 94.44 | 80.00 | 100.00 | 88.89 |
| KNN | 77.78 | 50.00 | 75.00 | 60.0 | |
| NN | 88.89 | 75.00 | 75.00 | 75.00 | |
| LDA | 72.22 | 42.85 | 75.00 | 54.55 | |
| 2 | SVM | 80.00 | 57.14 | 80.00 | 66.67 |
| KNN | 85.00 | 75.00 | 60.00 | 66.67 | |
| NN | 95.00 | 100.00 | 80.00 | 88.89 | |
| LDA | 90.00 | 80.00 | 80.00 | 80.00 | |
| 3 | SVM | 93.75 | 80.00 | 100.00 | 88.89 |
| KNN | 87.50 | 66.67 | 100.00 | 80.00 | |
| NN | 93.75 | 80.00 | 100.00 | 88.89 | |
| LDA | 81.25 | 60.00 | 75.00 | 66.67 |
Accuracy, precision, recall, and F1 score as percentages when task time, path length of both the instrument and the arthroscope, and maximum bending force of the instrument are used as inputs to the classifiers.
| Task | Classifier | Accuracy | Precision | Recall | F1 score |
|---|---|---|---|---|---|
| 1 | SVM | 66.67 | 25.00 | 25.00 | 25.00 |
| KNN | 61.11 | 0.00 | 0.00 | 0.00 | |
| NN | 83.33 | 66.67 | 50.00 | 57.14 | |
| LDA | 88.89 | 75.00 | 75.00 | 75.00 | |
| 2 | SVM | 80.00 | 66.67 | 40.00 | 50.00 |
| KNN | 75.00 | 50.00 | 20.00 | 28.57 | |
| NN | 95.00 | 100.00 | 80.00 | 88.89 | |
| LDA | 80.00 | 100.00 | 20.00 | 33.33 | |
| 3 | SVM | 81.25 | 60.00 | 75.00 | 66.67 |
| KNN | 87.50 | 75.00 | 75.00 | 75.00 | |
| NN | 93.75 | 100.00 | 75.00 | 85.71 | |
| LDA | 87.50 | 100.00 | 50.00 | 66.67 |
Mean and standard deviation of the running time for different classifiers.
| Classifier | SVM | KNN | NN | LDA |
|---|---|---|---|---|
| Running time (s) (Mean ± SD) | 0.969 ± 0.028 | 0.866 ± 0.039 | 3.290 ± 0.452 | 1.015 ± 0.033 |