Literature DB >> 32812067

Machine learning in neurosurgery: a global survey.

Victor E Staartjes1,2,3, Vittorio Stumpo4,5, Julius M Kernbach6, Anita M Klukowska7,8, Pravesh S Gadjradj9,10, Marc L Schröder7, Anand Veeravagu11, Martin N Stienen4, Christiaan H B van Niftrik4, Carlo Serra4, Luca Regli4.   

Abstract

BACKGROUND: Recent technological advances have led to the development and implementation of machine learning (ML) in various disciplines, including neurosurgery. Our goal was to conduct a comprehensive survey of neurosurgeons to assess the acceptance of and attitudes toward ML in neurosurgical practice and to identify factors associated with its use.
METHODS: The online survey consisted of nine or ten mandatory questions and was distributed in February and March 2019 through the European Association of Neurosurgical Societies (EANS) and the Congress of Neurosurgeons (CNS).
RESULTS: Out of 7280 neurosurgeons who received the survey, we received 362 responses, with a response rate of 5%, mainly in Europe and North America. In total, 103 neurosurgeons (28.5%) reported using ML in their clinical practice, and 31.1% in research. Adoption rates of ML were relatively evenly distributed, with 25.6% for North America, 30.9% for Europe, 33.3% for Latin America and the Middle East, 44.4% for Asia and Pacific and 100% for Africa with only two responses. No predictors of clinical ML use were identified, although academic settings and subspecialties neuro-oncology, functional, trauma and epilepsy predicted use of ML in research. The most common applications were for predicting outcomes and complications, as well as interpretation of imaging.
CONCLUSIONS: This report provides a global overview of the neurosurgical applications of ML. A relevant proportion of the surveyed neurosurgeons reported clinical experience with ML algorithms. Future studies should aim to clarify the role and potential benefits of ML in neurosurgery and to reconcile these potential advantages with bioethical considerations.

Entities:  

Keywords:  Artificial intelligence; Global; Machine learning; Neurosurgery; Technology; Worldwide survey

Mesh:

Year:  2020        PMID: 32812067      PMCID: PMC7593280          DOI: 10.1007/s00701-020-04532-1

Source DB:  PubMed          Journal:  Acta Neurochir (Wien)        ISSN: 0001-6268            Impact factor:   2.216


Introduction

Recent years have witnessed the rise of machine learning applications in the scientific literature, both in basic science and clinical medicine [18, 26]. Neurosurgical practice has always relied on the individual experience of surgeons to carefully balance surgical indications, operative risk and expected outcome [30]. The advent of evidence-based medicine has framed the surgical decision-making process into guidelines based on the results of high-quality data, and of randomized controlled clinical trials—not devoid of several flaws in design themselves [19]. This approach, despite remaining the gold standard, is limited by the oversimplification of patients’ individual characteristics that often do not allow patient-specific analytics. With the exponential growth of data in the era of big data, it is increasingly important to provide clinicians with tools for integrating this individual patient data into reliable prediction models. The latter primarily aims to enhance the surgical decision-making processes and potentially improve outcomes, but predictive analytics furthermore harbour the potential to reduce unnecessary health-care costs [21, 29, 31, 34, 36, 37, 41]. It is often difficult for clinicians to integrate the many described risk factors and outcome predictors into a single workable prognosis [3]. Neurosurgical research and clinical practice is ideal for the application of machine learning (ML), which harbours the potential for predictive analytics to integrate all relevant patient factors in a way that is often too complex for natural intelligence [28, 40]. Moreover, ML can be used to extract deep features from data such as radiological and histological images, or genomic data [16, 38–40, 43]. At present, the neurosurgical literature is increasingly focusing on substituting traditional statistical models with more complex ML models with the aim of improving predictive power [29, 31]. For example, ML has been used in neurosurgery to predict post-operative satisfaction [2], early post-operative complications [41] or cerebrospinal fluid leaks [37]. Despite this encouraging trend and the presence of recent publications reviewing the large range of publications on ML in neurosurgery [28-30], data on the worldwide adoption and perception of ML in our specialty are currently lacking. Our aim was to carry out a worldwide survey among neurosurgeons to assess the adoption of ML algorithms into neurosurgical clinical practice and research and to identify factors associated with their use.

Materials and methods

Sample population

The survey was distributed via the European Association of the Neurosurgical Societies (EANS) and Congress of Neurological Surgeons (CNS) in January, February and March 2019. The EANS is the professional organization that represents European neurosurgeons. An email invitation was sent through the EANS newsletter on January 28, 2019. Furthermore, the membership database of the CNS was searched for email addresses of active members and congress attendants. The CNS is a professional, US-based (US) organization, that represents neurosurgeons worldwide. At the time of the search, the database contained 9007 members from all continents. A total of 7280 neurosurgeons had functioning email addresses and were recipients of the survey. The survey was hosted by SurveyMonkey (San Mateo, CA, USA) and sent by email alongside an invitation letter. Reminders were sent after 2 and 4 weeks to non-responders to increase the response rate. To limit answers to unique site visitors, each email address was only allowed to fill in the survey once. All answers were captured anonymously. No incentives were provided.

Survey content

The online survey was made up of nine or ten compulsory questions, depending on the participants’ choice of whether they had or had not used ML in their neurosurgical practice. A complete overview of survey questions and response options is provided in Table 1. The order in which potential reasons for use/non-use were displayed was randomized to avoid systematic bias. The definition of ML applications that were provided within the survey was: “Any form of artificial intelligence (AI)–based or algorithm-based assistance, including but not limited to (online) prediction models, automated radiographic analysis (i.e. segmentation, classification), diagnostic models, ML-based scoring systems, etc. Logistic and linear regressions are also considered ML. Other common ML algorithms include (deep) neural networks, random forests, decision trees, gradient boosting machines and naïve Bayes classifiers. The survey was developed by the authors based on prior, similar surveys carried out in a similar population [9, 10]. This report was constructed according to the Checklist for Reporting Results of Internet E-Surveys (CHERRIES) guidelines [8].
Table 1

Elements contained within the survey. Depending on the participants’ choice, nine or ten questions were displayed

QuestionResponse optionsResponse type
What is your primary subspecialty?Spine; neurovascular; neuro-oncology; trauma; epilepsy, paediatric; peripheral nerve; neuro-intensive care; functional; otherSingle choice; free text
What setting do you primarily practice in?Academic hospital; non-academic hospital; private practice; otherSingle choice; free text
What is your level of experience?Resident; fellow; board-certified/attending; chairperson; medical student; otherSingle choice; free text
What is your gender?Male; femaleSingle choice
What age group are you in?< 30 years; 30–40 years; 40–50 years; 50–60 years; > 60 yearsSingle choice
What country are you currently based in?ListSingle choice
In your clinical practice, have you ever made use of machine learning?Yes, noSingle choice
If yes:
  What have you used machine learning for? Please select any of the applicableShared decision-making/patient information; outcome prediction; prediction of complications: interpretation/quantification of imaging; grading of disease severity; diagnosis; otherMulti-choice; free text
  Please rate the importance of the following reasons for using machine learning from 1 to 4, based on your own clinical experience
  Improved preoperative surgical  decision-making/treatment selection1 (Not important) to 4 (Highly important)Single choice
  Improved anticipation of complications1 (Not important) to 4 (Highly important)Single choice
  Objectivity in diagnosis/grading/risk  assessment1 (Not important) to 4 (Highly important)Single choice
  Improved shared decision-making/ patient information1 (Not important) to 4 (Highly important)Single choice
  Time savings1 (Not important) to 4 (Highly important)Single choice
If no:
  Please rate the importance of the following reasons for not using machine learning from 1 to 4
  Not personally convinced of added value1 (Not important) to 4 (Highly important)Single choice
  Lack of skilled resources (staff, equipment)  to develop a model1 (Not important) to 4 (Highly important)Single choice
  Lack of data (quantity/quality) to develop a model1 (Not important) to 4 (Highly important)Single choice
  Limited time to implement ML in clinical practice1 (Not important) to 4 (Highly important)Single choice
  Limited affordability1 (Not important) to 4 (Highly important)Single choice
  Difficulties in deciding which processes may  benefit most from application of ML algorithms1 (Not important) to 4 (Highly important)Single choice
  Lack of ML models for my indications1 (Not important) to 4 (Highly important)Single choice
In your research, have you ever made use of machine learning?Yes; No; I do not engage in medical researchSingle choice

ML, machine learning

Elements contained within the survey. Depending on the participants’ choice, nine or ten questions were displayed ML, machine learning

Statistical analysis

Continuous variables are given as means ± standard deviations (SD), whereas categorical variables are reported as numbers (percentages). By use of multivariable logistic regression models, we identified independent predictors of adoption of ML algorithms into clinical practice and research, respectively. Countries were grouped by region (Europe/North America/Latin America/Asia and Pacific/Middle East/Africa) according to a previous worldwide survey by Härtl et al. [10], and response rates per region were calculated. Fisher’s exact test was applied to compare ML implementation rates among regions. The importance of reasons for use or non-use of ML was compared among regions using Kruskal-Wallis H tests. When calculating the ratio of respondents who had applied ML in research, we incorporated both respondents who had never used ML in their research as well as those who do not participate in medical research into the denominator. All analyses were carried out using R version 3.5.2 (the R Foundation for Statistical Computing, Vienna, Austria). A p ≤ 0.05 was considered statistically significant in two-sided tests.

Results

Response rate and respondent characteristics

A total of 7280 CNS/EANS members were sent the survey and 362 complete or incomplete answers were received for analysis. The descriptive data of respondents are provided in Table 2. The most represented age range was 30–40 (32.6%), and 89.2% of the answers were from male participants. Most of surveyed neurosurgeons were specialized in spine surgery (36.2%). As far as the work setting was concerned, more than two-thirds of the neurosurgeons were practicing in an academic hospital (67.4%), followed by non-academic hospitals (15.5%), private practice (15.5%) and other settings (1.7%). We also sought to describe the level of experience of the surveyed population. Participants were mostly board-certified/attending neurosurgeons (59.9%), while residents (19.1%), department chairs (11.3%), fellows (5.0%), medical students (2.2%) and others (2.5%) were less represented. Geographic distribution of the answers was skewed in favour of North America (69.1%) and Europe (18.8%), while less answers were received from surgeons from Asia and Pacific (4.1%), Latin America (5.0%), Middle East (2.5%) and Africa (0.6%), with only two responses for the latter region.
Table 2

Basic demographics of the respondent population

CharacteristicValue (n = 362)
Age groups, n (%) (years)
< 3028 (7.7)
30–40118 (32.6)
40–5096 (26.5)
50–6061 (16.9)
> 6059 (16.3)
Male gender, n (%)323 (89.2)
Specialty, n (%)
Spine131 (36.2)
Neuro-oncology64 (17.7)
Neurovascular49 (13.5)
Paediatric32 (8.8)
Functional27 (7.5)
Trauma16 (4.4)
Epilepsy5 (1.4)
Neuro-intensive care4 (1.1)
Skull base1 (0.3)
Peripheral nerve2 (0.6)
Other31 (8.6)
Work setting, n (%)
Academic hospital244 (67.4)
Non-academic hospital56 (15.5)
Private practice56 (15.5)
Other6 (1.7)
Level of experience, n (%)
Board-certified/attending217 (59.9)
Resident69 (19.1)
Chairperson41 (11.3)
Fellow18 (5.0)
Medical student8 (2.2)
Other9 (2.5)
Geographic origin, n (%)
North America250 (69.1)
Europe68 (18.8)
Asia and Pacific15 (4.1)
Latin America18 (5.0)
Middle East9 (2.5)
Other2 (0.6)
Use of machine learning in clinical practice, n (%)103 (28.5)
Use of machine learning in research, n (%)108 (31.1)
Basic demographics of the respondent population

Machine learning in clinical practice and research

A total of 28.5% and 31.1% of the surveyed population responded positively when asked about the use of ML in clinical practice and in clinical research, respectively. Concerning the use of ML in clinical practice, stratified by region (Table 3), adoption rates of ML were homogenously distributed (p = 0.125), with 25.6% for North America, 30.9% for Europe, 33.3% for Latin America and the Middle East, 44.4% for Asia and Pacific and 100% for Africa, albeit with only two responses. Figure 1 illustrates the worldwide clinical use of ML. We also asked respondents to list the kinds of applications that they employed ML for (Table 4). The most frequently reported uses of ML were for prediction of outcome (60.2%) and complications (51.5%), as well as to interpret or quantify medical imaging (50.5%). In addition, neurosurgeons applied ML to better inform their patients (38.8%), to grade disease severity (37.9%) and for diagnostic analytics (19.4%).
Table 3

Proportions of neurosurgeons who report having used machine learning in clinical practice or clinical research among the responders, stratified by region

DomainRegionp
Overall (n = 362)North America (n = 250)Europe (n = 68)Latin America (n = 15)Asia & Pacific (n = 18)Middle East (n = 9)Africa (n = 2)
Clinical practice, n (%)103/362 (28.5)64 (25.6)21 (30.9)5 (33.3)8 (44.4)3 (33.3)2 (100.0)0.125
Clinical research, n (%)a108/347 (31.1)69/239 (28.9)27/67 (40.3)3/15 (20.0)6/16 (37.5)1/8 (12.5)2/2 (100.0)0.087

aWhile all responders answered the question on machine learning use in clinical practice, a subset did not answer the second question on application of machine learning in clinical research

Fig. 1

Proportions of neurosurgeons who report having used machine learning in their clinical practice among the 362 responders, stratified by region and plotted on a world map (Mercator projection)

Table 4

Reported applications of machine learning in clinical practice

ApplicationFrequency, n (%) (n = 103)
Outcome prediction62 (60.2)
Prediction of complications53 (51.5)
Interpretation/quantification of imaging52 (50.5)
Shared decision-making/patient information40 (38.8)
Grading of disease severity39 (37.9)
Diagnosis20 (19.4)
Proportions of neurosurgeons who report having used machine learning in clinical practice or clinical research among the responders, stratified by region aWhile all responders answered the question on machine learning use in clinical practice, a subset did not answer the second question on application of machine learning in clinical research Proportions of neurosurgeons who report having used machine learning in their clinical practice among the 362 responders, stratified by region and plotted on a world map (Mercator projection) Reported applications of machine learning in clinical practice

Predictors of machine learning use

Multivariate logistic regression analysis (Table 5) was used to investigate independent predictors of ML use in clinical practice and research. Our analysis revealed that none of the studied variables was associated with increased or decreased use of ML in clinical practice, demonstrating the wide and homogenous adoption of ML globally. On the other hand, surgeons specialized in neuro-oncology (odds ratio (OR) = 2.76, 95% confidence interval (CI) = 1.28 to 6.05, p = 0.010), functional neurosurgery (OR = 2.79, 95% CI = 1.03 to 7.47, p = 0.040), trauma (OR = 3.8, 95% CI = 1.44 to 10.02, p = 0.007) and epilepsy (OR = 3.8, 95% CI = 1.14 to 12.9, p = 0.030) were found to be significantly more likely to apply ML for research purposes with respect to the reference group. Also, when referenced to neurosurgeons working in academic hospitals, those working in non-academic centres (OR = 0.23, 95% CI = 0.08 to 0.57, p = 0.003) or in private practice (OR = 0.36, 95% CI = 0.14 to 0.85, p = 0.026) were significantly less likely to engage in ML-based research.
Table 5

Multivariable logistic regression models describing the relationship between covariates and adoption of machine learning into clinical practice and research, respectively

VariableClinical practiceClinical research
OR95% CIp valueOR95% CIp value
Age group
 < 301.210.52 to 2.740.6581.330.55 to 3.190.520
 30–40Reference--Reference--
 40–500.970.41 to 2.20.9381.330.56 to 3.170.520
 50–601.620.71 to 3.70.2480.850.33 to 2.10.730
 > 601.820.47 to 6.930.3823.250.78 to 13.70.110
Male gender0.970.43 to 2.270.9352.190.89 to 5.940.100
Specialty
 SpineReference--Reference--
 Neuro-oncology1.120.53 to 2.320.7632.761.28 to 6.050.010*
 Neurovascular1.130.51 to 2.430.7540.670.26 to 1.610.380
 Paediatric0.580.19 to 1.570.3011.000.33 to 2.850.997
 Functional1.000.37 to 2.500.9962.791.03 to 7.470.040*
 Trauma1.460.55 to 3.680.4253.801.44 to 10.020.007*
 Epilepsy2.270.75 to 6.740.1403.801.14 to 12.90.030*
 Neuro-intensive careNANA0.991NANA0.990
 Peripheral nerveNANA0.9932.820.11 to 75.50.570
 Skull base10.05 to 8.930.9972.010.09 to 20.120.480
 OtherNANA0.995NANA0.990
Setting
 Academic hospitalReference--Reference--
 Non-academic hospital0.670.30 to 1.430.3150.230.08 to 0.570.003*
 Private practice0.590.26 to 1.280.1950.360.14 to 0.850.026*
 Other1.110.13 to 6.890.915NANA0.990
Experience
 Board-certified/attendingReference--Reference--
 Resident1.400.56 to 3.60.4581.140.44 to 3.000.790
 Chairperson1.580.68 to 3.580.2792.030.80 to 5.170.130
 Fellow1.360.38 to 4.630.6280.420.08 to 1.790.270
 Medical student1.180.17 to 7.370.8601.100.17 to 8.040.920
 Other0.770.11 to 3.690.7671.600.27 to 8.070.570
Geographic origin
 North AmericaReference--Reference--
 Europe1.120.57 to 2.160.7381.320.65 to 2.630.440
 Latin America2.480.81 to 7.520.5470.490.10 to 1.830.330
 Asia and Pacific1.430.41 to 4.460.1061.420.350.630
 Middle East1.640.30 to 7.450.5360.160.01 to 1.150.110
 OtherNANA0.992NANA0.999

*p ≤ 0.05

OR, odds ratio; CI, confidence interval

Multivariable logistic regression models describing the relationship between covariates and adoption of machine learning into clinical practice and research, respectively *p ≤ 0.05 OR, odds ratio; CI, confidence interval

Attitudes towards machine learning in neurosurgery

The surveyed population was also asked to rate the importance of the factors that encouraged or prevented the application of ML in neurosurgical clinical practice (Table 6). Among those the surgeons adopting who had already adopted ML into their clinical practice, their most important reasons determining this choice were first improved preoperative surgical decision-making/treatment selection (3.27 ± 0.86), followed by objectivity in diagnosis/grading/risk assessment (3.22 ± 0.84), improved anticipation of complications (3.13 ± 0.92) and improved shared decision-making/patient information (3.07 ± 0.9), while less importance was given to potential time savings (2.62 ± 1.07). These attitudes towards the benefits of ML in clinical practice were compared among regions, with no significant differences between the regions apart from the anticipation of complications (p = 0.048).
Table 6

Tabulation of reasons for use and non-use of machine learning (ML) in clinical practice, stratified per region

Region
AllNorth AmericaEuropeAsia and PacificLatin AmericaMiddle EastAfricap value
Reasons for use
Improved preoperative surgical decision-making/treatment selection3.27 ± 0.863.14 ± 0.923.57 ± 0.63.6 ± 0.553.5 ± 0.763 ± 1.413 ± 1.410.430
Improved anticipation of complications3.13 ± 0.922.92 ± 0.963.57 ± 0.63.2 ± 0.843.62 ± 0.743 ± 1.413 ± 1.410.048*
Objectivity in diagnosis/grading/risk assessment3.22 ± 0.843.25 ± 0.853.05 ± 0.743.4 ± 0.553.5 ± 0.763 ± 1.412.15 ± 2.120.680
Improved shared decision-making/patient information3.07 ± 0.93.06 ± 0.973.14 ± 0.652.8 ± 0.843.38 ± 0.742.5 ± 0.712.5 ± 2.120.720
Time savings2.62 ± 1.072.72 ± 1.032.29 ± 1.12.8 ± 1.12.5 ± 1.23 ± 1.412.5 ± 2.120.720
Reasons for non-use
Not personally convinced of added value2.04 ± 1.052.13 ± 1.051.77 ± 1.072 ± 0.941.56 ± 0.732.5 ± 1.22NA0.070
Lack of skilled resources (staff, equipment) to develop a model3.11 ± 0.983.14 ± 0.973.02 ± 1.073.1 ± 1.12.78 ± 0.833.33 ± 0.82NA0.670
Lack of data (quantity/quality) to develop a model2.67 ± 0.992.67 ± 0.992.72 ± 0.992.8 ± 0.921.78 ± 0.673.33 ± 0.82NA0.160
Limited time to implement ML in clinical practice2.85 ± 0.962.85 ± 0.982.98 ± 0.942.9 ± 0.882.33 ± 0.712.33 ± 0.52NA0.160
Limited affordability2.74 ± 1.082.77 ± 1.062.51 ± 1.162.5 ± 0.853.22 ± 1.093.33 ± 1.03NA0.034*
Difficulties in deciding which processes may benefit most from the application of ML algorithms2.75 ± 0.962.77 ± 0.932.64 ± 1.112.6 ± 0.972.78 ± 0.833 ± 0.89NA0.900
Lack of ML models for my indications2.84 ± 12.82 ± 0.992.79 ± 1.122.7 ± 0.673.44 ± 0.733.33 ± 0.82NA0.250

Continuous variables are presented as mean ± SD. The importance of reasons for use or non-use of robotics was compared among regions using Kruskal-Wallis H tests

*p ≤ 0.05

Tabulation of reasons for use and non-use of machine learning (ML) in clinical practice, stratified per region Continuous variables are presented as mean ± SD. The importance of reasons for use or non-use of robotics was compared among regions using Kruskal-Wallis H tests *p ≤ 0.05 On the other hand, when asked to rate reasons for not using ML, lack of skilled resources (staff, equipment) to develop a model received the highest score (3.11 ± 0.98), followed by time limitations restricting ML application in clinical practice (2.85 ± 0.96), lack of available ML models for the indications of interest (2.84 ± 1), uncertainty concerning which processes may benefit most from application of ML algorithms (2.75 ± 0.96) and, less importantly, lack of data quantity/quality to develop a ML model (2.67 ± 0.99). The lack of personal conviction of the added value of ML scored last (2.04 ± 1.05). The only differences among regions were observed in terms of the affordability of ML applications—this reason for non-use of ML was rated significantly higher in the Middle East and Latin America (p = 0.034).

Discussion

There exists no prior published data on the worldwide adoption of ML in neurosurgery. This global survey reached a diverse cohort of neurosurgeons at different levels of training. Our results indicate that ML has already quickly gained wide acceptance in the neurosurgical community, without notable heterogeneity in its global distribution. Almost a third of neurosurgeons reported having applied ML in either clinical practice or research, a value that exceeded expectations. Furthermore, the most common applications of ML in neurosurgery were for prediction of complications and outcomes, as well as to interpret or automatically quantify imaging. No predictors of clinical ML use were identified, again stressing that the availability and acceptance of readily developed ML tools are not bound by socio-demographic factors. On the other hand, among research-active neurosurgeons, some subspecialties as well as academic surgeons appear to apply ML more frequently for their research. Our study is the first to our knowledge to provide a worldwide overview of the implementation of ML in neurosurgical clinical practice and research. To our surprise, almost a third of respondents stated making use of ML, and this was true for both clinical practice and research. Although this can be partially explained by response bias—with academic surgeons active in the EANS and CNS targeted and with a likely higher response rate to our survey among surgeons interested in ML—our results still indicate that ML is quickly becoming one of the foremost technologies in neurosurgical practice. Importantly, the heterogeneity in adoption rates among regions was relatively low, and adoption of ML into clinical practice was not apparently influenced by limitations in costs or socioeconomic status, as is the case with other less accessible technologies such as robotics [33, 35]. While the development of ML models can often be expensive and resource-intensive, the application of readily trained ML algorithms does not usually require especially high technological standards or expenses. Many ML applications are web-based [25]. For this reason, we expect that ML will increasingly enable enhanced diagnostic, prognostic and predictive analytics around the world, even in the most rural areas. After controlling for potential confounding factors, we could not identify factors associated with increased or decreased use of ML in clinical practice. This again demonstrates how homogenously ML use seems to be distributed among the neurosurgical community. On the other hand, subspecialists in neuro-oncology, functional neurosurgery, trauma and epilepsy were significantly more likely to apply ML in their research. As expected, surgeons working in non-academic centres and private practice were less likely to engage in ML-based neurosurgical applications, consistent with the development of ML models currently being rather confined to academic institutions possessing the resources, protected time, expertise, extensive databases and computational power to create and distribute algorithms. However, it has to be considered that the development of e.g. ML-based prediction models has been massively eased by free software packages released by the major technology companies, which nowadays enable training of simple ML models on even the most basic notebooks. Still, the development of models may be limited by a lack of high-quality, structured datasets [24]. In fact, ML has already been broadly applied to several subspecialties in neurosurgery spanning from cranial [1, 7, 39], vascular [15, 32], spinal [5, 11, 13, 25, 31, 36] and radiosurgery, among others [23, 41]. Several examples of how ML outperforms traditional statistics and prognostic indexes commonly applied in the clinical practice are already available in the medical literature. For example, a recent study by van Niftrik et al. reported the use of a gradient boosting machine to predict early post-operative complications after intracranial tumour surgery [41]. The authors were able to show improved performance with respect to conventional statistical modelling based on logistic regression and interestingly observed that among the variables in their model, features that were not taken into account in the statistical model, such as histology, anatomical localization or surgical access in fact contributed strongly in the ML model [41]. Oermann et al. also showed that artificial neural networks performed better at 1-year survival prediction than more traditional models in patients with brain metastases treated with radiosurgery [22]. The same group also was able to show an improvement in predictions of arteriovenous malformation radiosurgery outcomes [23]. Staartjes et al. found that a deep learning approach was significantly better at predicting intraoperative cerebrospinal fluid leaks and gross total resection in pituitary surgery than logistic regression, while no predictors could be identified using traditional interferential statistics for the former outcome [34, 37]. In spinal neurosurgery, applications of ML have included prediction of outcome in patients with lumbar disc herniation and lumbar spinal stenosis [2, 31, 36], or to predict complications following elective adult spinal deformity procedures [14]. For example, Khor et al. developed a prediction model from a state-wide database to predict clinically relevant improvement after lumbar spinal fusion and integrated their model into a freely available web app, which was then externally validated [13, 25].. Again, this shows that while it may be resource-intensive to develop such models, they can be rolled out to clinicians and patients around the world for free using simple interfaces. Radiological applications are ideally suited to machine learning algorithms given the magnitude and complexity of data extractable from examinations such as CT and MRI scans. Interestingly, ML models can establish a hidden relationship between deep radiological features (“radiomics”) and outcomes of the pathology of interest. Lao et al., for example, were able to stratify patients into different prognostic subgroups based on radiomic features [17]. Similarly, it has been shown that it is possible to identify IDH mutation status in gliomas from radiomic features alone [4]. Finally, more extravagant applications of ML in neuroradiology include e.g. the generation of synthetic CT images—practically indistinguishable from actual CTs—from cranial MRI [6, 42]. Despite these positive results, still many present and future potential ML applications remain unknown to the majority of neurosurgical specialists. Our study determined that the factors deterring the use of ML were, in decreasing order, lack of skilled resources (staff, equipment) to develop a model, time limitations restricting ML application in clinical practice, lack of ML models for the indications of interest, uncertainty concerning which processes may benefit most from the application of ML algorithms, as well as—less importantly—lack of data to develop a model, and lack of personal convincement of the added value of this new technology. Our results warrant some considerations. First, once a ML model with clinical relevance is developed and after it has been externally validated [25], the focus has to shift on making it easy to implement and widely available in clinical practice. Web-based apps that are clinician- or patient-friendly are ideal [12, 13, 25]. Second, while a large proportion of neurosurgeons may already be applying ML in their clinical practice, it is important to foster ML literacy in the neurosurgical community. As with randomized studies forming the basis of evidence-based practice, clinicians should be able to make an informed decision as to which ML models published are likely valid and have applied good methodology, and which ones should probably not be trusted in clinical practice. Lastly, ML relies on the availability of “big data” to be exploited for algorithm training and validation subsequently [21, 24]. A wide and complete collection of patient data in the sense of population-based databases enables more representative ML models. Integrated databases with automated comprehensive data collection that are necessary for such applications are currently few and far between, preventing the development of highly generalizable models [20, 21, 24, 27].

Limitations

Survey-based studies, while able to provide important insights, have inherent limits because of several potential biases. During survey distribution, selection and response bias are frequent. Time constraints on responders may have limited their ability to answer with maximal accuracy, and in fact concerning the adoption of ML into clinical research, we obtained several incomplete or blank answers. The data is mostly based on subjective impressions of surgeons. Knowing this, bias could arise from the fact that surgeons who are more exposed to neurosurgical ML can value it more positively than those who do not routinely make use of it, and vice-versa. However, the reasons for advantages and disadvantages were specifically captured separately for users and non-users. Additionally, the relative percentage of geographic regions was skewed in favour of western countries, limiting the sensitivity of our survey for what concerns regions such as Asia and Pacific, South America and in particular Africa with only two responses.

Conclusions

This study provides a first global overview of the adoption of ML into neurosurgical practice. Machine learning has the potential to improve diagnostic work-up and neurosurgical decision-making by shedding light on radiological interpretation, surgical outcome and complication prediction and as a consequence patients’ quality of life and surgical satisfaction. A relevant proportion of neurosurgeons appears to already have adopted ML into their clinical practice in some form. The homogenous distribution of ML users in neurosurgery is a testimony to the accessibility of readily developed ML algorithms, even in low-resource settings. Still, many structural issues need to be addressed in order for ML to achieve its full potential in neurosurgery. These include easy-to-access resources for surgeons and patients; prospective-integrated data collection systems to allow model development; and surgeon education on ML, all of which can add to the rapid development of ML in neurosurgery while ensuring high quality of the introduced tools and their correct application. Best practice recommendations, external validation and sound methodology are necessary for any ML tool before its application in our high-stakes clinical practice. Furthermore, future trials may be conducted to assess the real clinical impact—and any changes in decision-making—that may be caused by ML algorithms in neurosurgery.
  42 in total

1.  Development and Validation of a Prediction Model for Pain and Functional Outcomes After Lumbar Spine Surgery.

Authors:  Sara Khor; Danielle Lavallee; Amy M Cizik; Carlo Bellabarba; Jens R Chapman; Christopher R Howe; Dawei Lu; A Alex Mohit; Rod J Oskouian; Jeffrey R Roh; Neal Shonnard; Armagan Dagal; David R Flum
Journal:  JAMA Surg       Date:  2018-07-01       Impact factor: 14.766

2.  Are patient-reported outcome measures biased by method of follow-up? Evaluating paper-based and digital follow-up after lumbar fusion surgery.

Authors:  Marc L Schröder; Marlies P de Wispelaere; Victor E Staartjes
Journal:  Spine J       Date:  2018-05-03       Impact factor: 4.166

3.  Machine learning modeling for predicting hospital readmission following lumbar laminectomy.

Authors:  Saisanjana Kalagara; Adam E M Eltorai; Wesley M Durand; J Mason DePasse; Alan H Daniels
Journal:  J Neurosurg Spine       Date:  2018-12-07

4.  Utility of deep neural networks in predicting gross-total resection after transsphenoidal surgery for pituitary adenoma: a pilot study.

Authors:  Victor E Staartjes; Carlo Serra; Giovanni Muscas; Nicolai Maldaner; Kevin Akeret; Christiaan H B van Niftrik; Jorn Fierstra; David Holzmann; Luca Regli
Journal:  Neurosurg Focus       Date:  2018-11-01       Impact factor: 4.047

Review 5.  Role of prospective registries in defining the value and effectiveness of spine care.

Authors:  Matthew J McGirt; Scott L Parker; Anthony L Asher; Dan Norvell; Ned Sherry; Clinton J Devin
Journal:  Spine (Phila Pa 1976)       Date:  2014-10-15       Impact factor: 3.468

6.  External validation of a prediction model for pain and functional outcome after elective lumbar spinal fusion.

Authors:  Ayesha Quddusi; Hubert A J Eversdijk; Anita M Klukowska; Marlies P de Wispelaere; Julius M Kernbach; Marc L Schröder; Victor E Staartjes
Journal:  Eur Spine J       Date:  2019-10-22       Impact factor: 3.134

7.  Machine Learning Algorithm Identifies Patients at High Risk for Early Complications After Intracranial Tumor Surgery: Registry-Based Cohort Study.

Authors:  Christiaan H B van Niftrik; Frank van der Wouden; Victor E Staartjes; Jorn Fierstra; Martin N Stienen; Kevin Akeret; Martina Sebök; Tommaso Fedele; Johannes Sarnthein; Oliver Bozinov; Niklaus Krayenbühl; Luca Regli; Carlo Serra
Journal:  Neurosurgery       Date:  2019-10-01       Impact factor: 4.654

8.  Development of machine learning algorithms for prediction of discharge disposition after elective inpatient surgery for lumbar degenerative disc disorders.

Authors:  Aditya V Karhade; Paul Ogink; Quirina Thio; Marike Broekman; Thomas Cha; William B Gormley; Stuart Hershman; Wilco C Peul; Christopher M Bono; Joseph H Schwab
Journal:  Neurosurg Focus       Date:  2018-11-01       Impact factor: 4.047

Review 9.  A primer on deep learning in genomics.

Authors:  James Zou; Mikael Huss; Abubakar Abid; Pejman Mohammadi; Ali Torkamani; Amalio Telenti
Journal:  Nat Genet       Date:  2018-11-26       Impact factor: 38.330

10.  Machine Learning Application for Rupture Risk Assessment in Small-Sized Intracranial Aneurysm.

Authors:  Heung Cheol Kim; Jong Kook Rhim; Jun Hyong Ahn; Jeong Jin Park; Jong Un Moon; Eun Pyo Hong; Mi Ran Kim; Seung Gyu Kim; Seong Hwan Lee; Jae Hoon Jeong; Sung Won Choi; Jin Pyeong Jeon
Journal:  J Clin Med       Date:  2019-05-15       Impact factor: 4.241

View more
  11 in total

1.  A Brief History of Machine Learning in Neurosurgery.

Authors:  Andrew T Schilling; Pavan P Shah; James Feghali; Adrian E Jimenez; Tej D Azad
Journal:  Acta Neurochir Suppl       Date:  2022

Review 2.  Machine Learning in Neuro-Oncology, Epilepsy, Alzheimer's Disease, and Schizophrenia.

Authors:  Mason English; Chitra Kumar; Bonnie Legg Ditterline; Doniel Drazin; Nicholas Dietz
Journal:  Acta Neurochir Suppl       Date:  2022

Review 3.  Machine Learning in Pituitary Surgery.

Authors:  Vittorio Stumpo; Victor E Staartjes; Luca Regli; Carlo Serra
Journal:  Acta Neurochir Suppl       Date:  2022

Review 4.  Machine Learning Algorithms in Neuroimaging: An Overview.

Authors:  Vittorio Stumpo; Julius M Kernbach; Christiaan H B van Niftrik; Martina Sebök; Jorn Fierstra; Luca Regli; Carlo Serra; Victor E Staartjes
Journal:  Acta Neurochir Suppl       Date:  2022

Review 5.  Machine learning for sperm selection.

Authors:  Jae Bem You; Christopher McCallum; Yihe Wang; Jason Riordon; Reza Nosrati; David Sinton
Journal:  Nat Rev Urol       Date:  2021-05-17       Impact factor: 14.432

6.  Postsurgical functional outcome prediction model using deep learning framework (Prediction One, Sony Network Communications Inc.) for hypertensive intracerebral hemorrhage.

Authors:  Masahito Katsuki; Yukinari Kakizawa; Akihiro Nishikawa; Yasunaga Yamamoto; Toshiya Uchiyama
Journal:  Surg Neurol Int       Date:  2021-05-03

7.  Easily created prediction model using deep learning software (Prediction One, Sony Network Communications Inc.) for subarachnoid hemorrhage outcomes from small dataset at admission.

Authors:  Masahito Katsuki; Yukinari Kakizawa; Akihiro Nishikawa; Yasunaga Yamamoto; Toshiya Uchiyama
Journal:  Surg Neurol Int       Date:  2020-11-06

8.  Preliminary development of a deep learning-based automated primary headache diagnosis model using Japanese natural language processing of medical questionnaire.

Authors:  Masahito Katsuki; Norio Narita; Yasuhiko Matsumori; Naoya Ishida; Ohmi Watanabe; Siqi Cai; Teiji Tominaga
Journal:  Surg Neurol Int       Date:  2020-12-29

Review 9.  Hemodynamic Imaging in Cerebral Diffuse Glioma-Part B: Molecular Correlates, Treatment Effect Monitoring, Prognosis, and Future Directions.

Authors:  Vittorio Stumpo; Lelio Guida; Jacopo Bellomo; Christiaan Hendrik Bas Van Niftrik; Martina Sebök; Moncef Berhouma; Andrea Bink; Michael Weller; Zsolt Kulcsar; Luca Regli; Jorn Fierstra
Journal:  Cancers (Basel)       Date:  2022-03-05       Impact factor: 6.639

10.  Easily Created Prediction Model Using Automated Artificial Intelligence Framework (Prediction One, Sony Network Communications Inc., Tokyo, Japan) for Subarachnoid Hemorrhage Outcomes Treated by Coiling and Delayed Cerebral Ischemia.

Authors:  Masahito Katsuki; Shin Kawamura; Akihito Koh
Journal:  Cureus       Date:  2021-06-16
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.