Medical imaging is one of the first branches of health science to utilize machine
learning and artificial intelligence (AI) to assist human medical practice.[1] Machine learning allows computers to learn in an analogous way to humans,
extracting patterns or classes based on input experience or a data set. This
parallelism in learning is becoming increasingly close with the continuous
innovations in data science and the progress of machine learning and AI.[2] With the advancement in medical imaging technology and the incorporation of
large data sets, machines can extract features that are arguably beyond the reach of
human perception and cognition. In recent years, a number of machine learning
algorithms have been used in content-based image retrieval systems for improving
efficiency and accuracy.[3],[4]Several computational principles can be used to categorize machine learning
algorithms, such as unsupervised learning, supervised learning, and semi-supervised learning.[4] In supervised learning, a system is provided with input and output features,
and the emphasis is on understanding how these are mapped to each other. In
unsupervised learning, inferences are drawn from data sets comprising input data
without any labeled data. Cluster analysis is the most common unsupervised learning
method, which is used for identifying trends or groups in data through exploratory
data analysis.[5] It functions by grouping sets of unlabeled data into clusters of similar
features without differentiating between dependent and non-dependent variables.
Semi-supervised learning is a compromise between supervised and unsupervised
learning techniques, utilizing some labeled data to leverage the analysis of
unlabeled data. Speech analysis is the most common application of semi-supervised
learning models.[6-8]Presently, a new era of AI in radiology is emerging with, focus on analyzing images
which has been showing promising results for some time. Indeed, expectations the
application of AI to radiological images have increased significantly. This suggests
a need to review the existing literature on the application of machine learning and
AI to radiological modalities, so that their potential effect can be understood. The
purpose of this study is to provide a review on the challenges and barriers
experienced in diagnostic radiology on the basis of the key clinical applications of
machine learning techniques. The following hypothesis will be examined in the
study:= The radiomic information extracted through machine learning programs form
images that improves the prognostic and diagnostic value of data sets.
Methodology
Design and eligibility
This review has carried out a search for studies that explore the challenges and
barriers in diagnostic radiology through the context of machine learning
techniques. Only studies published in 2010–2019 and in English were included.
The setting was hospital-based or clinical-based, and concerned reporting the
effectiveness of machine learning models or AI algorithms on the ability to
detect and interpret radiological findings. Narrative reviews, letters,
preprints, and scientific reports were also included in the review.
Interventions and findings related to home-based settings were excluded, as were
studies on non-human or animal samples or duplicate data were excluded (Table 1). The review
assumes that expert opinion or consensus opinion and standard-of-care diagnoses
are accurate.
Table 1.
Description of inclusion and exclusion criteria.
Inclusion criteria
Exclusion criteria
Publication
Between 2010 to 2019
Before 2010
Setting
Hospital or clinical based
Home based
Outcome
Effectiveness of AI on radiological findings
Effectiveness of AI non-radiological medical equipment
This review has searched EMBASE, Science Citation Index, Conference Proceedings
Citation Index, and Ovid-MEDLINE for studies published from 1 January 2010 to 30
December 2019, in English. The following keywords were used: Machine learning
AND imaging; Machine learning AND Radiology; Deep learning AND algorithms AND
imaging; and AI AND Radiology
Patients and intervention
Studies that include patients diagnosed with any type of disease detected using
machine learning algorithms were selected. Prospective assessments were
undertaken for identifying the effect of these algorithms upon diagnostic yield
and also on therapeutic yield. This review explores the implementation of
machine learning algorithms and its “downstream effects” on the clinical
pathway. The Consolidated Standards of Reporting Trials and Standard Protocol
Items were reviewed for prospective trials.
Data management
A manual search of citations, related articles, and bibliographies of included
studies was undertaken to identify any further relevant articles that might have
been missed during the automated search process. The analysis for studies
providing contingency tables for both machine learning algorithm performance and
health-care professional performance was done using the sample external
validation data sets.
Risk of bias
The recommendations of the Preferred Reporting Items for Systematic Reviews and
Meta-Analyses statement were followed throughout. Methods of analysis and
inclusion criteria were specified in advance. The research question was
formulated based on previously published recommendations for systematic reviews
of prediction models, the CHecklist for critical Appraisal and data extraction
for systematic Reviews of prediction Modeling Studies.
Data synthesis
A single contingency table was selected for each study to report the highest
accuracy of radiology professionals and machine learning algorithms. Binary
diagnostic accuracy data were extracted. Contingency tables of true-negative,
false-negative, true-positive, and false-positive were used for calculating
sensitivity and specificity.
Secondary outcomes
A meta-analysis of studies was based on contingency tables to estimate the
accuracy of machine learning algorithms. This review has assumed contingency
tables to be independent from each other, whether a study provides various
contingency tables for the same or different algorithms. A unified hierarchical
model was used for the meta-analysis of diagnostic accuracy studies and the
plotted summary receiver operating characteristic (ROC) curves.
Results
The search identified 10,758 records, out of which 5035 were screened. The study
found and evaluated 102 full text articles for eligibility; 23 studies[9-32] were included in this
systematic review. Sixteen studies collected retrospective data and seven studies
collected data prospectively. A pre-specified sample size calculation was not
reported in any of the studies. The condition that health-care professionals were
provided with additional clinical information alongside the image was examined in
four studies.An algorithm-plus-clinician condition was undertaken with diagnostic performance in
three studies. One study depended on single expert consensus, five studies used
histopathology for confirmation, four studies used clinical follow-up, five studies
used various models of expert consensus, two studies used clinical trial data; one
study used surgical confirmation, and two studies used clinical care notes or
labels.The present review has also pooled performances of radiology professionals and deep
learning models obtained from matched internally validated samples as an exploratory
analysis. In addition, a single contingency table was selected to report each study
with the highest accuracy.The specificity for all the deep learning models ranged from 39% to 100%, whereas
sensitivity ranged from 85% to 100%. The pooled sensitivity was 89% for the deep
learning algorithms for detecting abnormalities compared to 75% for radiology
experts, when averaging across studies using hierarchical summary ROC curves (Fig. 1). Similarly, the pooled
specificity was 85% for health-care professionals and 91% for deep learning
algorithms (p = 0.000).
Fig. 1.
ROC curves of all studies. CI: confidence interval.
ROC curves of all studies. CI: confidence interval.A comparison between radiology professionals and deep learning algorithms was made in
12 studies. Of these 12 studies, the pooled specificity for detection of
abnormalities was 91% for deep learning models and 85% for radiology professionals
(p < 0.000). The pooled sensitivity was 73% for radiology professionals and 81%
for deep learning algorithms. The pooled sensitivity detection was 82% for
health-care professionals and 83% for deep learning algorithms (p < 0.005).
Discussion
A number of machine learning algorithms, in particular, deep neural networks, have
been implemented in content-based image retrieval systems for improving query
efficiency and accuracy. The processing of radiology text reports is another
application of machine learning in radiology. Large text databases comprise the
accumulated reports of daily radiology practice.[10] Modern information processing technologies are used to exploit these
radiology report databases to enhance retrieval, report search, and assist
radiologists in making accurate diagnoses. Natural language understanding and
natural language processing offer a more effective approach for managing and
retrieving appropriate information hidden in the radiology reports.[11] They can extract meaningful information as well as manage large-scale data in
a more efficient way that is not possible for human readers.[33] The importance of machine learning has grown in recent years owing to these
attributes, being above all a practical way to carry out a text analysis of
radiology report databases.The advantage of rapid technological change in radiology, and the rapidly evolving
field of radiomics, is similar to other fields that have benefited from
transitioning to digital systems, while issues continue to present themselves, such
as those surrounding the perception that machines and computers take jobs away from
humans, often considered to be a cultural barrier in the implementation of AI in radiology.[15] It has been predicted that much of the work of anatomic pathologists and
radiologists will be possible through machine learning in the future, and thus,
human occupations will become threatened. In addition, it is likely that machine
learning techniques will become even more sophisticated in the next 5–10 years,
which may threaten radiology as a thriving human discipline.[18]Medical images are highly heterogeneous at both a population and an individual level,
and so to train AI systems for a given application is a complex task if the number
of available labeled images is restricted.[21] In this regard, there may often be a risk of over-fitting the data, and there
will be a loss of generalizability. Therefore, the practice of radiology may
beneficially integrate AI methods, rather than replace radiologists, and improve the
efficiency of digital imaging methods.[22]A robust source of ground truth for each detection is needed to validate with AI
programs trained on proven or known cases, whether the learning is supervised or unsupervised.[16] Patient outcomes, gold standard testing results, and imaging methods can
provide the source of the ground truth, but this should be comprehensively
elucidated for each AI program that is established and used clinically.[18] Fast computing systems are as yet not generally available in medical
institutions for supplying results in a clinically relevant time frame for urgent or
emergency diagnoses. However, this may not be a practical concern owing to the easy
access to “cloud” computing solutions and the rapid development of graphic
processing units at lower costs.[19]The endeavor for generalizability of results, beyond the patient population in which
the research was performed, is a broad challenge in clinical research. Data mining
can be a complicated and costly process, despite the potential of big data to show
the efficiency of technology with respect to patient outcomes in health care.[24] The lack of structured reporting can be a strong hindrance behind this
technology. Therefore, approved nomenclature and standards could be created by the
radiology industry that provides definition and structure, to locate the same types
of data across reports, irrespective of the format that each facility utilizes.
Information generated on images will not be dictated quantifiably but will be
directed from the image viewing solutions in prospective events.[26] The improvement in accuracy and efficiency of machine learning emerges when
location, anatomical findings, and measurements are developed as a result of the
radiological viewing workflow.The automatic generalizability of health-care knowledge from training data to future
test data is one of the most significant contributions of machine learning. For
instance, a computer can explain and make decisions on masses or microcalcifications
of the human breast in a mammography Computer-Aided Detection (CAD) system.[6] In this respect, then, the knowledge of radiologists of mammography diagnosis
may be said to be transferred to the computer. For data, the input should be
original, the associated problems should be anatomical structure of the patient and
previous knowledge of the object of interest, and the objective should be
segmentation of an object of interest in the image for medical image segmentation.[1]The extraction of useful features and their identification, and designing an adequate
objective function, is the second step in machine learning. Various problems can be
addressed toward the task of fitting the data to the anatomical structures of the
target. Training the algorithm and finding the best parameters for the graph cut
model are the last step in the process. An improved scanning capability is produced
through the trained machine learning segmentation algorithm for deep learning in radiology.[31]Probabilistic models solve segmentation and image content analysis in radiological applications.[5] Various processes are included therein, such as integration or
marginalization of a complete probability model, independent and dependent variable
identification, and probability density function, for making sure distributions meet
the target variable or the objective function.[7] Previous studies have addressed the segmentation problem of brain magnetic
resonance images using a hidden Markov random field model[11] which is a stochastic process generated by Markov chain. Similarly, a hidden
Markov random field model was used to capture the association between unidentified
cluster labels and observations under spatial barriers.Diagnostic imaging is one of the first medical disciplines that has optimally applied
machine learning algorithms toward the automation of health care, while other
medical fields have marked potential in this regard, particularly but not
exclusively, cardiology, dermatology, gastroenterology, and pathology.[10] Machine learning approaches may also further personalize health care to
include a wider spectrum of data, such as genetic, laboratory, imaging, clinical,
and laboratory information.Linear regression (prediction of the dependent variable of the output by fitting a
linear function to correlate the input/output pairs, which have a continuous range
of values),[34] logistic regression (where the prediction is carried out on dependent
categorical variables),[35] artificial neural networks (nonlinear connection of the input to the output,
emulating the biological neurons found in the brain),[36] and decision trees (in which the entry “nodes” are labeled with features that
are arranged to form multiple element “classes,” where a “leaf” of a decision “tree”
can reach a finite discrete target in each pathway) are the algorithms used for
supervised learning.[37] AI algorithms have facilitated significant progress in image-recognition
tasks, specifically through the use of deep learning approaches. These methods vary,
from convolutional neural networks to variational auto-encoders, and have
significant application in medical imaging analysis. In radiology practice, medical
images are traditionally evaluated visually by trained physicians, for detection,
characterization, and monitoring of disease. However, with the application of AI in
radiology modalities, identification of images can carried out with more
accuracy.One of the major challenges of AI radiology is the lack of trust by the radiologists,
when it is regarding answers related to analysis of medical images. The reason
being, many radiologists perceive it as a “black box” due to doubts regarding the
unclear process which gives a conclusive answer. scientific research and test
running of the software in the hospital can help strengthen the radiologist’s trust
in AI. According to an example proposed by one study,[38] similar cases from training databases could be depicted for rendering more
information about data and providing relevant insights to the physicians. For AI
radiology to survive, it is important to have the trust of its users, i.e.,
radiologists. Also radiologists can play a vital role in identifying targeted
clinical cases for which these AI integrated tools can be implemented to test their
effectiveness and sensitivity in clinical practice.[39] They can also play a crucial role of preserving their expertise and keeping
check on the drawbacks of over-reliance on technology.[39]In the future, imaging data may be associated more readily with non-imaging data,
such as those of electronic medical records or other large data sets. Indeed, when
applied to electronic medical record data, deep learning can assist in extracting
patient presentations that may link to clinical predictions and improve clinical
decision support systems. Machine learning may thus play more of significant role in
the prediction of treatment response and prognosis. Initial phases toward this type
of work have already begun. For example, machine learning can accurately estimate
brain tumor response to various therapies. Also, machine learning can be used in the
prediction of longevity of patients by detecting characteristics representative of
overall individual health.This study has several limitations. First, it might have language bias, as only the
studies that were published in English were included. Another limitation was that
the studies varied in their methodology; therefore, meta-analysis was not done.In conclusion, it is necessary for medical technologies to improve value with respect
to the delivery of radiology services and medical care, for reduced time on tasks
for radiologists, increased diagnostic certainty, mitigated costs of care with
effective findings for patients, and faster availability of findings. Significant
time and experience are required to establish whether these advantages have been met
in the implemented technology and to understand comparative benefits, as with any
new technological innovation. If machine learning and AI programs can be developed
that are tolerant of various data acquisition protocols and work in various patient
populations, they will have achieved the required outcomes. Nevertheless, success
will need comprehensive understanding of the conditions under which a particular
program is appropriate. Yet, the ultimate role of machine learning methods in
radiology is still unclear, as is the influence these will eventually have on
radiologists. What is apparent, however, is that machine learning and AI offer a
powerful set of tools to analyze image data that have considerable potency. The
elevated interest in AI in radiomics and radiology in recent years suggests it may
have a primary role in the near future.
Authors: Garry Choy; Omid Khalilzadeh; Mark Michalski; Synho Do; Anthony E Samir; Oleg S Pianykh; J Raymond Geis; Pari V Pandharipande; James A Brink; Keith J Dreyer Journal: Radiology Date: 2018-06-26 Impact factor: 11.105
Authors: John Zech; Margaret Pain; Joseph Titano; Marcus Badgeley; Javin Schefflein; Andres Su; Anthony Costa; Joshua Bederson; Joseph Lehar; Eric Karl Oermann Journal: Radiology Date: 2018-01-30 Impact factor: 11.105
Authors: Manisha Bahl; Regina Barzilay; Adam B Yedidia; Nicholas J Locascio; Lili Yu; Constance D Lehman Journal: Radiology Date: 2017-10-17 Impact factor: 11.105