Literature DB >> 32133724

Artificial intelligence in oncology.

Hideyuki Shimizu1, Keiichi I Nakayama1.   

Abstract

Artificial intelligence (AI) has contributed substantially to the resolution of a variety of biomedical problems, including cancer, over the past decade. Deep learning, a subfield of AI that is highly flexible and supports automatic feature extraction, is increasingly being applied in various areas of both basic and clinical cancer research. In this review, we describe numerous recent examples of the application of AI in oncology, including cases in which deep learning has efficiently solved problems that were previously thought to be unsolvable, and we address obstacles that must be overcome before such application can become more widespread. We also highlight resources and datasets that can help harness the power of AI for cancer research. The development of innovative approaches to and applications of AI will yield important insights in oncology in the coming decade.
© 2020 The Authors. Cancer Science published by John Wiley & Sons Australia, Ltd on behalf of Japanese Cancer Association.

Entities:  

Keywords:  artificial intelligence; deep learning; machine learning; oncology; personalized medicine

Mesh:

Year:  2020        PMID: 32133724      PMCID: PMC7226189          DOI: 10.1111/cas.14377

Source DB:  PubMed          Journal:  Cancer Sci        ISSN: 1347-9032            Impact factor:   6.716


INTRODUCTION

Artificial intelligence (AI) is a field of research in which computers are applied to mimic human intelligence. Machine learning is a subfield of AI in which mathematical and statistical approaches are applied to improve the performance of computers. Deep learning is a subfield of machine learning characterized by the operation of multilayered artificial neural networks (Figure 1). The term “deep learning” refers to a set of new techniques that together have shown marked improvements in performance compared with existing best‐in‐class machine learning algorithms in several disciplines. For instance, these methods have revolutionized image classification and speech recognition as a result of their flexibility and high accuracy. These breakthroughs have allowed deep learning to be adopted as an approach that can efficiently solve various problems in biomedicine. The application of deep learning to the diagnosis of diseases on the basis of the classification of radiological or pathological images has demonstrated a performance that equals or actually exceeds that of clinical experts. , Deep learning has also proved highly accurate in the detection of retinopathy from fundus photographs. With high expectations for this technology, it is now being applied to the field of drug discovery.
Figure 1

Artificial Intelligence (AI), machine learning and deep learning. AI refers to a broad range of computational methods that mimic human intelligence. Machine learning is a subfield of AI that relies on statistical methods to detect hidden patterns within a dataset. Deep learning is a subfield of machine learning that harnesses the power of multilayered networks

Artificial Intelligence (AI), machine learning and deep learning. AI refers to a broad range of computational methods that mimic human intelligence. Machine learning is a subfield of AI that relies on statistical methods to detect hidden patterns within a dataset. Deep learning is a subfield of machine learning that harnesses the power of multilayered networks Biology and medicine are rapidly becoming data‐intensive. Stephens et al (2015) claimed that the application of AI to genomics alone will equal or exceed that to social media, online videos and other data‐intensive disciplines with regard to data generation and analysis within the next 10 years. Automated algorithms that extract meaningful patterns can provide practical knowledge and change the way in which treatments are developed, patients are classified and diseases are studied. In contrast, AI may infringe on privacy because of potential access to personal information such as genomic sequences during data processing. Given that large datasets with appropriate data annotation are required for the application of deep learning technology, it is essential that both medical professionals and biological scientists possess a basic knowledge of deep learning, including its applications and potential drawbacks, when collaborating with AI researchers and using deep learning in their projects. Cancer is the most common cause of death in developed countries, and it is estimated that the number of cases will increase further in aging populations. , In Japan, almost 1 million individuals are diagnosed with cancer and nearly 400 000 die of the disease every year. Cancer research will, thus, continue to be a top priority for the saving of lives in the coming decade. In this review, we focus on the application of deep learning to cancer. For more comprehensive information on deep learning, including its mathematical aspects, we recommend several recent reviews , , , , , and a book.

DEEP LEARNING ESSENTIALS

The deep learning approach evolved from the study of artificial neurons, being first proposed in 1943 as a model for the processing of information by neurons in the biological brain. In a neural network, the input is provided to an input layer, which transfers its computed value to one or more hidden layers that are linked to an output layer. A layer consists of a set of nodes, known as “units” or “features,” that are connected through edges to both the previous and the next layers. Each unit transforms the data in a nonlinear manner by applying an activation function (Figure 2A). A deep neural network possesses multiple (sometimes > 100) hidden layers (Figure 2B). The training process allows deeper layers in the network to combine high‐level features coming from the previous layer and build more such features. As a result, these algorithms can automatically design features that are appropriate for solving the task at hand.
Figure 2

Common architectures of neural networks. A, The simplest neural network comprises three layers: an input layer, a hidden layer and an output layer. Each node has some value and transmits its signal to the next layer. It first sums all weighted inputs and then transmits the resulting value to an activation function (rectified linear unit, or ReLU, in this case). B, A typical deep neural network, also known as a dense neural network, has multiple hidden layers, the nodes of each of which calculate values in the same manner as shown in (A). C, A convolutional neural network (CNN) applies multiple convolution layers before feeding the data into a dense neural network. The convolution layers apply filters (or kernels) to grid‐based data. D, A fully convolutional network (FCN) is a variant of a CNN in that it lacks densely connected layers. E, A recurrent neural network (RNN) is a special network designed for time‐series data. Each hidden layer holds certain variables and transmits them to the next time step. F, An autoencoder resembles a dense neural network but is trained to output signals that are identical to the inputs. Such networks offer a means to encode and decode data, with encoded features being stored in the hidden layer. G, A generative adversarial network (GAN) consists of two independent neural networks: a generator and a discriminator. The generator attempts to create new data (false data) that resemble the true data. In contrast, the discriminator discriminates real data from artificial data created by the generator. Alternate training of these two networks helps to decipher the complex rules underlying the data

Common architectures of neural networks. A, The simplest neural network comprises three layers: an input layer, a hidden layer and an output layer. Each node has some value and transmits its signal to the next layer. It first sums all weighted inputs and then transmits the resulting value to an activation function (rectified linear unit, or ReLU, in this case). B, A typical deep neural network, also known as a dense neural network, has multiple hidden layers, the nodes of each of which calculate values in the same manner as shown in (A). C, A convolutional neural network (CNN) applies multiple convolution layers before feeding the data into a dense neural network. The convolution layers apply filters (or kernels) to grid‐based data. D, A fully convolutional network (FCN) is a variant of a CNN in that it lacks densely connected layers. E, A recurrent neural network (RNN) is a special network designed for time‐series data. Each hidden layer holds certain variables and transmits them to the next time step. F, An autoencoder resembles a dense neural network but is trained to output signals that are identical to the inputs. Such networks offer a means to encode and decode data, with encoded features being stored in the hidden layer. G, A generative adversarial network (GAN) consists of two independent neural networks: a generator and a discriminator. The generator attempts to create new data (false data) that resemble the true data. In contrast, the discriminator discriminates real data from artificial data created by the generator. Alternate training of these two networks helps to decipher the complex rules underlying the data There are many architectures of deep neural networks. A convolutional neural network (CNN) can be applied to data that have a grid topology, such as images, and uses multiple filters to detect patterns within the data (Figure 2C). A fully convolutional network (FCN) differs from a CNN in that fully connected layers of the CNN are replaced by upsampling and deconvolution layers, which are considered the inverse of pooling and convolution layers, respectively (Figure 2D). An FCN generates a score map for each class instead of one probability score. This map is the same precise size as the input image and classifies the image by pixels. These new layers have been used to develop deep learning algorithms in many applications. Finally, a recurrent neural network (RNN) is useful for the analysis of sequential data, such as language and genomic sequences (Figure 2E). These neural networks correspond to supervised learning methods, which require answers to train the network. In contrast, no labels are necessary for unsupervised learning. An autoencoder is a representative example of such an unsupervised method that attempts to provide output signals that are identical to the original input (Figure 2F). A generative adversarial network (GAN) actually consists of two networks: a generator and a discriminator. The generator neural network creates fake data to cheat the discriminator network, whereas the discriminator distinguishes the false data from true data. Training these networks alternately allows the generator to finally learn how to generate fake data indistinguishable from real data (Figure 2G). Although we do not dive into detail here, various novel architectures, such as graphical convolution , and capsuleNet, are continually being proposed and applied in biomedicine. Given that each type of neural network is specialized in its specific data structure, the combination of multiple existing frameworks or the development of new architectures is expected to greatly improve the feasibility of using neural networks for interpretation of complex phenomena.

ARTIFICIAL INTELLIGENCE FOR CANCER IMAGES

Application to image analysis

Early detection of cancer is key to saving the lives of affected individuals. Deep learning has revolutionized image analysis since its spectacular win in the image recognition contest ILSVRC (ImageNet Large Scale Visual Recognition Challenge) in 2012, with many researchers and physicians having attempted to harness the power of AI (or, more precisely, CNN) for application to clinical radiology and pathology because it obviates the need to generate detailed features by craftsmanship. One example of high impact is the successful classification of dermoscopy images. , AI was, thus, found to be able to annotate skin lesions (including melanoma) as precisely as were expert dermatologists (with an area under the curve [AUC] of 0.94‐0.96). Given that smartphones extend the reach of dermatologists outside of the clinic, this achievement has the potential to provide universal access to dermatologist‐level diagnoses. AI has also achieved a level of accuracy similar to that of medical specialists in the interpretation of mammograms for breast cancer screening (AUC for AI of 0.840, compared with 0.814 for physicians). In addition, deep neural networks have been able to detect enlarged lymph nodes or colonic polyps in computed tomography images. The multiple applications of deep learning in radiology are covered in more detail elsewhere. , Whole‐slide imaging is becoming routine in developed countries, which has resulted in the accumulation of digital pathology images and allowed the application of deep learning to pathological diagnosis. One proposed model extracts features with a CNN and supplies them to a support vector machine, another machine learning algorithm, to train for the detection of breast mitosis. A deep learning system has been used to detect areas of cancer in whole‐slide images of radical prostatectomy specimens and to automatically assign the Gleason score with an accuracy of 0.70 (which is superior to that of general pathologists). A CNN has also been applied to pathological slides from The Cancer Genome Atlas (TCGA) for the automatic detection of tumor‐infiltrating lymphocytes (TIL), and the features extracted in this model (TIL maps) were found to be prognostic factors for 13 different cancer types, including breast, lung and colorectal tumors. In addition, given that deep learning can also output a map of where potential cancer cells are present on a slide and determine with what likelihood the cells are cancerous, pathologists may need only to evaluate slides for which AI is not able to provide a clear answer. Deep learning is also able to elucidate the molecular status of a tumor from pathological data. In addition to being able to assess and score the expression of tumor marker proteins such as HER2 in tissue slides, a CNN model (inception v3) successfully predicted which genes in tumor tissue harbor mutations (with AUC from 0.733 to 0.856). Another CNN model, designated HE2RNA, can predict transcriptome profiles from pathological images, without expert annotation. Collectively, these achievements show that deep learning has already been used to perform versatile tasks, such as cancer diagnosis, at a level equal to or sometimes greater than that of expert physicians. One of the remaining obstacles in automatic histopathology analysis is the establishment of standard protocols. Differences in color tone on pathology slides may occur among institutions as a result of differences in staining reagents, protocols and section thickness. It will, therefore, be necessary to standardize color tones in digital slides for the development of accurate AI algorithms. Some automated CNN‐based tools, such as HistoQC and DeepFocus, , have been developed to standardize the quality of whole‐slide imaging. GAN‐based image generators were also recently proposed to tackle this obstacle. In this case, a neural network receives a grayscale image, instead of noise data, as input, and a normalized hematoxylineosin‐stained image can be generated. GAN‐based approaches have also been adopted beyond diagnosis. For example, they have already been used to correct feature segmentation and to score levels of the immune‐checkpoint protein PD‐L1 in needle biopsy specimens of non–small cell lung cancer. Hematoxylineosin–stained specimens have also been successfully converted to immunohistologically stained images for cytokeratins 18/19. These various examples indicate that AI can be of great help in reducing the burden on medical staff involved in tumor assessment. In the near future, deep learning will become an important support tool for pathologists that will improve the accuracy and efficiency of histopathologic diagnosis and thereby inform treatment selection. For further reading, we recommend two recent reviews in this field. ,

Limitations and their potential solutions in medical image analysis

In almost all cases, the number of medical images available for training is <1 million, which is many orders of magnitude smaller than the natural image collection. Researchers have developed several strategies to overcome this limitation. Data augmentation, whereby images are randomly cropped, tilted, inverted or flipped to increase their number, is one effective strategy for dealing with the small size of training sets. An example of this approach is provided by a series of papers on the analysis of mammograms. , A second strategy, known as transfer learning, involves reuse for the new purpose of medical imaging of features extracted by deep learning from natural image datasets such as ImageNet. The potential to diagnose diabetic retinopathy from fundus photographs has attracted many deep learning researchers since 2015, when a large, labeled dataset of images was released at a machine learning competition called Kaggle. For example, researchers reused a 48‐layer CNN architecture known as inception v3 that was pretrained with natural images and succeeded in exceeding the specificity and sensitivity of the then state‐of‐the‐art model. These approaches have also been exploited in training AI to detect melanoma, the most deadly skin cancer. , Such findings motivate researchers to share image data and make it freely usable among communities. Some useful resources of tumor images are summarized in Table 1. The increasing rate of adoption of AI in oncology research will result in an increase in the number of medical images becoming available, thus allowing researchers to build more robust and sophisticated algorithms.
Table 1

Examples of available image datasets related to cancer

DatasetCancerURL
AIDA‐EStomach https://isbi-aida.grand-challenge.org/
BraTSBrain https://www.med.upenn.edu/sbia/brats2018/data.html
BreakHisBreast https://web.inf.ufpr.br/vri/databases/breast-cancer-histopathological-database-breakhis/
BACHBreast https://iciar2018-challenge.grand-challenge.org/Dataset/
CAMELYONLymph node https://camelyon17.grand-challenge.org/Data/
CCSDCervix https://www.kaggle.com/c/intel-mobileodt-cervical-cancer-screening/data
Histopathologic Cancer DetectionLymph node https://www.kaggle.com/c/histopathologic-cancer-detection/data
ISIC 2018Skin https://challenge.kitware.com/#challenge/5aab46f156357d5e82b00fe5
INbreastBreast http://medicalresearch.inescporto.pt/breastresearch/index.php/Get_INbreast_Database
KiTSKidney https://kits19.grand-challenge.org/data/
LIDC‐IDRILung https://wiki.cancerimagingarchive.net/display/Public/LIDC-IDRI#940027f1a8a845d0a61a1b5b5083567e
MED‐NODESkin http://www.cs.rug.nl/imaging/databases/melanoma_naevi/
mini‐MIASBreast http://peipa.essex.ac.uk/info/mias.html
mitos‐atypiaBreast https://mitos-atypia-14.grand-challenge.org/dataset/
PROMISE12Prostate https://promise12.grand-challenge.org/Home/
PAIPLiver https://paip2019.grand-challenge.org/Dataset/
TCIAMiscellaneous https://www.cancerimagingarchive.net/
18F‐FDGHead and Neck https://www.kaggle.com/c/pet-radiomics-challenges/data
Examples of available image datasets related to cancer

ARTIFICIAL INTELLIGENCE FOR CANCER GENOMICS

With the cost of genome sequencing declining, the use of a supercomputer to analyze genomic data from cancer patients often results in the identification of between 1000 and 100 000 genomic mutations for each tumor sample. However, it is necessary to clarify the association of each of these mutations with clinical phenotypes, which is currently a bottleneck for genomic medicine. The clinical interpretation of genetic variants depends mainly on information in the scientific and medical literature. In other words, researchers need to find the relevant literature to link the identified genomic mutations to information on disease states, effective drugs and prognosis. With more than 200 000 new cancer‐related articles being published in 2019 alone, human resources are not sufficient for manual curation. An example of a database that summarizes the relation between genomic variation and disease is the COSMIC database provided by the Sanger Center. COSMIC version 90, released in September 2019, extracted 9 733 455 mutations in gene coding regions from 26 829 papers. The use of AI will, therefore, become increasingly indispensable. Before the application of AI to genomic data, the sequence is transformed into a binary table (one‐hot encoding) that shows the presence or absence of each of the four bases at each position. Several filters are then applied one‐dimensionally for convolution of the table (Figure 3A). There are two main reasons why deep learning is useful in cancer genomics. First, in addition to single‐task learning, it allows multitask learning, in which AI learns multiple different tasks simultaneously by sharing parts of a model (Figure 3B). User‐defined losses (differences in predicted and actual values for each task) are minimized during the training process by an appropriate optimizer algorithm. Second, it allows multimodal learning, a method for integration of different types of data (such as sequence and chromatin accessibility) and their entry as inputs. In this process, AI automatically learns how to combine these different kinds of data. Given that cancer is a complex disease, it is preferable to integrate multilayered data. The application of AI to analysis of a large amount of “omics” (exome, transcriptome, and epigenome) data as well as data on the susceptibility of patients with acute myeloid leukemia to anticancer drugs resulted in the identification of drug‐susceptibility genes. Watson for Genomics also analyzed 323 patients to identify genomic alterations with potential clinical effects that were not recognized by the conventional Molecular Tumors panel.
Figure 3

Deep learning for cancer genomics. A, Sequence data are converted to a binary map (one‐hot encoding) and several filters are applied (1D‐CNN), resulting in the transformation of genomics data into numerical vectors. The remaining procedure is the same as for common tasks (updating of weights to minimize loss). B, A typical workflow receives one type of data and outputs prediction. In multitask learning, multiple types of prediction (such as clinical impact of a mutation and biological activity of a gene) are generated by the shared network and specified networks for each task. Conversely, in multimodal learning, the network integrates different types of information (such as sequence data and chromatin accessibility) and outputs prediction

Deep learning for cancer genomics. A, Sequence data are converted to a binary map (one‐hot encoding) and several filters are applied (1D‐CNN), resulting in the transformation of genomics data into numerical vectors. The remaining procedure is the same as for common tasks (updating of weights to minimize loss). B, A typical workflow receives one type of data and outputs prediction. In multitask learning, multiple types of prediction (such as clinical impact of a mutation and biological activity of a gene) are generated by the shared network and specified networks for each task. Conversely, in multimodal learning, the network integrates different types of information (such as sequence data and chromatin accessibility) and outputs prediction Deep learning is also applied to the variant calling process. In addition to the standard variant detection framework, Google’s DeepVariant exploits the Inception TensorFlow framework, which was initially developed for image classification. This workflow converts variant calling into an image recognition task by changing the BAM file into an image similar to the genome browser snapshot and determining variants based on likelihood. Another algorithm, ExPecto, links genetic mutations with disease prediction. ExPecto predicts the level of gene expression in each tissue on the basis of wide regulatory regions consisting of 40‐kb promoter‐proximal sequences. This framework was built with the use of all publicly available genome‐wide association studies and has been experimentally validated. Estimation of gene expression level by this approach might help to decipher the complex etiology of cancer on the basis of genome‐wide sequence data. More and more clinical cancer genomic data are gradually being accumulated. In 2017, the US FDA approved several genome sequence‐based panels related to oncology, including the Oncomine Dx Target Test, the Praxis Extended RAS Panel, MSK‐IMPACT and FoundationOne CDx. In Japan, the Ministry of Health, Labor, and Welfare set a goal in May 2019 to perform genome‐wide sequencing of 100 000 individuals over the next 3 years through the full‐scale introduction of genome‐wide sequencing to the clinic. AI is, thus, set to play a more important role in the interpretation of cancer genomic findings. Machine learning also has the potential to provide novel biological insights. As one example, a regulatory role for Fbxw7 (one of the most frequently mutated E3 ubiquitin ligases in cancer) in the oxidative metabolism of cancer cells was discovered with the use of a machine learning algorithm (kernelized Bayesian transfer learning). In combination with our previous studies showing that Fbxw7 plays an essential role in the maintenance of quiescence and stemness in cancer stem cells, , , this finding may provide important clues to uncovering the metabolic characteristics of cancer stem cells. AI can also precisely predict RNA splicing. Precursor mRNA transcripts undergo splicing to generate multiple mature mRNA isoforms, and prediction of splicing sites has been a major goal over the past decade. Early studies adopted a naive Bayesian model, but the advent of deep learning allowed the development of more complex models that have provided better predictive accuracy. SpliceAI, a 32‐layer deep CNN, predicts splicing from a given pre–mRNA sequence. Abnormal splicing has been frequently observed in cancer, and the splicing machinery is being targeted for therapeutic purposes. Further advances in this field might allow researchers to predict tissue‐specific and exon‐specific splicing patterns from genomic sequences.

ARTIFICIAL INTELLIGENCE AND PERSONALIZED MEDICINE

A key challenge in medical science is the precise classification of diseases and the development of optimal therapies, which would be expected to improve the outcome of many patients. Current “gold standard” approaches to disease classification in oncology include histological examination by expert pathologists and evaluation of the expression of molecular markers such as cell surface receptors at the protein or mRNA level. One example is the PAM50 classification, in which breast cancer is classified into several subtypes on the basis of the expression of marker genes. There is still considerable heterogeneity within these subtypes, however. Given the increasing amount of available molecular data, it may be possible to identify disease subtypes more comprehensively to predict future disease behavior and treatment response. We recently developed a tool, termed the molecular Prognostic Score (mPS), that is able to predict the prognosis of breast cancer patients precisely (Figure 4A). We integrated almost 6000 breast cancer patients by meta‐analysis, which is mainly applied in the field of epidemiology, and comprehensively identified 184 prognosis‐related genes for breast cancer without any biological information. We built a random forest classifier combined with a neural network and trained the model to predict overall survival with half of the METABRIC cohort. This scoring system stratified prognosis more accurately than existing methods in several independent test datasets. Unlike previous tools, , it can even be applied to estrogen receptor‐negative breast cancer patients (Figure 4B). In addition, the score provides useful information for avoidance of overtreatment (Figure 4C).
Figure 4

Prediction of prognosis of breast cancer patients. A, All human protein‐coding genes were evaluated using a series of methods (log‐rank test, meta‐analysis, machine learning with the random forest method and a neural network). The molecular Prognostic Score (mPS) based on 23 prognostic genes predicts the prognosis of breast cancer patients. B, mPS stratifies not only estrogen receptor (ER)‐positive but also ER‐negative patients, in contrast to existing methods such as MammaPrint and Oncotype. C, Chemotherapy may not be necessary for patients with a low mPS because their prognosis is fairly good

Prediction of prognosis of breast cancer patients. A, All human protein‐coding genes were evaluated using a series of methods (log‐rank test, meta‐analysis, machine learning with the random forest method and a neural network). The molecular Prognostic Score (mPS) based on 23 prognostic genes predicts the prognosis of breast cancer patients. B, mPS stratifies not only estrogen receptor (ER)‐positive but also ER‐negative patients, in contrast to existing methods such as MammaPrint and Oncotype. C, Chemotherapy may not be necessary for patients with a low mPS because their prognosis is fairly good Attempts have also been made to predict the risk of cancer recurrence from pathological images. A digital pathology test thus predicts the risk of recurrence in breast cancer patients with the use of stored samples from a completed clinical trial, ECOG 2197. This AI‐driven approach allows the stratification of patients with a hazard ratio of 2.41 (95% confidence interval, 1.21‐4.79). Artificial intelligence is a key driver of the transformation of health care to precision medicine. Numerous international competitions are held each year to further revolutionize the role of AI in health care (Table 2) and have yielded novel algorithms to tackle complicated tasks. Together with crowdsourcing, such open and innovative challenges will broaden further applications of AI to cancer research. On the basis of recent advances, the FDA has finally begun the approval of clinical medical devices based on deep learning (Table 2). In 2018, the cloud‐based Arterys imaging platform was approved by the FDA as a tool to help physicians track tumors on the basis of MRI and computed tomography scans of lung and liver cancer patients. In 2019, the digital pathology solution PAIGE.AI was designated as a Breakthrough Device by the FDA. Startups such as PAIGE.AI, Proscia, and PathAI use deep learning‐based AI algorithms to detect, diagnose and predict certain cancer types.
Table 2

Summary of open competitions, companies with FDA‐approved artificial intelligence devices, useful tools, and educational resources for beginners

TypeNameURL
Open challengeDream Challenges http://dreamchallenges.org/
Open challengeGrand Challenge https://grand-challenge.org/challenges/
Open challengePrecisionFDA Truth Challenge https://precision.fda.gov/challenges/truth
Open challengeCAGI https://genomeinterpretation.org/
CompanyArterys https://www.arterys.com/
CompanyPAIGE.AI https://paige.ai/
CompanyProscia https://proscia.com/
CompanyPathAI https://www.pathai.com/
LibraryScikit‐learn https://scikit-learn.org/stable/
LibraryTensorFlow https://www.tensorflow.org/
LibraryKeras https://keras.io/ja/
LibraryPyTorch https://pytorch.org/
LibraryDragoNN https://kundajelab.github.io/dragonn/
LibraryKipoi https://kipoi.org/
EducationDeepOncology https://github.com/deeponcology/PyTorchMedicalAI
Educationfast.ai https://www.fast.ai/
EducationMedical‐ai‐course‐materials https://japan-medical-ai.github.io/medical-ai-course-materials/
Summary of open competitions, companies with FDA‐approved artificial intelligence devices, useful tools, and educational resources for beginners

CONCLUSION

Digitization of health care is dramatically changing clinical workflow by providing both healthcare providers and patients with access to information based on big data. Experience‐based medicine is being replaced with an evidence‐based, patient‐centric approach. Rapidly evolving AI technology will continue to have a large impact on the field of cancer in the near future. Both physicians and researchers need to be ready for this coming revolutionary era. Medical education must, therefore, include not only life sciences and clinical medicine, but also advanced statistical and computational skills. Indeed, some medical schools have already begun to include AI education courses in the curriculum. Implementation of AI technology is becoming easier as a result of the availability of various open‐source tools and cloud computing. Some useful resources for beginners in this field are listed in Table 2. Deep learning has gained much attention in recent years, but several obstacles must be overcome before its application to health care and cancer research becomes more widespread. First, maximization of the power of deep learning will require the deposition of medical data with sufficient annotation in large‐scale databases. International collaborative projects (such as The Cancer Imaging Archive [http://www.cancerimagingarchive.net] and Genomic Data Commons Data Portal %5B http://broken-link/) that build large, labeled datasets should make a substantial contribution to meeting this challenge. A second obstacle is that deep learning is now a black box that does not explain the decision‐making process clearly. The large number of parameters involved makes it difficult to understand the details of how deep learning analyzes data and makes decisions. The development of a “white box” approach has become a major research topic in biomedical science. Such innovative approaches in this area are also likely to provide key insights in cancer research. Given how rapidly the field is evolving and the many potential applications of AI in cancer science, AI will revolutionize oncology in the coming decade.

CONFLICT OF INTERESTS

The authors declare no potential conflicts of interest.
  60 in total

Review 1.  The current state of clinical interpretation of sequence variants.

Authors:  Derick C Hoskinson; Adrian M Dubuc; Heather Mason-Suares
Journal:  Curr Opin Genet Dev       Date:  2017-01-31       Impact factor: 5.578

Review 2.  Deep learning.

Authors:  Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

3.  Compound-protein interaction prediction with end-to-end learning of neural networks for graphs and sequences.

Authors:  Masashi Tsubaki; Kentaro Tomii; Jun Sese
Journal:  Bioinformatics       Date:  2019-01-15       Impact factor: 6.937

4.  Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs.

Authors:  Varun Gulshan; Lily Peng; Marc Coram; Martin C Stumpe; Derek Wu; Arunachalam Narayanaswamy; Subhashini Venugopalan; Kasumi Widner; Tom Madams; Jorge Cuadros; Ramasamy Kim; Rajiv Raman; Philip C Nelson; Jessica L Mega; Dale R Webster
Journal:  JAMA       Date:  2016-12-13       Impact factor: 56.272

Review 5.  The rise of deep learning in drug discovery.

Authors:  Hongming Chen; Ola Engkvist; Yinhai Wang; Marcus Olivecrona; Thomas Blaschke
Journal:  Drug Discov Today       Date:  2018-01-31       Impact factor: 7.851

6.  Cancer statistics for adults aged 85 years and older, 2019.

Authors:  Carol E DeSantis; Kimberly D Miller; William Dale; Supriya G Mohile; Harvey J Cohen; Corinne R Leach; Ann Goding Sauer; Ahmedin Jemal; Rebecca L Siegel
Journal:  CA Cancer J Clin       Date:  2019-08-07       Impact factor: 508.702

7.  Automated Melanoma Recognition in Dermoscopy Images via Very Deep Residual Networks.

Authors:  Lequan Yu; Hao Chen; Qi Dou; Jing Qin; Pheng-Ann Heng
Journal:  IEEE Trans Med Imaging       Date:  2016-12-21       Impact factor: 10.048

8.  Big Data: Astronomical or Genomical?

Authors:  Zachary D Stephens; Skylar Y Lee; Faraz Faghri; Roy H Campbell; Chengxiang Zhai; Miles J Efron; Ravishankar Iyer; Michael C Schatz; Saurabh Sinha; Gene E Robinson
Journal:  PLoS Biol       Date:  2015-07-07       Impact factor: 8.029

Review 9.  To wake up cancer stem cells, or to let them sleep, that is the question.

Authors:  Shoichiro Takeishi; Keiichi I Nakayama
Journal:  Cancer Sci       Date:  2016-06-20       Impact factor: 6.716

10.  Deep learning sequence-based ab initio prediction of variant effects on expression and disease risk.

Authors:  Jian Zhou; Chandra L Theesfeld; Kevin Yao; Kathleen M Chen; Aaron K Wong; Olga G Troyanskaya
Journal:  Nat Genet       Date:  2018-07-16       Impact factor: 38.330

View more
  24 in total

Review 1.  Can artificial intelligence overtake human intelligence on the bumpy road towards glioma therapy?

Authors:  Precilla S Daisy; T S Anitha
Journal:  Med Oncol       Date:  2021-04-03       Impact factor: 3.064

Review 2.  Artificial Intelligence: Review of Current and Future Applications in Medicine.

Authors:  L Brannon Thomas; Stephen M Mastorides; Narayan A Viswanadhan; Colleen E Jakey; Andrew A Borkowski
Journal:  Fed Pract       Date:  2021-11

3.  Artificial intelligence opportunities in cardio-oncology: Overview with spotlight on electrocardiography.

Authors:  Daniel Sierra-Lara Martinez; Peter A Noseworthy; Oguz Akbilgic; Joerg Herrmann; Kathryn J Ruddy; Abdulaziz Hamid; Ragasnehith Maddula; Ashima Singh; Robert Davis; Fatma Gunturkun; John L Jefferies; Sherry-Ann Brown
Journal:  Am Heart J Plus       Date:  2022-04-01

Review 4.  Artificial intelligence in oncology.

Authors:  Hideyuki Shimizu; Keiichi I Nakayama
Journal:  Cancer Sci       Date:  2020-03-21       Impact factor: 6.716

5.  A machine learning-based predictor for the identification of the recurrence of patients with gastric cancer after operation.

Authors:  Chengmao Zhou; Junhong Hu; Ying Wang; Mu-Huo Ji; Jianhua Tong; Jian-Jun Yang; Hongping Xia
Journal:  Sci Rep       Date:  2021-01-15       Impact factor: 4.379

Review 6.  Machine Learning and Radiomics Applications in Esophageal Cancers Using Non-Invasive Imaging Methods-A Critical Review of Literature.

Authors:  Chen-Yi Xie; Chun-Lap Pang; Benjamin Chan; Emily Yuen-Yuen Wong; Qi Dou; Varut Vardhanabhuti
Journal:  Cancers (Basel)       Date:  2021-05-19       Impact factor: 6.639

7.  A pharmacometrician's role in enhancing medication use in pregnancy and lactation.

Authors:  Sara K Quinney; Peter L Bonate
Journal:  J Pharmacokinet Pharmacodyn       Date:  2020-08       Impact factor: 2.745

Review 8.  Artificial Intelligence in Health Care: Current Applications and Issues.

Authors:  Chan Woo Park; Sung Wook Seo; Noeul Kang; BeomSeok Ko; Byung Wook Choi; Chang Min Park; Dong Kyung Chang; Hwiyoung Kim; Hyunchul Kim; Hyunna Lee; Jinhee Jang; Jong Chul Ye; Jong Hong Jeon; Joon Beom Seo; Kwang Joon Kim; Kyu Hwan Jung; Namkug Kim; Seungwook Paek; Soo Yong Shin; Soyoung Yoo; Yoon Sup Choi; Youngjun Kim; Hyung Jin Yoon
Journal:  J Korean Med Sci       Date:  2020-11-02       Impact factor: 2.153

9.  Machine and Deep Learning Based Radiomics Models for Preoperative Prediction of Benign and Malignant Sacral Tumors.

Authors:  Ping Yin; Ning Mao; Hao Chen; Chao Sun; Sicong Wang; Xia Liu; Nan Hong
Journal:  Front Oncol       Date:  2020-10-16       Impact factor: 6.244

10.  Noncoding RNAs and Deep Learning Neural Network Discriminate Multi-Cancer Types.

Authors:  Anyou Wang; Rong Hai; Paul J Rider; Qianchuan He
Journal:  Cancers (Basel)       Date:  2022-01-12       Impact factor: 6.575

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.