Literature DB >> 31871830

Opportunities for Artificial Intelligence in Advancing Precision Medicine.

Fabian V Filipp1,2.   

Abstract

PURPOSE OF REVIEW: We critically evaluate the future potential of machine learning (ML), deep learning (DL), and artificial intelligence (AI) in precision medicine. The goal of this work is to show progress in ML in digital health, to exemplify future needs and trends, and to identify any essential prerequisites of AI and ML for precision health. RECENT
FINDINGS: High-throughput technologies are delivering growing volumes of biomedical data, such as large-scale genome-wide sequencing assays; libraries of medical images; or drug perturbation screens of healthy, developing, and diseased tissue. Multi-omics data in biomedicine is deep and complex, offering an opportunity for data-driven insights and automated disease classification. Learning from these data will open our understanding and definition of healthy baselines and disease signatures. State-of-the-art applications of deep neural networks include digital image recognition, single-cell clustering, and virtual drug screens, demonstrating breadths and power of ML in biomedicine.
SUMMARY: Significantly, AI and systems biology have embraced big data challenges and may enable novel biotechnology-derived therapies to facilitate the implementation of precision medicine approaches.

Entities:  

Keywords:  AI; DL; DNN; Deep learning; Digital health; Digital pathology; ML; Machine learning; Multi-omics; Precision medicine; Single-cell transcriptomics; Spatial transcriptomics; Systems biology

Year:  2019        PMID: 31871830      PMCID: PMC6927552          DOI: 10.1007/s40142-019-00177-4

Source DB:  PubMed          Journal:  Curr Genet Med Rep        ISSN: 2167-4876


Introduction

In the past decade, advances in genetic disease and precision oncology have resulted in an increased demand for predictive assays that enable the selection and stratification of patients for treatment [1]. The enormous divergence of signaling and transcriptional networks mediating the cross talk between healthy, diseased, stromal, and immune cells complicates the development of functionally relevant biomarkers based on a single gene or protein. Unexpectedly, the conclusion of the human genome did not translate into a burst of new drugs. The pharmaceutical industry rather announced a declining output in terms of the number of new drugs approved despite increasing commercial efforts of drug research and development [2, 3]. In contrast, machine learning (ML) as well as network and systems biology are innovating with impactful discoveries and are now starting to be seamlessly integrated into the biomedical discovery pipeline [4]. A major ambition of medical artificial intelligence (AI) lies in translating patient data to successful therapies. Machine learning models face particular challenges in biomedicine such as the size of the library to train the model, data input conversion problems, transfer, overfitting, ignorance of confounders, and many more [5-7]. They may require new infrastructures, while making possibly just recently established workflows obsolete. On the other hand, deep neural network (DNN) approaches may offer distinct benefits. Such opportunities for deep learning (DL) in biomedicine include scalability, handling of extreme data heterogeneity, and the ability to transfer learning [8], or if wanted even the possibility not to depend on data supervision at all [9]. The goal of this work is to show progress in ML in digital health and exemplify needs, trends, and requirements for AI and ML for precision medicine. Digital image recognition, single-cell analysis, and virtual screens demonstrate breadths and power of ML in biomedicine (Fig. 1).
Fig. 1

Machine learning applications using big data in precision health

Enabling Synergies Between Artificial Intelligence and Digital Pathology

Advances in pattern recognition and image processing have enabled synergies between AI technology and modern pathology [10, 11•]. In particular, DL architectures such as deep convolutional neural networks have achieved unprecedented performance in image classification and gaming tasks [13-16]. The expression “digital pathology” was coined when referring to advanced slide-scanning techniques in combination with AI-based approaches for the detection, segmentation, scoring, and diagnosis of digitized whole-slide images [17]. In pathology, quantifying and standardizing clinical outcome remains a challenge. Accurate grading, staging, classifying, and quantifying response to treatment by computer-assisted technologies are important recent initiatives [12, 18]. Neural network algorithms perform well in a setting where either large amounts of input data or high-quality training sets are provided. Using a digital archive of more than 100,000 clinical images of skin disease such prerequisites were fulfilled and a deep convolutional neural network was successfully trained to classify skin lesions comparable with current quality standards in pathology [19]. Given such an intuitive image-based analysis, a mechanistic understanding of the convoluted layers is not necessary and the approach could be transferred to patient-based mobile phone platforms to enhance early detection and cancer prevention [20-22]. In the future, specific DNN modules will replace selected steps of the traditional pathology workflow. By looking at different computational image-recognition tasks, already today, particularly strong performance of DL is already observed in segmentation tasks nuclei, epithelia or tubules, immune infiltration by lymphocyte classification, cell cycle characterization and mitosis quantification, and grading of tumors. Over time, the transition toward the digital pathology lab will lead to more accurate drug response prediction and prognosis of this underlying disease [23].

Digital Healthcare and Clinical Health Records

ML can learn from almost any data type, even unstructured medical text, such as patient records, medical notes, prescriptions, audio interview transcripts, or pathology and radiology reports. Future day-to-day applications will embrace ML methods to organize a growing volume of scientific literature, facilitating access and extraction of meaningful knowledge content from it [24]. In the clinic, ML can harness the potential of electronic health records to accurately predict medical events [25]. By implementing a ranking function in the content network, one can overcome heterogeneity of clinical or healthcare provider–specific electronic health records, inherent to the current medical practice around the world [26].

Multi-omics Integration

A defined goal of precision medicine is to predict the best treatment strategy for the patient. Drug responses in combination with genomic, epigenomic, transcriptomic, proteomic, metabolomic profiling data provide accurate network prediction to the perturbation. Using multi-omics data, including somatic copy number alterations, somatic exome mutations, methylomes, and transcriptomes of 1000 cell lines, ML can be utilized in a modeling exercise to predict genomic features for process and drug response prediction [27]. Top-performing methods exploit ML, integrate multiple profiling data sets, and enhance scoring by regression models to predict drug sensitivities [28-30]. Given convolution and non-linear relationship between transcriptomic, epigenomic, and metabolic functions, future ML applications can be challenged to resolve intricate multi-omics patterns [31]. Precision oncology has been showcased by implementing patient-derived cancer cell lines [32]. Such bench-to-bedside models can provide real-time drug response predictions and often create massive knowledge banks accessible to ML workup. In the future, the ability to screen patient-derived avatars will inform about resistance mechanisms and facilitate evidence-based medicine, even of complex traits [33].

Machine Detection of Resistance Signatures

Somatic alterations in cancer frequently escape the recognition by the endogenous immune system, creating resistance [34]. Even though excellent efficacy and some complete remissions have been seen in a limited number of melanoma patients, some of whom may be regarded as cured of cancer, many malignancies show resistance or lack of response of long duration with these agents. Predicting tumor responses to immune checkpoint blockade remains a major challenge and an active field of research fueled by systems biology and AI approaches [18].

Deciphering Epigenomic Networks

Epigenomics of oncogenic networks has an ability to accurately predict regulome function, epigenomic-transcriptomic cooperation, and disease progression [35]. Then again, epigenetic modifications on chromatin, DNA, and RNA are complex and often context-specific, making their mechanistic understanding challenging. Elastic net is a shrinkage method hybrid of ridge and lasso regularization (preventing overfitting) able to handle ultra-high dimensional regression and suitable for epigenomic data [36]. Using such methods, metabolic and epigenomic data have been used to establish biomarkers and to predict clocks in aging [37, 38]. Enhanced by ML methods, epigenetic marks including promoter methylation are utilized as a continuous readout of transcriptional accessibility and molecular processes that guide development, tissue maintenance, disease states, and eventually aging. Given progress in multiplex barcoding, new data challenges in the field of epigenomics are quickly at hand. Frontiers include processing and machine integration of sequencing and chromatin accessibility information derived from the transcriptome and epigenome of the same cell [39•].

Visualizing and Exploring Cellular Heterogeneity at Single-Cell Resolution

In single-cell biology, ML and DL are frequently utilized to investigate the diversity and complexity of cell populations. In cancer, single-cell methods provide a view of heterogeneity that recognizes the impact of diverse cell states and types surrounding the tumor microenvironment. Further, cancer is a dynamic and highly heterogeneous disease composed of a mix of clones characterized by distinct genotypes pushing bulk sequencing methods to their limits. Profiling of copy numbers, transcripts, or chromatin accessibility together with cluster analysis can uncover differences, even in seemingly homogenous tissues and resolve subclonal complexity. Dimensionality reduction and clustering are typical ML techniques employed to visualize single-cell transcriptomics (scRNA-Seq) data. In particular, the clustering algorithm Louvain community detection is robust for high-dimensional data like scRNA-Seq matrices. The human cell atlas [40], whose primary goal is to establish, discover, and catalogue different cell populations ab initio, creates unsupervised maps, serving as a resource for subsequent disease-directed studies. In addition, it is possible to predict cycle, disease progression, and perturbation responses using deep network approaches [41•, 42•, 43–45]. Spatial transcriptomics (spRNA-Seq) combines the benefits of traditional histopathology with single cell gene expression profiling. The ability to connect the spatial organization of molecules in cells and tissues with their gene expression state enables mapping of specific disease pathology [46, 47]. ML has the ability to decode molecular proximities from sequencing information and construct images of gene transcripts at sub-cellular resolution [48].

Artificial Intelligence in Chemical Informatics and Drug Discovery

Chemical informatics has an ability to predict novel drug targets, quantify ADME and toxicology, match drugs with targets and biological activities, model physicochemical properties, accelerate data mining, predict biological targets for compounds on a large scale, design new chemicals and syntheses [49], and analyze large virtual chemical spaces [50]. Such a new paradigm enables medicinal chemists to process billions of molecules in virtual screens [51, 52]. By tightly integrating database knowledge, AI, and lab automation, it is possible to accelerate the drug discovery pipeline and select structures that can be prepared on automated systems and made available for biological testing, allowing for timely hypothesis testing and validation. Computational analyses of drug-perturbation assays have the ability to predict the activities of the compounds on seemingly unrelated biological processes [53]. ML can provide insight into drug mechanism, create correlative bridges between disjoint nodes, establish biomarkers, repurpose existing drugs, optimize drug candidates, design clinical trials, and even recruit for clinical trials. Image-based drug fingerprints were demonstrated to enable biological activity prediction for drug discovery, even when a chemical library in combination with high-content image screening was repurposed. Potential applications of predictions delivered by implemented computational models were far beyond the intended target of the original compound screen [54•].

Conclusion

Biomedical science of genomic signatures, image processing, and drug discovery rapidly adopted big data opportunities and new learning-based technologies. From traditional approaches relying on leads from nature to brute-force screening using robotics, following the introduction of several other disruptive technologies, artificial intelligence is yet another pivotal moment toward a rationalized, data-driven process in healthcare and pharmaceutical industry. Machine intelligence and deep networks are changing our approach to medical bioinformatics at an unprecedented speed. As a result, the decision-making processes in precision medicine will shift from an algorithm-centric to a data-centric insight.
  50 in total

Review 1.  Deep learning.

Authors:  Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

2.  Artificial intelligence: Learning to see and act.

Authors:  Bernhard Schölkopf
Journal:  Nature       Date:  2015-02-26       Impact factor: 49.962

3.  Unsupervised word embeddings capture latent knowledge from materials science literature.

Authors:  Vahe Tshitoyan; John Dagdelen; Leigh Weston; Alexander Dunn; Ziqin Rong; Olga Kononova; Kristin A Persson; Gerbrand Ceder; Anubhav Jain
Journal:  Nature       Date:  2019-07-03       Impact factor: 49.962

4.  Three pitfalls to avoid in machine learning.

Authors:  Patrick Riley
Journal:  Nature       Date:  2019-08       Impact factor: 49.962

5.  Pharmacogenomic landscape of patient-derived tumor cells informs precision oncology therapy.

Authors:  Jin-Ku Lee; Zhaoqi Liu; Jason K Sa; Sang Shin; Jiguang Wang; Mykola Bordyuh; Hee Jin Cho; Oliver Elliott; Timothy Chu; Seung Won Choi; Daniel I S Rosenbloom; In-Hee Lee; Yong Jae Shin; Hyun Ju Kang; Donggeon Kim; Sun Young Kim; Moon-Hee Sim; Jusun Kim; Taehyang Lee; Yun Jee Seo; Hyemi Shin; Mijeong Lee; Sung Heon Kim; Yong-Jun Kwon; Jeong-Woo Oh; Minsuk Song; Misuk Kim; Doo-Sik Kong; Jung Won Choi; Ho Jun Seol; Jung-Il Lee; Seung Tae Kim; Joon Oh Park; Kyoung-Mee Kim; Sang-Yong Song; Jeong-Won Lee; Hee-Cheol Kim; Jeong Eon Lee; Min Gew Choi; Sung Wook Seo; Young Mog Shim; Jae Ill Zo; Byong Chang Jeong; Yeup Yoon; Gyu Ha Ryu; Nayoung K D Kim; Joon Seol Bae; Woong-Yang Park; Jeongwu Lee; Roel G W Verhaak; Antonio Iavarone; Jeeyun Lee; Raul Rabadan; Do-Hyun Nam
Journal:  Nat Genet       Date:  2018-09-27       Impact factor: 38.330

Review 6.  Applications of machine learning in drug discovery and development.

Authors:  Jessica Vamathevan; Dominic Clark; Paul Czodrowski; Ian Dunham; Edgardo Ferran; George Lee; Bin Li; Anant Madabhushi; Parantu Shah; Michaela Spitzer; Shanrong Zhao
Journal:  Nat Rev Drug Discov       Date:  2019-06       Impact factor: 84.694

Review 7.  DNA methylation-based biomarkers and the epigenetic clock theory of ageing.

Authors:  Steve Horvath; Kenneth Raj
Journal:  Nat Rev Genet       Date:  2018-06       Impact factor: 53.242

8.  Exploiting machine learning for end-to-end drug discovery and development.

Authors:  Sean Ekins; Ana C Puhl; Kimberley M Zorn; Thomas R Lane; Daniel P Russo; Jennifer J Klein; Anthony J Hickey; Alex M Clark
Journal:  Nat Mater       Date:  2019-04-18       Impact factor: 43.841

9.  Integrating biomedical research and electronic health records to create knowledge-based biologically meaningful machine-readable embeddings.

Authors:  Charlotte A Nelson; Atul J Butte; Sergio E Baranzini
Journal:  Nat Commun       Date:  2019-07-10       Impact factor: 14.919

10.  Discrete Changes in Glucose Metabolism Define Aging.

Authors:  Silvia Ravera; Marina Podestà; Federica Sabatini; Monica Dagnino; Daniela Cilloni; Samuele Fiorini; Annalisa Barla; Francesco Frassoni
Journal:  Sci Rep       Date:  2019-07-17       Impact factor: 4.379

View more
  10 in total

1.  Meeting Report: 68th Montagna Symposium on the Biology of Skin "Decoding Complex Skin Diseases: Integrating Genetics, Genomics, and Disease Biology".

Authors:  Johann E Gudjonsson; James T Elder
Journal:  J Invest Dermatol       Date:  2020-06-27       Impact factor: 8.551

2.  Cell-type modeling in spatial transcriptomics data elucidates spatially variable colocalization and communication between cell-types in mouse brain.

Authors:  Francisco Jose Grisanti Canozo; Zhen Zuo; James F Martin; Md Abul Hassan Samee
Journal:  Cell Syst       Date:  2021-10-08       Impact factor: 10.304

Review 3.  A critical review of datasets and computational suites for improving cancer theranostics and biomarker discovery.

Authors:  Gayathri Ashok; Sudha Ramaiah
Journal:  Med Oncol       Date:  2022-09-29       Impact factor: 3.738

4.  Artificial Intelligence-Aided Precision Medicine for COVID-19: Strategic Areas of Research and Development.

Authors:  Enrico Santus; Nicola Marino; Davide Cirillo; Emmanuele Chersoni; Arnau Montagud; Antonella Santuccione Chadha; Alfonso Valencia; Kevin Hughes; Charlotta Lindvall
Journal:  J Med Internet Res       Date:  2021-03-12       Impact factor: 5.428

Review 5.  Artificial intelligence in cancer research: learning at different levels of data granularity.

Authors:  Davide Cirillo; Iker Núñez-Carpintero; Alfonso Valencia
Journal:  Mol Oncol       Date:  2021-02-20       Impact factor: 6.603

6.  Human-interpretable image features derived from densely mapped cancer pathology slides predict diverse molecular phenotypes.

Authors:  James A Diao; Jason K Wang; Wan Fung Chui; Andrew H Beck; Hunter L Elliott; Amaro Taylor-Weiner; Victoria Mountain; Sai Chowdary Gullapally; Ramprakash Srinivasan; Richard N Mitchell; Benjamin Glass; Sara Hoffman; Sudha K Rao; Chirag Maheshwari; Abhik Lahiri; Aaditya Prakash; Ryan McLoughlin; Jennifer K Kerner; Murray B Resnick; Michael C Montalto; Aditya Khosla; Ilan N Wapinski
Journal:  Nat Commun       Date:  2021-03-12       Impact factor: 14.919

7.  Isabl Platform, a digital biobank for processing multimodal patient data.

Authors:  Juan S Medina-Martínez; Juan E Arango-Ossa; Max F Levine; Yangyu Zhou; Gunes Gundem; Andrew L Kung; Elli Papaemmanuil
Journal:  BMC Bioinformatics       Date:  2020-11-30       Impact factor: 3.169

Review 8.  Translational precision medicine: an industry perspective.

Authors:  Dominik Hartl; Valeria de Luca; Anna Kostikova; Jason Laramie; Scott Kennedy; Enrico Ferrero; Richard Siegel; Martin Fink; Sohail Ahmed; John Millholland; Alexander Schuhmacher; Markus Hinder; Luca Piali; Adrian Roth
Journal:  J Transl Med       Date:  2021-06-05       Impact factor: 5.531

9.  scGCN is a graph convolutional networks algorithm for knowledge transfer in single cell omics.

Authors:  Qianqian Song; Jing Su; Wei Zhang
Journal:  Nat Commun       Date:  2021-06-22       Impact factor: 14.919

Review 10.  Application of Big Data and Artificial Intelligence in COVID-19 Prevention, Diagnosis, Treatment and Management Decisions in China.

Authors:  Jiancheng Dong; Huiqun Wu; Dong Zhou; Kaixiang Li; Yuanpeng Zhang; Hanzhen Ji; Zhuang Tong; Shuai Lou; Zhangsuo Liu
Journal:  J Med Syst       Date:  2021-07-24       Impact factor: 4.460

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.