| Literature DB >> 36009026 |
Peng-Chan Lin1,2, Yi-Shan Tsai3, Yu-Min Yeh1, Meng-Ru Shen4,5,6.
Abstract
To provide precision medicine for better cancer care, researchers must work on clinical patient data, such as electronic medical records, physiological measurements, biochemistry, computerized tomography scans, digital pathology, and the genetic landscape of cancer tissue. To interpret big biodata in cancer genomics, an operational flow based on artificial intelligence (AI) models and medical management platforms with high-performance computing must be set up for precision cancer genomics in clinical practice. To work in the fast-evolving fields of patient care, clinical diagnostics, and therapeutic services, clinicians must understand the fundamentals of the AI tool approach. Therefore, the present article covers the following four themes: (i) computational prediction of pathogenic variants of cancer susceptibility genes; (ii) AI model for mutational analysis; (iii) single-cell genomics and computational biology; (iv) text mining for identifying gene targets in cancer; and (v) the NVIDIA graphics processing units, DRAGEN field programmable gate arrays systems and AI medical cloud platforms in clinical next-generation sequencing laboratories. Based on AI medical platforms and visualization, large amounts of clinical biodata can be rapidly copied and understood using an AI pipeline. The use of innovative AI technologies can deliver more accurate and rapid cancer therapy targets.Entities:
Keywords: artificial intelligence; bioinformatics; cancer genomics; high-performance computing; next-generation sequencing; precision medicine
Mesh:
Year: 2022 PMID: 36009026 PMCID: PMC9405970 DOI: 10.3390/biom12081133
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Figure 1Clinical practice for precision cancer genomics and artificial intelligence-powered bioinformatic technologies. Artificial intelligence techniques, software platforms, and high-performance computation have been used extensively to provide improved cancer care via clinical patient data, such as electronic medical records, physiological measurements, biochemistry, computerized tomography scans, digital pathology, and the genetic landscape of cancer tissue.
AI-based prediction models for the pathogenicity of genetic variants.
| Methods | Categorical Prediction | Algorithms | Author |
|---|---|---|---|
|
| |||
| VEST4 | Higher scores are more deleterious | RF | Carter et al., 2013 [ |
| MetaSVM | Higher scores are more deleterious | Radial kernel SVM | Dong et al., 2015 [ |
| REVEL | Higher scores are more deleterious | Ensemble methods/RF | Ioannidis et al., 2016 [ |
| Primate AI | Higher scores are more deleterious | Convolutional neural network | Sundaram et al., 2018 [ |
| CADD | Higher scores are more deleterious | Linear kernel SVM | Rentzsch et al., 2019 [ |
| Splice AI | Higher scores are more deleterious | Deep neural network | Jaganathanet al., 2019 [ |
| 3Cnet | Higher scores are more deleterious | Recurrent neural network | Won et al., 2021 [ |
| CoLaSp | Higher scores are more deleterious | Latent space matrix | Abdollahi et al., 2021 [ |
| MVP | Higher scores are more deleterious | ResNets | Qi et al., 2021 [ |
| VARITY | P: Pathogenicity; B: Benign | XGBoost | Wu et al., 2021 [ |
| EvoDiagnostics | P: Pathogenicity; B: Benign | RF | Labes et al., 2022 [ |
|
| |||
| PROVEAN | D: Deleterious; N: Neutral | Delta alignment score | Choi et al., 2015 [ |
| ProtVec | P: Pathogenicity; B: Benign | NLP/SVM | Asgari et al., 2015 [ |
| BioSeq-Analysis 2.0 | P: Pathogenicity; B: Benign | RF/SVM | Liu et al., 2019 [ |
| Rhapsody | Pathogenicity probability | RF | Ponzoni et al., 2020 [ |
| LYRUS | P: Pathogenicity; B: Benign | XGBoost | Lai et al., 2021 [ |
| LightGBM | P: Pathogenicity; B: Benign | LightGBM | Wu et al., 2021 [ |
|
| |||
| Modelling ACMG | P: Pathogenicity; B: Benign | Bayesian classification | Tavtigian et al., 2018 [ |
| CharGer | Higher scores are more deleterious | Databases and criteria-based | Scott et al., 2019 [ |
| VarSome | P: Pathogenicity; B: Benign | Databases and criteria-based | Kopanos et al., 2019 [ |
| Clinvitae | P: Pathogenicity; B: Benign | Penalized logistic regression | Nicora et al., 2022 [ |
Notes: RF: random forest; SVM: support vector machine; and NLP: natural language processing.
AI tools in bioinformatics for mutational analysis.
| Methods | DATA | Algorithms | Author |
|---|---|---|---|
|
| |||
| COSMIC Signatures | SNV/indels | Non-negative matrix factorization | Alexandrov et al., 2020 [ |
| DeconstructSigs | SNV/indels | Multiple linear regression model | Rosenthal et al., 2016 [ |
| DeaminationSigs | SNV/indels | Non-negative matrix factorization | Bhagwate et al., 2019 [ |
| SparseSignatures | SNV | Non-negative matrix factorization | Lal et al., 2021 [ |
| Musicatk | SNV | Non-negative matrix factorization/LDA | Chevalier et al., 2021 [ |
|
| |||
| LICHeE | SNV/CNV | Directed acyclic graph | Popic et al., 2015 [ |
| SCHISM | SNV/CNV | Directed acyclic graph | Niknafs et al., 2015 [ |
| Canopy | SNV/CNV | Bayesian mixture models | Jiang et al., 2016 [ |
| ClonEvol | SNV/CNV | Bootstrap resampling | Dang et al., 2017 [ |
| PACTION | SNV/CNV | Mixed integer linear programming | Sashittal et al., 2022 [ |
| DeCiFering | SNV | Descendant cell fraction | Satas et al., 2022 [ |
Notes: ACMG, American College of Medical Genetics and Genomics; SNV, single-nucleotide variant; CNV, copy number variations; and LDA, latent Dirichlet allocation.
Single-cell genomics and computational biology.
| Methods | Goal | Algorithms | Author |
|---|---|---|---|
|
| |||
| MOFA+ | Sparse data | Stochastic version of the algorithm | Argelaguet et al., 2020 [ |
| sCCA | Sparse data | Sparse canonical correlation analysis (CCA) | Rodosthenous et al., 2020 [ |
| Unicom | Distance matrix | Unsupervised topological alignment | Cao et al., 2020 [ |
| sGCN | High-dimensional data | Graph convolutional networks | Song et al., 2020 [ |
| PIntMF | Sparse data | Penalized integrative matrix factorization | Pierre-Jean et al., 2021 [ |
|
| |||
| ACTINN | Immune cell | Neural Networks | Ma et al., 2020 [ |
| Ikarus | Tumor cell | Logistic regression/network propagation | Dohmen et al., 2022 [ |
|
| |||
| CellRouter | Tree methods | Context likelihood of relatedness | Lummertz et al., 2018 [ |
| STREAM | Graph methods | Gaussian process latent variable model | Chen et al., 2019 [ |
| TinGA | Graph methods | Growing neural graph algorithm | Todorov et al., 2020 [ |
| ELPIgraphy | Cyclic methods | Elastic energy functional and topological graph | Albergante et al., 2020 [ |
| CStreet | Graph methods | k-nearest neighbors graph | Zhao et al., 2021 [ |
| Tree methods | Euclidean minimum spanning tree | Tenha et al., 2022 [ | |
Text-mining model for cancer-associated genes.
| Relationships | Name | Algorithms | Author |
|---|---|---|---|
| Mutation–Gene | MuGeX | Naïve Bayes/Rocchio algorithm-TF-IDF | Erdogmus et al., 2007 [ |
| Disease–Mutation | C4.5 decision tree | Singhal et al., 2016 [ | |
| Protein–Mutation | EnzyMiner | Probabilistic indexing | Yeniterzi et al., 2009 [ |
| Variants–Literature | tmVar 2.0 | Conditional random fields | Wei et al., 2018 [ |
| Variant–Disease–Gene | MAGPEL | Sentence co-occurrence scoring | Saberian et al., 2020 [ |
| Cancer–Genes | Hypergeometric test | Chen et al., 2021 [ |
Notes: TF: term frequency; and IDF: inverse document frequency.
High-performance computing systems for cancer genome research.
| Name | Computing System | Clinical Practice | Author |
|---|---|---|---|
| NVIDIA | GPUs | Mutational signature | Haradhvala et al., 2018 [ |
| Critical care | Gorzynski et al., 2022 [ | ||
| DRAGEN | FPGAs | TSO500 FFPE pipeline | Wei et al., 2022 [ |
| TSO500 ctDNA pipeline | Pommergaard et al., 2022 [ |
Notes: GPUs: graphics processing units; FPGAs: field-programmable gate arrays; and TSO 500: TruSight Oncology 500 assay.