| Literature DB >> 34026829 |
Sandra Brasil1,2, Cátia José Neves1,2, Tatiana Rijoff1,2, Marta Falcão3, Gonçalo Valadão4,5,6, Paula A Videira1,2,3, Vanessa Dos Reis Ferreira1,2.
Abstract
More than 7,000 rare diseases (RDs) exist worldwide, affecting approximately 350 million people, out of which only 5% have treatment. The development of novel genome sequencing techniques has accelerated the discovery and diagnosis in RDs. However, most patients remain undiagnosed. Epigenetics has emerged as a promise for diagnosis and therapies in common disorders (e.g., cancer) with several epimarkers and epidrugs already approved and used in clinical practice. Hence, it may also become an opportunity to uncover new disease mechanisms and therapeutic targets in RDs. In this "big data" age, the amount of information generated, collected, and managed in (bio)medicine is increasing, leading to the need for its rapid and efficient collection, analysis, and characterization. Artificial intelligence (AI), particularly deep learning, is already being successfully applied to analyze genomic information in basic research, diagnosis, and drug discovery and is gaining momentum in the epigenetic field. The application of deep learning to epigenomic studies in RDs could significantly boost discovery and therapy development. This review aims to collect and summarize the application of AI tools in the epigenomic field of RDs. The lower number of studies found, specific for RDs, indicate that this is a field open to expansion, following the results obtained for other more common disorders.Entities:
Keywords: artificial intelligence; epigenetics; epigenomic; machine learning; personalized medicine; rare diseases (RD)
Year: 2021 PMID: 34026829 PMCID: PMC8131862 DOI: 10.3389/fmolb.2021.648012
Source DB: PubMed Journal: Front Mol Biosci ISSN: 2296-889X
FIGURE 1Compilation of the information obtained in this review regarding the (A) number of articles published from 2013 to 2020, (B) the medical areas covered, (C) the number of publications per country, highlighting research collaboration and (D) percentage of unsupervised and supervised AI tools reported.
List of available AI and ML-based tools used for epigenetic studies in RDs.
| Annotates and prioritizes non-coding regulatory variants | FunSeq2 | Scoring scheme, using conservation, regulatory, and other measures | Medulloblastoma | Supervised/Unsupervised | |
| Discover variants associated to specific Mendelian disorders | Genomiser | ReMM framework/RF classifier | Beckwith-Wiedemann syndrome (ORPHA:116), beta thalassemia (ORPHA:848), Marie Unna hereditary hypotrichosis (ORPHA:444) | Supervised | |
| Causal variant analysis and identification | PICS | Bayesian approaches | Immune disorders | Supervised | |
| Predict the effect of regulatory variation | Delta SVM | SVM classifier | Blood cell traits | Supervised | |
| Genes and gene sets prediction | GeneMANIA | Fast heuristic algorithm derived from ridge regression | RVF | Supervised/Unsupervised | |
| miRNA target prediction and functional annotation | miRDB | MirTarget | |||
| Detect statistically significant interaction events in Capture HiC data | CHiCAGO ( | Convolution background model | Waldenstrom macroglobulinemia | Supervised | |
| Identifies the precise location of active TREs | dREG.HD | Epsilon SVR with a Gaussian kernel | Human glioblastoma | Supervised | |
| Genotype/phenotype data analysis | PLINK ( | Linear regression model | EOC, sMTC and PTC, leukemia | Supervised | |
| miRNA-disease associations | NBMDA | Gaussian interaction profile kernel similarity/KNN | Esophageal, breast, and colon neoplasms | Supervised | |
| Learning and characterization of chromatin states | ChromHMM | HMM | CRC | Supervised | |
| Analysis of high-throughput sequencing data (ChIP-seq, RNA-seq, MNase-seq) | DeepTools | AML | Unsupervised |