| Literature DB >> 31905969 |
Ryuji Hamamoto1,2, Masaaki Komatsu1,2, Ken Takasawa1,2, Ken Asada1,2, Syuzo Kaneko1.
Abstract
To clarify the mechanisms of diseases, such as cancer, studies analyzing genetic mutations have been actively conducted for a long time, and a large number of achievements have already been reported. Indeed, genomic medicine is considered the core discipline of precision medicine, and currently, the clinical application of cutting-edge genomic medicine aimed at improving the prevention, diagnosis and treatment of a wide range of diseases is promoted. However, although the Human Genome Project was completed in 2003 and large-scale genetic analyses have since been accomplished worldwide with the development of next-generation sequencing (NGS), explaining the mechanism of disease onset only using genetic variation has been recognized as difficult. Meanwhile, the importance of epigenetics, which describes inheritance by mechanisms other than the genomic DNA sequence, has recently attracted attention, and, in particular, many studies have reported the involvement of epigenetic deregulation in human cancer. So far, given that genetic and epigenetic studies tend to be accomplished independently, physiological relationships between genetics and epigenetics in diseases remain almost unknown. Since this situation may be a disadvantage to developing precision medicine, the integrated understanding of genetic variation and epigenetic deregulation appears to be now critical. Importantly, the current progress of artificial intelligence (AI) technologies, such as machine learning and deep learning, is remarkable and enables multimodal analyses of big omics data. In this regard, it is important to develop a platform that can conduct multimodal analysis of medical big data using AI as this may accelerate the realization of precision medicine. In this review, we discuss the importance of genome-wide epigenetic and multiomics analyses using AI in the era of precision medicine.Entities:
Keywords: DNA methylation; deep learning; epigenetics; histone modifications; machine learning; precision medicine
Mesh:
Substances:
Year: 2019 PMID: 31905969 PMCID: PMC7023005 DOI: 10.3390/biom10010062
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Figure 1The summarized figure of epigenetic regulations and technologies for epigenetics analysis. Image credit: Shutterstock.com/ellepigrafica.
List of main technologies for epigenetics and chromatin analyses.
| Method Name | Purpose | Methodology | Era | Ref. |
|---|---|---|---|---|
| Chromatin immunoprecipitation (ChIP) assay | Analysis of histone modification and transcription factor binding status | A type of immunoprecipitation experimental technique used to investigate the interaction between proteins and DNA in the cell. It aims to determine whether specific proteins are associated with specific genomic regions, and also aims to determine the specific location in the genome that various histone modifications are associated with. | 1985 | [ |
| Bisulfite sequencing (BS-Seq) | DNA methylation analysis | Treatment of DNA with bisulfite converts cytosine residues to uracil, but leaves 5-methylcytosine residues unaffected. Hence, DNA that has been treated with bisulfite retains only methylated cytosines. | 1992 | [ |
| Histone acetyltransferase (HAT) assay | Assay for histone acetyltransferase activity | Multiple biochemical HAT assays have been described; these assays measure HAT activity by detecting either the acetylated histone-based product (direct) or the free CoA product (indirect). | 1995 | [ |
| DNA methylation array: differential methylation hybridization (DMH) | DNA methylation analysis | A DNA array-based method, called differential methylation hybridization (DMH), to identify hypermethylated sequences in tumor cells by simultaneously screening many CpG island loci derived from a genomic library, CGI. | 1999 | [ |
| ChIP-on-chip | Genome-wide analysis of histone modification and transcription factor binding status | A technology that combines chromatin immunoprecipitation (ChIP) with DNA microarray (chip). It allows the identification of the cistrome, the sum of binding sites, for DNA-binding proteins on a genome-wide basis. | 1999 | [ |
| Histone methyltransferase (HMT) assay | Assay for histone methyltransferase activity | Radiometric Assays, Mass Spectrometry, Anti-Methylation Antibody-Based Detection, Enzyme-Coupled SAH Detection, Protease-Coupled Detection, Competition Binding. | 2000 | [ |
| Histone demethylase (HDMT) assay | Assay for histone demethylase activity | Measuring the release of radiolabeled formaldehyde from 3H-labeled methylated histone substrates, by monitoring the change in methylation levels of histone substrates by immunoblotting with site-specific methyl-histone antibodies, or by using mass spectrometry to detect reductions in histone peptide masses that correspond to methyl groups. | 2004 | [ |
| Reduced Representation Bisulfite Sequencing (RRBS) | Genome-wide DNA methylation analysis | An efficient and high-throughput technique for analyzing the genome-wide methylation profiles on a single nucleotide level; it combines restriction enzymes and bisulfite sequencing to enrich for areas of the genome with a high CpG content. | 2005 | [ |
| ChIP-loop | Chromosome conformation capture technique | This method combines the standard 3C protocol with a routine ChIP protocol; it allows the selective identification of long-range chromatin interactions between loci that are bound to specific proteins of interest. | 2005 | [ |
| ChIP-sequencing (ChIP-seq) | Genome-wide analysis of histone modification and transcription factor binding status | By combining chromatin immunoprecipitation (ChIP) assays with next-generation sequencing (NGS), ChIP sequencing (ChIP-seq) is a powerful method for identifying genome-wide DNA binding sites for transcription factors and other proteins. | 2007 | [ |
| Whole Genome Bisulfite Sequencing (WGBS) | Genome-wide DNA methylation analysis | A NGS technology used to determine the DNA methylation status of single cytosines by treating the DNA with sodium bisulfite before sequencing. | 2009 | [ |
| Hi-C | Chromosome conformation capture technique | A genome-wide chromatin conformation capture protocol using proximity ligation. The technology is of special interest for three-dimensional genome organization in the nucleus and de novo genome assemblies. | 2009 | [ |
| ChIA-PET | Determination of de novo long-range chromatin interactions genome-wide | The ChIA-PET method combines ChIP-based methods, and Chromosome conformation capture (3C), to extend the capabilities of both approaches. | 2009 | [ |
| ATAC-seq | Identification of accessible DNA regions | This method relies on NGS library construction using the hyperactive transposase Tn5. NGS adapters are loaded onto the transposase, which allows simultaneous fragmentation of chromatin and integration of those adapters into open chromatin regions. | 2013 | [ |
| Capture Hi-C (CHi-C) | Identification of higher resolution mapping of chromatin interactions | The CHi-C is a new technique for assessing genome organization based on chromosome conformation capture coupled to oligonucleotide capture of regions of interest like gene promoters. | 2014 | [ |
Figure 2The summarized figure of artificial intelligence development.
Figure 3Advantages of machine learning and deep learning technologies in medical research. (A) An example of multimodal learning analysis using multiomics data including epigenetic data. (B) An example of multitask learning analysis using gene mutation data, DNA methylation data and gene expression data. This is a modified figure from reference [113]. (C) An example of semi-supervised learning using epigenetic data. This is a modified figure from reference [114].