| Literature DB >> 29669513 |
Ja-Rang Lee1, Dong-Sung Ryu2, Sang-Je Park3, Se-Hee Choe3,4, Hyeon-Mu Cho3,4, Sang-Rae Lee3,4, Sun-Uk Kim5,4, Young-Hyun Kim6,7, Jae-Won Huh8,9.
Abstract
BACKGROUND: The characterization of genomic or epigenomic variation in human and animal models could provide important insight into pathophysiological mechanisms of various diseases, and lead to new developments in disease diagnosis and clinical intervention. The African green monkey (AGM; Chlorocebus aethiops) and cynomolgus monkey (CM; Macaca fascicularis) have long been considered important animal models in biomedical research. However, non-human primate-specific methods applicable to epigenomic analyses in AGM and CM are lacking. The recent development of methyl-capture sequencing (MC-seq) has an unprecedented advantage of cost-effectiveness, and further allows for extending the methylome coverage compared to conventional sequencing approaches.Entities:
Keywords: African green monkey; Animal model; Cynomolgus monkey; DNA methylation; Methyl-capture sequencing
Mesh:
Year: 2018 PMID: 29669513 PMCID: PMC5907189 DOI: 10.1186/s12864-018-4666-1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Definitions of various genomic regions used for methyl-capture sequencing in this study. Targeted genomic regions can be separated into regulatory regions and intragenic regions. Regulatory regions included CG islands (CGIs) with the surrounding CGI shore, CGI shelf, and promoter region. Intragenic regions are organized into all exon regions, including the coding sequence (CDS), excluding introns. US, upstream; DS, downstream; TSS, transcription start site
Fig. 2The redefined target region comprising the homologous probe region (HPR) and orthologous promoter region (OPR). To determine inter-species homologous genome sequences, we defined the HPR as alignment regions greater than the empirical thresholds (e-value and identity of the BLAT alignment result). For promoter regions not covered by the HPR, the OPR was also considered, consisting of the gene symbols that match between the non-human primate (NHP) genomes and human targets for which the promoter regions overlap with the probe targeted-region by more than 60%
Fig. 3CpG content distribution according to each genomic region in redefined targeted regions. The pie charts show the percentages of sequences identified with the SureSelect human toolkit that could be aligned to the a African green monkey (AGM) and b cynomolgus macaque (CM) genome with an identity threshold of 85% and an e-value < e− 10. c The height of bar graphs represents the number of CpGs covered by the homologous promoter region (HPR) and orthologous promoter region (OPR) methods
Fig. 4Read statistics. a Bars show the numbers of mapped reads and uniquely mapped reads, and the line represents the number of raw reads. Each vertical axis shows the number of reads in millions. b Bars represent the rate of coverage of on-targeted read by genomic regions at a calling depth ≥ 5-fold. AGM, African green monkey; CM, cynomolgus macaque; HPR, homologous promoter region; OPR, orthologous promoter region
Fig. 5Comprehensive distribution maps of redefined targeted regions in African gene monkey (a) and cynomolgus macaque (b). Each numerical distribution according to the genomic region was calculated against a 500-kb bin on the whole genome. Black peaks, distribution of CpG sites on the whole genome; green peaks, distribution of target homologous promoter regions (HPRs) + orthologous promoter regions (OPRs) greater than 500 bp; blue peaks, distribution of target HPRs greater than 500 bp; red peaks, distribution of the representative transcript region; purple peaks, distribution of the promoter region; orange peaks, distribution of CGIs and flanking regions
Fig. 6Average methylation levels according to each genomic region in the African green monkey (a) and cynomolgus macaque (b) genomes. The boxes and error bars indicate the mean and 95% confidence intervals, respectively
Fig. 7Schematic of the procedures for genome-wide DNA methylation analysis. 1) Fragmentation of genomic DNA by sonication or restriction enzyme digestion. 2) Target genomic DNA enrichment using MBD protein, methylation antibody, or target probe. 3) Bisulfite conversion or 4) direct genome-wide DNA methylation analysis by microarray or a next-generation sequencing platform. MBD, methyl-CpG-binding domain; MeDIP, methylated DNA immunoprecipitation; Infinium or HM450, Illumina, Infinium Human Methylation 450 BeadChips; RRBS, reduced-representation-bisulfite-sequencing; MC-seq, methyl-capture sequencing; WGBS, whole-genome bisulfite sequencing
Summary of experimental approaches for genome-wide DNA methylation profiling
| aSpecies | bDNA amount | Reads | Genome Coverage | cInfluencing factor | Resolution | Bioinformatics requirement | Cost | |
|---|---|---|---|---|---|---|---|---|
| Array-based methodsd | ||||||||
| MeDIP-chip | limited | 5 | – | depends on array | A, B, C, D | ~ 150 bp | ++ | ++ |
| MBD-chip | limited | 5 | – | depends on array | A, B, C, D | ~ 150 bp | ++ | ++ |
| Infinium | limited | 0.5–1 | – | 485 K | C, E, G | Single base | + | + |
| NGS-based methodse | ||||||||
| MeDIP-seq | any | 0.3–5 | 50 M | ~ 23 M | A, B, D | ~ 150 bp | +++ | ++ |
| MBD-seq | any | 1–3 | 30 M | ~ 23 M | A, B, D | ~ 150 bp | +++ | ++ |
| RRBS | any | 0.01–2 | 10 M | ~ 2 M | E, F, G | Single base | +++ | ++ |
| MC-seq | limited | 1–3 | 50 M | 3.7 M | E, G | Single base |
|
|
| WGBS | any | 1–5 | > 500 M | > 28 M | E, G | Single base | +++++ | +++++ |
Abbreviations: MeDIP methylated DNA immunoprecipitation, MBD methyl-CpG-binding domain, NGS next-generation sequencing, RRBS reduced-representation-bisulfite-sequencing, MC-seq methyl-capture sequencing, WGBS whole-genome bisulfite sequencing, M million, K thousand, + very low, ++ low, +++ moderate, ++++ high, +++++ very high
aSpecies: the range of applications varies according to the methylome profiling method adopted; methods are limited to species with commercially available arrays, or species with a complete reference genome available
bDNA input varies depending on the protocol
cInfluencing factors represent the potential sources of genomic region bias. A, CG content; B, CpG density; C, probe hybridization; D, copy number variation; E, bisulfite conversion rate; F, enzyme recognition sites; G, bisulfite PCR bias
dReferences: MeDIP-Chip [39, 40]; MBD-Chip [40, 41]; Infinium [19, 34, 42–44]
eReferences: MeDIP-seq [32, 34, 45]; MBD-seq [33, 34, 44, 46]; RRBS [26, 34, 44, 47, 48]; MC-seq [21, 26, 34]; WGBS [20, 26, 34, 44, 49]