| Literature DB >> 26358297 |
Minkyeung Choi1, Jongin Lee1, Min Thong Le1, Dinh Truong Nguyen1, Suhyun Park1, Nagasundarapandian Soundrarajan1, Kyle M Schachtschneider2, Jaebum Kim1, Jin-Ki Park3, Jin-Hoi Kim1, Chankyu Park4.
Abstract
DNA methylation plays a major role in the epigenetic regulation of gene expression. Although a few DNA methylation profiling studies of porcine genome which is one of the important biomedical models for human diseases have been reported, the available data are still limited. We tried to study methylation patterns of diverse pig tissues as a study of the International Swine Methylome Consortium to generate the swine reference methylome map to extensively evaluate the methylation profile of the pig genome at a single base resolution. We generated and analysed the DNA methylome profiles of five different tissues and a cell line originated from pig. On average, 39.85 and 62.1% of cytosine and guanine dinucleotides (CpGs) of CpG islands and 2 kb upstream of transcription start sites were covered, respectively. We detected a low rate (an average of 1.67%) of non-CpG methylation in the six samples except for the neocortex (2.3%). The observed global CpG methylation patterns of pigs indicated high similarity to other mammals including humans. The percentage of CpG methylation associated with gene features was similar among the tissues but not for a 3D4/2 cell line. Our results provide essential information for future studies of the porcine epigenome.Entities:
Keywords: DNA methylation; RRBS; epigenetics; methylome; pig
Mesh:
Substances:
Year: 2015 PMID: 26358297 PMCID: PMC4596400 DOI: 10.1093/dnares/dsv017
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Comparison between porcine and other reduced representation (RR) genomes
| Genome | ||||
|---|---|---|---|---|
| Human | Mouse | Zebrafish[ | Pig | |
| Size of genome (Gb) | 3.2 | 2.8 | 1.41 | 2.8 |
| Size of RR genome (Mb) | 74 | 38 | 31 | 73 |
| Per cent of whole genome | 2.3 | 1.4 | 2.2 | 2.6 |
| GC content of whole genome (%) | 40.9 | 41.7 | 36.5 | 42.3 |
| Number of fragments (RR genome) | 647,626 | 333,104 | 264,598 | 664,080 |
| Number of CpG sites (RR genome) | 4,068,947 | 1,506,712 | 1,430,390 | 5,586,410 |
| Per cent of total genomic CpG sites | 13.5 | 7 | 5.3 | 18.30 |
All RR genomes were simulated by selecting DNA fragment in a 40- to 220-bp window after in silico restriction enzyme digestion.
Details of mapping of porcine RRBS libraries
| Clean read (M) | Clean read (Gb) | Mapped rate (%) | Uniquely mapped rate (%) | Multiple mapping (%) | CpG methylation (%) | Non-CpG methylation (%) | |
|---|---|---|---|---|---|---|---|
| 3D4/2 | 66.3 | 3.25 | 60.83 | 55.08 | 5.74 | 53.05 | 1.71 |
| Neocortex | 76.8 | 3.76 | 62.29 | 56.89 | 5.40 | 50.87 | 2.30 |
| OE | 67.6 | 3.31 | 58.48 | 53.56 | 4.92 | 49.19 | 1.63 |
| Spleen | 72.6 | 3.56 | 62.38 | 57.16 | 5.22 | 52.84 | 1.66 |
| Liver | 111.84 | 5.48 | 56.17 | 50.77 | 5.40 | 47.83 | 1.67 |
| Muscle | 85.1 | 4.17 | 57.20 | 51.76 | 5.45 | 48.06 | 1.69 |
OE: olfactory epithelium.
Figure 1.Correlation analysis of the CpG methylation patterns among different tissues. Scatter plots of percentage methylation values for pairwise comparisons in six libraries [pulmonary alveolar macrophage (PAM) cells, neocortex, spleen, liver, muscle and olfactory epithelium (OE)]. Colours in the scatter plot indicate the number of CpG sites with identical methylation pattern (methylated or non-methylated); yellow denotes large number of correlations, blue denotes lack of correlation and green denotes different methylation patterns. Numbers in the upper right side represent the pairwise Pearson's correlation scores. Histograms on the diagonal are methylation distribution of CpG sites for each sample. Comparisons of global CpG methylation level (A), methylation level of CpG in CGI regions (B) and CGI shore (C).
Figure 2.Hierarchical cluster analysis according to methylation patterns across different tissues. Distance between tissues according to their methylation patterns was estimated by the Ward's minimum variance method.
Figure 3.Comparison of the percentage methylation level of CpG and non-CpG sites among different tissues. Methylation level was indicated according to different criteria including global, 2 kb upstream of TSS (up2 kb) and CGI. Non-CpG methylation was separately evaluated for mCHG and mCHH. PAM, pulmonary alveolar macrophage; OE, olfactory epithelium. Percentage methylation of CpG sites (A), non-CpG (B), CHG (C) and CHH (D) for global, up2 kb and CGI. ** denotes P < 1 × 10−5, * denotes P < 1 × 10−3.
Figure 4.Comparison of the methylation levels in gene-related regions in relation to the numbers of genes and CpGs on individual pig chromosomes. The gene annotation data of the pig genome were obtained from Ensemble. The total number of CpG for each chromosome was obtained from the current pig genome assembly (Sscrofa10.2). Gene body (A), 2 kb (B) and 5 kb (C) upstream regions of TSS. This figure is available in black and white in print and in colour at DNA Research online.
Figure 5.The pattern of CpG methylation of imprinted genes from six tissues in pigs. The result was visualized by heat map. Gene body, exon and intron; up2 kb and up2–10 kb, upstream 2 kb and 2–10 kb regions from TSS, respectively.
Figure 6.The number of tissue-specific differentially methylated cytosine (tDMCs) from five pig tissues. Venn diagrams showing the number of positions of tDMCs from each tissue. The positions of tDMCs depend on tissue type were visualized by Venn diagram. NC: neocortex; OE: olfactory epithelium; LV: liver; MS: muscle; SP: spleen. The 3D4/2 cells were not represented here. Detailed positions of tDMC were summarized in the Supplementary Table S8. This figure is available in black and white in print and in colour at DNA Research online.
Correlation between tDMCs and the level of gene expression from the neocortex, muscle and spleen
| Tissue type | Gene | Chra | Position | Classb | Mc | Ed | Pearson's corr.e | |
|---|---|---|---|---|---|---|---|---|
| Neocortex | NRXN2 | 2 | 6587361 | UP10K | − | + | −0.897000684 | |
| 2 | 6635041 | Genebody | − | + | −0.838015353 | |||
| 2 | 6635090 | Genebody | − | + | −0.886097969 | |||
| 2 | 6635002 | Genebody | − | + | −0.828919384 | |||
| 2 | 6634986 | Genebody | − | + | −0.902681422 | |||
| 2 | 6634955 | Genebody | − | + | −0.802729923 | |||
| 2 | 6634980 | Genebody | − | + | −0.874609127 | |||
| 2 | 6634930 | Genebody | − | + | −0.720944026 | |||
| 2 | 6634961 | Genebody | − | + | −0.837225041 | |||
| 2 | 6634934 | Genebody | − | + | −0.785419851 | |||
| 2 | 6634951 | Genebody | − | + | −0.865604511 | |||
| MAPT | 12 | 17133831 | Genebody | − | + | −0.926451406 | ||
| 12 | 17133834 | Genebody | − | + | −0.922834214 | |||
| SYNE1 | 1 | 16124463 | Genebody | − | + | −0.80723621 | ||
| 1 | 16124489 | Genebody | − | + | −0.750224969 | |||
| Muscle | CCDC88C | 7 | 119491407 | Genebody | + | − | −0.936174813 | |
| EXOC2 | 7 | 75292 | Genebody | CGI shore | + | − | −0.851066176 | |
| 7 | 75340 | Genebody | CGI shore | + | − | −0.832158353 | ||
| FGR | 6 | 78329214 | Genebody | CGI shore | + | − | −0.802870198 | |
| 6 | 78336406 | Genebody | + | − | −0.755515526 | |||
| MAP2K7 | 2 | 71712421 | UP10K | + | − | −0.857718538 | ||
| NR5A2 | 10 | 27361479 | Genebody | − | + | −0.957673269 | ||
| 10 | 27361475 | Genebody | − | + | −0.95434547 | |||
| 10 | 27361468 | Genebody | − | + | −0.921209644 | |||
| 10 | 27361450 | Genebody | − | + | −0.960461808 | |||
| PCOLCE | 3 | 7921803 | Genebody | − | + | −0.995207087 | ||
| RAB22A | 17 | 65753356 | Genebody | CGI shore | + | − | −0.884084122 | |
| SLA-7 | 7 | 27608918 | UP10K | + | − | −0.914258584 | ||
| TUBGCP6 | 5 | 110682055 | Genebody | CGI shore | + | − | −0.90531831 | |
| VGLL2 | 1 | 49880654 | Genebody | CGI shore | − | + | −0.753634499 | |
| 1 | 49880666 | Genebody | CGI shore | − | + | −0.805639146 | ||
| XPO4 | 11 | 268954 | Genebody | CGI shore | + | − | −0.909143259 | |
| 11 | 269035 | Genebody | + | − | −0.914121809 | |||
| Spleen | ICAM-1 | 2 | 69496888 | Genebody | CGI shore | − | + | −0.755261243 |
| ALDH3A1 | 12 | 62650896 | UP10K | CGI shore | + | − | −0.788071349 | |
| 12 | 62650898 | UP10K | CGI shore | + | − | −0.752480115 | ||
| KIRREL3 | 9 | 59344774 | Genebody | − | + | −0.957798851 | ||
| GTF2IRD1 | 3 | 11271020 | UP10K | − | + | −0.917047466 | ||
| UBA52 | 2 | 59052151 | UP10K | − | + | −0.953385635 | ||
| CD163 | 14 | 153630638 | Genebody | CGI shore | − | + | −0.860727255 | |
| HMHA1 | 2 | 77641888 | UP10K | CGI shore | + | − | −0.861581753 | |
| RPS21 | 17 | 69165427 | UP10K | CGI shore | − | + | −0.829232698 | |
| 17 | 69165425 | UP10K | CGI shore | − | + | −0.811905734 | ||
| 17 | 69169540 | UP10K | − | + | −0.852778602 | |||
| 17 | 69169556 | UP10K | − | + | −0.743101278 | |||
| SLC9A3R2 | 3 | 42231210 | Genebody | CGI shore | − | + | −0.752184061 | |
| ZNF646 | 3 | 17732629 | Genebody | CGI shore | − | + | −0.828529507 | |
| ADAMTS4 | 4 | 97117737 | Genebody | − | + | −0.802729052 | ||
| 4 | 97117751 | Genebody | − | + | −0.8029031 | |||
| 4 | 97117778 | Genebody | − | + | −0.806509102 | |||
| 4 | 97117512 | UP10K | − | + | −0.763107181 | |||
| 4 | 97117534 | UP10K | − | + | −0.814870957 | |||
| TBKBP1 | 12 | 23838338 | Genebody | − | + | −0.89415394 | ||
| 12 | 23838110 | Genebody | CGI shore | − | + | −0.975048371 | ||
| 12 | 23838349 | Genebody | − | + | −0.821418354 | |||
aChromosome.
bClass represents genomic location of tDMC; exon and intron region, 10 kb upstream region from TSS and CGI Shore were marked as Genebody, UP10K and CGI shore, respectively.
cMethylation pattern, + and − represent higher and lower methylation over other tissues, respectively.
dExpression level, + and − represent higher and lower expression levels over other tissues, respectively.
ePearson's correlation score.