Literature DB >> 32878957

The Landscape of Genomic Imprinting at the Porcine SGCE/PEG10 Locus from Methylome and Transcriptome of Parthenogenetic Embryos.

Jinsoo Ahn1, In-Sul Hwang2, Mi-Ryung Park2, In-Cheol Cho3, Seongsoo Hwang4, Kichoon Lee5.   

Abstract

In mammals, imprinted genes often exist in the form of clusters in specific chromosome regions. However, in pigs, genomic imprinting of a relatively few genes and clusters has been identified, and genes within or adjacent to putative imprinted clusters need to be investigated including those at the SGCE/PEG10 locus. The objective of this study was to, using porcine parthenogenetic embryos, investigate imprinting status of genes within the genomic region spans between the COL1A2 and ASB4 genes in chromosome 9. Whole-genome bisulfite sequencing (WGBS) and RNA sequencing (RNA-seq) were conducted with normal and parthenogenetic embryos, and methylome and transcriptome were analyzed. As a result, differentially methylated regions (DMRs) between the embryos were identified, and parental allele-specific expressions of the SGCE and PEG10 genes were verified. The pig imprinted interval was limited between SGCE and PEG10, since both the COL1A2 and CASD1 genes at the centromere-proximal region and the genes between PPP1R9A and ASB4 toward the telomere were non-imprinted and biallelically expressed. Consequently, our combining analyses of methylome, transcriptome, and informative polymorphisms revealed the boundary of imprinting cluster at the SGCE/PEG10 locus in pig chromosome 9 and consolidated the landscape of genomic imprinting in pigs.
Copyright © 2020 Ahn et al.

Entities:  

Keywords:  PEG10; RNA sequencing; SGCE; differentially methylated region; embryo; genomic imprinting; parthenogenetic; porcine; whole-genome bisulfite sequencing

Mesh:

Year:  2020        PMID: 32878957      PMCID: PMC7642923          DOI: 10.1534/g3.120.401425

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


In mammals, imprinted genes often exist in the form of clusters in specific chromosome regions (Lewis and Reik 2006). Therefore, detailed comparative characterization of large imprinted clusters is important to understand epigenetic regulation of genomic imprinting. In pigs, progress has been made in identifying imprinted genes (Li ; Bischoff ; Oczkowicz ; Tian 2014; Ahn ); however, still a relatively few genes and clusters have been identified and genes within or adjacent to putative imprinted clusters need to be investigated, including those at the SGCE/PEG10 locus. In the mouse, a large cluster of imprinted genes within a 1-Mb region between Col1a2 and Asb4 in proximal (centromeric) chromosome 6 (6A1) was previously reported (Ono ). Of nine murine genes belonging to the above interval (Col1a2, Casd1, Sgce, Peg10, Ppp1r9a, Pon1, Pon3, Pon2, and Asb4), two genes, Sgce and Peg10, are paternally expressed imprinted genes (Piras ; Ono ). This paternal expression appears critical in mammalian development as evidenced by early embryonic lethality of mice with maternal duplication of the proximal chromosome 6 cluster and with a deficiency of Peg10 (Ono ). Three genes (Ppp1r9a, Pon3, and Pon2) from the same cluster were maternally expressed in the embryonic stages with some exceptions, and the Asb4 gene showed a strong maternal expression (Mizuno ; Ono ). Expressions of Col1a2 and Casd1 were biallelic, and Pon1 expression was low in the mouse embryo and undetectable in human-derived cell lines (Mizuno ; Okita ; Ono ). Although the way that the epigenetic germline mark affects the expression of genes in cis remains to be determined, a single cis-acting imprinting control region (ICR) which may establish and maintain imprinting throughout the Col1a2-Asb4 cluster was identified at the SGCE/PEG10 differentially methylated region (DMR) (Ono ; Monk ). This DMR/ICR acquired allele-specific DNA methylation in the maternal germline. In humans, on chromosome 7 (7q21.3), paternal expressions of SGCE and PEG10 was conserved (Ono ; Müller ; Grabowski ). Human PPP1R9A gene was found to be imprinted and maternally expressed mainly in skeletal muscle tissues (Nakabayashi ). In ovine fetuses, a putative imprinted interval was reported to be extended to between CASD1 (maternally expressed) and PON3 (paternally expressed), while the paternal expression of SGCE and PEG10 and maternal expression of PPP1R9A were conserved (Duan ). In this study, the imprinted cluster containing the SGCE/PEG10 locus was determined by comparing parthenogenetic (PA) and normal control (CN) porcine embryos using whole-genome bisulfite sequencing (WGBS) and RNA sequencing (RNA-seq). In addition, to confirm the expression pattern of PPP1R9A, a single nucleotide polymorphism-based assay was performed. We report that the orthologous pig imprinted interval was between SGCE and PEG10, since both the centromere-proximal genes (COL1A2 and CASD1) and the telomeric genes between PPP1R9A and ASB4 were non-imprinted and biallelically expressed.

Materials And Methods

Ethics statement

All the treatments and experiments related to pigs were performed in accordance with study protocols and standard operating procedures that were approved by the Institutional Animal Care and Use Committee of the National Institute of Animal Science, Rural Development Administration (RDA) of Korea (approval number NIAS2015-670).

In vitro maturation of porcine oocytes and production of parthenogenetic embryos

Pig oocytes were collected and in vitro maturation (IVM) was performed as described in our previous reports (Kwon ; Ahn ). In brief, from a local slaughterhouse (Nonghyup Moguchon, Gimje, Korea), ovaries from Landrace x Yorkshire x Duroc (LYD) pigs were obtained, and then immediately transported to our laboratory in saline solution supplemented with penicillin and maintained at 30–35° in a thermos. After collection of cumulus-oocyte complexes (COCs), they were washed in Tyrode’s lactate-Hepes containing 0.1% (w/v) polyvinyl alcohol. Oocytes surrounded by several layers of cumulus cells were collected and washed three times in TCM-199 (GIBCO, Grand Island, NY, USA) which is supplemented with 0.1% polyvinyl alcohol (w/v), 3.05 mM D-glucose, 0.91 mM sodium pyruvate, 0.57 mM cysteine, 0.5 µg/ml luteinizing hormone, 0.5 µg/ml follicle stimulating hormone, 10 ng/ml epidermal growth factor, 10% porcine follicular fluid (pFF), 75 µg/ml penicillin G, and 50 µg/ml streptomycin. For IVM, in each well of five four-well dishes (Nunc, Roskilde, Denmark) containing 500 µL of maturation medium, 50 COCs were placed and the oocytes were matured for 40–42 h at 38.5° in an incubator containing 5% CO2. Cumulus cells were then removed, and oocytes having the first polar body were selected and activated. In a fusion chamber with 250 µm diameter wire electrodes (BLS, Budapest, Hungary) covered with 0.3 M mannitol solution containing 0.1 mM MgSO4, 1.0 mM CaCl2, and 0.5 mM Hepes, selected oocytes were placed and fusion was achieved by two DC pulses (1 sec interval) of 1.2 kV/cm for 30 µs using an LF101 Electro Cell Fusion Generator (Nepa Gene Co., Ltd., Chiba, Japan). After electric stimulation of oocytes followed by 2 h of stabilization period, 200 parthenogenetic embryos were placed into oviducts of each of the two LYD surrogate gilts aged 12 months at onset of estrus (i.e., a total of 400 parthenogenetic embryos for two surrogates) to produce developed parthenogenetic embryos.

Collection of both fertilized control (CN) embryos and parthenogenetic (PA) embryos

To produce fertilized control (CN) embryos, during the natural heat period at the onset of estrus (day 0) two LYD gilts were naturally mated with boars twice with a 6 h interval. Both 12 CN embryos and six PA embryos were recovered at day 21 from the gilts and surrogates, respectively, that had been confirmed pregnant by an ultrasound examination. Regarding the recovery, it was considered that some imprinted genes in pigs (e.g., H19, IGF2, XIST and PEG10) does not exhibit monoallelic expression until the blastocyst stage (Park ), and porcine parthenogenetic embryos undergo morphological changes at approximately day 30 of gestation (Bischoff ) and ceased fetal development with degenerated organs having a high rate of apoptosis of cells at day 35 of gestation (Moss ). After the gilts and surrogates were killed, their reproductive tracts were sectioned and placenta were isolated from the uterus. Then, embryos separated from the surrounding placenta were placed on a piece of cleaning tissue in order to dry liquid on the surface of the embryos. Morphologically intact embryos with comparable sizes were selected for further experiments (lengths of 2.2, 2.1, and 2.1 cm for CN and 2.0, 2.0, and 1.6 cm for PA), and stored in liquid nitrogen until further use.

Genomic DNA isolation and whole-genome bisulfite sequencing (WGBS) analysis

From the whole collected embryos of CN and PA (n = 3 for each), genomic DNA was isolated and fragmented. For optimization of bisulfite conversion of genomic DNA, Accel-NGS Methyl-Seq DNA Library Kit (Swift Biosciences, Inc. Ann Arbor, MI, USA) was used according to the manufacturer`s instructions. PCR was conducted with adapter primers and Diastar EF-Taq DNA polymerase (Solgent, Daejeon, Korea) to amplify the bisulfite-treated DNA, under the following thermal conditions: 3 min at 95° followed by 35 cycles of 30 sec at 95°, 30 sec at 60°, and 30 sec at 72°, and a final extension for 5 min at 72°. After bead-based clean-up, the PCR products were sequenced using HiSeqX sequencer operated by Macrogen (Seoul, Korea). For each of the WGBS libraries, the number of raw reads were approximately 848M (CN1), 864M (CN2), 868M (CN3), 842M (PA1), 859M (PA2), and 851M (PA3). After trimming adapter sequences and filtering out reads shorter than 20bp using Trim Galore (v0.4.5), cleaned reads for each library [approximately 847M (CN1), 862M (CN2), 866M (CN3), 840M (PA1), 857M (PA2), and 849M (PA3)] were retrieved (Table S1). After mapping to the reference genome (Sscrofa11.1/susScr11) with BSMAP (v2.87), the clean mapped reads were sorted and indexed. Using ‘methylatio.py’ script in BSMAP, extraction of the methylation ratio of every single cytosine from the mapping results was conducted. DMRs were identified by the program metilene (v0.2-8) (Juhling ) using the methylation ratios at all CpGs. The criteria for DMR were a presence of more than 10 CpGs, a genomic distance of less than 300 bp between CpGs, and a mean methylation difference of more than 0.2 between groups. DMRs with false discovery rate (FDR) < 0.05 were obtained as significant DMRs (Table S2). To visualize methylation ratios and significant DMRs throughout the genomic coordinates of the imprinted cluster, the R/Bioconductor package Gviz (v1.28.3) (Hahne and Ivanek 2016) was used.

RNA extraction, cDNA library construction, and RNA sequencing (RNA-seq)

Total RNA from whole CN embryos (n = 3) and whole PA embryos (n = 3) samples was isolated with TRIzol reagent (Sigma-Aldrich, USA) following the manufacturer’s instructions. DNase I was treated to the RNA samples to avoid genomic DNA contamination. The RNA samples were electrophoresed in 1.2% agarose gels to evaluate the integrity of RNA, which was then confirmed by more than 2 of 28S/18S rRNA ratio and more than 7 of RNA integrity number (RIN) using an Agilent 2100 BioAnalyzer. Using the ratios of A260/A280 and A260/A230 (1.8–2.0), the concentrations of RNA were assessed. One ug of total RNA was used to construct cDNA libraries with the TruSeq RNA Sample Prep Kit v.2 (Illumina, San Diego, CA, USA). The final cDNA library was produced using the protocol consisted of polyA-selected RNA extraction, RNA fragmentation, random hexamer primed reverse transcription and amplification. Quantification of the cDNA libraries was done by quantitative Real-Time PCR (qPCR) according to the qPCR Quantification Protocol Guide, and qualification of the libraries was assessed using the Agilent 2100 Bioanalyzer. The Illumina HiSeq2500 RNA-seq platform was used to sequence the library products (100nt paired-end). The number of raw reads generated for each RNA-seq library were approximately 77.3M (CN1), 73.5M (CN2), 77.7M (CN3), 80.7M (PA1), 79.9M (PA2), and 81.0M (PA3). After adapter trimming and quality filtering, the number of remaining cleaned reads were approximately 76.8M (CN1), 73.0M (CN2), 77.2M (CN3), 80.0M (PA1), 79.3M (PA2), and 80.3M (PA3) (Table S1). From the Assembly database at the National Center for Biotechnology Information (NCBI) (http://www.ncbi.nlm.nih.gov/assembly/GCF_000003025.6), the reference genome sequence of Sus scrofa (Sscrofa11.1/susScr11) and annotation files were downloaded. Then, using HISAT2 (v2.1.0) (Kim ) with default parameter settings, the clean sequencing reads were aligned to the reference genome, except for the parameter–dta regarding tailored transcriptome assembly. BAM format containing RNA-seq alignments were produced and sorted by SAMtools (v1.9), and then BAM file-derived read coverage (or depth) was plotted and visualized using the R/Bioconductor package Gviz (v1.28.3) (Hahne and Ivanek 2016).

Analysis of differential gene expression

The htseq-count script (Anders ) was used to count aligned RNA-seq reads using each BAM file and an annotation file as inputs, and a count matrix was generated. Using the R/Bioconductor package DESeq2 (v1.26.0) (Love ), read counts from individual fetal samples were normalized for sequencing depth and differential expression between CN embryos and PA embryos was calculated. To obtain significant differentially expressed genes (DEGs), the combined criteria of FDR < 0.05 and the absolute log2-fold change > 1 was used, where a fold change is defined as the expression in samples of PA divided by the expression in samples of CN.

Polymorphism-based assays

Potential single nucleotide polymorphism (SNP) sites on the last exon of the porcine PPP1R9A gene was obtained from the Ensembl Genome Browser (http://www.ensembl.org). The gDNAs of boar (Korean native pig, KNP), sow (KNP), and offspring (neonates, 2-day-old) were isolated from ear tissue. RNA was extracted from six tissues (heart, kidney, liver, lung, muscle, and spleen) from neonates (2-day-old) and reverse-transcribed to cDNA. A primer set for amplification of a genomic DNA (gDNA) fragment containing those potential SNPs (316 bp) was designed on the last exon: forward, 5′-AGGCTCTTGGGATGACATCAT-3′ and reverse, 5′-ACGGTTCCCCAG TCTGCCAAAG -3′. To amplify a corresponding cDNA fragment (350 bp) while avoiding genomic DNA contamination, a forward primer (5′-ACTGGAGAGCAACTCCTGCAGT-3′) was designed on the exon coming just before the last exon and was coupled with the above reverse primer on the last exon. The conditions for PCR amplification were 95° for 2 min, appropriate cycles with linear amplification ranges of 95° for 20 s, 60° (gDNA) or 54° (cDNA) for 40 s, 72° for 30 s, with an additional extension step at 72° for 5 min. The amplified gDNA fragment was sequenced to identify heterozygous offspring with potential SNPs. The informative SNP sequences in gDNA of offspring were compared with sequences in amplified cDNA of offspring to determine maternal and/or paternal expression. Those SNP sequences of gDNA in the boars and sows were compared with above sequences to determine parent-of-origin-specific gene expression.

Data availability

The WGBS and RNA-seq datasets generated in this study have been submitted to the NCBI BioProject database under accession number PRJNA657886 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA657886) and PRJNA658544 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA658544), respectively. DMRs called by the metilene software are included in Table S2. Supplemental material available at figshare: https://doi.org/10.25387/g3.12845840.

Results

Differentially methylated regions were only located at the SGCE/PEG10 locus within the COL1A2-ASB4 region in pigs

Nine genes (from COL1A2 to ASB4) in pig chromosome 9 were mapped to an approximate 1-Mb region between 74.17 and 75.20 Mb according to the genome sequence of Sus scrofa (Sscrofa11.1/ susScr11). The order of those genes in this locus is conserved in the human chromosome 7 (7q21.3) and mouse chromosome 6 (6 A1). In the mouse, DNA methylation in this region was examined in four CpG islands located in the promoter regions of the Casd1, Sgce-Peg10, Ppp1r9a, and Pon2 genes, and bisulfite sequencing analyses between reciprocal F1 hybrids revealed that only the promoter region of Sgce-Peg10 is differentially methylated in a parent-of-origin-specific manner (maternally hypermethylated) and the other three regions were non-methylated (Ono ). Similarly, in pigs, the promoter regions of the CASD1, SGCE-PEG10, PPP1R9A, and PON2 genes contained CpG islands, and only the areas containing the promoter regions of SGCE and PEG10 displayed DMR (Figure 1). DNA methylation of each of the three parthenotes and three controls was exhibited, and there were two CpG islands: one located in the 1st exons and intergenic region of SGCE and PEG10 and the other located in the 2nd exon of PEG10 (Figure 1). The genomic DNA around the latter CpG island was hypermethylated in all CN and PA embryos indicating non-DMR; whereas, hypomethylation of CN was found at the 1st exons of SGCE and PEG10, intergenic region, and introns where GC content is high (Figure 1). As a result, maternally hypermethylated DMR in PA was identified.
Figure 1

High-throughput DNA methylation profiling of the SGCE/PEG10 locus in porcine embryos determined by WGBS. GeneRegionTrack represents an approximate 1.03-Mb region between the COL1A2 and ASB4 genes on porcine chromosome 9 (chr9:74,174,484 – 75,203,450). Nine genes are indicated by brown boxes for protein-coding transcripts (short boxes: noncoding region; tall boxes: protein-coding region), and directions are marked by black horizontal arrows. From I track to PA-CN track: I, CpG islands; GC%, GC content; PA, mean methylation ratio of parthenotes (pink histogram lines); CN, mean methylation ratio of controls (blue histogram lines); and PA-CN, mean methylation ratio of parthenotes subtracted by that of controls (green histogram lines). In PA-CN track, hypermethylated DMR in PA (FDR < 0.05) is overlaid with red histogram lines. Yellow highlighted area, an area containing the promoter regions of SGCE and PEG10 and two CpG islands that is zoomed in. In the bottom plot, PA1 - 3, methylation ratios of three different parthenotes; CN1 - 3, methylation ratios of three different controls; PA, mean ratio; CN, mean ratio; PA-CN, PA subtracted by CN; and DMR, differentially methylated region (FDR < 0.05).

High-throughput DNA methylation profiling of the SGCE/PEG10 locus in porcine embryos determined by WGBS. GeneRegionTrack represents an approximate 1.03-Mb region between the COL1A2 and ASB4 genes on porcine chromosome 9 (chr9:74,174,484 – 75,203,450). Nine genes are indicated by brown boxes for protein-coding transcripts (short boxes: noncoding region; tall boxes: protein-coding region), and directions are marked by black horizontal arrows. From I track to PA-CN track: I, CpG islands; GC%, GC content; PA, mean methylation ratio of parthenotes (pink histogram lines); CN, mean methylation ratio of controls (blue histogram lines); and PA-CN, mean methylation ratio of parthenotes subtracted by that of controls (green histogram lines). In PA-CN track, hypermethylated DMR in PA (FDR < 0.05) is overlaid with red histogram lines. Yellow highlighted area, an area containing the promoter regions of SGCE and PEG10 and two CpG islands that is zoomed in. In the bottom plot, PA1 - 3, methylation ratios of three different parthenotes; CN1 - 3, methylation ratios of three different controls; PA, mean ratio; CN, mean ratio; PA-CN, PA subtracted by CN; and DMR, differentially methylated region (FDR < 0.05).

Promoter regions of genes in the COL1A2-ASB4 region mostly showed non-methylation

A close view of promoter regions of seven genes other than SGCE and PEG10 was obtained. The promoter regions close to transcription start sites and first exons of the CASD1, PPP1R9A, and PON2 genes enclosed CpG islands and high GC contents. These promoter regions were almost non-methylated in both CN and PA as shown by a regional hypomethylation (Figure 2). It suggests that these regions are biallelically active and the CASD1, PPP1R9A and PON2 genes are not imprinted. The promoter region of the COL1A2 gene also showed a similar hypomethylation, suggesting active and non-imprinting status of this gene (Figure 2). In addition, the promoter region of the PON1 gene contained a relatively narrow range of hypomethylation and the promoter regions of the PON3 and ASB4 genes did not possess hypomethylated areas in both CN and PA, suggesting a relatively inactive status of these promoters and non-imprinting of these genes (Figure 2)
Figure 2

Promoter regions of the seven genes other than SGCE and PEG10 and CpG islands in the COL1A2-ASB4 region and methylation profiles. GeneRegionTrack represents regions containing a transcription start site and the first exon of the seven genes. From I track to PA-CN track: I, CpG islands; GC%, GC content; PA1 - 3, methylation ratios of three different parthenotes; CN1 - 3, methylation ratios of three different controls; PA, mean methylation ratio of parthenotes; CN, mean methylation ratio of controls; and PA-CN, mean methylation ratio of parthenotes subtracted by that of controls.

Promoter regions of the seven genes other than SGCE and PEG10 and CpG islands in the COL1A2-ASB4 region and methylation profiles. GeneRegionTrack represents regions containing a transcription start site and the first exon of the seven genes. From I track to PA-CN track: I, CpG islands; GC%, GC content; PA1 - 3, methylation ratios of three different parthenotes; CN1 - 3, methylation ratios of three different controls; PA, mean methylation ratio of parthenotes; CN, mean methylation ratio of controls; and PA-CN, mean methylation ratio of parthenotes subtracted by that of controls.

Allele-specific expression in the porcine COL1A2-ASB4 region showed a partly conserved pattern

Expression levels of each gene in the porcine COL1A2-ASB4 region were investigated with RNA-seq analysis. The SGCE and PEG10 genes were expressed only in CN, but not in PA (Figure 3). Because CN has one paternal and one maternal allele and PA has two maternal alleles, the sole expression in CN, but not in PA, could be derived from expression in the paternal allele (‘paternal expression’) which is consistent with previous reports (Bischoff ; Park ). Among the genes between PPP1R9A and ASB4 toward the telomere, PON2 was expressed biallelically as described previously (Zhou ) and expression of ASB4 was also biallelic (Figure 3). Expression of PON1 was low, but appeared to be biallelic, and PON3 expression was not detectable. Regarding expression of PPP1R9A, previous studies showed that it is expressed biallelically in tissues from 2-month piglets, but maternally in placental tissues (Bischoff ; Zhang ), and it appeared to be biallelic (Figure 3). In addition, expressions of both the COL1A2 and CASD1 genes at the centromere-proximal region were biallelic. Taken together, compared with humans and mice, only SGCE and PEG10 genes showed a conserved paternal allele-specific expression pattern.
Figure 3

Allelic gene expression in porcine SGCE/PEG10 locus. Nine genes in the porcine COL1A2-ASB4 region are displayed in a gene order (from top left to bottom right). The RNA-seq read coverages, or depths, extracted from BAM files are shown as counts (Y axis) in each track of PA embryo (n = 3) and CN embryo (n = 3). PA, parthenote; CN, control.

Allelic gene expression in porcine SGCE/PEG10 locus. Nine genes in the porcine COL1A2-ASB4 region are displayed in a gene order (from top left to bottom right). The RNA-seq read coverages, or depths, extracted from BAM files are shown as counts (Y axis) in each track of PA embryo (n = 3) and CN embryo (n = 3). PA, parthenote; CN, control. To validate the above expression patterns, DEGs were screened. Among the nine genes in the porcine COL1A2-ASB4 region, the SGCE and PEG10 genes were differentially expressed and showed significantly higher expression in CN indicating their paternal allele-specific expression (Table 1). In contrast, the expression of PPP1R9A tended to be higher in PA, but it was statistically nonsignificant indicating its expression is biallelic. To confirm the expression pattern of PPP1R9A, a SNP-based assay was performed as presented below (Figure 4). The rest of detected genes, the COL1A2, CASD1, PON1, PON2, and ASB4 genes, did not show a difference between CN and PA indicating their biallelic expression and expression of the PON3 gene was not detected (Table 1).
Table 1

Analysis of differentially expressed genes between parthenogenetic and control embryos

IDGeneM.Counta (CN)M.Count (PA)log2FoldChange (PA/CN)P-ValueP.adjb
ENSSSCG00000015326COL1A260178.3361307.490.030.800.94
ENSSSCG00000015327CASD11433.211551.010.110.320.65
ENSSSCG00000015328SGCE1426.9635.66−5.322.01E-253.97E-23
ENSSSCG00000036049PEG1040529.7727.35−10.550.00E+00c0.00E+00
ENSSSCG00000015329PPP1R9A586.36733.520.320.100.38
ENSSSCG00000015332PON172.2176.730.090.770.92
ENSSSCG00000029515PON3NDNDNDNDND
ENSSSCG00000015331PON2895.78855.09−0.070.690.89
ENSSSCG00000015333ASB4109.7983.32−0.400.130.43

Test for differentially expressed genes was conducted by the R/Bioconductor package DESeq2. aMean of read counts which is presented as mean ± SEM bBenjamini–Hochberg (BH) adjusted p-value. cValue of 0.00E+00 represents that the significance was not measurable because it reached a maximum significance set by the program. ND, not detectable.

Figure 4

Polymorphism-based analysis of parent-of-origin dependent allele-specific expression of the porcine PPP1R9A gene. Amplified gDNA and cDNA fragments containing an informative SNP (rs329892462) in the last exon of PPP1R9A gene were sequenced. The downward arrows (in orange) point out the informative (heterozygous) SNP in gDNA of offspring for which cDNA of offspring was examined. Two different combinations of boar and sow and their offspring were analyzed (A and B). All cDNA from multiple tissues of offspring showed biallelic expression of A/G at the SNP site.

Test for differentially expressed genes was conducted by the R/Bioconductor package DESeq2. aMean of read counts which is presented as mean ± SEM bBenjamini–Hochberg (BH) adjusted p-value. cValue of 0.00E+00 represents that the significance was not measurable because it reached a maximum significance set by the program. ND, not detectable. Polymorphism-based analysis of parent-of-origin dependent allele-specific expression of the porcine PPP1R9A gene. Amplified gDNA and cDNA fragments containing an informative SNP (rs329892462) in the last exon of PPP1R9A gene were sequenced. The downward arrows (in orange) point out the informative (heterozygous) SNP in gDNA of offspring for which cDNA of offspring was examined. Two different combinations of boar and sow and their offspring were analyzed (A and B). All cDNA from multiple tissues of offspring showed biallelic expression of A/G at the SNP site.

SNP-based analysis revealed biallelic expression of PPP1R9A

Among SNPs in the last exon of the porcine PPP1R9A gene, a SNP rs329892462 (G/A) was heterozygous (informative) in our gDNA samples from offspring (Figure 4). The A allele in gDNA of offspring was derived from either sow or boar. Subsequently, cDNA from six tissues including heart, kidney, liver, lung, muscle and spleen of offspring were sequenced to investigate whether or not allele-specific expression occurs. In all tested tissues of both offspring from two different combinations of boar and sow, allele-specific patterns were not shown and biallelic expressions with an approximate A:G ratio of 1:1 were detected (Figure 4).

Comparative approach highlighted diversity and universality of CALCR and ASB4 imprinted cluster

Imprinted expression of the genes located between CALCR and ASB4 was compared between mice and humans (Table 2). In general, expression patterns in the porcine embryos within the syntenic region were closer to the expression in humans than that of mice, but in the pig the PPP1R9A gene showed biallelic expression in all tested tissues [embryonic and 2-day-old in the current study, and 2-month-old as reported in (Zhang )]. This biallelic expression of porcine PPP1R9A was different from the mouse and human which showed maternal expression in embryonic or fetal skeletal muscle tissues (Nakabayashi ; Monk ) and ovine fetuses that showed maternal expression in the kidney (Duan ). Our polymorphism-based analysis revealed biallelic expression of PPP1R9A in the muscle and kidney in 2-day-old neonates (Figure 4). In addition, based on this study and literature, the imprinting pattern was illustrated by displaying differences and similarities among various species: selected mammals (pig, human, mouse, and sheep), the marsupial tammar wallaby with imprinting of PEG10 but not SGCE (Suzuki ), and the avian (chicken) deficient in imprinting (Frésard ) (Figure 5).
Table 2

Biallelic and/or parental-origin-dependent expression of genes in the genomic region between CALCR and ASB4 in mice and humans

Mouse chr6A1Human chr7q21.3
CALCRBi: all embryonic tissues (E12.5 and 14.5) (except brain)Sequencing[1]Bi: all fetal tissues (8-18 wk) (except brain)Sequencing[1]
M: embryonic brain (E15.5), adult brain (Brain-specific)RFLP[2]monoallelic: fetal brainns[1]
GNGT1Bi: embryonic tissues, placenta, yolk sac (E12.5 and 14.5)ns[1]Bi: all fetal tissues (8-18 wk), placentaSequencing[1]
TFPI2Bi: embryonic tissues (E12.5 and 14.5)Sequencing[1]absence of expression in fetus (8-18 wk)RT-PCR[1]
M: placenta, yolk sac (E12.5 and 14.5)Sequencing[1]M: placenta (polymorphic†††)Sequencing[1]
GNG11Bi: embryonic tissues, placenta, yolk sac (E12.5 and 14.5)ns[1]Bi: all fetal tissues (8-18 wk), placentaSequencing[1]
BET1Bi: embryonic tissues, placenta, yolk sac (E12.5 and 14.5)ns[1]Bi: all fetal tissues (8-18 wk), placentaSequencing[1]
COL1A2Bi: embryo (all tested tissues, E15.5)ns[3]Bi: all fetal tissues (8-18 wk), placentaSequencing[1]
CASD1Bi: embryo, placenta, yolk sac (E10 and 13)RFLP[4]Bi: all fetal tissues (8-18 wk), placentaSequencing[1]
Bi: neonate (brain)RFLP[4]
SGCEP: embryo at E13RFLP[5]P: patUPD7 cell line/ familyRT-PCR/ sequencing[7]
P: adult (brain, heart, kidney)RFLP[4]
PEG10P: embryo at E10Sequencing[4]P: embryonic villiRFLP[8]
P: patUPD7 cell lineRT-PCR[7]
P: fetus (8-18 wk), placentaSequencing[1]
PPP1R9ABi: embryo (brain, liver, lung, eye) (E15.5) & neonate (brain)Sequencing/ SNaPshot[6]Bi: all fetal tissues (8-18 wk) except muscleSequencing[1]
Mp: embryo (tongue), placenta, yolk sac & neonate (limb, tongue††)Sequencing/ SNaPshot[6]M: fetal muscle (8-18 wk), 1st trimester placentaSequencing[1]
PON1low expression in embryo, placenta, yolk sacns[4]undetectable (lymphoblasts)ns[9]
Bi: Neonate (liver, lung)RFLP[4]
PON3Mp: embryo (E10), placenta (E10) & yolk sac (E13)RFLP[4]Bi: all fetal tissues (8-18 wk), placentaSequencing[1]
Bi: Neonate (liver, lung)RFLP[4]
PON2Mp: placenta (E10) & yolk sac (E13)RFLP[4]Bi: all fetal tissues (8-18 wk), placentaSequencing[1]
Bi: Neonate (all tested tissues)RFLP[4]
ASB4M: embryo at E15.5 (All tested tissues)RFLP[3]Bi: fetus (8-18 wk) (liver, heart), placentaSequencing[1]

ns: data not shown, P: paternal expression, M: maternal expression, M: preferential maternal expression, Monoallelic: monoallelic expression (Parental origin was not determined due to absence of parental samples), E: embryonic day. †Quantitative allele frequency measurement for SNPs. ††Limb & tongue represent skeletal muscle tissues, while placenta & yolk sac represent extra-embryonic tissues. †††Some placenta showed maternal expression and others showed biallelic expression. [1] Monk ; [2] Hoshiya ; [3] Mizuno ; [4] Ono ; [5] Piras ; [6] Nakabayashi ; [7] Grabowski ; [8] Ono ; [9] Okita .

Figure 5

Simplified schematic representation of methylation and gene expression within the region spans between COL1A2 and ASB4 (or PDK4) genes in various species. Filled red lollipops indicate paternally hypermethylated DMRs. Paternal (blue), maternal (pink), weak maternal (light pink), biallelic (gray), tissue and/or stage-dependent maternal or biallelic (green) and low (blank with dotted gray perimeter) expressions are denoted. The direction of transcription is indicated with arrows. The genomic distance was scaled based on the NCBI gene database (http://www.ncbi.nlm.nih.gov/gene), except the tammar which was based on Suzuki et al., 2007.

ns: data not shown, P: paternal expression, M: maternal expression, M: preferential maternal expression, Monoallelic: monoallelic expression (Parental origin was not determined due to absence of parental samples), E: embryonic day. †Quantitative allele frequency measurement for SNPs. ††Limb & tongue represent skeletal muscle tissues, while placenta & yolk sac represent extra-embryonic tissues. †††Some placenta showed maternal expression and others showed biallelic expression. [1] Monk ; [2] Hoshiya ; [3] Mizuno ; [4] Ono ; [5] Piras ; [6] Nakabayashi ; [7] Grabowski ; [8] Ono ; [9] Okita . Simplified schematic representation of methylation and gene expression within the region spans between COL1A2 and ASB4 (or PDK4) genes in various species. Filled red lollipops indicate paternally hypermethylated DMRs. Paternal (blue), maternal (pink), weak maternal (light pink), biallelic (gray), tissue and/or stage-dependent maternal or biallelic (green) and low (blank with dotted gray perimeter) expressions are denoted. The direction of transcription is indicated with arrows. The genomic distance was scaled based on the NCBI gene database (http://www.ncbi.nlm.nih.gov/gene), except the tammar which was based on Suzuki et al., 2007.

Discussion

In this study, we report a comprehensive assessment of DNA methylation and allelic expression in the large chromosomal interval between COL1A2 and ASB4 in pigs. Our generation of parthenogenetic porcine embryos followed by strategies that combine whole-genome DNA methylation, transcriptome, and SNP-based allelic expression enabled the large-scale imprinting profiling on a genomic level. The methylome and transcriptome were constructed independently for three individuals in each control and parthenogenetic group to reduce genetic variability, and SNP analysis further evaluated allelic expression of the PPP1R9A gene in neonatal pigs. This experimental design was appropriate to detect, not only paternal expression which showed an all-or-none fashion (i.e., SGCE and PEG10 expressions in only control embryos having a paternal allele), but also allelic expression which was verified by the ensuing SNP analysis, followed by determining the boundary of imprinting cluster. Genomic imprinting operates in mammals via several gene silencing mechanisms: e.g., i) direct silencing of a promoters where the non-transcribed allele is heavily methylated, ii) inhibiting long-range communication between promoters and enhancers by binding of an insulator on the unmethylated allele and forming chromatin loops, iii) repressing a long noncoding RNA (ncRNA) transcript by methylation only on one of the parental alleles and silencing the other allele by the non-repressed long ncRNA. iv) isoform-dependent silencing originated from alternative promoter usage where different isoforms are silenced or active depending on the influence of DNA methylation (Barlow and Bartolomei 2014; Ahn ). Regarding the third mechanism, unlike other known imprinted clusters, there was no evidence of a long ncRNA influenced by the porcine SGCE/PEG10 DMR as reported in the mouse and human (Monk ). Also, alternative promoter usage in the fourth mechanism was not likely to occur in this genomic interval because there was no evidence of expression of isoforms other than the displayed main transcripts according to our screening of transcriptome data using Integrative Genomics Viewer (IGV). Within the porcine COL1A2-ASB4 region, maternal alleles of SGCE/PEG10 locus appears inactive through direct silencing of promoters by maternal hypermethylation as described in the first mechanism, resulting in paternal expression. In addition, this maternal hypermethylation (and an insulator binding on unmethylated allele) could be involved in inhibition of long-range communication between a promoter and an enhancer as described in the second mechanism. However, genes other than SGCE and PEG10 including PPP1R9A did not show an allele-specific expression pattern, but a biallelic expression, and therefore the insulator-mediated mechanism was not the mode of action in the SGCE/PEG10 imprinting cluster in pigs. A previous study showed, using semiquantitative RT-PCR, expression of porcine PPP1R9A was greater in the parthenote than the control placental sample (a parthenote: control ratio of 1.7), suggesting maternal expression, but similar in the brain, fibroblast, and liver (Bischoff ). Another study revealed that expression of PPP1R9A is biallelic in multiple tissues of 2-month-old piglets, except skeletal muscle, fat, and testis in which expression of PPP1R9A was not detectable (Zhang ). To confirm whether or not allele-specific expression of PPP1R9A occurs in neonatal pigs, polymorphism-based assays were conducted for both parents and offspring and it revealed a biallelic expression of PPP1R9A which is different from its maternal expression in the mouse, human, and sheep (Ono ; Nakabayashi ; Duan ). The telomeric (distal) boundary of the large imprinted cluster organized around Sgce/Peg10 locus in the mouse proximal region toward the centromere of chromosome 6 (6A1) reaches the Asb4 gene, based on previous reports on maternally expression of Ppp1r9a, Pon3, Pon2, and Asb4 in mice (Piras ; Mizuno ; Ono ; Nakabayashi ). However, the boundary spans between CASD1 and PON3 in sheep (Duan ), while in the human it spans between SGCE and PPP1R9A (Ono ; Grabowski ; Okita ; Monk ). In addition, there are reports about monoallelic expression of mouse Calcr and Tfpi2 and human CALCR (in brain) and TFPI2 toward the centromeric (proximal) region, but the expression of those genes was restricted to brain and placenta, respectively (Hoshiya ; Monk ). Our porcine parthenogenetic embryos showed an imprinting boundary between SGCE and PEG10 which is narrower than the human one, and expression of CALCR and TFPI2 was not monoallelic in this study (Figure S1). As illustrated, genomic imprinting of SGCE/PEG10 is conserved among mammals, but not in the marsupial and avian (Figure 5). The tammar wallaby showed a distinct imprinting pattern with a DMR located only at the promoter region of PEG10, but not within the promoter region of SGCE (Suzuki ), while the chicken has been found to be deficient in genomic imprinting (Shin ; Renfree ; Frésard ). Regarding the role of imprinting, both the murine Sgce and Peg10 genes are expressed in the embryonic tissues and their paternal expression is essential for embryonic development, while maternal uniparental inheritance of the genomic region containing Sgce and Peg10 is associated with embryonic lethality (Piras ; Ono ). As shown in Peg10 knockout mice, deletion of Peg10 gave rise to growth retardation and an early embryonic lethal phenotype (Ono ). It has been reported that polymorphisms in PEG10 genes are associated with carcass traits and fat deposition in pigs (Qiao ), and SGCE and PEG10 are candidate genes for piglet birth weight (Zhang ). Taken together, genomic imprinting in the SGCE-PEG10 cluster may play an important role in growth and development of pigs. As shown in Table 2, parent-of-origin-dependent expression is often a developmental stage- and tissue-specific. In the mouse, embryonic days of analyzed embryos were between E10 and E15.5 which corresponds to Carnegie Stage (CS) 11-22, and the d21 porcine embryos used in this study were at CS15 (Hikspoors ). Human fetal tissues collected in a previous study were from 8- to 18-week-old fetuses (which are in or after CS23) that were obtained at the termination of pregnancies (Monk ). Therefore, the developmental stage of murine and porcine embryos was earlier than the human fetuses which requires careful attention. In addition, although our study did not include in vitro matured oocytes undergone in vitro fertilization (IVF), a previous study using these IVF oocytes showed that more exposure to in vitro culture condition might produce an adverse effect on dynamics of DNA methylation and increase DNA methylation during embryonic development (Deshmukh ). This can suggest that there is a possibility of increased DNA methylation during in vitro handling even though our PA oocytes were transferred into the oviduct instead of culturing in vitro after activation. However, the degree of increase could be low, because in our previous reports almost complete absence of DNA methylation in PA was found in Nesp DMR which is paternally hypermethylated (Ahn ). Additionally, there were some embryonic losses due to abnormal shaping but, among well-developed d21 PA embryos, we selected PA embryos that can be compared with control embryos based on intact morphology and sizes. Furthermore, the same chromosomal content was compared between control and PA embryos by genotyping the sex of control embryos and selecting female control embryos (i.e., control embryos with 36,XX were compared with PA embryos with a double of 18,X). In conclusion, porcine parthenogenetic embryos were used to explore a putative imprinted genomic interval, and this study revealed that the porcine SGCE/PEG10 locus may not be clustered with a nearby gene locus. This cluster contains a single cis-acting DMR/ICR at the SGCE/PEG10 locus, and direct silencing of promoters by hypermethylation is the mechanism underlying genomic imprinting of SGCE/PEG10. Our findings provide a comprehensive overview of one of the important imprinted clusters of pig chromosome 9 that may be implicated in the regulation of pig growth and development. Additionally, our dataset and forthcoming publication will greatly facilitate analyses on other imprinted loci in the pig genome.
  38 in total

1.  Characterization of conserved and nonconserved imprinted genes in swine.

Authors:  Steve R Bischoff; Shengdar Tsai; Nicholas Hardison; Alison A Motsinger-Reif; Brad A Freking; Dan Nonneman; Gary Rohrer; Jorge A Piedrahita
Journal:  Biol Reprod       Date:  2009-07-01       Impact factor: 4.285

2.  Deletion of Peg10, an imprinted gene acquired from a retrotransposon, causes early embryonic lethality.

Authors:  Ryuichi Ono; Kenji Nakamura; Kimiko Inoue; Mie Naruse; Takako Usami; Noriko Wakisaka-Saito; Toshiaki Hino; Rika Suzuki-Migishima; Narumi Ogonuki; Hiromi Miki; Takashi Kohda; Atsuo Ogura; Minesuke Yokoyama; Tomoko Kaneko-Ishino; Fumitoshi Ishino
Journal:  Nat Genet       Date:  2005-12-11       Impact factor: 38.330

Review 3.  Genomic imprinting in mammals.

Authors:  Denise P Barlow; Marisa S Bartolomei
Journal:  Cold Spring Harb Perspect Biol       Date:  2014-02-01       Impact factor: 10.005

4.  Zac1 (Lot1), a potential tumor suppressor gene, and the gene for epsilon-sarcoglycan are maternally imprinted genes: identification by a subtractive screen of novel uniparental fibroblast lines.

Authors:  G Piras; A El Kharroubi; S Kozlov; D Escalante-Alcalde; L Hernandez; N G Copeland; D J Gilbert; N A Jenkins; C L Stewart
Journal:  Mol Cell Biol       Date:  2000-05       Impact factor: 4.272

5.  Genomic imprinting of PPP1R9A encoding neurabin I in skeletal muscle and extra-embryonic tissues.

Authors:  K Nakabayashi; S Makino; S Minagawa; A C Smith; J S Bamforth; P Stanier; M Preece; L Parker-Katiraee; T Paton; M Oshimura; P Mill; Y Yoshikawa; C C Hui; D Monk; G E Moore; S W Scherer
Journal:  J Med Genet       Date:  2004-08       Impact factor: 6.318

6.  The epsilon-sarcoglycan gene (SGCE), mutated in myoclonus-dystonia syndrome, is maternally imprinted.

Authors:  Monika Grabowski; Alexander Zimprich; Bettina Lorenz-Depiereux; Vera Kalscheuer; Friedrich Asmus; Thomas Gasser; Thomas Meitinger; Tim M Strom
Journal:  Eur J Hum Genet       Date:  2003-02       Impact factor: 4.246

7.  HTSeq--a Python framework to work with high-throughput sequencing data.

Authors:  Simon Anders; Paul Theodor Pyl; Wolfgang Huber
Journal:  Bioinformatics       Date:  2014-09-25       Impact factor: 6.937

8.  Effects of maternal nutrition on the expression of genomic imprinted genes in ovine fetuses.

Authors:  Jingyue Ellie Duan; Mingyuan Zhang; Kaleigh Flock; Sahar Al Seesi; Ion Mandoiu; Amanda Jones; Elizabeth Johnson; Sambhu Pillai; Maria Hoffman; Katelyn McFadden; Hesheng Jiang; Sarah Reed; Kristen Govoni; Steve Zinn; Zongliang Jiang; Xiuchun Cindy Tian
Journal:  Epigenetics       Date:  2018-09-21       Impact factor: 4.528

9.  Genome wide screening of candidate genes for improving piglet birth weight using high and low estimated breeding value populations.

Authors:  Lifan Zhang; Xiang Zhou; Jennifer J Michal; Bo Ding; Rui Li; Zhihua Jiang
Journal:  Int J Biol Sci       Date:  2014-02-07       Impact factor: 6.580

10.  metilene: fast and sensitive calling of differentially methylated regions from bisulfite sequencing data.

Authors:  Frank Jühling; Helene Kretzmer; Stephan H Bernhart; Christian Otto; Peter F Stadler; Steve Hoffmann
Journal:  Genome Res       Date:  2015-12-02       Impact factor: 9.043

View more
  4 in total

1.  Genomic Imprinting at the Porcine PLAGL1 Locus and the Orthologous Locus in the Human.

Authors:  Jinsoo Ahn; In-Sul Hwang; Mi-Ryung Park; Seongsoo Hwang; Kichoon Lee
Journal:  Genes (Basel)       Date:  2021-04-08       Impact factor: 4.096

2.  Loss of Monoallelic Expression of IGF2 in the Adult Liver Via Alternative Promoter Usage and Chromatin Reorganization.

Authors:  Jinsoo Ahn; Joonbum Lee; Dong-Hwan Kim; In-Sul Hwang; Mi-Ryung Park; In-Cheol Cho; Seongsoo Hwang; Kichoon Lee
Journal:  Front Genet       Date:  2022-07-22       Impact factor: 4.772

3.  Comparative maternal protein profiling of mouse biparental and uniparental embryos.

Authors:  Fumei Chen; Buguo Ma; Yongda Lin; Xin Luo; Tao Xu; Yuan Zhang; Fang Chen; Yanfei Li; Yaoyao Zhang; Bin Luo; Qingmei Zhang; Xiaoxun Xie
Journal:  Gigascience       Date:  2022-09-03       Impact factor: 7.658

4.  Genomic Imprinting at the Porcine DIRAS3 Locus.

Authors:  Jinsoo Ahn; In-Sul Hwang; Mi-Ryung Park; Seongsoo Hwang; Kichoon Lee
Journal:  Animals (Basel)       Date:  2021-05-03       Impact factor: 2.752

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.