| Literature DB >> 30239702 |
Hyeonsoo Jeong1, Xin Wu1, Brandon Smith1, Soojin V Yi1.
Abstract
Recent genome-wide DNA methylation analyses of insect genomes accentuate an intriguing contrast compared with those in mammals. In mammals, most CpGs are heavily methylated, with the exceptions of clusters of hypomethylated sites referred to as CpG islands. In contrast, DNA methylation in insects is localized to a small number of CpG sites. Here, we refer to clusters of methylated CpGs as "methylation islands (MIs)," and investigate their characteristics in seven hymenopteran insects with high-quality bisulfite sequencing data. Methylation islands were primarily located within gene bodies. They were significantly overrepresented in exon-intron boundaries, indicating their potential roles in splicing. Methylated CpGs within MIs exhibited stronger evolutionary conservation compared with those outside of MIs. Additionally, genes harboring MIs exhibited higher and more stable levels of gene expression compared with those that do not harbor MIs. The effects of MIs on evolutionary conservation and gene expression are independent and stronger than the effect of DNA methylation alone. These results indicate that MIs may be useful to gain additional insights into understanding the role of DNA methylation in gene expression and evolutionary conservation in invertebrate genomes.Entities:
Mesh:
Year: 2018 PMID: 30239702 PMCID: PMC6195173 DOI: 10.1093/gbe/evy203
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—Contrasting methylation landscapes found in honey bees and humans. Methylated CpGs are sparse but clustered in honey bees and other hymenopteran insects. These “Methylation Islands” are ∼250 bp in length and stand out in the otherwise lowly methylated insect genomes. In contrast, the human methylation landscape is heavily methylated throughout with breaks of hypomethylated CpG islands that are typically ∼1 kb in length.
Genome Composition of the Species Used in This Study Along with Methylation Statistics
| Species | Genome Size (Mb) | # Protein-Coding Genes | # of mCGs (% of all CGs) | Avg. Fractional Methylation of mCGs |
|---|---|---|---|---|
| 234.07 | 15,314 | 78,846 (0.78%) | 0.584 | |
| 232.68 | 11,042 | 85,746 (0.84%) | 0.635 | |
| 294.46 | 11,838 | 112,212 (0.53%) | 0.662 | |
| 295.78 | 13,354 | 114,261 (0.85%) | 0.737 | |
| 211.21 | 9,876 | 15,744 (0.24%) | 0.386 | |
| 396.02 | 14,451 | 157,829 (0.98%) | 0.526 | |
| 196.22 | 13,200 | 60,298 (0.60%) | 0.345 |
Summary Statistics of MIs Detected in Each Species
| # of predicted MIs | 5,126 | 6,327 | 8,375 | 9,644 | 1,342 | 10,574 | 4,889 |
| # of mCGs in MIs (% of total mCGs) | 29,254 | 47,804 | 78,490 | 85,007 | 8,293 | 112,819 | 30,141 |
| Total MI length (bp) | 1,043,247 | 1,803,969 | 2,969,693 | 3,355,006 | 210,235 | 4,291,930 | 1,136,846 |
| Avg. MI length (bp) | 213.15 | 286.12 | 355.59 | 348.88 | 157.66 | 406.89 | 233.53 |
| Avg. mCG density per MI (# of mCGs/MI length) | 0.03 | 0.02 | 0.03 | 0.02 | 0.07 | 0.03 | 0.03 |
| # of MIs overlapping with genes | 4,958 | 6,082 | 7,845 | 9,079 | 1,020 | 9,843 | 4,603 |
| # of MIs overlapping exclusively with genes | 4,788 | 5,961 | 7,606 | 8,873 | 1,010 | 9,477 | 4,469 |
| # of MIs overlapping with exons/exclusively with exons (% of all MIs) | 4,830/3,117 | 5,763/2,634 | 7,319/2,704 | 8,184/3,381 | 741/524 | 8,839/3,412 | 4,433/2,926 |
| # of MIs overlapping with introns/exclusively with introns (% of all MIs) | 1,794/178 | 3,404/382 | 5,011/592 | 5,739/1,206 | 478/273 | 6,300/1,160 | 1,881/242 |
| # of MIs overlapping with exon–intron boundaries/only one exon–intron boundary (% of all MIs) | 1,637/705 | 3,051/1,312 | 4,461/1,690 | 4,672/1,635 | 205/92 | 5,252/2,123 | 1,649/611 |
| # of MIs overlapping with promoters (% of all MIs) | 172 | 117 | 146 | 199 | 30 | 308 | 213 |
Defined as the region spanning the transcript start site to the transcription termination site.
. 2.—Characterization of MIs across genic regions in seven hymenopteran species. (A) Box plots displaying the coverage of MIs across different types of genic regions (gene body, introns, and exons). (B) Violin plots comparing the relative position of MIs with regards to the TSS for genes with MIs.
. 3.—MIs are enriched in evolutionarily conserved genes. (A) Bar plots illustrating the number of genes in each gene type (all genes [AG], complete orthologous genes [CO], incomplete orthologous genes [IO], and unique genes in each species [UG]). (B) Bar plots indicating the proportion of genes having different types of methylation features.
Statistical Significance of Differences between Odds Ratios (OR) of Genes with and without MI Using Z Approximation
| Species | OR of Genes w/MI | OR of Genes w/o MI but w/mCG | Difference of Log. OR (δ) | SE(δ) | |
|---|---|---|---|---|---|
| 2.31 | 1.65 | 0.34 | 0.07 | 3.8E-07 | |
| 2.79 | 1.33 | 0.74 | 0.07 | 2.2E-16 | |
| 3.23 | 0.93 | 1.25 | 0.08 | 2.2E-16 | |
| 3.29 | 1.14 | 1.06 | 0.09 | 2.2E-16 | |
| 1.44 | 1.23 | 0.16 | 0.10 | 5.7E-02 | |
| 3.84 | 1.19 | 1.17 | 0.07 | 2.2E-16 | |
| 2.97 | 1.74 | 0.53 | 0.08 | 1.2E-11 |
Note.—Odds ratios were calculated and summarized in supplementary table S1, Supplementary Material online.
Odds ratio of the number of genes with MIs and the number of the remaining genes between CO and IO types, respectively, were tested using Fisher’s exact test.
Odds ratio of the number of genes without MIs but with mCGs and the number of the remaining genes between CO and IO types, respectively, were tested using Fisher’s exact test.
. 4.—Effect of MI and DNA methylation on amino acid conservation. Linear mixed models were fitted to estimate the conservation score of amino acid sites using amino acid location (outside-MI or inside-MI) and the existence of corresponding DNA methylation sites (nonmethylated CpGs or methylated CpGs) as the main factors with the interaction and random factors being gene and species information, respectively. The conservation score of each amino acid position is quantified based on the Jensen–Shannon (JS) divergence.
. 5.—Gene expression levels between MI- and non-MI-genes in each gene type. The y-axis is log2-transformed gene expression level (normalized by gene length). The x-axis represents gene types of all genes (AG), complete orthologous genes (CO), incomplete orthologous genes (IO), and unique genes in each species (UG).
Statistical Significance between Pairwise Correlation Coefficients of “Same State MI” and “Different State MI” Genes
| Same State MIs | Different State MIs | ||||
|---|---|---|---|---|---|
| Spearman’s ρ | Number of Genes | Spearman’s ρ | Number of Genes | ||
| 0.607 | 3,590 | 0.557 | 1,768 | 9.30E-03 | |
| 0.374 | 3,587 | 0.301 | 1,779 | 4.50E-03 | |
| 0.468 | 3,927 | 0.351 | 1,431 | 2.20E-16 | |
Note.—The correlation coefficients were estimated between two species’ gene expression level using Spearman’s rho correlations.
. 6.—Comparison of average expression levels between exons located in MIs (MI-exon) and exons located outside of MIs (non-MI-exon) within the same gene. For complete orthologous genes (CO), incomplete orthologous genes (IO), and unique genes in each species (UG), we calculated the expression fold change (log2 transformed) between MI-exons and non-MI-exons within the same gene, where each dot represents one gene. A locally weighted smoothing line is included to map the general direction bias of relative expression change; when the line is >0 it indicates higher expression in MI-exons compared with non-MI-exons and vice versa for when the line dips <0. We repeated this analysis for three species, (A) Apis mellifera, (I) Nasonia vitripennis, and (C) Trichogramma pretiousum.
Summary Statistics of DNA Methylation and MIs in Control and dnmt3 Gene Knockdown Honey Bees
| Control | dnmt3 Gene Knockdown | |
|---|---|---|
| # total mCG sites | 78,846 | 75,897 |
| # genes with mCG sites | 6,308 | 6,277 |
| Avg. # of mCGs per gene | 12.3 | 11.9 |
| # MIs (MI genes) | 5,126 (3,280) | 4,946 (3,207) |
| # MIs only present in group (MI genes) | 501 (222) | 372 (147) |
| # MIs at exon–intron boundary only present in group | 116 | 38 |