| Literature DB >> 32077945 |
Antoine Molaro1, Harmit S Malik1,2, Deborah Bourc'his3.
Abstract
Transcriptional silencing of retrotransposons via DNA methylation is paramount for mammalian fertility and reproductive fitness. During germ cell development, most mammalian species utilize the de novo DNA methyltransferases DNMT3A and DNMT3B to establish DNA methylation patterns. However, many rodent species deploy a third enzyme, DNMT3C, to selectively methylate the promoters of young retrotransposon insertions in their germline. The evolutionary forces that shaped DNMT3C's unique function are unknown. Using a phylogenomic approach, we confirm here that Dnmt3C arose through a single duplication of Dnmt3B that occurred ∼60 Ma in the last common ancestor of muroid rodents. Importantly, we reveal that DNMT3C is composed of two independently evolving segments: the latter two-thirds have undergone recurrent gene conversion with Dnmt3B, whereas the N-terminus has instead evolved under strong diversifying selection. We hypothesize that positive selection of Dnmt3C is the result of an ongoing evolutionary arms race with young retrotransposon lineages in muroid genomes. Interestingly, although primates lack DNMT3C, we find that the N-terminus of DNMT3A has also evolved under diversifying selection. Thus, the N-termini of two independent de novo methylation enzymes have evolved under diversifying selection in rodents and primates. We hypothesize that repression of young retrotransposons might be driving the recurrent innovation of a functional domain in the N-termini on germline DNMT3s in mammals.Entities:
Keywords: DNA methylation; chromatin modifications; diversifying selection; gene conversion; retrotransposons
Mesh:
Substances:
Year: 2020 PMID: 32077945 PMCID: PMC7306680 DOI: 10.1093/molbev/msaa044
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
. 1.Dnmt3C duplicated in the last common ancestor of all muroids. (A) Schematic of the genomic locus encoding Dnmt3 genes in representative species of Glires. Estimated divergence times based on Timetree analyses (Hedges et al. 2015) are indicated at each major node. Dnmt3 genes (colors) and corresponding neighboring genes (gray) are indicated with arrows. “?” denotes incomplete assembly, whereas “X” symbols denote absence of coding sequence. (B) Maximum likelihood nucleotide phylogeny of Dnmt3 genes in Glires. Bootstrap values and average pairwise dN/dS are indicated for each clade. (C) Maximum likelihood phylogeny of all identified Dnmt3B and Dnmt3C genes. Incomplete sequences are indicated with “inc,” whereas cases where Dnmt3B and Dnmt3C orthologs from the same species unexpectedly group together are highlighted with “*.” Bootstrap support values >50% are reported. In addition included are Dnmt3Bs from rabbit (Oryctolagus cuniculus), marmot (Marmota marmota), 13-lined ground squirrel (Ictidomys tridecemlineatus), and 3 Mus species (mouse, Mus musculus; caroli, Mus caroli; spretus, Mus spretus) as well as selected Dnmt3As as an outgroup. Abbreviations and species names: Rat, Rattus norvegicus; M.B.mole.rat, mountain blind mole rat (Spalax judaei); D.mouse, deer mouse (Peromyscus maniculatus); C.hamster, Chinese hamster (Cricetulus griseus); F.vole, field vole (Microtus agrestis); B.vole, bank vole (Myodes glareolus); P.vole, prairie vole (Microtus ochrogaster).
. 2.Gene conversion between Dnmt3C and Dnmt3B in muroids. (A) Recombination map of Dnmt3C CDS. Breakpoints (arrowheads) identified by GARD (see Materials and Methods) and recombination segments (“A” through “D”) are indicated on the Dnmt3C CDS with its predicted protein domains. (B) Maximum likelihood phylogenies of recombination segments A and B. Bootstrap supports >50% are shown. (C) Average pairwise percent identities between muroid DNMT3C and DNMT3B proteins before and after the PWWP domain. (D) Map of the mouse Dnmt3B genomic locus with annotated exons (boxes). Exons missing in Dnmt3C are shown with dashed lines. Exons are colored according to the domain they encode: pink, N-terminus; gray, PWWP; black, ADD; purple, methyltransferase. Percent nucleotide identity with Dnmt3C is plotted (y axis) using mVISTA (see Materials and Methods). Recombination segments (top) and predicted domains (bottom) are also shown.
Summary of Selection Tests across Muroid Dnmt3 Genes.
| Seg. | nb. Species | Length (bp) | Tree Length | PAML—M7 vs. M8 | PAML—M8a vs. M8 | M(0) d | % Sites d | Sites BEB ≥ 90% | |
|---|---|---|---|---|---|---|---|---|---|
|
|
| 11 | 471 | 3.19 | 0.002 | 0.004 | 0.886 | 49 (1.62) | 54 (T), 57 (Q), 95 (P), 96 (L) |
|
| 8 | 135 | 1.35 | 0.509 | 0.965 | 0.176 | — | N/A | |
|
| 8 | 546 | 1.2 | 1.000 | 0.827 | 0.114 | — | N/A | |
|
| 10 | 666 | 1.46 | 1.000 | 0.907 | 0.184 | — | N/A | |
|
|
| 8 | 2,052 | 1.32 | 0.170 | 0.475 | 0.116 | 1 (1.59) | N/A |
|
|
| 9 | 2,718 | 0.83 | 0.823 | 0.463 | 0.022 | — | N/A |
|
|
| 9 | 1,218 | 2.17 | 0.546 | 0.732 | 0.271 | — | N/A |
Note.—Recombination segments of Dnmt3C were analyzed independently, whereas full-length sequences were used for other Dnmt3s. P values are for likelihood ratio tests between substitution models allowing or not allowing for positive selection using codeml (PAML). Colored boxes highlight P values <0.05. See text and Materials and Methods for details.
. 3.DNMT3C N-terminus is subject to positive selection. (A) Schematic representation of positive selection test results for all recombination segments of DNMT3C CDS. “***” denotes the finding of positive selection in segment A (see text and Materials and Methods for details), whereas “∅” indicates no support for positive selection. (B) Amino-acid alignments (positions 50–61 and 91–100) of muroid DNMT3Cs showing four positively selected sites identified with PAML (red arrowheads). Sequences are arranged according to segment A phylogeny with species names on the right. (C) Amino-acid logos of DNMT3C and DNMT3B around the positively selected sites (arrowheads). Backslashes indicate sequences not shown. The gray box denotes an alignment gap between DNMT3C and DNMT3B.
Summary of Selection Tests across Primate DNMT3s.
| Segment | nb. Species | Length (bp) | Tree Length | PAML—M7 vs. M8 | PAML—M8a vs. M8 | M(0) d | % Sites d | Sites BEB ≥ 90% | |
|---|---|---|---|---|---|---|---|---|---|
|
| Whole | 20 | 2,724 | 0.509 | 0.022 | ≤0.001 | 0.047 | 2.02 (1.8) | 66 (P), 81 (A) |
|
| Whole | 21 | 2,481 | 0.828 | 0.149 | 0.430 | 0.080 | 1.81 (1.7) | — |
|
| Whole | 20 | 1,155 | 1.7 | 0.018 | 0.124 | 0.204 | 5.29 (1.6) | — |
Note.—Codeml (PAML) analyses using the accepted species phylogeny. P values are for likelihood ratio tests between substitution models allowing or not allowing for positive selection. Colored boxes highlight P values <0.05.
. 4.Positive selection of N-terminal domain of primate DNMT3A. Amino-acid alignments (positions 58–90) of primate DNMT3As showing the two positively selected sites identified with PAML (arrowheads). Sequences are arranged according to the accepted species phylogeny with species names on the right.