| Literature DB >> 29668012 |
Gilberto Cavalheiro Vieira1,2, Marícia Fantinel D'Ávila3, Rebeca Zanini2, Maríndia Deprá1,4, Vera Lúcia da Silva Valente1,3,4.
Abstract
The DNA methyltransferase 2 (DNMT2) protein is the most conserved member of the DNA methyltransferase family. Nevertheless, its substrate specificity is still controversial and elusive. The genomic role and determinants of DNA methylation are poorly understood in invertebrates, and several mechanisms and associations are suggested. In Drosophila, the only known DNMT gene is Dnmt2. Here we present our findings from a wide search for Dnmt2 homologs in 68 species of Drosophilidae. We investigated its molecular evolution, and in our phylogenetic analyses the main clades of Drosophilidae species were recovered. We tested whether the Dnmt2 has evolved neutrally or under positive selection along the subgenera Drosophila and Sophophora and investigated positive selection in relation to several physicochemical properties. Despite of a major selective constraint on Dnmt2, we detected six sites under positive selection. Regarding the DNMT2 protein, 12 sites under positive-destabilizing selection were found, which suggests a selection that favors structural and functional shifts in the protein. The search for new potential protein partners with DNMT2 revealed 15 proteins with high evolutionary rate covariation (ERC), indicating a plurality of DNMT2 functions in different pathways. These events might represent signs of molecular adaptation, with molecular peculiarities arising from the diversity of evolutionary histories experienced by drosophilids.Entities:
Year: 2018 PMID: 29668012 PMCID: PMC5913717 DOI: 10.1590/1678-4685-GMB-2017-0056
Source DB: PubMed Journal: Genet Mol Biol ISSN: 1415-4757 Impact factor: 1.771
Estimates of evolutionary divergence between sequence pairs of different Drosophilidae species groups. The numbers of amino acid differences per site from the average over all sequence pairs between groups are given below the diagonal. The measures of nucleotide evolutionary divergence are provided above the diagonal. The p-distances are given in percentages.
| Groups | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 |
| 25.31 | 26.39 | 12.89 | 10.70 | 26.08 | 30.53 | 23.66 | 29.58 | 23.87 | 15.66 | 22.22 | 33.33 | |
| 2 |
| 23.32 | 23.30 | 24.23 | 25.77 | 26.77 | 30.83 | 15.12 | 29.32 | 16.62 | 25.62 | 16.20 | 32.18 | |
| 3 |
| 24.22 | 18.83 | 25.31 | 26.13 | 26.23 | 28.62 | 22.79 | 27.11 | 23.15 | 26.39 | 20.37 | 33.95 | |
| 4 |
| 10.09 | 21.08 | 23.54 | 15.15 | 25.23 | 29.27 | 22.97 | 26.72 | 23.46 | 13.87 | 21.14 | 34.07 | |
| 5 |
| 7.92 | 24.81 | 23.92 | 11.29 | 25.36 | 30.54 | 23.56 | 29.15 | 24.74 | 17.16 | 22.89 | 32.95 | |
| 6 |
| 17.94 | 20.63 | 21.08 | 17.60 | 18.76 | 29.89 | 25.69 | 28.50 | 27.93 | 25.31 | 25.08 | 30.83 | |
| 7 |
| 25.16 | 25.41 | 24.22 | 24.63 | 25.95 | 23.56 | 29.87 | 23.72 | 29.78 | 29.93 | 29.24 | 32.72 | |
| 8 |
| 20.78 | 15.70 | 19.13 | 18.68 | 21.72 | 19.58 | 25.29 | 28.46 | 15.35 | 24.34 | 15.64 | 33.64 | |
| 9 |
| 25.56 | 26.01 | 22.57 | 24.29 | 24.81 | 23.02 | 19.11 | 25.26 | 28.91 | 28.10 | 26.23 | 32.61 | |
| 10 |
| 21.82 | 14.95 | 18.09 | 19.88 | 22.77 | 20.40 | 25.15 | 11.61 | 25.66 | 25.31 | 18.57 | 33.02 | |
| 11 |
| 12.00 | 23.09 | 23.99 | 10.48 | 12.86 | 18.89 | 25.26 | 20.67 | 24.96 | 21.56 | 22.69 | 34.47 | |
| 12 |
| 18.39 | 11.66 | 15.25 | 15.92 | 18.98 | 15.70 | 22.22 | 10.31 | 21.52 | 12.41 | 17.94 | 30.56 | |
| 13 |
| 31.17 | 31.17 | 28.48 | 30.72 | 31.76 | 27.91 | 29.55 | 31.84 | 27.20 | 30.19 | 31.22 | 28.92 |
Figure 1Evolutionary rate from all Drosophilidae DNMT2 complete sequences, Sophophora and Drosophila subgenus separately. The rates are scaled such that the average evolutionary rate across all sites is 1. All positions containing gaps and missing data are eliminated.
Figure 2Bayesian phylogenetic inference of the Dnmt2 gene in Drosophilidae species based on amino acid sequences alignment. Tree generated using the JTT model with a gamma distribution. Sequence of Spodoptera frugiperda was used as outgroup.
Figure 3Multiple alignment of DNMT2 sequences used in the positive selection analysis with no gaps. Black boxes represent conserved 100% in all sequences, dark grey 80% and light grey 60%. Red arrows indicate sites under positive selection and purple arrows sites under positive-destabilizing selection. Roman letters indicate the conserved (cytosine-5) MT2 motifs. TRD indicates the target recognition domain.
Figure 4ML phylogenetic tree of Dnmt2 used in analysis for detection of selection. The dotted frame highlights the foreground branch. The genetic distances are in nucleotide substitution per codon (number below the branches).
Parameters estimates and log likelihood values for branch-site M1a, MA and restricted MA model.
| Model | Parameters | lnL | ||
|---|---|---|---|---|
| M1a | ω0 = 0.04651 | p0 = 0.86743 | -11 046.650097 | |
|
| ω1 = 1.00000 | p1 = 0.13257 | ||
| Restricted MA (ω2 = 1 fixed) | ω0 =0.04520 | ω2 = 1.000 | -11 039.638028 | |
|
| p0 = 0.82646 | p1 = 0.12587 | (p2a + p2b) = 0.04767 | |
| MA | ω0 = 0.04596 | ω2 = 50.90099 | -11 032.575669 | |
|
| p0 = 0.84965 | p1 = 0.12542 | (p2a + p2b) = 0.02493 |
Comparison of null and alternative models by LRT and positively selected sites estimated by Bayes Empirical Bayes.
| Test | Contrast | LRT | D.F. | χ2- Probability | Positively selected sites |
|---|---|---|---|---|---|
|
| M1a MA | 25.20 | 2 |
| |
|
| Restricted MA MA | 12.41 |
|
| 85, 94, 109, 192, 257, 311 |
Amino acid physicochemical properties under positive destabilizing selection in DNMT2.
| Physicochemical property | Goodness-of-fit (neutral expectation) | radical change category (6, 7 and 8) | z-score |
|---|---|---|---|
| Alpha-helical tendencies ( | 33.339 | 8 | 4.292 |
| Average number of surrounding residues
( | 119.136 | 6.442 | |
| Beta-structure tendencies ( | 30.909 | 2.804 | |
| Bulkiness ( | 35.052 | 3.104 | |
| Buriedness ( | 54.407 | 2.580 | |
| Chromatographic index ( | 107.109 | 3.190 | |
| Coil tendencies ( | 18.839 | - | |
| Composition ( | 43.179 | 3.898 | |
| Compressibility ( | 22.548 | - | |
| Equilibrium constant (ionization of COOH)
( | 79.39 | 8 | 2.828 |
| Helical contact area ( | 81.079 | 3.732 | |
| Hydropathy ( | 83.046 | 3.885 | |
| Isoelectric point ( | 45.283 | 4.261 | |
| Long-range non-bonded energy ( | 78.459 | 5.712 | |
| Mean r.m.s. fluctuation displacement
( | 117.708 | 5.736 | |
| Molecular volume ( | 77.479 | 3.151 | |
| Molecular weight ( | 69.846 | 3.577 | |
| Normalized consensus hydrophobicity
( | 57.459 | 2.062 | |
| Partial specific volume ( | 61.97 | 2.986 | |
| Polar requirement ( | 29.801 | 7 | 2.341 |
| Polarity ( | 59.713 | 2.522 | |
| Power to be at the C-terminal ( | 118.516 | 6 | 5.758 |
| Power to be at the middle of alpha-helix
( | 49.271 | 7 | 3.657 |
| Power to be at the N-terminal ( | 32.449 | 7 | 3.159 |
| Refractive index ( | 47.548 | 3.632 | |
| Short and medium range non-bonded energy (Esm) | 67.785 | 3.743 | |
| Solvent accessible reduction ratio (Ra) | 83.159 | 2.942 | |
| Surrounding hydrophobicity ( | 69.397 | 2.345 | |
| Thermodynamic transfer hydrohphobicity
( | 50.917 | 2.785 | |
| Total non-bonded energy ( | 109.548 | 5.654 | |
| Turn tendencies ( | 109.136 | 6 | 6.198 |
p < 0.05
p < 0.01
p < 0.001
Figure 5DNMT2 structure predictions. (a) Sliding window plots of the z-scores of radically changed properties showing regions under positive-destabilizing selection in Dnmt2. Dashed horizontal line indicates the Bonferroni-corrected significant limit (z-score = 3.09, p < 0.001). Alpha-helical tendencies (P ) – red; Equilibrium constant (ionization of COOH) (pK’) – orange; Polar requirement (P ) – green; Power to be at the C-terminal (α ) – blue; Power to be at the middle of alpha-helix (αm) – pink; Power to be at the N-terminal (αn) – black; Turn tendencies (Pt) – purple. (b) Structure of sfDNMT2 (PDVB: 4H0N) (Li ), ribbon (up) and surface (down) representations. The respective sites under positive selection in drosophilids are represented in Spodoptera frugiperda structure. Sites under positive selection colored red, positive-destabilizing selection purple, motif CFT from TRD and the catalytic cysteine yellow and cofactor s-adenosyl methionine (SAM) cyan.
Gene annotation and sequence location from STRING and ERC webserver searching from D. melanogaster genome. Genes with high ERC value (> 0.400) and p-value < 0.05 are colored gray.
| Gene | Annotation | Description |
|---|---|---|
| Atac3 | CG32343 | Contributes to histone acetyltransferase activity. Involved chromatin remodeling. |
| Cap | CG18408 | Interacts selectively and non-covalently with vinculin, a protein found in muscle, fibroblasts, and epithelial cells that binds actin and appears to mediate attachment of actin filaments to integral proteins of the plasma membrane. |
| CG10262 | CG10262 | Proliferating Cell Nuclear Antigen (PCNA) domain. These polymerase processivity factors play a role in DNA replication and repair. |
| CG13035 | CG44836 | Uncharacterized protein involved in sensorial perception of pain. |
| CG14618 | CG14618 | Belongs to the class IV-like SAM-binding methyltransferase superfamily. tRNA (guanine(9)-N(1))-methyltransferase TRM10. Enzyme catalyzes the conversion of a guanosine residue to N1-methylguanine in position 37, next to the anticodon, in tRNA. |
| CG14906 | CG14906 | N-6 adenine-specific DNA methylases. Specifically methylate the amino group at the C-6 position of adenines in DNA. |
| CG16863 | CG16863 | The protein has zinc finger, BED-type, which is thought to be involved in chromatin insulation at BEAF (boundary element-associated factor) and DREF, a transcriptional regulator. |
| CG17124 | CG17124 | PKC-activated protein phosphatase-1 inhibitor. Stops, prevents or reduces the activity of a protein phosphatase, an enzyme that hydrolyzes phosphate groups from phosphorylated proteins. |
| CG6712 | CG6712 | Probably RNA binding inferred from sequence or
structural similarity with |
| CG7470 | CG7470 | Predicted gamma-glutamyl phosphate reductase with delta1-pyrroline-5-carboxylate synthetase activity, glutamate 5-kinase activity. An inner mitochondrial membrane enzyme, is essential to the de novo synthesis of the amino acids proline and arginine. Involved in epithelium development and germarium-derived egg chamber formation. |
| CTCF | CG8591 | CTCF is a ubiquitous transcription factor that binds to insulators and domain boundaries. It mediates insulator function and blocks enhancers by binding to Cp190. It contributes to long-range chromatin interaction, organizes chromatin domain boundaries and coordinates nuclear architecture. |
| Dnmt2 | CG10692 | Methyltransferase 2 is a (cytosine-5) DNA/tRNA methyltransferase. It is involved in regulation of gene expression by cytosine-5 methylation. The major protein role is the modifications that protects tRNAs against endonucleolytic cleavage and contributes to stress resistance, protein translation and small RNA-mediated gene regulation. |
| Eggless | CG12196 | Belongs to the class V-like SAM-binding methyltransferase superfamily. Histone-lysine methyltransferase family. Involved in negative regulation of gene expression. |
| Eno | CG17654 | Responsible for the catalysis of the conversion of 2-phosphoglycerate (2-PG) to phosphoenolpyruvate (PEP), the ninth and penultimate step of glycolysis |
| Haywire | CG8019 | Helicase Ercc3, core RNA polymerase binding transcription factor activity. Involved in regulation of alternative mRNA splicing, via spliceosome, cell proliferation and growth. |
| Homer | CG11324 | Involved in the positive regulation of circadian sleep/wake cycle, sleep; response to ethanol, behavioral response to ethanol, regulation of locomotion and adult behavior. Activity in stress response. |
| hop | CG1594 | Signal transduction-non receptor tyrosine kinase. Members of the Janus kinase (JAK) family of cytoplasmic protein tyrosine kinases physically associate with ligand-bound receptors. The JAK-STAT pathway regulates the expression of pair rule gene even-skipped (a transcriptional repressor of a number of genes) early in embryogenesis. |
| Hsp22 | CG4460 | A key player in cell-protection mechanisms against oxidative injuries and aging in Drosophila. Activated in late third-instar larvae of Drosophila melanogaster in the absence of heat stress |
| MBD-like | CG8208 | Methyl Binding Protein 2/3, a co-repressor and an integral component of the nucleosome remodelling and deacetylase (NuRD) complex. Negative regulation of transcription, involved in organism development |
| mei-S332 | CG5303 | Acts to maintain sister-chromatid cohesion before anaphase II of meiosis in both males and females. |
| mms4 | CG12936 | Methyl methanesulfonate sensitivity 4 is the non-catalytic subunit of the mus81-mms4 structure-selective endonuclease that functions in DNA repair. |
| mus209 | CG9193 | Belongs to the PCNA family. Involved in eggshell chorion gene amplification, DNA replication, mismatch repair, neurogenesis and antimicrobial humoral response. |
| Nsun2 | CG6133 | tRNA (cytosine-5-)-methyltransferase activity. |
| Orc2 | CG3041 | Origin recognition complex subunit 2. It is involved in eggshell chorion gene amplification, chromatin silencing, cell proliferation, DNA replication initiation, chromosome condensation and neurogenesis. |
| Pnt | CG17077 | It is a sequence-specific DNA binding transcription factor activity, repressing transcription factor binding, involved in positive regulation of transcription. It is involved in organism developmental process, open tracheal system development, post-embryonic organ morphogenesis, regulation of developmental process, sensory organ development, cardiovascular system development, regulation of RNA metabolic process, positive regulation of cell proliferation, anterior/posterior axis specification, multi-organism reproductive process, muscle cell differentiation, compound eye photoreceptor development, regulation of neurogenesis. |
| Rel | CG11992 | Relish is a transcription factor and the downstream component of the Immune Deficiency pathway, which regulates the antibacterial response and other less characterized cellular processes. |
| RhoGEF4 | CG8606 | Rho family small GTPases act as molecular switches that regulate neuronal morphogenesis, including axon growth and guidance, dendritic spine formation, and synapse formation. These proteins are positively regulated by guanine nucleotide exchange factors (GEFs) of the Dbl family. Findings suggest that DRhoGEF4 may participate in cytoskeleton-related cellular events by specifically activating RhoA in neuronal morphogenesis. |
| Rpd3 | CG7471 | Histone deacetylase 1. Catalyzes the deacetylation of lysine residues on the N-terminal part of the core histones (H2A, H2B, H3 and H4). Histone deacetylation may constitute a tag for epigenetic repression and plays an important role in transcriptional regulation, cell cycle progression and developmental events. For instance, deacetylation of histone H3 may be a prerequisite for the subsequent recruitment of the histone methyltransferase Su(var)3-9 to histones. Involved in position-effect variegation (PEV). |
| Rpp20 | CG33931 | A subunit of the RNase P and RNase MRP holoenzymes, has interaction with the Drosophila SMN protein. Immunofluorescence results indicate that Rpp20 is diffusely distributed throughout the cytoplasm with higher concentration observed in the nucleus. However, in response to stress, SMN forms aggregates and redistributes Rpp20 into punctuated cytoplasmic SMN granules. |
| Sna | CG3956 | Snail is a transcription factor that contributes to embryonic mesoderm development, epithelial to mesenchymal transition and asymmetric cell division. |
| Su(var)2-5 | CG8409 | Suppressor of variegation 205 is a heterochromatin protein associated with the pericentric heterochromatin and telomeres in Drosophila. It is involved in the positive autoregulatory expression and can bind directly to nucleosomes. |
| Su(var)2-10 | CG8068 | It is responsible for establishing and maintaining chromosome organization in interphase nuclei, promoting chromosome structure and function. |
| Su(var)3-3 | CG17149 | Probable histone demethylase that specifically demethylates ‘Lys-4’ of histone H3, a specific tag for epigenetic transcriptional activation, thereby acting as a corepressor. Required for heterochromatic gene silencing |
| Su(var)3-9 | CG43664 | Histone methyltransferase that specifically trimethylates ‘Lys-9’ of histone H3 using monomethylated H3 ‘Lys-9’ as substrate. H3 ‘Lys-9’ trimethylation represents a specific tag for epigenetic transcriptional repression by recruiting Su(var)205/HP1 to methylated histones. Mainly functions in heterochromatin regions, thereby playing a central role in the establishment of constitutive heterochromatin at pericentric regions. Involved in heterochromatic gene silencing including the modification of position-effect-variegation. |
| Tet | CG9973 | Ten-Eleven Translocation (TET) family protein. Involved in positive regulation of DNA demethylation, inducing positive regulation of transcription from RNA polymerase II promoter. |
| Thor | CG8846 | Eukaryotic translation initiation factor 4E binding protein, controlled by tor. It contributes to translation regulation, response to environmental stress and cell growth regulation. |
ERC values between potential DNMT2 protein-protein partners. The matrix shows all pairwise ERC values between genes below the diagonal and respectively p-values above the diagonal. Cells are shaded red according to the intensity of their deviation from the null expectation.
| Protein | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | pnt | N/A | 0.0010 | 0.1445 | 0.0781 | 0.4435 | 0.1082 | 0.0950 | 0.1429 | 0.3571 | 0.1486 | 0.1924 | 0.3492 | 0.0145 | 0.4435 | 0.0207 | 0.0170 | 0.0241 | 0.0760 |
| 2 | Su(var)2-10 | 0.811 | N/A | 0.0578 | 0.0168 | 0.0143 | 0.0868 | 0.0891 | 0.0912 | 0.1791 | 0.1679 | 0.3720 | 0.5330 | 0.0185 | 0.0912 | 0.0493 | 0.0572 | 0.0383 | 0.2213 |
| 3 | CG14618 | 0.375 | 0.480 | N/A | 0.0093 | 0.0398 | 0.0541 | 0.0117 | 0.0249 | 0.0220 | 0.1015 | 0.1111 | 0.1780 | 0.0123 | 0.0732 | 0.0489 | 0.1203 | 0.0775 | 0.4843 |
| 4 | CG14906 | 0.473 | 0.597 | 0.731 | N/A | 0.0089 | 0.0125 | 0.0025 | 0.0008 | 0.0387 | 0.0459 | 0.0879 | 0.2154 | 0.0266 | 0.1378 | 0.1770 | 0.1251 | 0.0431 | 0.4492 |
| 5 | CG32343 | 0.000 | 0.608 | 0.593 | 0.782 | N/A | 0.0032 | 0.0070 | 0.0057 | 0.1500 | 0.0492 | 0.1193 | 0.3912 | 0.0180 | 0.0818 | 0.2100 | 0.1325 | 0.1437 | 0.3912 |
| 6 | CTCF | 0.426 | 0.429 | 0.555 | 0.758 | 0.801 | N/A | 0.0004 | 0.0032 | 0.0435 | 0.0238 | 0.1094 | 0.1260 | 0.0641 | 0.1074 | 0.1074 | 0.1087 | 0.0153 | 0.1688 |
| 7 | CG16863 | 0.447 | 0.424 | 0.713 | 0.844 | 0.763 | 0.932 | N/A | 0.0009 | 0.0304 | 0.0262 | 0.0448 | 0.0905 | 0.0267 | 0.0693 | 0.0804 | 0.0794 | 0.0165 | 0.2525 |
| 8 | sna | 0.377 | 0.421 | 0.650 | 0.874 | 0.773 | 0.833 | 0.912 | N/A | 0.0149 | 0.1107 | 0.0146 | 0.0721 | 0.0278 | 0.0655 | 0.0518 | 0.0411 | 0.0127 | 0.3520 |
| 9 | mei-S332 | 0.087 | 0.311 | 0.662 | 0.650 | 0.386 | 0.643 | 0.698 | 0.796 | N/A | 0.1072 | 0.0903 | 0.0312 | 0.0248 | 0.4685 | 0.0943 | 0.1754 | 0.0643 | 0.5163 |
| 10 | homer | 0.370 | 0.323 | 0.457 | 0.622 | 0.587 | 0.708 | 0.713 | 0.524 | 0.457 | N/A | 0.0026 | 0.0788 | 0.0196 | 0.1536 | 0.1112 | 0.0661 | 0.0397 | 0.0903 |
| 11 | Rpp20 | 0.307 | 0.109 | 0.441 | 0.513 | 0.440 | 0.506 | 0.651 | 0.797 | 0.489 | 0.777 | N/A | 0.0127 | 0.0186 | 0.1018 | 0.0495 | 0.0950 | 0.0703 | 0.1915 |
| 12 | RhoGEF4 | 0.097 | -0.025 | 0.337 | 0.313 | 0.000 | 0.480 | 0.541 | 0.602 | 0.638 | 0.498 | 0.716 | N/A | 0.0322 | 0.0241 | 0.0250 | 0.1788 | 0.1488 | 0.1462 |
| 13 | Mt2 | 0.644 | 0.590 | 0.711 | 0.693 | 0.699 | 0.589 | 0.712 | 0.741 | 0.663 | 0.650 | 0.680 | 0.577 | N/A | 0.0179 | 0.0049 | 0.0339 | 0.0392 | 0.1800 |
| 14 | Orc2 | 0.000 | 0.421 | 0.512 | 0.425 | 0.509 | 0.508 | 0.585 | 0.616 | 0.000 | 0.375 | 0.470 | 0.609 | 0.686 | N/A | 0.0044 | 0.0428 | 0.4939 | 0.1165 |
| 15 | hay | 0.609 | 0.497 | 0.572 | 0.369 | 0.297 | 0.508 | 0.563 | 0.653 | 0.481 | 0.440 | 0.582 | 0.605 | 0.771 | 0.741 | N/A | 0.0028 | 0.0082 | 0.0493 |
| 16 | mms4 | 0.631 | 0.481 | 0.426 | 0.445 | 0.418 | 0.507 | 0.565 | 0.686 | 0.344 | 0.521 | 0.483 | 0.313 | 0.622 | 0.542 | 0.831 | N/A | 0.0041 | 0.0273 |
| 17 | hop | 0.596 | 0.524 | 0.503 | 0.629 | 0.399 | 0.747 | 0.762 | 0.805 | 0.540 | 0.588 | 0.528 | 0.351 | 0.602 | 0.000 | 0.774 | 0.845 | N/A | 0.0940 |
| 18 | mus209 | 0.476 | 0.264 | 0.020 | 0.052 | 0.000 | 0.409 | 0.295 | 0.185 | -0.010 | 0.479 | 0.336 | 0.355 | 0.349 | 0.397 | 0.621 | 0.703 | 0.551 | N/A |