| Literature DB >> 34480573 |
Juliet M Wong1, Jose M Eirin-Lopez1.
Abstract
The methyltransferase-like (METTL) proteins constitute a family of seven-beta-strand methyltransferases with S-adenosyl methionine-binding domains that modify DNA, RNA, and proteins. Methylation by METTL proteins contributes to the epigenetic, and in the case of RNA modifications, epitranscriptomic regulation of a variety of biological processes. Despite their functional importance, most investigations of the substrates and functions of METTLs within metazoans have been restricted to model vertebrate taxa. In the present work, we explore the evolutionary mechanisms driving the diversification and functional differentiation of 33 individual METTL proteins across Metazoa. Our results show that METTLs are nearly ubiquitous across the animal kingdom, with most having arisen early in metazoan evolution (i.e., occur in basal metazoan phyla). Individual METTL lineages each originated from single independent ancestors, constituting monophyletic clades, which suggests that each METTL was subject to strong selective constraints driving its structural and/or functional specialization. Interestingly, a similar process did not extend to the differentiation of nucleoside-modifying and protein-modifying METTLs (i.e., each METTL type did not form a unique monophyletic clade). The members of these two types of METTLs also exhibited differences in their rates of evolution. Overall, we provide evidence that the long-term evolution of METTL family members was driven by strong purifying selection, which in combination with adaptive selection episodes, led to the functional specialization of individual METTL lineages. This work contributes useful information regarding the evolution of a gene family that fulfills a variety of epigenetic functions, and can have profound influences on molecular processes and phenotypic traits.Entities:
Keywords: METTL; epigenetics; metazoan; methyltransferase; phylogenetics; selection
Mesh:
Substances:
Year: 2021 PMID: 34480573 PMCID: PMC8662637 DOI: 10.1093/molbev/msab267
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
METTL Genes Separated by Type (i.e., DNA/RNA-modifying, protein-modifying, and unknown).
| Gene Name | NCBI ID | Aliases | Preferred Name | Target |
|---|---|---|---|---|
| DNA/RNA-modifying METTLs | ||||
| METTL1 | 4234 | C12orf1, TRM8, TRMT8, YDL201w | tRNA (guanine-N(7)-)-methyltransferase | m7G (tRNA, mRNA, miRNA) |
| METTL2 (METTL2A/B) | 339175/55798 | METL, PSENIP1 | tRNA N(3)-methylcytidine methyltransferase METTL2; methyltransferase-like protein 2 | m3C (tRNA) |
| METTL3 | 56339 | IME4, M6A, MT-A70, Spo8, hMETTL3 | N6-adenosine-methyltransferase catalytic subunit | m6A (mRNA) |
| METTL4 | 64863 | HsT661 | N(6)-adenine-specific methyltransferase METTL4 | m6A (DNA)/m6Am (snRNA) |
| METTL5 | 29081 | HSPC133, MRT72 | rRNA N6-adenosine-methyltransferase METTL5 | m6A (rRNA) |
| METTL6 | 131965 | None | tRNA N(3)-methylcytidine methyltransferase METTL6 | m3C (tRNA) |
| METTL8 | 79828 | TIP | mRNA N(3)-methylcytidine methyltransferase METTL8 | m3C (mRNA) |
| METTL14 | 57721 | None | N6-adenosine-methyltransferase noncatalytic subunit | m6A (mRNA) |
| METTL15 | 196074 | METT5D1 | 12S rRNA N4-methylcytidine (m4C) methyltransferase | m4C (rRNA) |
| METTL16 | 79066 | METT10D | RNA N6-adenosine-methyltransferase METTL16 | m6A (ncRNA, pre-mRNA, mRNA) |
| METTL19 | 152992 | TRMT44, C4orf23, TRM44 | Probable tRNA (uracil- | 2′-O-methyluridine (tRNA) |
| METTL25B | 51093 | RRNAD1, C1orf66, CGI-41 | Ribosomal RNA adenine dimethylase domain containing 1 | m6A (rRNA) |
|
| ||||
| Protein-modifying METTLs | ||||
| METTL9 | 51108 | CGI-81, DREV, DREV1, PAP1 | Methyltransferase-like protein 9 | Unknown |
| METTL10 | 399818 | EEF1AKMT2, C10orf138, Efm4 | EEF1A lysine methyltransferase 2 | Lysine (EEF1A) |
| METTL11A | 28989 | NTMT1, AD-003, C9orf32, HOMT1A, NRMT, NRMT1, NTM1A | N-terminal Xaa-Pro-Lys N-methyltransferase 1 | N-terminal |
| METTL11B | 149281 | C1orf184, HOMT1B, NTM1B | Alpha N-terminal protein methyltransferase 1B | N-terminal |
| METTL12 | 751071 | CSKMT, CS-KMT, U99HG | Citrate synthase-lysine N-methyltransferase CSKMT, mitochondrial | Lysine (citrate synthase) |
| METTL13 | 51603 | EEF1AKNMT, 5630401D24Rik, CGI-01, DFNB26, DFNB26M, DFNM1, KIAA0859, feat | eEF1A lysine and N-terminal methyltransferase | Lysine (EEF1A) |
| METTL18 | 92342 | AsTP2, C1orf156, HPM1 | Histidine protein methyltransferase 1 homolog | Histidine |
| METTL20 | 254013 | ETFBKMT, C12orf72, ETFB-KMT | Electron transfer flavoprotein beta subunit lysine methyltransferase | Lysine (ETFβ) |
| METTL21A | 151194 | FAM119A, HCA557b, HSPA-KMT | Protein N-lysine methyltransferase METTL21A | Lysine (HSPA) |
| METTL21B | 25895 | EEF1AKMT3, FAM119B | EEF1A lysine methyltransferase 3 | Lysine (EEF1A) |
| METTL21C | 196541 | C13orf39 | Protein-lysine methyltransferase METTL21C | Lysine (VCP) |
| METTL21D | 79609 | VCPKMT, C14orf138, VCP-KMT | Protein-lysine methyltransferase METTL21D | Lysine (VCP) |
| METTL21E | 403183 | 4832428D23Rik, Gm991 | Protein-lysine methyltransferase METTL21E | Lysine (VCP) |
| METTL22 | 79091 | C16orf68 | Methyltransferase-like protein 22 | Lysine (Kin17) |
| METTL23 | 124512 | C17orf95, MRT44 | Methyltransferase-like protein 23 | Unknown |
|
| ||||
| METTLs of unknown function | ||||
| METTL7A | 25840 | AAM-B, AAMB | Methyltransferase-like protein 7A | Unknown |
| METTL7B | 196410 | ALDI | Methyltransferase-like protein 7B | Unknown |
| METTL17 | 64745 | METT11D1 | Methyltransferase-like protein 17, mitochondrial | Unknown |
| METTL25 | 84190 | C12orf26 | Methyltransferase-like protein 25 | Unknown |
| METTL26 | 84326 | C16orf13, JFP2 | Methyltransferase-like 26 | Unknown |
| METTL27 | 155368 | WBSCR27 | Methyltransferase-like protein 27 | Unknown |
Note.—The gene names, NCBI IDs, aliases, and preferred names are vertebrate METTL genes.
Fig. 1.METTLs in major metazoan phyla according to NCBI search results. The metazoan cladogram is modified from the Tree of Life web project (http://tolweb.org/tree/phylogeny.html). The matrix displays the detection (gray), probable detection but with uncertain identification (striped-gray), or failure of detection (white) by the search for each METTL gene within each phylum. The number of METTL genes out of 33 total is noted in parenthesis with asterisks indicating phyla in which probable METTL sequences were either located but could not be definitively identified as specific METTL genes, or were located outside of our search in the NCBI GenBank database (e.g., Ensembl Metazoa). METTLs are listed in order of decreasing number of representatives in each phylum (left to right).
Fig. 2.Maximum likelihood tree describing the evolutionary relationships across metazoan METTL proteins. Bootstrap values (1,000 replicates) are displayed at each internal node. Branch labels displayed in blue indicate METTLs that have been shown to target DNA or RNA nucleosides, labels in red are METTLs shown to target protein residues, and labels in purple represent METTLs whose function and targets are unknown. The METTL proteins comprise nine different taxa representing eight different phyla: A sponge (Amphimedon queenslandica, Porifera), a stony coral (Acropora millepora, Cnidaria), a sea hare (Aplysia californica, Mollusca), a priapulid (Priapulus caudatus, Priapulida), a shrimp (Litopenaeus vannamei, Arthropoda), a sea star (Acanthaster planci, Echinodermata), an acorn worm (Saccoglossus kowalevskii, Hemichordata), a lancelet (Branchiostoma belcheri, Chordata), and human (Homo sapiens, Chordata). Subtrees with the same METTL gene from multiple taxa have been collapsed for figure readability.
Conserved Domains within Each METTL Gene Based on Results from CD-Search and Batch CD-Search (NCBI).
| Superfamily | CD | Gene |
|---|---|---|
| AdoMet MTases superfamily | COG2263 | METTL5 |
| DREV | METTL9 | |
| DUF938 | METTL26 | |
| Methyltransf_11 | METTL7A, METTL7B | |
| Methyltransf_12 | METTL2, METTL6 | |
| Methyltransf_16 | METTL21A, METTL21D | |
| Methyltransf_25 | METTL2, METTL6, METTL7A, METTL8, METTL10, METTL12, METTL13, METTL27 | |
| Methyltransf_31 | METTL10 | |
| Methyltransf_32 | METTL25B | |
| Methyltransf_4 | METTL1 | |
| Methyltransf_PK | METTL11A | |
| Nnt1 | METTL20 | |
| PRK00050 | METTL15 | |
| No specific CD (superfamily only) | METTL11B, METTL16, METTL17, METTL18, METTL19, METTL21B, METTL21C, METTL21E, METTL22, METTL23, METTL25 | |
|
| ||
| MT-A70 superfamily | MT-A70 | METTL3, METTL14 |
| No specific CD (superfamily only) | METTL4 | |
Fig. 3.Maximum likelihood phylogeny reconstructed based on the CDs of each METTL protein across Metazoa. Bootstrap values (1,000 replicates) are displayed at each node. Branch label colors are identical to figure 2.
Selection Results, Variance Estimations, and Best-Fit ML Models for All METTLs and Each Individual METTL Gene.
| Gene |
|
|
|
|
|
| ENC | ModelNT | ModelAA |
|---|---|---|---|---|---|---|---|---|---|
| All | 0.5 | 5.8 | 0.65 ± 0.01 | 0.82 ± 0.02 | 0.74 ± 0.005 | 0.62 ± 0.02 | 52.45 | GTR + G + I | LG + G |
| METTL1 | 0.8 | 23.38 | 0.36 ± 0.01 | 0.32 ± 0.02 | 0.83 ± 0.02 | 0.23 ± 0.02 | 53.60 | GTR + G + I | WAG + G |
| METTL2 | 0.7 | 22.02 | 0.4 ± 0.01 | 0.39 ± 0.02 | 0.79 ± 0.01 | 0.28 ± 0.02 | 51.09 | GTR + G + I | LG + G |
| METTL3 | 0.7 | 27.28 | 0.4 ± 0.01 | 0.4 ± 0.02 | 0.82 ± 0.01 | 0.28 ± 0.01 | 53.89 | GTR + G | LG + G |
| METTL4 | 0.6 | 12.93 | 0.49 ± 0.01 | 0.55 ± 0.02 | 0.78 ± 0.02 | 0.4 ± 0.02 | 50.78 | T92 + G + I | LG + G + I |
| METTL5 | 0.8 | 18.17 | 0.4 ± 0.01 | 0.42 ± 0.02 | 0.81 ± 0.02 | 0.28 ± 0.02 | 53.16 | T92 + G | LG + G |
| METTL6 | 0.8 | 20.0 | 0.39 ± 0.01 | 0.4 ± 0.02 | 0.81 ± 0.02 | 0.27 ± 0.02 | 51.25 | GTR + G | LG + G |
| METTL7A | 0.6 | 12.02 | 0.52 ± 0.01 | 0.61 ± 0.02 | 0.78 ± 0.02 | 0.44 ± 0.02 | 54.10 | K2 + G + I | LG + G + I |
| METTL7B | 0.6 | 10 | 0.53 ± 0.01 | 0.61 ± 0.03 | 0.8 ± 0.02 | 0.45 ± 0.02 | 54.92 | JC + I | LG + I |
| METTL8 | Vertebrates only | 50.53 | |||||||
| METTL9 | 0.7 | 14.15 | 0.49 ± 0.01 | 0.54 ± 0.02 | 0.81 ± 0.01 | 0.39 ± 0.02 | 52.31 | K2 + G + I | LG + G + I |
| METTL10 | 0.7 | 12.85 | 0.47 ± 0.01 | 0.55 ± 0.02 | 0.75 ± 0.02 | 0.38 ± 0.02 | 54.78 | T92 + G + I | LG + G + I |
| METTL11A | 0.7 | 18.84 | 0.45 ± 0.01 | 0.49 ± 0.02 | 0.8 ± 0.01 | 0.35 ± 0.02 | 51.79 | K2 + G + I | LG + G + I |
| METTL11B | Vertebrates only | 49.58 | |||||||
| METTL12 | 0.6 | 9.02 | 0.54 ± 0.01 | 0.63 ± 0.03 | 0.76 ± 0.02 | 0.46 ± 0.02 | 53.23 | K2 + I | LG + G + I |
| METTL13 | 0.7 | 24.82 | 0.48 ± 0.01 | 0.56 ± 0.01 | 0.78 ± 0.01 | 0.39 ± 0.01 | 50.84 | GTR + G + I | LG + G |
| METTL14 | 0.7 | 29.64 | 0.37 ± 0.01 | 0.32 ± 0.02 | 0.83 ± 0.01 | 0.23 ± 0.01 | 53.61 | GTR + G | LG + G |
| METTL15 | 0.7 | 18.11 | 0.45 ± 0.01 | 0.48 ± 0.02 | 0.79 ± 0.01 | 0.34 ± 0.02 | 51.84 | HKY + G + I | LG + G + I |
| METTL16 | 0.6 | 18.45 | 0.49 ± 0.01 | 0.57 ± 0.02 | 0.79 ± 0.01 | 0.41 ± 0.01 | 53.99 | HKY + G + I | LG + G + I |
| METTL17 | 0.7 | 17.89 | 0.53 ± 0.01 | 0.61 ± 0.02 | 0.79 ± 0.01 | 0.44 ± 0.01 | 54.27 | GTR + G | LG + G + I |
| METTL18 | 0.7 | 13.1 | 0.5 ± 0.01 | 0.57 ± 0.02 | 0.78 ± 0.02 | 0.41 ± 0.02 | 52.54 | T92 + G + I | LG + G + I |
| METTL19 | 0.7 | 13.24 | 0.49 ± 0.01 | 0.56 ± 0.02 | 0.77 ± 0.01 | 0.41 ± 0.02 | 51.68 | GTR + G + I | LG + G + I |
| METTL20 | 0.7 | 14.86 | 0.45 ± 0.01 | 0.48 ± 0.03 | 0.77 ± 0.01 | 0.35 ± 0.02 | 52.27 | K2 + G + I | LG + G + I |
| METTL21A | 0.6 | 17.06 | 0.48 ± 0.01 | 0.54 ± 0.02 | 0.78 ± 0.01 | 0.39 ± 0.02 | 54.38 | K2 + G + I | LG + G |
| METTL21B | Vertebrates only | 45.13 | |||||||
| METTL21C | Vertebrates only | 54.33 | |||||||
| METTL21D | 0.7 | 13.23 | 0.47 ± 0.01 | 0.54 ± 0.02 | 0.76 ± 0.02 | 0.38 ± 0.02 | 54.89 | T92 + G + I | LG + G + I |
| METTL21E | Chordates only | 53.19 | |||||||
| METTL22 | 0.6 | 15.33 | 0.5 ± 0.01 | 0.58 ± 0.02 | 0.79 ± 0.01 | 0.42 ± 0.02 | 53.77 | T92 + G + I | LG + G + I |
| METTL23 | 0.6 | 18.63 | 0.45 ± 0.01 | 0.5 ± 0.02 | 0.78 ± 0.01 | 0.35 ± 0.02 | 52.42 | K2 + G + I | LG + G |
| METTL25 | 0.6 | 16.26 | 0.5 ± 0.01 | 0.58 ± 0.02 | 0.76 ± 0.01 | 0.42 ± 0.01 | 50.58 | T92 + G + I | LG + G + I |
| METTL25B | 0.7 | 17.41 | 0.5 ± 0.01 | 0.56 ± 0.02 | 0.79 ± 0.01 | 0.4 ± 0.01 | 51.20 | GTR + G + I | LG + G + I |
| METTL26 | 0.7 | 13.38 | 0.44 ± 0.01 | 0.49 ± 0.03 | 0.8 ± 0.02 | 0.34 ± 0.02 | 50.61 | K2 + I | LG + G + I |
| METTL27 | 0.5 | 4.12 | 0.55 ± 0.02 | 0.69 ± 0.03 | 0.68 ± 0.04 | 0.5 ± 0.03 | 48.47 | JC | LG |
Note.—SE, standard error; R, average transition/transversion ratio; dNT, mean nucleotide distance; dAA, mean amino acid distance; dS, synonymous substitution distance; dN, nonsynonymous substitution distance; ENC, effective number of codons (codon bias) ranging from 61 (i.e., no bias) to 20 (i.e., maximum bias); ModelNT, best-fit maximum likelihood (ML) DNA model; ModelAA, best-fit ML protein model.
All SE values were calculated using the Bootstrap method with 1,000 replications.
P value < 0.001 in Z-test of purifying selection (i.e., HA: dN < dS).
Fig. 4.Estimated rates of evolution of METTL proteins. Rates were only estimated for METTLs in which amino acid differences per site could be calculated by comparing the protein between three or more of the preselected metazoan phyla.
Fig. 5.Episodes of diversifying selection shaping the evolution of METTL proteins in vertebrates. For figure readability, “METTL” has been removed from the names of each tip label (e.g., “METTL1” is simply displayed as “1”). (A) The strength of selection at significant branches is represented in red (ω > 5), gray (ω = 1), and blue (ω = 0), with the proportion of sites within each class represented by the color width. Thicker branches have been classified as undergoing episodic diversifying selection at corrected P value < 0.001 (thicker branches) and P value < 0.01 (thinner branches). (B) The physical positions of adaptive selection episodes involved in the diversification of METTL genes. Numbers of synonymous (blue bars) and nonsynonymous (red bars) substitutions at codon positions that are subject to significant episodes of diversifying selection in vertebrates (P value < 0.05). (C) Phylogenetic location of the mutations involved in such episodes. Branches in red account for higher numbers of nonsynonymous mutations, branches in blue indicate higher numbers of synonymous mutations, and branches in green represent cases with the same numbers of nonsynonymous and synonymous mutations. Light blue, light red, and light purple backgrounds behind each branch indicate METTLs that modify DNA/RNA, proteins, or unknown targets, respectively. Subtrees that include several closely related METTLs (i.e., METTL11A and METTL11B; METTL21A, METTL21B, METTL21C, METTL21D, and METTL21E; METTL25 and METTL25B) were collapsed into a single branch for figure readability.
Fig. 6.Episodes of adaptive selection identified upon separately analyzing vertebrate DNA/RNA-modifying METTLs and vertebrate protein-modifying METTLs. (A) The physical positions of adaptive selection episodes involved in the diversification of METTLs that modify DNA/RNA nucleosides (top) and METTLs that modify protein residues (bottom). Numbers of synonymous (blue bars) and nonsynonymous (red bars) substitutions at codon positions that are subject to significant episodes of diversifying selection in vertebrates (P value < 0.05). (B) Phylogenetic location of the mutations involved in such episodes for DNA/RNA-modifying METTLs, indicated by a light blue background, and (C) protein-modifying METTLs, indicated by a light red background. For figure readability, “METTL” has been removed from the names of each tip label (e.g., “METTL1” is simply displayed as “1”). Branches in red account for higher numbers of nonsynonymous mutations, branches in blue indicate higher numbers of synonymous mutations, and branches in green represent cases with the same numbers of nonsynonymous and synonymous mutations.