| Literature DB >> 25389274 |
Dominik Theler1, Cyril Dominguez1, Markus Blatter1, Julien Boudet1, Frédéric H-T Allain2.
Abstract
N(6)A methylation is the most abundant RNA modification occurring within messenger RNA. Impairment of methylase or demethylase functions are associated with severe phenotypes and diseases in several organisms. Beside writer and eraser enzymes of this dynamic RNA epigenetic modification, reader proteins that recognize this modification are involved in numerous cellular processes. Although the precise characterization of these reader proteins remains unknown, preliminary data showed that most potential reader proteins contained a conserved YT521-B homology (YTH) domain. Here we define the YTH domain of rat YT521-B as a N(6)-methylated adenosine reader domain and report its solution structure in complex with a N(6)-methylated RNA. The structure reveals a binding preference for NGANNN RNA hexamer and a deep hydrophobic cleft for m(6)A recognition. These findings establish a molecular function for YTH domains as m(6)A reader domains and should guide further studies into the biological functions of YTH-containing proteins in m(6)A recognition.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25389274 PMCID: PMC4267619 DOI: 10.1093/nar/gku1116
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The YTH domain has an increased affinity for m6A-containing RNA. (A) Top: schematic depiction of the domain organization of R. Norvegicus YT521-B. Nuclear localization signals are represented as white boxes numbered 1–4. E-rich, P-rich and ER-rich stand for sequence stretches enriched in glutamate, proline, glutamate and arginine amino acids, respectively (17). Secondary structure elements based on the presented structure are shown. Bottom: combined chemical shift mapping of the R. Norvegicus YT521-B YTH domain 1H-15N backbone resonances upon 1:1 complex formation with 5′-UGm6ACAC-3′ plotted against the sequence of the used construct. Perturbations were calculated using the formula: Δδ = [(δHN)2 + (δN/6.51)2]1/2. Proline and residues, which could not be assigned in both states, are represented with negative bars. (B) Overlay of 1H-15N HSQC spectra of the YTH domain of YT521-B (blue), the domain in a 1:1 complex with 5′-UGACAC-3′ (red) and 5′-UGm6ACAC-3′ (green). For clarity folded arginine and lysine side-chain resonances were omitted from the overlay. Side-chain resonances of N370 and W380 displaying large chemical-shift perturbations are labeled. (C) Isothermal titration calorimetry data of RNA binding to the YTH domain. Left and right calorimetric titration profiles correspond to the 5′-UGm6ACAC-3′ and 5′-UGACAC-3′ RNA being injected into the YTH protein, respectively. The upper panels show the raw calorimetric data. The bottom plots are integrated heats as a function of the RNA/YTH molar ratio. Black dots indicate the experimental data. The best fit (depicted by a red line) was obtained from a non-linear least-squares method using a one-site binding model. Heats of dilution have been obtained from independent titration experiments. Both reactions are exothermic.
Figure 2.Structure of the YTH domain in complex with 5′-UGm6ACAC-3′. (A) Ensemble of the 20 selected structures superimposed on the structured residues (Table 1). The YTH domain is displayed as a gray ribbon and the RNA in stick representation of the heavy atoms, carbon (yellow), nitrogen (blue), oxygen (red) and phosphate (orange). (B) Electrostatic potential plotted on the surface of the solvent-accessible surface of the YTH domain. RNA is colored as in (A). The surface is displayed partially transparent to visualize m6A3, which is buried in the hydrophobic core. The ±1 kT/e electrostatic potential is shown with the respective color gradient depicted above the structure with red denoting a negative and blue a positive potential. (C) Stereo view of a representative structure of the complex. Depiction as in (A) with the exception of protein carbon atoms shown in green and H-bonds depicted as purple dashed lines. (D) Stereo view of the G2 and m6A3 binding pocket. Same depiction as in (A) and (C).
Structural statistics of the YT521-B YTH domain in complex with 5′-UGm6ACAC-3'
| NMR restraints | |
| Distance restraints | 4760 |
| Protein intramolecular | 4497 |
| intraresidual | 915 |
| sequential (| | 1008 |
| medium range (1 < | | 895 |
| long range (| | 1644 |
| hydrogen bondsa | 35 |
| RNA intramolecular | 49 |
| intraresidual | 30 |
| sequential (| | 19 |
| Complex intermolecular | 214 |
| Torsion anglesb | 241 |
| Protein backbone | 234 |
| RNA sugar pucker (DELTA) | 6 |
| RNA base conformation (CHI; syn) | 1 |
| Energy statisticsc | |
| Average distance constraint violations | |
| 0.1–0.2 Å | 65.2 ± 4.7 |
| 0.2–0.3 Å | 6.1 ± 2.3 |
| 0.3–0.4 Å | 0.6 ± 0.7 |
| > 0.4 Å | 0.1 ± 0.3 |
| Maximal (Å) | 0.32 ± 0.06 |
| Average angle constraint violations | |
| < 5° | 29.3 ± 2.7 |
| > 5° | 0.0 ± 0.0 |
| Maximal (°) | 0.51 ± 0.07 |
| Mean AMBER constraint violation energy (kcal mol−1) | 53.2 ± 2.7 |
| Distance (kcal mol−1) | 52.6 ± 2.7 |
| Torsion (kcal mol−1) | 0.6 ± 0.1 |
| Mean AMBER energy (kcal mol−1) | −5040.4 ± 11.4 |
| Mean deviation from ideal covalent geometry | |
| Bond length (Å) | 0.0036 ± 0.0000 |
| Bond angle (°) | 1.706 ± 0.007 |
| Ramachandran plot statisticsc,d,e | |
| Residues in most favored regions (%) | 91.2 ± 0.7 |
| Residues in additionally allowed regions (%) | 8.8 ± 0.7 |
| Residues in generously allowed regions (%) | 0.0 ± 0.0 |
| Residues in disallowed regions (%) | 0.0 ± 0.0 |
| RMSD to mean structure statisticsc,d | |
| Protein | |
| Backbone atoms | 0.21 ± 0.04 |
| Heavy atoms | 0.51 ± 0.05 |
| RNA | |
| Backbone atoms | 0.59 ± 0.31 |
| Heavy atoms | 0.64 ± 0.26 |
| All molecules | |
| Backbone atoms | 0.29 ± 0.08 |
| Heavy atoms | 0.54 ± 0.06 |
aH-bond constraints were identified from slow exchanging amide protons in D2O.
bProtein backbone angles determined by the program TALOS+ and sugar pucker angles based on coupling efficiency in homonuclear TOCSY.
cStatistics computed for the deposited bundle of 20 violation energy best structure selected out of 30 amber energy best.
dBased on structured residue range as defined by user: protein: 351–498, chain ID: A (sequence range: 347–502) RNA : 1–6, chain ID: B (sequence range: 1–6).
eRamachandran plot as defined by the program PROCHECK.
Figure 3.Alignment of YTH domains and homology models. (A) Clipped sequence alignment of a representative selection of YTH domain containing proteins. Regions shown include amino acids involved in RNA binding. Hydrophobic residues are colored gray, aromatic residues (F,Y,H,W) yellow, blue, green and red, respectively. Top: secondary structure representation of selected regions; first β-strand (β1), second α-helix (α2), second β-strand (β2) and the two variable loops (v-loop1 and v-loop2). Position of selected residues involved in m6A recognition are marked with arrows. (B) Homology models of the YTH domains of Homo sapiens YTHDF1 and S. cerevisiae MRB1 binding m6A. The corresponding perspective from the presented structure is shown for comparison. Representation as in (Figure 2A and C), except that the protein ribbon of YTHDF1 is in purple and the one of MRB1 in blue.