| Literature DB >> 35327467 |
Ana Rita Marques1,2, João Xavier Santos1,2, Hugo Martiniano1,2, Joana Vilela1,2, Célia Rasga1,2, Luísa Romão2,3, Astrid Moura Vicente1,2.
Abstract
Autism Spectrum Disorder (ASD) is a heterogeneous neurodevelopmental condition with unclear etiology. Many genes have been associated with ASD risk, but the underlying mechanisms are still poorly understood. An important post-transcriptional regulatory mechanism that plays an essential role during neurodevelopment, the Nonsense-Mediated mRNA Decay (NMD) pathway, may contribute to ASD risk. In this study, we gathered a list of 46 NMD factors and regulators and investigated the role of genetic variants in these genes in ASD. By conducting a comprehensive search for Single Nucleotide Variants (SNVs) in NMD genes using Whole Exome Sequencing data from 1828 ASD patients, we identified 270 SNVs predicted to be damaging in 28.7% of the population. We also analyzed Copy Number Variants (CNVs) from two cohorts of ASD patients (N = 3570) and discovered 38 CNVs in 1% of cases. Importantly, we discovered 136 genetic variants (125 SNVs and 11 CNVs) in 258 ASD patients that were located within protein domains required for NMD. These gene variants are classified as damaging using in silico prediction tools, and therefore may interfere with proper NMD function in ASD. The discovery of NMD genes as candidates for ASD in large patient genomic datasets provides evidence supporting the involvement of the NMD pathway in ASD pathophysiology.Entities:
Keywords: autism spectrum disorder; copy number variants; nonsense-mediated mRNA decay; single nucleotide variants
Year: 2022 PMID: 35327467 PMCID: PMC8945030 DOI: 10.3390/biomedicines10030665
Source DB: PubMed Journal: Biomedicines ISSN: 2227-9059
Figure 1Simplified representation of the NMD pathway in mammalian cells. (A) The Exon Junction Complex (EJC; composed by eIF4A3, RBM8A and MAGOH) is formed during splicing and deposited 20–24 nucleotides (nts) upstream of exon–exon junctions, remaining associated with the mRNA during its transport to the cytoplasm. (B) In the cytoplasm, the mRNA is translated by the ribosome. If the mRNA carries a premature translation termination codon (PTC), during the pioneer round of translation, i.e., while the mRNA is still bound to the cap binding complex CBP80/20, the ribosome stops at the PTC. If the PTC is located more than 50–54 nts upstream of the last exon–exon junction, the ribosome is not able to displace downstream EJCs. Instead, when the ribosome stops at the PTC, UPF1 can interact with eRF3a, inducing premature translation termination that triggers NMD. For that, the SURF complex is formed by UPF1, eRF1, eRF3a, the SMG1c kinase complex and DHX34 associated with RUVBL1 and RUVBL2. At this stage, both PYM1 and UPF3A can act as NMD repressors. (C) UPF2 and UPF3B, either diffused in the cytoplasm (in case of EJC-independent NMD) or bound to the EJC, interact with UPF1, favoring its phosphorylation by SMG1 and the formation of the DECID complex. (D) Phosphorylated UPF1 recruits SMG6, which cleaves the mRNA near the PTC, and the SMG5-SMG7 dimer, which recruits the decapping complex DPC2-DCP1a through PNRC2 and the deadenylation complex CCR4-NOT through CNOT8. SRSF1 and SMG5-SMG7 independently recruit PP2A, which dephosphorylates UPF1. NMD-targeted mRNAs are further degraded by 5′-to-3′ and 3′-to-5′ exonucleolytic activities of XRN1 and the exosome, respectively. (E) Model of the ER-NMD pathway. The NMD model was adapted from Nogueira et al., 2021 [29]. Legend: NMD, Nonsense-Mediated mRNA Decay; PTC, Premature Termination Codon; EJC, Exon Junction Complex; SURF, SMG1-UPF1-eRF1-eRF3 complex; DECID, Decay Inducing Complex; PP2A, Protein Phosphatase 2A; ER-NMD, NMD response at the Endoplasmic Reticulum; TC, Termination Codon.
Figure 2Overview of the analysis. The study began with the identification of 46 genes (Table 1) encoding proteins involved in the NMD pathway or in its regulation (yellow) that were used for further analyses. This workflow describes the analysis of SNVs obtained from ASC WES datasets for 1828 ASD patients (1338 BI and 490 BCM, blue) and the analysis of CNVs predicted from SNP genotyping data in 3570 ASD patients from AGP (2446) and SSC (1124) datasets (green). Analysis proceeded separately for CNVs and SNVs to identify variants in NMD genes and ended up with the identification of 270 SNVs (Figure 3) and 38 CNVs (Table 2) in 524 and 38 ASD probands, respectively. Protein domains affected by SNVs or CNVs were then identified (orange), and a total of 136 genetic variants were located within regions required for NMD in 258 ASD patients (Table 3). Legend: SNVs, Single Nucleotide Variants; CNVs, Copy Number Variants; ASC, Autism Sequencing Consortium; AGP, Autism Genome Project; SSC, Simons Simplex Collection; BI, Broad Institute; BCM, Baylor College of Medicine; DGV, Database of Genomic Variant; CDS, coding sequence for protein; MAF, Minor Allele Frequency; NFE, Non-Finnish European; MISPD, missense variants predicted to be damaging and deleterious; LoF, loss-of-function.
List of human genes encoding NMD factors and regulators.
| NMD Group | Gene | Alternative Name (2) | Role in NMD | References |
|---|---|---|---|---|
| EJC |
| eIF4AIII | RNA helicase eukaryotic initiation factor 4A3 is a core EJC factor that | [ |
|
| Y14 | RNA-binding motif protein 8A is a core EJC factor that interacts with eIF4A3 and MAGOH to provide a stable and direct binding site for the UPF3B protein and activate NMD | [ | |
|
| MAGOH1 | Mago nashi homolog protein is a core EJC factor that interacts with eIF4A3 and Y14 to provide a stable and direct binding site for the UPF3B protein and activate NMD | [ | |
|
| MGN2 | Mago nashi protein homolog B is a paralog of MAGOH that interacts with eIF4A3 and Y14 forming the trimeric EJC core to activate NMD | [ | |
|
| RNA-binding protein S1 is a component of the SR-rich EJCs that | [ | ||
|
| MLN51 | The peripheral EJC component CASC3 activates NMD and promotes SMG6-dependent endonucleolytic cleavage | [ | |
|
| KIAA0947 | Component of the little elongation complex promotes the | [ | |
|
| WIGB | Ribosome-associated protein PYM interacts with Y14-MAGOH | [ | |
| SURF-DECID |
| RENT1 | Up-frameshift protein 1 is the central component of the NMD pathway; its helicase and ATPase activities are essential to trigger NMD | [ |
|
| RENT2 | Up-frameshift protein 2 and UPF3B interact with UPF1 favoring its | [ | |
|
| UPF3X | Up-frameshift protein 3B and UPF2 interact with UPF1 favoring its | [ | |
|
| eRF1 | Eukaryotic release factor 1 is part of the eRF1–eRF3 translation | [ | |
|
| eRF3a | Eukaryotic release factor 3 is part of the eRF1–eRF3 translation | [ | |
|
| CBP80 | Component of the cap-binding complex (CBC) directly binds | [ | |
|
| CBP20 | Component of the CBC is essential for the stability of complex | [ | |
|
| eIF4E | Eukaryotic translation initiation factor 4E binds to UPF1 | [ | |
|
| ATX | Suppressor of morphogenesis in genitalia-1 associate with SMG8-SMG9 to form the SMG1c kinase complex that | [ | |
|
| Suppressor of morphogenesis in genitalia-8 and suppressor of | [ | ||
|
| ||||
|
| KIAA0134 | RNA helicase DHX34 binds SMG1 and promotes UPF1 phosphorylation, triggering the conversion from the SURF to the DECID complex | [ | |
|
| RVB1 | AAA-ATPases RUVBL1 and RUVBL2 form a hetero-hexameric ring | [ | |
|
| RVB2 | |||
| mRNA |
| EST1B | Suppressor of morphogenesis in genitalia-5 and -7 form a | [ |
|
| EST1C | |||
|
| EST1A | Endonuclease that interacts both with UPF1 and p-UPF1 | [ | |
|
| POP2 | CCR4-NOT transcription complex subunit 8 is recruited by SMG7 to degrade NMD targets | [ | |
|
| mRNA-decapping enzyme 1A is a decapping activator and | [ | ||
|
| Proline rich nuclear receptor coactivator 2 binds p-UPF1 and | [ | ||
|
| Decapping protein engaged in the 5′→3′ mRNA degradation | [ | ||
|
| gb110 | RNA helicase contributes to degradation of | [ | |
|
| PP2AC | Protein phosphatase 2 (PP2A) promotes dephosphorylation of UPF1; both structural ( | [ | |
|
| PP2AA | |||
|
| Exonuclease involved in 5′→3′ mRNA degradation | [ | ||
|
| DIS3L1 | Core exosome-associated factor involved in the 3′→5′ mRNA | [ | |
|
| Exoribonuclease that degrades mRNA from 3′→5′ | [ | ||
|
| PM/Scl100 Rrp6p | Exosome catalytic subunit involved in the | [ | |
|
| Ribonuclease engaged in the 3′→5′ mRNA degradation | [ | ||
| ER–NMD |
| NAG | Protein involved in Golgi-to-endoplasmic reticulum (ER) | [ |
| Regulator |
| PABP1 | Polyadenylate-binding protein 1 inhibits the interaction of UPF1 with eRF3, repressing NMD | [ |
|
| EIF4G | Eukaryotic initiation factor 4G inhibits NMD | [ | |
|
| UPF3 | Up-frameshift protein 3A compete with UPF3B for | [ | |
|
| FMRP | Fragile X mental retardation protein binds directly to UPF1 | [ | |
|
| INT6 | Eukaryotic translation initiation factor 3 subunit E is a non-core eIF3 subunit that interacts with UPF2 and triggers NMD | [ | |
|
| SFRS1 | Serine/arginine-rich splicing factor 1 promotes NMD by | [ | |
|
| GNL2 and SEC13 are conserved NMD factors that regulate | [ | ||
|
| Ngp-1 |
(1) Gene annotation according to HGNC. (2) Previous SYMBOL or alias reported by HGNC database or identifier found in literature review. Legend: NMD, Nonsense-Mediated mRNA Decay; PTC, Premature Termination Codon; EJC, Exon Junction Complex; SURF, SMG-1–Upf1–eRF1–eRF3 complex; DECID, Decay Inducing Complex; ER-NMD, NMD response at the Endoplasmic Reticulum; p-UPF1, phosphorylated UPF1.
Figure 3Distribution of rare SNVs predicted to be damaging among 38 NMD genes in ASD patients. (A) Variants in NMD genes were identified in 1 to 5% of ASD patients. (B) Detailed distribution of gene variants in <1% of the ASD population sample.
Figure 4Schematic domain architecture of proteins involved in NMD, and representations of the protein alterations encoded by MISPD and nonsense/frameshift variants identified in ASD subjects. The binding regions for interacting proteins are indicated. Proteins were grouped by NMD groups: (A) EJC, (B) SURF-DECID, (C) mRNA decay, (D) ER-NMD and (E) Regulator. Legend: MISPD, missense variants predicted to be damaging and deleterious; BD, Binding Domain; CH, cysteine and histidine-rich domain; SQ, serine- and glutamine-rich domain; RS, arginine and serine-rich domain; RRM, RNA recognition motif; EJC, Exon Junction Complex; PIN, PilT N-terminus; P-UPF1, phosphorylated UPF1; WH, Winged-Helix; CTD, C-terminal domain; NTD, N-terminal domain.
NMD genes disrupted by CNVs identified in a population of 3570 ASD patients.
| NMD Group | Gene | Location | CNV Type | Gene Region | Protein Domains Affected | ASD N (1) |
|---|---|---|---|---|---|---|
| EJC |
| 1q21.1 | Deletion | Whole gene | All domains | 1 |
|
| 1q21.1 | Duplication | Whole gene | All domains | 3 | |
| SURF-DECID |
| 10p14 | Duplication | Partial | MIF4G domains | 1 |
|
| Xq24 | Duplication | Whole gene | All domains | 1 | |
|
| 16p13.13 | Duplication | Whole gene | All domains | 1 | |
|
| 3q29 | Deletion | Whole gene | All domains | 1 | |
|
| 19q13.32 | Deletion | Whole gene | All domains | 1 | |
|
| 19q13.33 | Duplication | Partial | DI domain | 1 | |
| mRNA decay |
| 15q22.31 | Duplication | Whole gene | All domains | 1 |
|
| 2q37.1 | Deletion | Partial | Part of CSD2 and RNB domains | 1 | |
|
| 2q37.1 | Duplication | Partial | RNB and C-terminal S1 domain | 1 | |
|
| 1p36.22 | Deletion | Partial | PMC2NT, EXO1 and HRCD domains | 1 | |
|
| 1p36.22 | Duplication | Partial | EXO and HRCD domain | 1 | |
|
| 1p13.2 | Duplication | Whole gene | All domains | 2 | |
|
| 16p13.12 | Deletion | Partial | All domains | 1 | |
|
| 19q13.41 | Duplication | Partial | PP2A subunit B binding | 1 | |
|
| 17p13.3 | Duplication | Partial | PIN domain | 1 | |
|
| 3q23 | Duplication | Partial | XRN1 SH3-like domain | 13 | |
| ER-NMD |
| 2p24.3 | Duplication | Partial | Sec39-like domain | 1 |
| Regulator |
| Xq27.3 | Deletion | Partial | KH2, NES and RGG domains | 1 |
|
| 13q34 | Duplication | Whole gene | All domains | 2 | |
|
| 13q34 | Duplication | Partial | EJC-binding domain | 1 |
(1) ASD N—number of ASD patients carrying CNVs.
Numbers of SNVs or CNVs identified within protein domains important for NMD in 258 ASD patients. A total of 136 genetic variants in 23 NMD genes were identified.
| NMD Group | Gene | Location | SNVs (1) | CNVs (1) | N ASD (2) | pLI | mis_Z | SFARI (3) | |
|---|---|---|---|---|---|---|---|---|---|
| LoF | MISPD | ||||||||
| EJC |
| 17q25.3 | 1 | 1 | 1.00 | 4.02 | |||
|
| 1q21.1 | 2 | 4 | 0.57 | 2.16 | 1q21.1 | |||
|
| 17q21.1 | 3 | 3 | 0.61 | 1.32 | ||||
|
| 5p15.32 | 2 | 7 | 10 | 1.00 | 0.73 | |||
| SURF-DECID |
| 19p13.11 | 4 | 4 | 1.00 | 5.63 | |||
|
| 10p14 | 1 | 3 | 1 | 11 | 1.00 | 3.19 | category 3 | |
|
| Xq24 | 1 | 3 | 1 | 7 | 0.98 | 1.84 | category 1 | |
|
| 5q31.2 | 1 | 3 | 6 | 1.00 | 4.39 | |||
|
| 16p13.13 | 4 | 1 | 7 | 1.00 | 3.32 | |||
|
| 19q13.32 | 2 | 12 | 1 | 25 | 0.00 | −0.08 | ||
|
| 3q21.3 | 1 | 1 | 1.00 | 3.39 | ||||
|
| 19q13.33 | 1 | 1 | 2 | 1.00 | 3.11 | |||
|
| 16p12.3 | 1 | 12 | 13 | 1.00 | 3.30 | |||
|
| 17q22 | 6 | 8 | 0.01 | 1.73 | ||||
|
| 19q13.31 | 1 | 1 | 4 | 0.00 | 1.60 | |||
| mRNA decay |
| 1q22 | 2 | 2 | 0.01 | 1.13 | |||
|
| 17p13.3 | 2 | 1 | 6 | 0.98 | 0.18 | category 3 | ||
|
| 1q25.3 | 8 | 14 | 1.00 | 2.19 | ||||
| ER-NMD |
| 2p24.3 | 5 | 20 | 1 | 99 | 0.00 | −0.87 | |
| Regulator |
| 13q34 | 1 | 2 | 2 | 11 | 0.00 | −0.60 | |
|
| 17q22 | 1 | 1 | 0.98 | 3.96 | ||||
|
| 3p25.3 | 2 | 2 | 0.02 | 0.62 | ||||
|
| 1p34.3 | 1 | 11 | 17 | 0.00 | 0.28 | |||
(1) Numbers of SNVs and CNVs located within regions necessary for NMD function. (2) Numbers of ASD patients carrying SNVs and CNVs. (3) Genes and CNVs associated with ASD, described in SFARI Gene database (https://gene.sfari.org/ accessed on 8 March 2021). (4) These genes do not have the described NMD domains. Legend: SNVs, Single Nucleotide Variants; CNVs, Copy Number Variants; pLI, probability of being loss-of-function intolerant; mis_Z, missense Z scores.