Yanghai Zhang1, Chunyan Wang1, Mingming Sun2, Yutong Jin3, Camila Urbano Braz1, Hasan Khatib1, Timothy A Hacker4, Martin Liss5, Michael Gotthardt5,6, Henk Granzier7, Ying Ge3,8,9, Wei Guo1,2. 1. Department of Animal and Dairy Sciences, University of Wisconsin-Madison, Wisconsin, USA. 2. Department of Animal Science, University of Wyoming, Laramie, Wyoming, USA. 3. Department of Chemistry, University of Wisconsin-Madison, Madison, Wisconsin, USA. 4. Division of Cardiovascular Medicine, Department of Medicine, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin, USA. 5. Neuromuscular and Cardiovascular Cell Biology, Max Delbrueck Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany. 6. Charité Universitätsmedizin, Berlin, Germany. 7. Department of Cellular and Molecular Medicine, University of Arizona, Tucson, Arizona, USA. 8. Department of Cell and Regenerative Biology, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin, USA. 9. Human Proteomics Program, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin, USA.
Abstract
Arginine-serine (RS) domain(s) in splicing factors are critical for protein-protein interaction in pre-mRNA splicing. Phosphorylation of RS domain is important for splicing control and nucleocytoplasmic transport in the cell. RNA-binding motif 20 (RBM20) is a splicing factor primarily expressed in the heart. A previous study using phospho-antibody against RS domain showed that RS domain can be phosphorylated. However, its actual phosphorylation sites and function have not been characterized. Using middle-down mass spectrometry, we identified 16 phosphorylation sites, two of which (S638 and S640 in rats, or S637 and S639 in mice) were located in the RSRSP stretch in the RS domain. Mutations on S638 and S640 regulated splicing, promoted nucleocytoplasmic transport and protein-RNA condensates. Phosphomimetic mutations on S638 and S640 indicated that phosphorylation was not the major cause for RBM20 nucleocytoplasmic transport and condensation in vitro. We generated a S637A knock-in (KI) mouse model (Rbm20S637A ) and observed the reduced RBM20 phosphorylation. The KI mice exhibited aberrant gene splicing, protein condensates, and a dilated cardiomyopathy (DCM)-like phenotype. Transcriptomic profiling demonstrated that KI mice had altered expression and splicing of genes involving cardiac dysfunction, protein localization, and condensation. Our in vitro data showed that phosphorylation was not a direct cause for nucleocytoplasmic transport and protein condensation. Subsequently, the in vivo results reveal that RBM20 mutations led to cardiac pathogenesis. However, the role of phosphorylation in vivo needs further investigation.
Arginine-serine (RS) domain(s) in splicing factors are critical for protein-protein interaction in pre-mRNA splicing. Phosphorylation of RS domain is important for splicing control and nucleocytoplasmic transport in the cell. RNA-binding motif 20 (RBM20) is a splicing factor primarily expressed in the heart. A previous study using phospho-antibody against RS domain showed that RS domain can be phosphorylated. However, its actual phosphorylation sites and function have not been characterized. Using middle-down mass spectrometry, we identified 16 phosphorylation sites, two of which (S638 and S640 in rats, or S637 and S639 in mice) were located in the RSRSP stretch in the RS domain. Mutations on S638 and S640 regulated splicing, promoted nucleocytoplasmic transport and protein-RNA condensates. Phosphomimetic mutations on S638 and S640 indicated that phosphorylation was not the major cause for RBM20 nucleocytoplasmic transport and condensation in vitro. We generated a S637A knock-in (KI) mouse model (Rbm20S637A ) and observed the reduced RBM20 phosphorylation. The KI mice exhibited aberrant gene splicing, protein condensates, and a dilated cardiomyopathy (DCM)-like phenotype. Transcriptomic profiling demonstrated that KI mice had altered expression and splicing of genes involving cardiac dysfunction, protein localization, and condensation. Our in vitro data showed that phosphorylation was not a direct cause for nucleocytoplasmic transport and protein condensation. Subsequently, the in vivo results reveal that RBM20 mutations led to cardiac pathogenesis. However, the role of phosphorylation in vivo needs further investigation.
alaninealternative 3′ splice sitealternative 5′ splice siteanterior wallnatriuretic peptidescalcium/calmodulin‐dependent protein kinase type II dcollision‐induced dissociationAspartic aciddilated cardiomyopathyDEAD‐box helicase 6differentially expressed genesDulbecco's modified eagle's mediumdifferentially spliced geneselectron capture dissociationejection fractionformic acidfetal bovine serumfalse discovery rateFourier transform ion cyclotron resonanceglycinegene ontologyguide RNAshematoxylin and eosinheterozygoushomozygoushuman phenotype ontologyKyoto Encyclopedia of Genes and Genomesknock‐inknockoutleucineliquid chromatographyleft ventriclemass spectrometrytandem MSmolecular weight cutoffmutually exclusive exonsmyosin heavy chain 6giant N2BAnano‐electrospray ionizationnegative controlnatriuretic peptide precursor bprolinepulmonary arterypercent spliced‐inpost‐translational modificationsposterior wallquantitative PCRquadrupole‐time‐of‐flightRNA‐binding motif 20retained intronmultivariate analysis of transcript splicingRNA‐sequencingribonucleoproteinRNA recognition motifarginine–serine‐rich domainroom temperatureright ventricleryanodine receptor 2skipped exonstress granulesserine‐ and arginine‐rich proteinstricuspid annular plane systolic excursionTAR DNA‐binding protein 43trimmed mean of M‐valuesvertical sodium dodecyl sulfate −1% agarose gel electrophoresis systemvelocity time integralwild type
INTRODUCTION
RNA‐binding motif 20 (RBM20) is a muscle‐specific splicing factor identified in a spontaneous mutant rat model.
,
,
In this rat model, 13 of 14 RBM20 exons (exons 2–14) were deleted, and the absence of RBM20 led to the largest isoform of protein titin resulting from alternative splicing.
,
,
,
,
The role of RBM20 as a primary regulator of Ttn pre‐mRNA splicing was also verified through a gene‐edited mouse model.
In this mouse model, the RNA recognition motif (RRM) in RBM20 was deleted, resulting in the expression of the largest titin isoform in the heart of these mice.
In addition to Ttn pre‐mRNA splicing, at least 30 other genes such as myosin heavy chain 6 (Myh6), ryanodine receptor 2 (Ryr2), and calcium/calmodulin‐dependent protein kinase type II d (Camk2d) have also been found to be regulated by RBM20.
,
,
,
,
Aberrant splicing through RBM20 depletion in animal models and Rbm20 mutations in human patients and animal models was also associated with dilated cardiomyopathy (DCM) and subsequent heart failure development.
,
,
,
,
,
,
Uncovering the splicing mechanism regulated by RBM20 is important to develop new treatments for cardiac dysfunction.RBM20 protein contains the RRM domain located close to the N‐terminus and the arginine–serine‐rich (RS) domain close to the C‐terminus, both of which are commonly found in other conventional splicing factors.
,
,
Both RRM and RS domains play an essential role in defining the splicing sites and the selection of exon/intron boundaries.
,
Phosphorylation of the RS domain impacts localization and activity of splicing factors.
,
,
Both hyper‐ and hypo‐phosphorylation of splicing factors can suppress splicing reactions
and translocation between the nucleus and the cytoplasm.
,
Although serine‐ and arginine‐rich (SR) proteins dephosphorylation is essential for the maturation of splicing catalysis and the spliceosome,
dephosphorylation has also been found important for export of splicing factors and processed mRNA to the cytoplasm for protein translation.
Hence, RS domain phosphorylation and dephosphorylation are thought to be critical for numerous steps in splicing factor subcellular localization and splicing control. Previous studies reported that RS domain was critical for RBM20 nuclear retention and the phosphorylation of RS domain may play a role in the nuclear retention.
However, both studies did not characterize the actual phosphorylation sites and the role of phosphorylation in nuclear retention. In the present study, we determined the post‐translational modifications (PTMs) of RBM20 by middle‐down mass spectrometry (MS). We also investigated the influence of RBM20 PTMs on splicing control and protein trafficking. Particularly, we studied how mutations of the phosphorylation sites in the RS domain affect cardiac function by using a gene‐edited Rbm20S637A mutation knock‐in (KI) mouse model.
METHODS AND MATERIALS
Experimental animals and sample preparation
The Rbm20S637A mouse was generated via CRISP/Cas9 system at the Genetically Engineered Mouse Models Core Facility of the University of Arizona. The CRISPR guide RNAs (gRNAs), used for mouse Rbm20S637A knock‐in, were designed using the CRISPOR web tool (http://crispor.tefor.net/). The gRNAs, CTCATTGGACTTCGAGAACGTGG (gRNA1) and CTTCGAGAACGTGGCCGCTCTGG (gRNA2) (PAM sequence underlined), were designed for the CRISPR knock‐in experiments. Oligos with 30–50 bp homology to sequences on each side of each gRNA‐mediated double‐stranded break were designed. The genetically edited C57BL/6J mice were identified by PCR genotyping (Table S1), as well as Sanger sequencing.Animals were maintained on standard rodent chow. Hearts were removed immediately after euthanasia and the left ventricle (LV), right ventricle (RV), and atria were separated. Samples were collected from 8‐week‐old mice. Tissues were separated for paraffin‐embedding and snap‐frozen in liquid nitrogen and stored in a −80°C freezer until use. All animal experiments were in accordance with the guidelines of the Care and Use of Laboratory Animals of the National Institutes of Health. The protocol was approved by the Institutional Animal Care and Use Committee of the University of Wisconsin‐Madison (#A006262).
Middle‐down mass spectrometry
RBM20 was constructed and expressed with BaculoDirect™ Baculovirus Expression System following manufacture's instruction (Invitrogen, Carlsbad, CA, Cat#12562‐054). The RBM20 concentration was determined by Bradford protein assay prior to enzymatic digestion. For each enzymatic digestion, approximately 50 μg of purified RBM20 was reduced in 6 mM DTT solution at 37°C for 3 h and then alkylated in 5 mM iodoacetic acid at 37°C for 30 min. The reduced and alkylated RBM20 was then digested with an enzyme to protein ratio of either 1:100 (for Lys‐C, Glu‐C, and Asp‐N) or 1:200 (for trypsin) to produce large peptides with mass range of 3–20 kDa. The reactions were incubated in 25 mM ammonium bicarbonate buffer at pH 8 for 30 min at 37°C and then quenched by acidifying the solution with 2 μl 98% formic acid (FA). 10 kDa molecular weight cutoff (MWCO) filters and 0.1% FA in water were used to remove salt and small peptides from proteolytic products. The desalted large peptide mixture was separated by a home‐packed PLRP column (PLRP‐S, 200 mm length × 500 μm i.d., 10 μm particle size, 1,000 Å pore size, Agilent) in a 63‐min gradient from 10% to 90% mobile phase B (mobile phase A: 0.1% FA in water; mobile phase B: 0.1% FA in 1:1 acetonitrile:ethanol) in a nanoACQUITY liquid chromatography (LC) system (Waters, Milford, MA, USA). The nanoACQUITY LC system was coupled with an impact II quadrupole‐time‐of‐flight (q‐TOF) mass spectrometer (Bruker, Bremen, Germany) for online mass spectrometry (MS) and tandem MS (MS/MS) analysis.
,
Mass spectra were collected at 1 Hz over the range of 200–3000 m/z. The isolation window for online MS/MS of auto collision‐induced dissociation (CID) was set to 4 m/z. The collision DC bias was set from 18 to 50 V for CID with nitrogen as collision gas.For offline high‐resolution MS/MS analysis of phosphorylated peptides, the eluates from LC separation were collected every 1 min from retention time of 15–45 min. Then, the peptide fractions were analyzed by a 12‐T SolariX Fourier transform ion cyclotron resonance (FTICR) mass spectrometer (Bruker, Bremen, Germany) equipped with electron capture dissociation (ECD).
The samples were introduced into the mass spectrometer using an automated chip‐based nano‐electrospray ionization (nano‐ESI) source (Triversa NanoMate, Advion Bioscience, Ithaca, NY, USA). The mass spectra were collected over the range of 200–3000 m/z with 2 M transient size (1.153 s) and 28%–30% excitation power. The isolation window for ECD fragmentation was set to 2–3 m/z and the ECD bias was from 0.8 to 1.0 V. The MS/MS spectra were the sum of 200–500 transients.The online LC‐MS and MS/MS data were processed and analyzed by Data Analysis software from Bruker Daltonics. Mass spectra of peptide profiling were deconvoluted using the Maximum Entropy algorithm in the DataAnalysis software. For online MS/MS data, the msalign file of each MS/MS spectrum was output from the DataAnalysis software for identification and characterization of RBM20 peptides using MS‐Align+.
The cutoff of E‐value and p‐value was set to E‐10 for the confident identification of peptides. For offline MS/MS data, the mass and charge list of fragmentation ions were output from DataAnalysis software for peptide sequence identification using MS‐Align+. In‐house developed MASH Suite Pro
,
was used for the manual validation of fragment ions and localization of phosphorylation sites. A minimum fit of 60% and S/N threshold of 3 were set for peak picking. Fragment ions including c, c‐1, z
●, and z
●
+1 ions were validated within 10 ppm mass error. The reported masses for the RBM20 peptides and the corresponding fragment ions are all monoisotopic masses.
Cell cultures, plasmids construction, and transfection
H9c2 rat cardiomyocytes (ATCC, Cat#CRL‐1446), HeLa human cervical cancer cells (ATCC, Cat#CCL‐2), and Human embryonic kidney (HEK293) cells (Life Technologies GmbH, Darmstadt, Germany) were cultured in Dulbecco's Modified Eagle's Medium (DMEM; HyClone, Cytiva, Marlborough, MA, USA; Cat#SH3002201) plus 10% fetal bovine serum (FBS; Sigma‐Aldrich, Merck Kgaa, Darmstadt, Germany; Cat#F2442) with 1% penicillin/streptomycin 10 000 U/ml (Gibco by Life Technologies GmbH, Darmstadt, Germany). Plasmids of pEGFP‐C1‐8*his‐RBM20 wild type (Wt) or mutations were constructed by General Biosystems, Inc. (Durham, NC, USA). Plasmid's transfection was performed by FuGENE® HD Transfection Reagent (Promega, Madison, WI, USA; Cat#E2311) following the manufacturer instructions. Forty‐eight hours after transfection, cells were fixed on coverslips for immunostaining.
Immunocytochemistry (ICC) and cell imaging
H9c2 or HeLa cells were grown on coverslips. Cells were fixed with methanol for 15 min on ice and blocking/permeabilized with 5% goat serum (Sigma‐Aldrich; Cat#G6767), 0.2% Triton‐X100 (Sigma‐Aldrich; Cat#X100) in PBS for 1 h at room temperature (RT) after 48‐h transfection. Then, cells were incubated with primary antibody anti‐RBM20 (rabbit, 1:400) in blocking buffer for 1 h at RT or overnight at 4°C. After washing, cells were incubated with Alexa flour‐conjugated secondary antibody (1:1500; anti‐rabbit, fluor 488, Cat#A32731; anti‐rabbit, flour 568, Cat#A11036; Invitrogen) for 1 h at RT, washed with PBST, and mounted in SlowFade™ Gold Antifade Mountant with DAPI (Invitrogen; Cat#S36938). Cells were visualized and captured with a Zeiss LSM‐710 confocal microscope with 405 and 488 nm laser lines, and Keyence BZ‐X800 microscope (Osaka, Japan).
Titin minigene transfection and RT‐PCR analysis
The plasmid of “pcDNA3.1‐Ex64‐70” expressing titin exons 64–70 was constructed as described previously.
Ttn mini gene (Ex 64–70) was co‐transfected with Rbm20 Wt or mutated plasmids in HeLa cells, respectively, using Lipofectamine 2000 (Invitrogen) according to the manufacturer's instructions. Cells were harvested for RNA preparation at 48 h post co‐transfection. RT‐PCR was performed to detect exon inclusion or exclusion of Ttn mini‐gene under each experimental condition using forward primer (5′‐ACCAGCTGTGCACACAAAGA‐3′) and reverse primer (5′‐TCTTCTTTGCCACAGGAACG‐3′). Results were confirmed in three independent experiments, and the PCR products were analyzed on ethidium bromide agarose gels as described previously.
A double reporter splicing assay in mammalian cell system
For double reporter splicing assay, 25 000 HEK293 cells per well were seeded on 96‐well Nunc F96 MicroWell™ plates (Life Technologies GmbH) and transfected with a 200‐ng plasmid DNA, of which 1 ng was splice reporter (PEVK Ex4‐13)
plus a corresponding amount of Rbm20 Wt, Rbm20 mutations, or control plasmids (pcDNA3.1) in a 20x molar excess. To deliver plasmid DNA, we used the 40‐kDa linear polyethylenimine (Polysciences Europe GmbH, Hirschberg, Germany) at a 1:3 ratio (DNA: PEI40). Cells transfected were at a confluence of 50%–60%. Compounds were applied 24 h post‐transfection at a final DMSO concentration of 1%. Luciferase activity was measured 60 h post‐transfection using the Dual‐Luciferase® Reporter Assay System (Promega) on an Infinite® M200 Pro (TECAN, Maennedorf, Switzerland) plate reader. Ratios of firefly to renilla luciferase activity were normalized to the control (pcDNA3.1). Cell viability was measured 60 h post‐transfection using a resazurin‐based staining of metabolically active cells (PrestoBlue®, Life Technologies GmbH, Darmstadt, Germany).
Western blotting
Western blotting analysis was performed as described previously.
,
Briefly, protein lysed from cells and tissues was separated by SDS‐PAGE gel and transferred onto a PVDF membrane. The membrane was probed with primary antibodies against RBM20 (1:1500, rabbit, homemade
), phosphorylated RBM20 S637 (1:1000, rabbit; Abmart Inc. Shanghai, China), GAPDH (1:1500, rabbit; Cell Signaling Technology, Danvers, Massachusetts, USA; Cat#2118S) served as the protein‐loading control. Then, the membrane was probed with horseradish peroxidase‐coupled secondary antibodies (1:3000, anti‐rabbit; Promega; Cat#W4011) for 1 h. Chemiluminescence images were taken by ChemiDoc system (Bio‐Rad, Hercules, CA).
Chemical histology assessments
Whole hearts from 8‐week‐old male and female Wt, heterozygous (Het), and homozygous (Hom) Rbm20S637A mice (n = 3 per group) were isolated and fixed in 4% paraformaldehyde, paraffin‐embedded, and cut in 5 µm to glass slides. Heart sections were stained with Hematoxylin and Eosin (H&E) and Masson's trichrome (n = 3), respectively. Stained sections were photographed using the Keyence BZ‐X800 microscope, and the fibrotic area was quantified by using ImageJ software (Fiji).
,
The proportion of fibrotic area was calculated as a ratio of the fibrotic area to the total cross‐sectional area.
Echocardiography
For the in vivo cardiac function assessment, transthoracic echocardiography was performed with Wt (seven males, 10 females), Het (six males, seven females), and Hom (eight males, seven females) mice, respectively, at 8 weeks old on a Visual Sonics Vevo 3100 ultrasonograph (Fujifilm, Toronto, ON, Canada) with the MX400 transducer, ~30‐MHz, as detailed previously.
To acquire two‐dimensional guided M‐mode images at the tips of papillary muscles and Doppler studies, mice were sedated by facemask administration of 1% isoflurane, removed chest hair, and maintained on a heated platform. Blood velocities across the mitral, aortic, and pulmonary valves were measured using Doppler pulse wave imaging angling the probe to obtain a nearly parallel orientation to the blood flow. End diastolic and systolic LV diameter and anterior and posterior wall (AW and PW, respectively) thickness were measured online from M‐mode images obtained in a parasternal short‐axis view using the leading edge‐to‐leading edge convention. All parameters were measured over at least three consecutive cardiac cycles and averaged. LV fractional shortening was calculated as [(LV diameterdiastole − LV diametersystole)/LV diameterdiastole] × 100; ejection fraction [(7.0/(2.4 + LV diameterdiastole) (LV diameterdiastole)3 − (7.0/(2.4 + LV diametersystole) (LV diametersystole)3/(7.0/(2.4 + LV diameterdiastole) (LV diameterdiastole)3] × 100 and LV mass was calculated by using the formula [1.05 × ((Posterior Walldiastole + Anterior Walldiastole + LV diameterdiastole)3 − (LV diameterdiastole)3)]. RV wall thicknesses were measured as above from M‐mode images obtained in a right parasternal long‐axis window. Pulmonary artery (PA) diameter was measured just distal to the pulmonary valve. Care was taken to measure PA diameter during the widest part of the ejection cycle. The aortic diameter was measured just distal to the aortic valve. Heart rate was determined from at least three consecutive intervals from the pulse wave Doppler tracings of the LV outflow tract. Ejection time was measured from the same outflow track tracings from the onset of flow to the end of flow. Isovolumic relaxation time was measured as the time from the closing of the aortic valve to the opening of the mitral valve from pulse wave Doppler tracings of the LV outflow tract and mitral inflow region. All images and measures were obtained by the same observer.
Adult mouse intact cardiomyocyte isolation
Cardiomyocytes were isolated as described previously.
,
Briefly, mice were anesthetized under isoflurane. The heart was removed and cannulated via the aorta with a blunted 21‐gauge needle for antegrade coronary perfusion. The heart was perfused for 10 min with Ringer's buffer ([in mol/L] 2 NaCL, 2.5 KCL, 0.25 HEPES, 0.5 NaH2PO4, 0.12 MgSO4, 0.5 Pyruvate, 1.98 g D‐glucose, pH 7.3; filter by 0.22 μm) plus 1 mM CaCl2 and switch to calcium‐free Ringer's buffer for about 5 min, followed by digestion buffer (calcium‐free Ringer's buffer plus 300 U/ml Collagenase (235 U/mg, Worthington, Lakewood, NJ, USA)) for 10 min, continuing to add CaCl2 solution every 5 min to make the final Ca2+ concentration is 0.5 mM for a total of three times. Then, the heart was placed in a new petri dish filled with 0.5 mM Ca2++ Ringer's buffer. The left ventricle was cut into small pieces and triturated several times with a transfer pipette and then filtered through a 100‐μm nylon mesh filter. The cells were spun at 500× g for 45 s. Then the supernatant was discarded, and the pellet was resuspended in 0.5 mM Ca2+ Ringer Buffer. Sitting for 5 min after the resuspension at RT, the supernatant was carefully removed. This process was repeated twice in 0.75 mM and 1 mM Ca2+ Ringer Buffer, respectively. Finally, the cells were resuspended in 1 mM Ca2+ Ringer Buffer and ready for use.
Titin gel electrophoresis
Titin isoforms were resolved by a previously published method using a vertical sodium dodecyl sulfate‐1% agarose gel electrophoresis (VAGE) system.
The frozen left ventricle samples from 8‐week‐old mice were homogenized in urea–thiourea–SDS–dithiothreitol sample buffer using the Geno/Grinder (SPEX SamplePrep, Metuchen, NJ, USA) at 1500 strokes/min for 1 min, repeat for five to seven cycles, and were heated at 55°C for 10 min. The denatured protein samples were loaded into 1% SDS‐agarose gel, which is run at 30 mA constant current for 3.5 h. The agarose gel was fixed in 50% methanol, 12% glacial acetic acid, and 5% w/v glycerol for 1 h and then dried overnight at 37°C. The dried gel was scanned by the silver staining method.
RNA preparation and sequencing
For analysis of differential expression and splicing genes, total RNA was extracted from the LV of male mice at 8 weeks old (Wt, Het and Hom, n = 3 per group) using TRIzol Reagent (Life Technologies, Waltham, MA, USA), treated with DNase I (Lucigen, Middleton, WI; Cat#D9905K), and determined by Nanodrop One (Thermo Fisher Scientific, Waltham, MA, USA) and electrophoresis before RNA‐sequencing (RNA‐seq). The RNA‐seq procedure was performed at the Biotechnology Center of University of Wisconsin‐Madison. Quality check of the raw data was performed using FastQC software.
Low quality reads and adapter sequences were trimmed using Trimmomatic.
Trimmed reads were mapped to the mouse reference genome (Mus musculus GRCm39) using STAR.
The full dataset was uploaded to NCBI GEO database (No. GSE180216; https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE180216).
Differential expression and splicing gene analyses and gene functional annotation
For gene‐level expression, gene counts were estimated using “‐‐quantMode GeneCounts” option in STAR.
The R package edgeR
was used to normalize gene counting based on trimmed mean of M‐values (TMM) method.
Only expressed genes with at least 15 counts in more than three samples were evaluated, resulting in 14 729 genes for further analysis. Differential expression analysis was carried out for all three pairwise comparisons between Wt and Het, between Wt and Hom, and between Het and Hom based on negative binomial generalized linear model using edgeR package.
The statistical tests were corrected for multiple testing and only genes with a false discovery rate (FDR) less than .05 were considered significant.
Volcano plots, heat maps, and violin plots were plotted using “ggplot2” package v3.3.4.
For alternative splicing analysis for the three pairwise comparisons between Wt and Het, between Wt and Hom, and between Het and Hom, Multivariate Analysis of Transcript Splicing (rMATS) software for replicates was used.
Five types of alternative splicing events were analyzed including skipped exon (SE), mutually exclusive exons (MXE), alternative 5′ splice site (A5SS), alternative 3′ splice site (A3SS), and retained intron (RI). The output from rMATS was filtered using FDR ≤ 0.05 and absolute value of ΔPSI (percent spliced‐in) ≥0.1 as the cutoff criterion to identify significant alternative splicing events.Functional enrichment analysis using differentially expressed and spliced genes was performed using DAVID Functional Annotation Analysis v6.8
,
and g:Profiler
based on Gene Ontology (GO; biological process, molecular function, and cellular component), Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways, and Human Phenotype Ontology (HPO).
Quantitative PCR (qPCR) and RT‐PCR
The cDNA was synthesized from the same RNA used for RNA‐seq by iScript™ Reverse Transcription Supermix (Bio‐Rad; Cat#1708841) according to the manufacturer's protocol. To determine gene expression, qPCR was performed in a 10‐μl reaction, 384‐well format, and SsoAdvanced™ Universal SYBR® Green Supermix (Bio‐Rad; Cat#1725272) on CFX384™ real‐time system (Bio‐Rad). The reaction program was as follows: 95°C for 2 min, 40 cycles of 95°C for 15 s, and 60°C for 30 s; finally, the melt curve analysis was performed as default. The qPCR primers were listed in Table S2. Except for explanation, three biological samples were measured in each experimental group in duplicate, with a minimum of two independent experiments. The relative amount of target mRNA normalized to Gapdh was calculated according to the 2−∆∆Ct method.For the detection of splicing events, RT‐PCR was performed using EconoTaq PLUS GREEN 2X Master Mixes (Lucigen, Middleton, WI, USA) with 94°C for 2 min, then 35 cycles of 94°C for 15 s, 60°C for 30 s, and 72°C for 30 s. The Splicing RT‐PCR primers were listed in Table S3. PCR products were resolved on agarose gels and imaged by ChemiDoc system (Bio‐Rad).
Statistical analysis
GraphPad Prism software v9 (San Diego, CA, USA) was used for statistical analysis. Results were expressed as mean ± SEM. Statistical significance between groups was determined using one‐way ANOVA with Tukey's multiple comparisons test. The significance levels were *p ≤ .05, **p ≤ .01, ***p ≤ .001, ****p ≤ .0001.
RESULTS
Characterization of RBM20 phosphorylation with middle‐down MS and evaluation of phosphorylation on titin splicing
RS domains of splicing factors are frequently phosphorylated and phosphorylation plays a key role in pre‐mRNA splicing. Although previous study showed that RBM20 RS domain could be phosphorylated with anti‐phospho‐RBM20 antibody,
the actual phosphorylation sites have not been identified yet. To unambiguously identify RBM20 phosphorylation sites, we performed middle‐down MS with the in vitro expressed (sf9 insect cells) and purified rat RBM20. Sixteen phosphorylation sites were identified with two of them (amino acid S638 and S640 in rats) located in the RSRSP stretch of the RS domain and the rest located outside of the RSRSP stretch at S643, S645, S652, S729, S789, S879, S881, S999, S1034, S1046, S1057, S1096, S1190, and S1192 in rats (Figure 1A). A representative MS result was shown in Figure 1B and a summary of sequence coverage of RBM20 (Figure S1) and other MS results were presented in supplementary figures (Figure S2A–J). Among the 16 identified phosphorylation sites, two residues S638 (S635 in human and S637 in mice) and S640 (S637 in human and S639 in mice) were substituted by alanine (A) and glycine (G), respectively, found in human patients with DCM.
,
,
,
,
,
,
We expected that substitute of S to A or G might reduce the phosphorylation level in RS domain, and thus interrupt the splicing function of RBM20. We mutated 16 phosphorylation sites to non‐phosphorylatable amino acid A except for S640 that was mutated to G to mimic mutation in human patients with DCM, and constructed the individual mutations into a pEGFP‐C1 vector (Figure 1C). These mutated constructs were co‐transfected into HeLa cells with Ttn mini‐gene plasmid including exons 64–70. Plasmid‐carrying Wt RBM20 was used as a positive control and empty plasmid was used as negative control (NC). RNA was purified from each treatment 48 h after co‐transfection and RT‐PCR was performed to detect the splicing pattern of Ttn mini‐gene. In the absence of RBM20 (NC), a single smaller band was detected; while in the presence of RBM20 (Wt), two additional larger bands were present, which reflects RBM20 as a splicing repressor
(Figure 1D). The size and splicing pattern of the three bands were indicated in a schematic diagram (Figure 1D). Fourteen of 16 mutated RBM20 showed the same splicing pattern as Wt RBM20; however, two mutations S638A and S640G showed the same splicing pattern as the NC (Figure 1D). Thus, the in vivo cell splicing assay revealed that mutations on phosphorylation sites in the RSRSP stretch diminished the RBM20 splicing function. To further confirm the observation, a dual luciferase‐splicing reporter assay was performed with individual RBM20 mutations. The reporter construct contained exons 4–13 in titin PEVK region as described previously
(Figure 1E). High ratios of firefly to renilla luciferase reflected suppression of splicing. In the absence of RBM20 showed in the schematic diagram in Figure 1E, the two mutations S638A and S640G revealed the same splicing pattern as the NC, supporting the RT‐PCR results. The ratio of other mutations was significantly different from the NC, though they had tendency to the splicing pattern as the NC.
FIGURE 1
Identification of RBM20 phosphorylation and the role of phosphorylation in Ttn splicing. (A) Schematic diagram of the 16 phosphorylation sites on RBM20 that were found. Glu, glutamic acid; P, phosphorylation; Pro, proline; RRM, RNA recognition motif; RS, arginine/serine; ZnF, Zinc finger domain. (B) Mass spectra of six representative phosphorylated rat RBM20 peptides, D[629–659]A, K[723–750]E, D[765–925]K, Q[951–1016]R, M[985–1070]E, and A[1155–1203]E. Peptide sequences were assigned based on the MS/MS data from either CID or ECD fragmentation. The mass difference between each peak is 79.97 Da, indicating the occurrence of phosphorylation. The number above each peak shows the number of phosphorylation sites. Expt'l is the experimental monoisotopic mass based on data obtained from MS experiments. Calc'd represents the calculated monoisotopic mass based on amino acid sequences; (C) schematic diagram of mutated Rbm20 plasmids constructs and co‐transfection in cells. (D) PCR detection of Ttn mini‐gene splicing pattern after transfecting with different mutated Rbm20 plasmids and schematic diagram of Ttn mini‐gene (Exon 64–70) construct and potential alternative splicing pattern. (E) The schematic diagram of Ttn mini‐gene (PEVK Exon 4–13) splice reporter and dual luciferase splicing assay in HEK293 cells. Fluc, firefly luciferase; NC, negative control with empty plasmid; Rluc, renilla luciferase; Wt, Rbm20 wild type; Mean ± SEM (n = 3); *p < .05
Identification of RBM20 phosphorylation and the role of phosphorylation in Ttn splicing. (A) Schematic diagram of the 16 phosphorylation sites on RBM20 that were found. Glu, glutamic acid; P, phosphorylation; Pro, proline; RRM, RNA recognition motif; RS, arginine/serine; ZnF, Zinc finger domain. (B) Mass spectra of six representative phosphorylated rat RBM20 peptides, D[629-659]A, K[723-750]E, D[765-925]K, Q[951-1016]R, M[985-1070]E, and A[1155-1203]E. Peptide sequences were assigned based on the MS/MS data from either CID or ECD fragmentation. The mass difference between each peak is 79.97 Da, indicating the occurrence of phosphorylation. The number above each peak shows the number of phosphorylation sites. Expt'l is the experimental monoisotopic mass based on data obtained from MS experiments. Calc'd represents the calculated monoisotopic mass based on amino acid sequences; (C) schematic diagram of mutated Rbm20 plasmids constructs and co‐transfection in cells. (D) PCR detection of Ttn mini‐gene splicing pattern after transfecting with different mutated Rbm20 plasmids and schematic diagram of Ttn mini‐gene (Exon 64–70) construct and potential alternative splicing pattern. (E) The schematic diagram of Ttn mini‐gene (PEVK Exon 4–13) splice reporter and dual luciferase splicing assay in HEK293 cells. Fluc, firefly luciferase; NC, negative control with empty plasmid; Rluc, renilla luciferase; Wt, Rbm20 wild type; Mean ± SEM (n = 3); *p < .05
Phosphomimetic mutations on S638 and S640 did not restore the nuclear localization
To validate whether phosphorylation plays a critical role in the nucleocytoplasmic transport of mutated RBM20, we constructed plasmids with the phosphomimetic mutation from S to D (Aspartic acid) on S638 and/or S640, and transfected the plasmids to H9c2 cells. Our results showed that unphosphorylatable amino acid substitutes S638A and S640G facilitated export of RBM20 to the cytoplasm and assembly of ribonucleoprotein (RNP) granules, and so did the double mutations on S638A/S640G (Figure 2). Unexpectedly, the phosphomimetic mutations S638D and S640D did not restore the nuclear localization (Figure 2), suggesting phosphorylation on S638 and/or S640 was not the major cause for nucleocytoplasmic transport of RBM20 and the assembly of RNP granules. To further confirm this finding, we mutated a non‐phosphorylation site 641 from proline (P) to leucine (L) in the RSRSP stretch and transfected the plasmid to H9c2 cells. We found that P641L mutation also promoted RBM20 relocalization and the assembly of RNP granules (Figure 2). Furthermore, entire RSRSP stretch deletion facilitated the same phenomenon as observed with individual mutations (Figure 2). Quantification for the percent of cells demonstrating nuclear retention and cytoplasmic localization with Wt and various mutant RBM20 was presented in Figure S3A. However, mutations on phosphorylation sites outside of the RSRSP stretch in the RS domain did not facilitate RBM20 trafficking and granules formation in the cytoplasm (Figure S3B). These results imply that phosphorylation in RS domain is not a key determinant for RBM20 trafficking and RNP granules formation.
FIGURE 2
Immunofluorescence staining of Rbm20 mutations in H9c2 cells. Subcellular localization of RBM20 in H9c2 cells via transfection with various Rbm20 mutated plasmids. Single‐channel images are shown in grayscale, and RBM20 and nucleus are shown as red and blue, respectively, in merge images. Yellow bar = 20 µm and white bar = 50 µm
Immunofluorescence staining of Rbm20 mutations in H9c2 cells. Subcellular localization of RBM20 in H9c2 cells via transfection with various Rbm20 mutated plasmids. Single‐channel images are shown in grayscale, and RBM20 and nucleus are shown as red and blue, respectively, in merge images. Yellow bar = 20 µm and white bar = 50 µm
RBM20 mutation on the phosphorylation site led to reduced phosphorylation, increased RBM20 expression, cytoplasmic localization, and aberrant Ttn splicing in vivo
We next generated a mutation KI mouse model by replacing the phosphorylation site S637 (S638 in rats) in the RSRSP stretch with alanine (Rbm20S637A) through CRSPR/Cas9 gene‐editing technology to evaluate the functional role of Rbm20 mutation in vivo. PCR genotyping and sanger sequencing confirmed the successful mutation at position S637 (Figure 3A). Western blotting using antibody against phospho‐RBM20 on S637 site demonstrated that RBM20 phosphorylation level was dramatically reduced in Het mice and almost undetectable in Hom mice when compared to Wt. Notably, the total RBM20 level was significantly elevated in Het mice and Hom mice, relative to Wt mice (Figure 3B
). We assume that increased RBM20 level could be caused by a feedback mechanism to increase RBM20 transcript level (see detailed discussion below). We then determined the Rbm20 mRNA level using RT‐qPCR and found that the transcription level of Rbm20 significantly increased in Het and Hom heart tissues when compared to Wt (Figure S4A). Since RBM20 is a major splicing regulator of gene Ttn, encoding a giant sarcomeric protein titin that is responsible for ventricular wall stiffness, we performed gel electrophoresis with silver staining to detect titin isoform switching regulated by RBM20. The results revealed that the titin isoform was switched to a giant N2BA (N2BA‐G) isoform in Hom and to an intermediate‐sized N2BA titin in Het heart tissues. Titin isoform switching was consistent with that seen in RBM20 Het and Hom knockout (KO) rats
and was also observed in RBM20 targeted mice (Figures 3C and S4B). We then examined the localization of RBM20 in individual cardiomyocytes with immunocytochemical staining against RBM20 antibody. The results clearly indicated that RBM20 was contained only in the nucleus with two speckles in Wt mice (splicing mediated by RBM20 is known to occur in two nuclear speckles
) whereas the Rbm20S637A mutation reduced the nucleus localization and promoted RBM20 granule formation in the cytoplasm (Figure 3D, Videos [Link], [Link], [Link]). Furthermore, the fluorescent intensity of RBM20 in cardiomyocytes was quantified and it significantly increased in Het and Hom cardiomyocytes when compared to Wt (Figure S4C) which is consistent with the protein level detected by western blotting (Figure 3B).
FIGURE 3
Generation and characterization of Rbm20S637A KI mouse model. (A) Schematic diagram of Rbm20 gene structure and generation of a S637A mutation knock‐in mouse (Rbm20S637A) through CRISPR/Cas9 gene‐editing technology. The schematic structure indicates the mouse Rbm20 gene and coded protein domains. The mutation hot spot in the RS domain was indicated as “RSRSP.” The target amino acid is highlighted in yellow in “RSRSP” stretch, and the point mutation site is in red. The sgRNA is underlined in black and the orange nucleotides were the silent mutation site. The desired switch from TCT (S) to GCT (A) introduced an NotI restriction site and facilitates the genotyping. The edited Rbm20 alleles in mice were confirmed by Sanger sequencing and PCR genotyping. There are two PCR reactions A and B to detect offspring genotyping. For Wt mouse, only “A” reaction is positive; for Het mouse: “A” and “B” are positive; and for Hom mice, only “B” reaction is positive. (B) Phosphorylation of RBM20 S637 site and total RBM20 expressions in Wt, Het, and Hom mice by western blotting with anti‐phosphoRBM20 S637 and anti‐pan RBM20. GAPDH is a loading control. Box plots showing the quantification of phosphoRBM20 S637 to total RBM20, and the relative RBM20 expression level by normalizing to the GAPDH, respectively. Mean ± SEM (n = 4). *p < .05, **p < .001 and ***p < .0001. (C) Titin isoforms were detected in Wt, Het, and Hom heart tissues through vertical 1% SDS‐agarose gel electrophoresis (VAGE) system. N2BA‐G (giant N2BA), N2BA, and N2B represent different sizes of titin isoforms. Titin isoform expression in RBM20 KO rats were used as references. (D) Immunocytochemical staining of isolated Wt, Het, and Hom cardiomyocytes. Black bar = 50 µm and white bar = 20 µm. BF, Bright field; Het, heterozygotes; Hom, homozygotes; Wt, wild type
Generation and characterization of Rbm20S637A KI mouse model. (A) Schematic diagram of Rbm20 gene structure and generation of a S637A mutation knock‐in mouse (Rbm20S637A) through CRISPR/Cas9 gene‐editing technology. The schematic structure indicates the mouse Rbm20 gene and coded protein domains. The mutation hot spot in the RS domain was indicated as “RSRSP.” The target amino acid is highlighted in yellow in “RSRSP” stretch, and the point mutation site is in red. The sgRNA is underlined in black and the orange nucleotides were the silent mutation site. The desired switch from TCT (S) to GCT (A) introduced an NotI restriction site and facilitates the genotyping. The edited Rbm20 alleles in mice were confirmed by Sanger sequencing and PCR genotyping. There are two PCR reactions A and B to detect offspring genotyping. For Wt mouse, only “A” reaction is positive; for Het mouse: “A” and “B” are positive; and for Hom mice, only “B” reaction is positive. (B) Phosphorylation of RBM20 S637 site and total RBM20 expressions in Wt, Het, and Hom mice by western blotting with anti‐phosphoRBM20 S637 and anti‐pan RBM20. GAPDH is a loading control. Box plots showing the quantification of phosphoRBM20 S637 to total RBM20, and the relative RBM20 expression level by normalizing to the GAPDH, respectively. Mean ± SEM (n = 4). *p < .05, **p < .001 and ***p < .0001. (C) Titin isoforms were detected in Wt, Het, and Hom heart tissues through vertical 1% SDS‐agarose gel electrophoresis (VAGE) system. N2BA‐G (giant N2BA), N2BA, and N2B represent different sizes of titin isoforms. Titin isoform expression in RBM20 KO rats were used as references. (D) Immunocytochemical staining of isolated Wt, Het, and Hom cardiomyocytes. Black bar = 50 µm and white bar = 20 µm. BF, Bright field; Het, heterozygotes; Hom, homozygotes; Wt, wild type
Effect of mutating the phosphorylation site on cardiac remodeling and pathogenesis
We then assessed cardiac function in the RBM20S637A KI mice at the age of 8 weeks. Isolated Het and Hom hearts were flabby, flimsy, and flatten under their own weight in both male and female mice (Figure 4A). Gross anatomy and H&E staining revealed that the LV chamber was enlarged and the LV walls were thinner in KI male mice when compared to Wt mice, but not in the female mice (Figure 4A,B,J). Quantitative analysis of trichrome staining reveals no significant changes of fibrosis in KI mice compared to Wt mice at 8 weeks old (Figures 4C and S5A,B). The ratio of heart weight to body weight was not different between KI and Wt mice with an increased tendency in Hom mice (Figures 4D and S5C). This could be related to the enlarged atrium observed in Hom mice (Data not shown). About 34% of Hom KI mice died in the first 100 days with no significant difference between male and female as calculated by Kaplan–Meier analysis (Figure 4E). Echocardiography further revealed that both male and female KI mice had a DCM‐like phenotype with reduced ejection fraction (EF), and an enlarged LV chamber in both diastole and systole (Figure 4F–H, and Table S4). The echocardiographic data showed that aorta ejection time, ventricular wall thickness (Figure 4I), and velocity time integral (VTI) as well as PA ejection time were decreased in male KI mice. The tricuspid annular plane systolic excursion (TAPSE) depth was significantly reduced in both male and female Hom mice compared to control Wt mice (Table S4).
FIGURE 4
Histology and in vivo cardiac functional assessment of Rbm20S637A KI mice. (A) Gross anatomy of the heart with upper and side views, and slice anatomy at the midventricular level of Wt, Het, and Hom hearts in male and female mice. Bar = 0.5 cm. (B) Representative images of hematoxylin and eosin staining of cross section of mouse hearts at the age of 8 weeks in both male and female mice. Bar = 0.5 cm. (C) Representative images of Masson's trichrome staining in male mice at 8 weeks old. Bar = 100 µm. (D) Heart weight versus body weight at the age of 8 weeks in both male and female mice (n = 32 in Wt, 25 in Het, and 20 in Hom). (E) Kaplan–Meier survival curves of Wt, Het and Hom male and female mice. Statistical significance was determined by log‐rank (Mantel–Cox) test with two degrees of freedom (n = 42 in Wt, 30 in Het, and 31 in Hom). (F) Ejection fraction (EF), (G) left ventricular diastolic diameter (LVDd), (H) left ventricular systolic diameter (LVDs), and (I) left ventricular wall thickness (measured by 2*LVPW(d)/LVDd; mm) from echocardiographic imaging of mice at the age of 2 months (n = 7 males and 10 females in Wt, n = 6 males and 7 females in Het, and n = 8 males and 7 females). Mean ± SEM. Ns, not significance, *p < .05, **p < .001, and ***p < .0001
Histology and in vivo cardiac functional assessment of Rbm20S637A KI mice. (A) Gross anatomy of the heart with upper and side views, and slice anatomy at the midventricular level of Wt, Het, and Hom hearts in male and female mice. Bar = 0.5 cm. (B) Representative images of hematoxylin and eosin staining of cross section of mouse hearts at the age of 8 weeks in both male and female mice. Bar = 0.5 cm. (C) Representative images of Masson's trichrome staining in male mice at 8 weeks old. Bar = 100 µm. (D) Heart weight versus body weight at the age of 8 weeks in both male and female mice (n = 32 in Wt, 25 in Het, and 20 in Hom). (E) Kaplan–Meier survival curves of Wt, Het and Hom male and female mice. Statistical significance was determined by log‐rank (Mantel–Cox) test with two degrees of freedom (n = 42 in Wt, 30 in Het, and 31 in Hom). (F) Ejection fraction (EF), (G) left ventricular diastolic diameter (LVDd), (H) left ventricular systolic diameter (LVDs), and (I) left ventricular wall thickness (measured by 2*LVPW(d)/LVDd; mm) from echocardiographic imaging of mice at the age of 2 months (n = 7 males and 10 females in Wt, n = 6 males and 7 females in Het, and n = 8 males and 7 females). Mean ± SEM. Ns, not significance, *p < .05, **p < .001, and ***p < .0001
RBM20S637A mutation altered gene expression associated with cardiac dysfunction and RNP granule assembly
To determine differentially expressed genes (DEGs) affected by the RBM20S637A mutation, we performed transcriptome profiling with deep RNA‐seq in RBM20S637A KI male mice of 8 weeks old. A total of 1206 genes were differentially expressed in Hom LV compared to Wt LV as indicated in a volcano plot (Figures 5A and S6A; Data S5), but only 278 genes in Het LV compared to Wt LV (Figure S6A,B and Data S6). HPO analysis showed that the top 10 ranked terms were associated with cardiomyopathy and arrhythmia (Figure 5B and Data S7). GO enrichment revealed that the downregulated genes in Hom mice were associated with the regulation of heart contraction, whereas the upregulated genes in Hom mice were mostly related to translational processes and mRNA metabolism (Figure 5C and Data S8). All genes in up‐ and downregulated enrichment were indicated in the heat map (Figure 5C). To identify how many genes in DEGs were involved in assembly of stress granules (SG), we compared the known 611 SG proteins (Data S9)
,
with DEGs from our RNA‐seq data and found that 54 of 1079 DEGs were associated with SG assembly (Figures 5D and S6C). GO analysis with these 54 genes revealed that they primarily regulate RNA processing, binding, transport, and translation (Figure 5E). For example, Ddx6 (DEAD‐box helicase 6), an upregulated expression gene in KI Hom mice, is a member of DEAD box protein family and essential for assembly processing bodies (P‐bodies).
,
P‐bodies are one of the cytosolic membraneless ribonucleoprotein granules, involved in translational repression and RNA decay.
Recently, Fenix et al (2021) reported that RBM20 R636S mutant co‐localized with DDX6 in the induced pluripotent stem cells‐cardiac myocytes (iPSC‐CMs), indicating Ddx6 is also involved in RNP granule formation driven by RBM20 mutations. RNA‐seq data were validated with selected genes by qPCR and the results were consistent with RNA‐seq data (Figure 5F). Notably, the Natriuretic peptide precursor b (Nppb) encoding natriuretic peptides (BNP), a biomarker for heart failure
was significantly upregulated in Hom KI mice (Figure 5F).
FIGURE 5
Differentially expressed genes (DEGs) associated with cardiac function, protein localization and stress granules. (A) Volcano plot showing the DEGs between Hom and Wt mice. Log2 fold change (logFC) was plotted against the −log10(FDR) value. Genes with −log10(FDR) > 4 and |log2FC| > 2 were indicated. The genes highlighted in red were validated by quantitative PCR (qPCR). (B) Human phenotype ontology analysis of DEGs in Wt versus Het versus Hom mice, showing the top 10 ranked terms. The numbers inside of each bar represent intersection size versus terms size. (C) Heat maps comparing Wt, Het, and Hom mice. Genes downregulated or upregulated in Hom mice versus Wt and Het were analyzed by DAVID Functional Annotation Analysis. Related GO terms were listed, and the expression of these genes was generated by heat maps, respectively. The genes highlighted in red were validated by qPCR. (D) Venn diagram showing the overlapped genes between known stress granules (SG) genes and DEGs. The overlapping genes were shown as upregulated DEGs in red frame and downregulated DEGs in blue frame in Hom. (E) GO analysis of 54 genes overlapping with known SG genes by DAVID Functional Annotation Analysis. BP, biological process; CC, cellular component; MF, molecular function. (F) Validation of selected genes with qPCR. Mean ± SEM (n = 3). *p < .05, **p < .001, and ***p < .0001
Differentially expressed genes (DEGs) associated with cardiac function, protein localization and stress granules. (A) Volcano plot showing the DEGs between Hom and Wt mice. Log2 fold change (logFC) was plotted against the −log10(FDR) value. Genes with −log10(FDR) > 4 and |log2FC| > 2 were indicated. The genes highlighted in red were validated by quantitative PCR (qPCR). (B) Human phenotype ontology analysis of DEGs in Wt versus Het versus Hom mice, showing the top 10 ranked terms. The numbers inside of each bar represent intersection size versus terms size. (C) Heat maps comparing Wt, Het, and Hom mice. Genes downregulated or upregulated in Hom mice versus Wt and Het were analyzed by DAVID Functional Annotation Analysis. Related GO terms were listed, and the expression of these genes was generated by heat maps, respectively. The genes highlighted in red were validated by qPCR. (D) Venn diagram showing the overlapped genes between known stress granules (SG) genes and DEGs. The overlapping genes were shown as upregulated DEGs in red frame and downregulated DEGs in blue frame in Hom. (E) GO analysis of 54 genes overlapping with known SG genes by DAVID Functional Annotation Analysis. BP, biological process; CC, cellular component; MF, molecular function. (F) Validation of selected genes with qPCR. Mean ± SEM (n = 3). *p < .05, **p < .001, and ***p < .0001
RBM20S637A mutation led to aberrant splicing of genes associated with muscle contractile function and RNA processing
RBM20 is a muscle‐specific splicing factor that regulates splicing of the Ttn gene and an additional ~30 genes.
Splicing mediated by RBM20 occurs in two speckles found in nuclei of cardiomyocytes.
RBM20S637A mutation depleted RBM20 in the nucleus and translocated it in the sarcoplasm, suggesting an interrupted splicing process. We then performed splicing analysis using RNA‐seq, revealing that 135 genes underwent aberrant splicing in Hom mice and 144 genes in Het mice, when compared to Wt (Figure S7A). The differentially spliced genes (DSGs) in Hom KI mice were involved in all five splicing events (97 SE (skipping exon), 26 MXE (mutually exclusive exons), 11 A5SS (alternative 5′ splice site), 24 A3SS (alternative 3′ splice site), 14 RI (retained intron)), indicated in violin and volcano plots (Figure 6A,B). Ttn, Camk2d, and Ryr2 were exhibited in the top significantly spliced events (Figure 6B). These genes were also observed in most significantly spliced events in Het KI mice (Figure S7B). GO analysis showed that the DSGs were associated with cardiac muscle hypertrophy and contractile function, RNA processing, translation, and protein localization (Figure 6C). Furthermore, the Venn diagram between the known SG genes and DSGs in Het and Hom KI mice showed that five DSG genes overlapped with the known SG genes (Figure 6D), suggesting that gene splicing regulates assembly of RNP granules caused by RBM20S637A mutation. The Ttn splicing result was consistent with that of our protein analysis (Figure 3C), confirming that the RBM20S637A mutation promoted expression of large titin isoforms. We selected genes associated with calcium handling and SGs and validated the abnormal splicing of these genes with RT‐PCR (Figure 6E). The splicing variants were quantified with Image J (Figure S7C).
FIGURE 6
Splicing variant analysis involving cardiac function in Rbm20S637A KI mice. (A) Violin plots representing distributions of statistically significant ΔPSI (percent spliced‐in) values for different classes of AS events between Hom and Wt mice (ΔPSI = PSI(Hom) − PSI(Wt)). The lower and upper bounds of the embedded box represented the 25th and 75th percentile of the distribution, respectively. The horizontal line in the box represented the median. The numbers of events are shown below each plot. SE, skipped exon; MXE, mutually exclusive exons; A5SS, alternative 5′ splice site; A3SS, alternative 3′ splice site; RI, retained intron. (B) Volcano plot showing the differentially splicing genes (DSGs) between Hom and Wt mice. ΔPSI was plotted against the −log10(FDR) value. Genes with −log10(FDR) > 5 and |ΔPSI| > 0.1 were indicated. The genes with different colors represent different splicing events. (C) GO analysis of DSGs between Hom and Wt and between Het and Wt mice, respectively. (D) Venn diagram showing overlapping hits from DSGs with known SG proteins. The Tpm2 gene highlighted in red was validated by RT‐PCR. (E) Validation by RT‐PCR for DSGs in Wt, Het, and Hom mice (n = 3). The splicing patterns of target exon were shown on the right panel. Gapdh, a house keeping gene
Splicing variant analysis involving cardiac function in Rbm20S637A KI mice. (A) Violin plots representing distributions of statistically significant ΔPSI (percent spliced‐in) values for different classes of AS events between Hom and Wt mice (ΔPSI = PSI(Hom) − PSI(Wt)). The lower and upper bounds of the embedded box represented the 25th and 75th percentile of the distribution, respectively. The horizontal line in the box represented the median. The numbers of events are shown below each plot. SE, skipped exon; MXE, mutually exclusive exons; A5SS, alternative 5′ splice site; A3SS, alternative 3′ splice site; RI, retained intron. (B) Volcano plot showing the differentially splicing genes (DSGs) between Hom and Wt mice. ΔPSI was plotted against the −log10(FDR) value. Genes with −log10(FDR) > 5 and |ΔPSI| > 0.1 were indicated. The genes with different colors represent different splicing events. (C) GO analysis of DSGs between Hom and Wt and between Het and Wt mice, respectively. (D) Venn diagram showing overlapping hits from DSGs with known SG proteins. The Tpm2 gene highlighted in red was validated by RT‐PCR. (E) Validation by RT‐PCR for DSGs in Wt, Het, and Hom mice (n = 3). The splicing patterns of target exon were shown on the right panel. Gapdh, a house keeping gene
DISCUSSION
We found that RBM20 protein was phosphorylated on 16 serine amino acid residues, two of which were located in the RSRSP stretch in the RS domain. Individual mutations of 14 serine to the unphosphorylatable alanine neither altered splicing function of RBM20 nor altered the location of RBM20 in the nucleus. However, the S638A (S635 in human and S637 in mice) and S640G (S637 in human and S639 in mice) in the RSRSP stretch altered the splicing pattern of RBM20 substrates such as Ttn. Furthermore, they facilitated RBM20 translocation from the nucleus to the cytoplasm and formation of protein aggregation. Phosphomimetic mutations on these two phosphorylation sites did not restore RBM20 translocation and RNP granule formation. Furthermore, RBM20 S637A mutation KI mice showed that the mutated phosphorylation site S637 led to cardiac remodeling and pathogenesis through depletion of RBM20 in the nucleus, formation of granules in the cytoplasm, and dysregulation of gene expression and splicing.Pre‐mRNA splicing is a key mechanism for protein diversity as well as gene regulation and involves dynamic interactions of protein–RNA and protein–protein complexes.
In this tightly regulated process, SR proteins are a group of RNA‐binding proteins containing enriched dipeptide SR repeats (also known as the RS domain) that play a critical role in modulating the binding and assembly of the spliceosome complex and thereby in determining the splicing site selection.
RS domains can mediate protein–protein and protein–RNA interactions during spliceosome assembly.
Splicing factors are a subfamily of SR proteins and act as the trans effects in the regulation of pre‐mRNA splicing. Mutations disrupting the pre‐mRNA splicing have been linked to human diseases.
A wide range of disorders as well as heart diseases are caused by mutations affecting splice consensus sites (cis effects).
However, mutations in trans‐acting factors such as splicing factors have not been broadly studied, particularly not in heart diseases.
Studies in human genetics and animal models showed that mutations in the RS domain of RBM20 protein were associated with DCM, with young age at diagnosis, end‐stage heart failure, and high mortality.
,
,
,
,
,
,
,
,
Two of these mutations identified in human patients were S635A (S638A in rats and S637A in mice) and S637G (S640G in rats and S639G in mice) and both of these were identified as phosphorylation residues in this study. Previous studies indicated that phosphorylation and dephosphorylation of RS domain were critical for numerous steps in subcellular localization and splicing control of splicing factors.
,
,
,
,
,
Our data showed that RBM20 mutations from a phosphorylation residue (S) to an unphosphorylatable residue (A or G) in the RS domain disrupted pre‐mRNA splicing in cells and tissues. Using an in vitro cell splicing assay, we showed that dephosphorylation of a splicing factor through mutations in the RS domain interrupted splicing control in a trans effect manner. Cardiac dysfunction caused by mutations in trans‐acting factor RBM20 was associated with aberrant splicing of genes such as Ttn and camk2d, and thus pathophysiological changes.SR proteins in the spliceosome complex continuously shuttle between the nucleus and the cytoplasm. Localization of these proteins can shift in response to stress signals as well as mutations.
,
,
Changes in distribution of splicing factors in the nucleus and in the cytoplasm can impact splicing as well as gene expression and are associated with a diverse set of diseases.
,
SR protein trafficking/shuttling has been extensively studied in neurodegenerative diseases. For example, TAR DNA‐binding protein 43 (TDP‐43) is one of the RNA‐binding proteins that regulates splicing of the pre‐mRNA of CFTR (the gene mutated in cystic fibrosis). TDP‐43 is normally located in the nucleus. Mutations in the TDP‐43 gene (TARDBP) facilitate cytoplasmic inclusion. Depletion of TDP‐43 from the nucleus and accumulation in the cytoplasm were associated with both sporadic and familial forms of Alzheimer's disease and amyotrophic lateral sclerosis, demonstrating a role for TDP‐43 mutation in the pathogenesis of neurological diseases.
,
,
,
However, the pathogenic mechanisms caused by the mutations are still unclear and might involve loss of splicing factor's nuclear function and/or the toxicity of protein aggregation in the cytoplasm. Earlier work concluded that RBM20‐associated cardiac dysfunction was primarily due to the aberrant Ttn splicing.
,
,
Recent studies suggested that RBM20 shuttling and protein aggregation in the cytoplasm could be a likely major contributor to cardiac dysfunction caused by RBM20 mutations.
,
,
Our current study also revealed that RBM20 mutation in the RS domain caused cardiac dysfunction which confirmed the previous findings in a mouse model with the same mutation site.
However, our results indicated that mutation S637A caused pathogenesis and high mortality rate (~34%) at younger age (less than 8 weeks), which is different from the previous study observed pathogenesis and high death rate (~30%) at later age (~9 months).Lastly, we observed that the S637A RBM20 mutation reduced the phosphorylation level of RBM20 in Rbm20S637A mice, but significantly elevated RBM20 expression in protein level. The increased RBM20 expression might be due to a feedback mechanism, for example, translocation of RBM20 to the cytoplasm that depletes RBM20 from the nucleus might trigger increased RBM20 transcription. In addition, due to impairment of RBM20 nucleocytoplasmic transport, the translated RBM20 will accumulate in the cytoplasm. Depletion of RBM20 in the nucleus is likely to signal the transcription machinery to transcribe more RBM20 to meet the need in the nucleus. We did observe the increased mRNA level with RT‐qPCR in the KI mouse heart tissues. RBM20 protein could be protected in the protein aggregation for degradation, leading to accumulation of the protein which could be another possible reason for increased RBM20 protein level. This will certainly be our future direction to investigate the underpinning mechanisms.Taken together, this study identified the 16 phosphorylation sites on RBM20. We showed that phosphorylation and de‐phosphorylation of the RBM20 RS domain resulted in a titin isoform transition. Mutating phosphorylation sites led to RBM20 shuttling from the nucleus to the cytoplasm and caused protein aggregation in vitro and in vivo. Furthermore, our in vitro study with phospho‐mimetic mutations did not restore translocation of RBM20 and RNP granules formation. Finally, our in vivo study confirmed that the Rbm20S637A mutation resulted in cardiac remodeling and pathogenesis in both male and female mice. How phosphorylation affects RNA splicing, trafficking, and assembly of RNP granules in vivo is an important area for future study.
DISCLOSURES
None.
AUTHOR CONTRIBUTIONS
Wei Guo, Ying Ge, Henk Granzier, Timothy A. Hacker, Michael Gotthardt, Yanghai Zhang, and Chunyan Wang designed the experiments; Yanghai Zhang, Chunyan Wang, Martin Liss, and Mingming Sun performed the experiments; Yutong Jin performed the mass spectrometry experiments; Chunyan Wang and Yanghai Zhang performed the animal study and sample preparation; Camila Urbano Braz and Yanghai Zhang performed RNA‐seq analysis; Yanghai Zhang, Chunyan Wang, Yutong Jin, Timothy A. Hacker, Camila Urbano Braz, Hasan Khatib, Michael Gotthardt, Ying Ge, Wei Guo, and Mingming Sun analyzed and interpreted the data; Yanghai Zhang and Wei Guo wrote the paper; Hasan Khatib, Timothy A. Hacker, Ying Ge, and Henk Granzier revised the paper.Fig S1‐S7Click here for additional data file.Table S1‐S4Click here for additional data file.Video S1Click here for additional data file.Video S2Click here for additional data file.Video S3Click here for additional data file.Data S1Click here for additional data file.Data S2Click here for additional data file.Data S3Click here for additional data file.Data S4Click here for additional data file.Data S5Click here for additional data file.
Authors: Marwan M Refaat; Steven A Lubitz; Seiko Makino; Zahid Islam; J Michael Frangiskakis; Haider Mehdi; Rebecca Gutmann; Michael L Zhang; Heather L Bloom; Calum A MacRae; Samuel C Dudley; Alaa A Shalaby; Raul Weiss; Dennis M McNamara; Barry London; Patrick T Ellinor Journal: Heart Rhythm Date: 2011-10-17 Impact factor: 6.343
Authors: Wenxuan Cai; Jianhua Zhang; Willem J de Lange; Zachery R Gregorich; Hannah Karp; Emily T Farrell; Stanford D Mitchell; Trisha Tucholski; Ziqing Lin; Mitch Biermann; Sean J McIlwain; J Carter Ralphe; Timothy J Kamp; Ying Ge Journal: Circ Res Date: 2019-10-01 Impact factor: 17.367
Authors: Wei Guo; Sebastian Schafer; Marion L Greaser; Michael H Radke; Martin Liss; Thirupugal Govindarajan; Henrike Maatz; Herbert Schulz; Shijun Li; Amanda M Parrish; Vita Dauksaite; Padmanabhan Vakeel; Sabine Klaassen; Brenda Gerull; Ludwig Thierfelder; Vera Regitz-Zagrosek; Timothy A Hacker; Kurt W Saupe; G William Dec; Patrick T Ellinor; Calum A MacRae; Bastian Spallek; Robert Fischer; Andreas Perrot; Cemil Özcelik; Kathrin Saar; Norbert Hubner; Michael Gotthardt Journal: Nat Med Date: 2012-05 Impact factor: 53.440
Authors: Katharine M Brauch; Margaret L Karst; Kathleen J Herron; Mariza de Andrade; Patricia A Pellikka; Richard J Rodeheffer; Virginia V Michels; Timothy M Olson Journal: J Am Coll Cardiol Date: 2009-09-01 Impact factor: 24.094
Authors: Mei Methawasin; Kirk R Hutchinson; Eun-Jeong Lee; John E Smith; Chandra Saripalli; Carlos G Hidalgo; Coen A C Ottenheijm; Henk Granzier Journal: Circulation Date: 2014-03-05 Impact factor: 29.690
Authors: Nicola J Rutherford; Yong-Jie Zhang; Matt Baker; Jennifer M Gass; Nicole A Finch; Ya-Fei Xu; Heather Stewart; Brendan J Kelley; Karen Kuntz; Richard J P Crook; Jemeen Sreedharan; Caroline Vance; Eric Sorenson; Carol Lippa; Eileen H Bigio; Daniel H Geschwind; David S Knopman; Hiroshi Mitsumoto; Ronald C Petersen; Neil R Cashman; Mike Hutton; Christopher E Shaw; Kevin B Boylan; Bradley Boeve; Neill R Graff-Radford; Zbigniew K Wszolek; Richard J Caselli; Dennis W Dickson; Ian R Mackenzie; Leonard Petrucelli; Rosa Rademakers Journal: PLoS Genet Date: 2008-09-19 Impact factor: 5.917