Zhenfei Gao1,2, Anzhao Wang1,2, Yongxu Zhao3, Xiaoxu Zhang1,2, Xiangshan Yuan4, Niannian Li1,2, Chong Xu1,2, Shenming Wang1,2, Yaxin Zhu1,2, Jingyu Zhu1,2, Jian Guan1,2, Feng Liu1,2, Shankai Yin1,2. 1. Department of Otolaryngology Head and Neck Surgery & Center of Sleep Medicine, Otolaryngology Institute of Shanghai Jiao Tong University, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Yishan Road 600, Shanghai 200233, China. 2. Shanghai Key Laboratory of Sleep Disordered Breathing, Yishan Road 600, Shanghai 200233, China. 3. CAS Key Laboratory of Nutrition, Metabolism and Food Safety, Shanghai Institute of Nutrition and Health, Shanghai Institutes for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200231, China. 4. Department of Anatomy and Histoembryology, School of Basic Medical Sciences, State Key Laboratory of Medical Neurobiology and MOE Frontiers Center for Brain Science, Institutes of Brain Science, Fudan University, Shanghai 200231, China.
Abstract
Ubiquitination is a major posttranslational modification of proteins that affects their stability, and E3 ligases play a key role in ubiquitination by specifically recognizing their substrates. BTBD9, an adaptor of the Cullin-RING ligase complex, is responsible for substrate recognition and is associated with sleep homeostasis. However, the substrates of BTBD9-mediated ubiquitination remain unknown. Here, we generated an SH-SY5Y cell line stably expressing BTBD9 and performed proteomic analysis combined with ubiquitinome analysis to identify the downstream targets of BTBD9. Through this approach, we identified four potential BTBD9-mediated ubiquitination substrates that are targeted for degradation. Among these candidate substrates, inosine monophosphate dehydrogenase (IMPDH2), a novel target of BTBD9-mediated degradation, is a potential risk gene for sleep dysregulation. In conclusion, these findings not only demonstrate that proteomic analysis can be a useful general approach for the systematic identification of E3 ligase substrates but also identify novel substrates of BTBD9, providing a resource for future studies of sleep regulation mechanisms.
Ubiquitination is a major posttranslational modification of proteins that affects their stability, and E3 ligases play a key role in ubiquitination by specifically recognizing their substrates. BTBD9, an adaptor of the Cullin-RING ligase complex, is responsible for substrate recognition and is associated with sleep homeostasis. However, the substrates of BTBD9-mediated ubiquitination remain unknown. Here, we generated an SH-SY5Y cell line stably expressing BTBD9 and performed proteomic analysis combined with ubiquitinome analysis to identify the downstream targets of BTBD9. Through this approach, we identified four potential BTBD9-mediated ubiquitination substrates that are targeted for degradation. Among these candidate substrates, inosine monophosphate dehydrogenase (IMPDH2), a novel target of BTBD9-mediated degradation, is a potential risk gene for sleep dysregulation. In conclusion, these findings not only demonstrate that proteomic analysis can be a useful general approach for the systematic identification of E3 ligase substrates but also identify novel substrates of BTBD9, providing a resource for future studies of sleep regulation mechanisms.
Ubiquitination, one of the most important posttranslational modifications
of proteins, occurs in all cells and is essential for numerous aspects
of cell physiology. In addition, the ubiquitin–proteasome system
(UPS) is a common mechanism of protein degradation.[1] In recent years, considerable progress has been made in
elucidating the molecular action of ubiquitin in signaling pathways
and the mechanism by which the UPS leads to the development of distinct
human diseases such as cancer, metabolic syndromes, neurodegenerative
diseases, and sleep disorders.[2−4]Ubiquitin (Ub) is a 76-amino
acid protein with seven lysine residues,
all of which can be ubiquitinated by a three-enzyme cascade consisting
of an E1 Ub-activating enzyme, E2 Ub-conjugating enzyme, and E3 Ub-protein
ligase and subsequently attached to a specific substrate to generate
different polyubiquitin chains.[5] E3s are
the critical components of this cascade because they strictly regulate
both the efficiency and substrate specificity of the ubiquitination
reaction.[6] Cullin-RING ligase (CRL) complexes
are a major group of E3s and are characterized by a RING-finger structure
and an adaptor protein that is responsible for substrate recognition.[7] Increasing studies have shown that CRLs play
an important role in sleep regulation.[3,8,9]BTBD9 is a member of the BTB/POZ protein family
and serves as an
adaptor protein of CULLIN3 (CUL3) to participate in ubiquitination
reactions.[10] Genetic studies have found
that BTBD9 gene polymorphisms are important risk
factors for sleep disorders.[11,12] In addition, studies
using Btbd9 knockout mice and dBTBD9 knockout drosophila have confirmed that BTBD9 is
a sleep regulation factor.[13,14] Systematic quantitative
proteomic studies have revealed the presence of a stable interaction
between BTBD9 and CUL3, but the role of CUL3-mediated ubiquitination
in human circadian rhythms and sleep structure is not clear. However,
there is sufficient evidence that CUL3 is a crucial component of the
Drosophila clock.[15] Overall, these findings
suggest that BTBD9 may regulate sleep or wakefulness through CUL3-mediated
protein ubiquitination. However, the downstream substrate that is
ubiquitinated via the interaction between BTBD9 and CUL3 to influence
sleep regulation remains unknown, and the mechanisms by which BTBD9
and CUL3 exert their effects also need to be further studied.To identify the specific substrates of BTBD9-mediated ubiquitination,
we generated SH-SY5Y cell lines stably expressing BTBD9 and performed proteomic and ubiquitinome analyses. Ubiquitinome
analysis revealed that BTBD9 significantly contributes to the overall
protein ubiquitination state and suggested a potential role for BTBD9
in regulating protein localization and neurodegenerative diseases.
Ubiquitinome analysis in combination with quantitative proteomic analysis
showed that the levels of four candidate substrates are decreased,
with IMPDH2 showing the most significant change in expression. We
further validated that the ubiquitination of IMPDH2 is mediated by
BTBD9 and identified IMPDH2 as a novel sleep dysregulation risk gene.
This is the first study to systematically explore the substrates of
BTBD9 and its combined effects on the substrate proteins. We believe
that this study may help us reveal the mechanisms by which BTBD9 affects sleep and provide new ideas for sleep regulation.
Materials and Methods
Sleep Parameters Recording
of Btbd9PB/PB Mice
Animals
The
specific pathogen-free
(SPF) C57BL/6 J male wild-type (WT) mice (2 months old, 18–20
g) were purchased from Shanghai Laboratory Animal Center, Chinese
Academy of Science (SLAC, Shanghai, China), and Btbd9 mice were kindly gifted from Prof. Wu Xiaohui
(Fudan University) and housed four to five per cage under a constant
temperature (22 ± 0.5 °C), humidity (55 ± 5%), and
an automatically controlled 12 h light/12 h dark cycle (lights on
at 7 a.m.), with access to food and water ad libitum. The study protocol
was approved by the Institutional Animal Care and Use Committee of
Shanghai Jiao Tong University Affiliated Sixth People’s Hospital
(no. 2021-0149).
EEG/EMG Recording Electrode
Implantation
Mice were first anesthetized with a 2–3%
isoflurane/oxygen
mixture. After completion of scalp preparation and sterilization,
the mice’s heads were fixed on a stereotaxic holder and two
small holes (1 mm in diameter) were drilled in the frontal (AP/ML:
+1.50/–0.80 mm) and parietal (AP/ML: −1.50/–1.00
mm) bone surfaces to facilitate the implantation procedure. During
surgery, the EEG electrodes were screwed into the bone holes in the
frontal and parietal lobes, while the EMG electrodes were inserted
into the bilateral oblique muscles. Subsequently, the EEG and EMG
electrodes were attached to a miniature connector to form an electrode
assembly, which was then fixed to the skull surface with tooth-based
acrylic resin. After the surgical mice were housed individually and
recovered for 7 days, monitoring of video polysomnography recordings
was carried out.
Video-Polysomnography
Recordings and Data
Analysis
Video PSG recordings were performed as previously
described.[16] Briefly, mice were connected
to the device and habituated for 3 days prior to formal recording.
Cortical EEG and cervical EMG signals were digitized at a sampling
rate of 512 Hz, amplified, filtered (Biotex), and then recorded through
a CED 1401 digitizer and Spike 2 software (CED, UK). The Spike 2 data
were then converted to appreciable vigilance states using SleepSign
software (Kissei Comtec, Japan). By this method, the alertness states
of the mice (scored every 4 s timing) were automatically classified
as wake, rapid eye movement sleep (REM), and non-rapid eye movement
sleep (NREM). The sleep classification was then manually checked and
corrected in case of incompatibility. After manual calibration and
correction, the number, percentage, transition, and duration of each
alert state were calculated.
Cell
Culture and Transfection
SH-SY5y
cells were kindly provided by the Stem Cell Bank, Chinese Academy
of Sciences and cultured in Dulbecco’s modified Eagle’s
medium (DMEM) plus 10% fetal bovine serum and penicillin and streptomycin
at 37 °C in 5% CO2 (v/v). Cells were seeded in a six-well
plate and grown to 70–90% confluent for transfection. For each
well, 1 μg of the plasmid and 2 μL of the Lipofectamine
2000 reagent were mixed in an Opti-MEM medium and incubated for 25
min at room temperature. The mixture was added to a cell culture medium
in the absence of serum for 6 h and then changed to complete the medium.
Real-Time qPCR
Total RNA was extracted
from cells using an EZ-press RNA purification kit (EZBioscience, MN,
USA), and EZ-press reverse transcriptase kits were used to synthesize
cDNA as the template for qPCR with 1 μg of total RNA. All operations
were performed in accordance with product instruction.Relative
quantitative analysis of the gene expression level was performed in
LightCycler 480 II (ROCHE, USA) according to the operating manual.
The thermocycling procedure was set according to the instruction of
a Power SYBR Green PCR master mix (2×) provided by EZBioscience.
Housekeeping gene ACTIN was set as an IC (internal
control), and each assay was independently repeated three times. The
primers of BTBD9 and ACTIN were
listed below:ACTIN-F:5′-CATGTACGTTGCTATCCAGGC-3′ACTIN-R:5′-CTCCTTAATGTCACGCACGAT-3′BTBD9-F:5′-GGCAACGCTGACAGATGAGAA-3′BTBD9-R:5′-AGGTAGAATCCTCTAGCTCTGGA-3′
Sample Preparation and
MS Data Analysis
The cells were washed with cold 1×
PBS twice before being
collected from the dishes by trypsin. They were then transferred to
a new precooled tube, and liquid nitrogen was added to fully grind
the cells into a powder. The samples of each group were added to 4
times the volume of powdered lysis buffer (8 M urea, 1% “Protease
Inhibitor Cocktail Set I”, 50 μM 3,5-dithiocyanatopyridine-2,6-diamine;
the protease inhibitor was purchased from Merck Millipore (56500))
and lysed by ultrasonication. The samples were first centrifuged at
12,000g for 10 min at 4 °C to remove cell debris.
Then, the supernatant was transferred to a new centrifuge tube, and
the protein concentration was determined using a BCA kit following
the manufacturer’s instructions.The protein concentration
of each sample protein was adjusted to be the same, and then, the
equal volume of these was used for enzymatic hydrolysis and subsequent
analysis. Then, 20% trichloroacetic acid (TCA) was slowly added, vortexed,
and allowed to settle at 4 °C for 2 h. The samples were centrifuged
at 4500g for 5 min, and after removing the supernatant,
the precipitate was washed two to three times with precooled acetone.
After drying the pellet, TEAB was added to a final concentration of
200 mM, the pellet was ultrasonically dispersed, trypsin was added
to digest the protein, and the mixture was hydrolyzed overnight. Dithiothreitol
(DTT) was added to a final concentration of 5 mM and reduced at 56
°C for 30 min. Then, iodoacetamide (IAA) was added to a final
concentration of 11 mM and incubated for 15 min at room temperature
in the dark.The peptides were dissolved in mobile phase A of
liquid chromatography
and then separated using an EASY-nLC 1200 ultrahigh-performance liquid
system. Mobile phase A was an aqueous solution containing 0.1% formic
acid and 2% acetonitrile; mobile phase B was an aqueous solution containing
0.1% formic acid and 90% acetonitrile. Liquid gradient setting: 0–62
min, 4–23% B; 62–82 min, 23–35% B; 82–86
min, 35–80% B; 86–90 min, 80% B. The flow rate was maintained
at 500 nL/min. The peptides were separated by an ultrahigh-performance
liquid system, injected into the NSI ion source for ionization, and
then analyzed by Q Exactive HF-X mass spectrometry. The ion source
voltage was set to 2.1 kV, and the peptide precursor ions and their
secondary fragments were detected and analyzed by a high-resolution
Orbitrap. The scanning range of the primary mass spectrum was set
to 400–1500 m/z, and the
scanning resolution was set to 120,000; the scanning range of the
secondary mass spectrum was set to a fixed starting point of 100 m/z, and the secondary scanning resolution
was set to 15,000. A data-dependent scanning (DDA) program was used
to acquire the mass spectrum data, and after the first level scan,
the first 10 peptide precursor ions with the highest signal intensity
were selected to enter the HCD collision cell, and 28% of the fragmentation
energy was used for fragmentation grade mass spectrometry analysis.
To improve the effection of the mass spectrometer, the automatic gain
control (AGC) was set to 5 × 104, the signal threshold
was set to 2.5 × 105 ions/s, the maximum injection
time was set to 40 ms, and the dynamic rejection time of the tandem
mass spectrometry scan was set to 30 s to avoid repeated scanning
of ions.
Bioinformatic Analysis
Enrichment
of Gene Ontology Analysis
Proteins were classified by GO
annotation into three categories:
the biological process, cellular compartment, and molecular function
according to the Gene Ontology database.[17] For each category, a two-tailed Fisher’s exact test was employed
to test the enrichment of the differentially expressed protein against
all identified proteins. The GO with a corrected p value <0.05 is considered significant.
Enrichment
of Pathway Analysis
The Encyclopedia of Genes and Genomes
(KEGG) database (http://www.kegg.jp/)[18] was used to identify enriched pathways
by a two-tailed
Fisher’s exact test to test the enrichment of the differentially
expressed protein against all identified proteins. The pathway with
a corrected p value <0.05 was considered significant.
These pathways were classified into hierarchical categories according
to the KEGG website.
Enrichment of Protein
Domain Analysis
For each category proteins, the InterPro
database[19] was researched and a two-tailed
Fisher’s exact test
was employed to test the enrichment of the differentially expressed
protein against all identified proteins. Protein domains with a corrected p value <0.05 were considered significant.Over
representation analysis was conducted by “clusterProfiler”
R packages,[20] and gene set enrichment analysis
was performed by GSEA_4.1.0 application.[21]
Immunoprecipitation (IP)
Cells were
washed with cold PBS twice and lysed in chilled lysis buffer supplemented
with a protease inhibitor mixture. They were then incubated on ice
for 30 min. The cell debris was removed after centrifuging at 14,000
rpm for 25 min. The supernatant was subjected to IP with 20 μL
of anti-FLAG M2 affinity resin (Sigma) overnight at 4 °C. Resin-containing
immune complexes were washed with ice-cold lysis buffer followed by
TBS washes. After that, 50 μL of SDS-PAGE sample loading buffer
(2×) was added and it was heated at 95 °C for 10 min to
denature proteins.
Western Blots
Total proteins were
separated by SDS-PAGE (sodium dodecyl sulfate polyacrylamide gel electrophoresis)
and transferred onto PVDF (polyvinylidene fluoride) membranes. Membranes
were blocked in 5% non-fat milk diluted in TBS for 1 h and then were
incubated with the primary antibody diluted in TBST overnight at 4
°C. Membranes were subsequently incubated with the secondary
antibody for 1 h in room temperature after washing with 1× TBS
three times. The signals of the proteins were visualized on LI-COR
Odyssey (Lincoln, NE, USA). The following antibodies were used: the
anti-IMPDH2 antibody (1:2000, Abcam, ab131158), BTBD9 Rabbit poly
antibody (1:2000, Abclonal, A18528), anti-FLAG antibody (1:2000, Sigma,
F1804), anti-HA antibody (1:2000, Cell Signaling Technology, C29F4),
anti-β-actin antibody (1:10000, Abclonal, A2319), anti-GAPDH
antibody (1:10000, Abclonal, AC002), anti-Mouse IgG H&L (1:10000,
Abcam, ab216778), and anti-Rabbit IgG H&L (1:10000, Abcam, ab216776).
Genome-Wide Association Studies (GWAS)
Participants
This GWAS study was
performed in the sleep center of Shanghai Jiao Tong University Affiliated
Sixth People’s Hospital from January 2011 to June 2019 (ongoing
SSHS study) according to the Declaration of Helsinki (registration
number: ChiCTR1900025714). A written informed consent was obtained
from each participant. A total of 725 people were recruited in this
study. All participants were asked to complete a uniform questionnaire
regarding personal characteristics such as weight, height, and medical
histories. Exclusion criteria were as follows: (i) age less than 18
years old; (ii) history of sleep disorders; (iii) psychiatric disturbances,
chronic liver disease, or chronic kidney disease; and (iv) unavailable
clinical data.
Sleep Parameter Recording
Overnight
polysomnography (PSG) monitors were performed in the sleep center
of Shanghai Jiao Tong University Affiliated Sixth People’s
Hospital with an Alice 4 system (Philips Respironics Inc., Pittsburgh,
PA, USA). Sleep parameters were obtained from the electroencephalography
(EEG) result recorded by PSG including N1 sleep (N1), N2 sleep (N2),
N3 sleep (N3), rapid eye movement sleep (REM), wake time (WK), total
sleep time (TST), and sleep period time (SPT). The ratio of N1/TST,
N2/TST, N3/TST, REM/TST, WK/SPT, and sleep efficiency (SE, TST/SPT)
as well as TST was calculated to do the analysis.
Genotyping
Genomic DNA was extracted
from the whole blood by a DNA isolation kit (Qiagen). An Affymetrix
Genome-Wide Human SNP Array 6.0 (SNP6.0) and Affymetrix Axiom Genome-Wide
CHB Array (CHB) were used to detect the specific base sequence of
each participant.[22] Data from Affymetrix
chips were analyzed by using Genotyping Console version 3.1 software
(Affymetrix). Data with a high missing gene call rate (>5%), low
MAF
(<0.01), and significant deviation from Hardy–Weinberg equilibrium
(P < 1 × 10–6) were excluded
from analysis.
Statistic
For
comparisons between
two groups, Student’s t test was used and
a p value of <0.05 was considered statistically
significant.
Results
Mutations
in BTBD9 Cause Sleep Disturbance
To validate the association
of BTBD9 SNPs with
sleep architecture, we performed a small-scale GWAS consisting of
785 individuals with standard PSG monitoring. According to the result
of this analysis, we drew a Manhattan plot[23,24] and found that there was a statistically significant association
between the intronic BTBD9 gene variant rs201664431
(GACA > G) and the wake/sleep time (WK/SPT) (Figure A). We then compared WK/SPT data between
individuals of different rs201664431 genotypes, and the results showed
that individuals with deletion alleles had longer wake durations than
those with reference alleles (Figure B).
Figure 1
Alteration in BTBD9 causes sleep disturbance.
(A) Manhattan plot showing the distribution of SNPs associated with
WK/SPT in people (p < 10–3).
(B) Comparison of WT/SPT between different phenotypes of rs201664431
by the t test (*p < 0.05, n = 725). (C) Diagram for mouse EEG recording. (D) Line
chart that displays the duration of wake time and REM sleep of mice
during the night (1 = 19 p.m., 12 = 7 a.m., n = 3
in each group, error bar: mean ± SEM, *p <
0.05). (E) Average percent of the distinct sleep status of mice during
the recording time (7 p.m. to 7 a.m.).
Alteration in BTBD9 causes sleep disturbance.
(A) Manhattan plot showing the distribution of SNPs associated with
WK/SPT in people (p < 10–3).
(B) Comparison of WT/SPT between different phenotypes of rs201664431
by the t test (*p < 0.05, n = 725). (C) Diagram for mouse EEG recording. (D) Line
chart that displays the duration of wake time and REM sleep of mice
during the night (1 = 19 p.m., 12 = 7 a.m., n = 3
in each group, error bar: mean ± SEM, *p <
0.05). (E) Average percent of the distinct sleep status of mice during
the recording time (7 p.m. to 7 a.m.).To better characterize the functional role of BTBD9 in sleep regulation, a mouse strain with PiggyBac transposase-mediated
gene silencing (the Btbd9 strain)
was used in this study. Homozygous mice were able to be born, grown
to adults, and did not exhibit any apparent abnormalities. The sleep
of three pairs of congenic mice was monitored by EEG (Figure C). Each of the mice underwent
recording for 3 days to obtain stable data. We separated the sleep/wake
statuses of the mice into three categories: WAKE, REM sleep, and NREM
sleep. By comparing the duration that the mice spent in each status,
we found that, from 7 p.m. to 7 a.m., the WAKE durations of the Btbd9 mice were significantly longer than
those of the wild-type controls, while the REM sleep durations of
the Btbd9 mice were significantly
shorter than those of the wild-type controls (Figure D). Accordingly, analysis of the average
durations of wake time and REM sleep from 7 p.m. to 7 a.m. showed
a similar result (Figure E).Taken together, these data confirmed that BTBD9 plays a crucial role in sleep regulation.
Ubiquitinome Profiling of BTBD9 Overexpression
Cells
To systematically analyze the global change in the
ubiquitylated targets of BTBD9 and identify its specific substrate,
we carried out quantitative proteomic and ubiquitinome profiling of BTBD9 overexpression (OE) cells (Figure A). OE cells were generated using neuroblastoma
SH-SY5Y cells, which are commonly used for neuroscience research,
because of their human origin, catecholaminergic neuronal properties,
and ease of maintenance.[25] Cells stably
transfected with an empty vector were used as negative controls (NC).
The efficiency of OE was validated by quantitative RT-PCR (qRT-PCR)
and western blotting (Figure B). To ensure data reproducibility, proteins were extracted
from three biological replicates of each cell line. Correlation analysis
showed that the correlation coefficient between the two groups was
greater than 0.9, indicating good reproducibility (Figure C).
Figure 2
Characterization of protein
ubiquitinome in the BTBD9-overexpression cell line.
(A) Proteome and ubiquitinome project
workflow. (B) Validation of BTBD9 overexpression cell line construction
by qPCR (t test, ****p < 0.0001, n = 3 in each group). (C) Correlation analysis of samples
within two groups in the ubiquitinome project. (D) Statistic of differential
modified lysines and proteins in ubiquitinome. (E) Heatmap displaying
differential ubiquitinated proteins. (F) Volcano plot of differential
ubiquitinated proteins. (The top 10 were labeled-ordered by the p value). (G) Bar plot showing proteins containing both
up- and down-regulated ubiquitylation sites.
Characterization of protein
ubiquitinome in the BTBD9-overexpression cell line.
(A) Proteome and ubiquitinome project
workflow. (B) Validation of BTBD9 overexpression cell line construction
by qPCR (t test, ****p < 0.0001, n = 3 in each group). (C) Correlation analysis of samples
within two groups in the ubiquitinome project. (D) Statistic of differential
modified lysines and proteins in ubiquitinome. (E) Heatmap displaying
differential ubiquitinated proteins. (F) Volcano plot of differential
ubiquitinated proteins. (The top 10 were labeled-ordered by the p value). (G) Bar plot showing proteins containing both
up- and down-regulated ubiquitylation sites.Ubiquitinome profiling identified 8411 ubiquitylated lysine residues
in 2947 proteins, and of 5973 of these ubiquitylated lysine residues,
1738 proteins were found in both the NC and OE groups. The ubiquitylation
of 216 lysine residues in 163 proteins was significantly increased
after BTBD9 overexpression, and the ubiquitylation
of 43 lysine residues in 43 proteins was significantly decreased when
using a fold change = 1.5 and p value <0.05 as
thresholds to define differentially modified targets (Figure D). The global ubiquitination
level was significantly elevated in SH-SY5Y cells by BTBD9 overexpression since there were many more targets that showed increased
ubiquitylation than targets that showed decreased ubiquitylation (Figure E,F); this finding
is consistent with the previously characterized biological functions
of BTBD9.[10] Furthermore, five proteins
contained both residues that showed increased ubiquitylation and residues
that showed decreased ubiquitylation (Figure G). TNFAIP1 is a previously identified substrate
of BTBD9-mediated ubiquitination and is degraded by the proteasome.[26] In our study, although TNFAIP1 was not identified
as a significantly differentially modified protein because it was
only detected in one sample in each group, the level of TNFAIP1 in
these two samples suggested that the ubiquitination of TNFAIP1 was
strongly promoted by BTBD9.To better understand the preference
of enzymes for substrates,
we analyzed the sequence motifs of ubiquitylated peptides in OE and
NC cells with the motif-x algorithm.[27] The
result shows that 20 amino acid residues, from upstream 10 amino acid
sites to the downstream 10 amino sites, around the differential modified
lysine sites could be categorized into 10 motifs. These motifs are
KL, IxK, DK, LK, LxK, VxK, DxK, KxxxV, AxK, and NxK (K is the ubiquitylated
lysine, and X represents a random amino acid residue) (Figure A). Analysis of these motifs
suggested that six amino acids were enriched in the motifs including
four nonpolar aliphatic amino acids (leucine, isoleucine, alanine,
and valine), an acidic amino acid (aspartic acid), and an amide amino
acid (asparagine). Furthermore, there was little enrichment of positively
charged amino acid residues, suggesting that BTBD9-mediated protein
ubiquitination may not be involved in the interaction of proteins
with nuclear acids or phospholipids but mainly participates in protein
interactions through the hydrophobic interface formed by motifs of
BTBD9 targets and their counterparts.
Figure 3
Bioinformatic analysis of ubiquitinome.
(A) Ubiquitylation motif
diagram and heatmap indicating ubiquitination preference of BTBD9
by analyzing the frequency of the amino acid residues surrounding
differential ubiquitylated lysines in each motif. (B) Enrichment analysis
of proteins containing up-regulated ubiquitylation sites based on
the biological process by Gene Ontology. (C) Diagram showing the top
five items of the biological process enriched by proteins containing
up-regulated ubiquitylation sites. (D) GO item interaction network
of up-regulated ubiquitinated proteins. (E) KEGG pathway analysis
of proteins containing up-regulated ubiquitylation sites. (F) Pathway
interaction network of up-regulated ubiquitinated proteins. (G) Circular
network diagram showing the top 10 items of the KEGG pathway enriched
by proteins containing up-regulated ubiquitylation sites.
Bioinformatic analysis of ubiquitinome.
(A) Ubiquitylation motif
diagram and heatmap indicating ubiquitination preference of BTBD9
by analyzing the frequency of the amino acid residues surrounding
differential ubiquitylated lysines in each motif. (B) Enrichment analysis
of proteins containing up-regulated ubiquitylation sites based on
the biological process by Gene Ontology. (C) Diagram showing the top
five items of the biological process enriched by proteins containing
up-regulated ubiquitylation sites. (D) GO item interaction network
of up-regulated ubiquitinated proteins. (E) KEGG pathway analysis
of proteins containing up-regulated ubiquitylation sites. (F) Pathway
interaction network of up-regulated ubiquitinated proteins. (G) Circular
network diagram showing the top 10 items of the KEGG pathway enriched
by proteins containing up-regulated ubiquitylation sites.
Functional Annotation and Pathway Enrichment
Analysis of Differentially Modified Proteins
To understand
the biological functions or signaling pathways associated with the
ubiquitinylated proteins regulated by BTBD9, gene ontology (GO) enrichment
analysis and KEGG pathway analysis were performed. GO enrichment analysis
revealed that the proteins that showed increased ubiquitylation after BTBD9 overexpression were enriched for biological processes
associated with “cellular localization”, “protein
localization”, and “nitrogen compound transport”
(Figure B). Further
analysis of the protein interaction network of the enrichment pathways
revealed that most of the differentially modified proteins were associated
with ″cellular localization″ and ″establishment
of localization in cell″ (Figure C), and these two pathways were the key nodes
of the protein interaction network (Figure D). In addition, KEGG pathway analysis showed
that the proteins that showed increased ubiquitylation were significantly
enriched in 14 pathways (Figure E), including “amyotrophic lateral sclerosis”,
“Huntington’s disease”, and “Parkinson’s
disease”, indicating a potential role for BTBD9-mediated ubiquitination
in the regulation of neurodegenerative diseases. Furthermore, these
pathways formed the majority of the pathway interaction network (Figure F). Interestingly,
most of the enriched signaling pathways were associated with the same
proteins, such as the tumor suppressors p53 and β-tubulin (Figure G), suggesting that
these two proteins are potential targets of BTBD9 and that BTBD9 may
potentially regulate tumor progression and the microtubular cytoskeleton.[28,29]
Proteomic Profiling of BTBD9 Overexpression
Cells
Before we performed quantitative proteomic analysis
of BTBD9 overexpression cells, we carried out quality
control analysis. The correlation analysis showed that both of the
correlation coefficients between two groups were greater than 0.99,
indicating good reproducibility within the groups (Figure A). Overall, 4635 proteins
were identified in this analysis, and 3756 proteins were identified
in both groups. By using a FC = 1.2 and P < 0.05
as the differential expression threshold, we found that the expression
of 29 proteins was up-regulated and that the expression of 29 proteins
was down-regulated in BTBD9 OE cells compared with
NC cells (Figure B).
The heatmap shows that the change in the expression of the differentially
expressed genes was consistent in replicate samples (Figure C).
Figure 4
Bioinformatic analysis
of proteome. (A) Correlation analysis of
samples within two groups in the proteome project. (B) Differential
expressed proteins are shown in the volcano plot. (C) Heatmap displaying
the differential expressed proteins. (D) GSEA of the biological process
of proteome results with the table showing the NES, p value, and FDR of each item. (E) GSEA of the KEGG pathway of the
proteome result.
Bioinformatic analysis
of proteome. (A) Correlation analysis of
samples within two groups in the proteome project. (B) Differential
expressed proteins are shown in the volcano plot. (C) Heatmap displaying
the differential expressed proteins. (D) GSEA of the biological process
of proteome results with the table showing the NES, p value, and FDR of each item. (E) GSEA of the KEGG pathway of the
proteome result.Gene set enrichment analysis
(GSEA) was performed to analyze all
significantly differentially expressed proteins (defined by a p value < 0.05) because the differentially expressed
proteins were not suitable for over representation analysis (ORA)
due to the limited number. The results showed that the differentially
expressed genes were enriched in five categories of biological processes,
including “vascular transport”, “co-translational
protein targeting to membrane”, “protein targeting to
membrane”, and “import across plasma membrane”
(Figure D). Of note,
all these terms are related to protein transport, which is consistent
with the results of ubiquitinome analysis and were all enriched for
down-regulated genes. In addition, KEGG pathway analysis by GSEA revealed
that two pathways were enriched after BTBD9 overexpression:
“Parkinson’s disease” and the “ribosome
pathway” (Figure E).
Identification of Candidate Substrates of
BTBD9 for Degradation by Integrative Analysis
To identify
the substrate that is specifically ubiquitinated by BTBD9, the proteomic
and ubiquitinome profiling results were subjected to integrative analysis.
Through this analysis, we identified six common proteins in both profiles
(Figure A) and found
that four of these proteins, i.e., inosine monophosphate dehydrogenase
1 (IMPDH1), inosine monophosphate dehydrogenase 1 (IMPDH2), CD44,
and 24-dehydrocholesterol reductase (DHCR24), exhibited decreased
protein expression and increased ubiquitination after BTBD9 overexpression (Figure B). Considering that ubiquitinated proteins can be degraded
by the proteasome, these four proteins were candidate targets of BTBD9-mediated
ubiquitination. Among these four potential targets, IMPDH1 and IMPDH2
are isozymes that catalyze the conversion of inosine 5′-phosphate
(IMP) to xanthosine 5′-phosphate (XMP), and they share 96.5%
similarity in their amino acid sequences. Furthermore, the K195 and
K293 residues of both IMPDH1 and IMPDH2 were ubiquitinated (Figure B), suggesting that
these are the preferred residues of BTBD9-mediated ubiquitination.
Figure 5
Integrative
analysis of proteome and ubiquitinome. (A) Venn diagram
showing the overlapping groups of proteins among differential regulated
proteins and proteins with up- or down-regulated ubiquitylated lysine
sites, OE cells versus NC cells. (B) A summary table showing that
the candidate substrates of BTBD9 and their differential modified
lysines were listed. (C) Bioinformatic alignment result of IMPDH2
in distinct species. (D) Pattern diagram displaying the location of
differential ubiquitylated sites on IMPDH2. (E) 3D structure of IMPDH2
showing the distribution of ubiquitylated lysine.
Integrative
analysis of proteome and ubiquitinome. (A) Venn diagram
showing the overlapping groups of proteins among differential regulated
proteins and proteins with up- or down-regulated ubiquitylated lysine
sites, OE cells versus NC cells. (B) A summary table showing that
the candidate substrates of BTBD9 and their differential modified
lysines were listed. (C) Bioinformatic alignment result of IMPDH2
in distinct species. (D) Pattern diagram displaying the location of
differential ubiquitylated sites on IMPDH2. (E) 3D structure of IMPDH2
showing the distribution of ubiquitylated lysine.Previous studies have suggested that IMPDH1 and IMPDH2 are circadian oscillators and that IMPDH2 is a preferred
acetylation target of CLOCK.[30] Considering
this finding, we hypothesized that IMPDH2 has a potential
role in sleep regulation; therefore, we chose this protein for the
subsequent validation step. We identified seven ubiquitinated lysine
residues in IMPDH2: K124, K134, K167, K195, K293, K422, and K436.
Bioinformatic alignment showed that all of these ubiquitinated lysines
are conserved among various species (Figure C). Analysis of IMPDH2 structure data showed
that K124, K134, K167, and K195 are located in the cystathionine β-synthase
(CBS) domain[31] (Figure D), which catalyzes the conversion of IMP
to AMP.[32] Furthermore, K195 is required
for the binding of the CBS domain to the allosteric effectors ATP
and GTP (Figure E).
BTBD9 Promotes IMPDH2 Degradation
To validate
whether BTBD9 regulates IMPDH2 degradation via ubiquitination,
we treated NC and OE cells with the proteasome inhibitor MG132 and
then examined the expression level of IMPDH2. The results showed that
the protein expression of IMPDH2 was dramatically reduced in BTBD9 OE cells and greatly restored by the addition of MG132
(Figure A), indicating
that the stability of IMPDH2 was regulated by the proteasome and that
BTBD9 promoted this degradation. To further determine whether BTBD9
promotes IMPDH2 proteasome degradation through ubiquitination modification, BTBD9 OE cells and NC cells were transfected with 3×
FLAG-tagged IMPDH2 and HA-Ub and then treated with MG132 for 12 h
before being collected. By using immunoprecipitation (IP), we surprisingly
found that the overexpression of BTBD9 decreased
the ubiquitination of IMPDH2 (Figure B). This result is not in agreement with the results
of our omics research. To determine why this phenomenon occurred,
we carried out in vivo ubiquitination assays. The results showed that
IMPDH2 was mainly ubiquitinated at K11, K29, and K63 rather than K48,
which is the most common ubiquitination code that leads to UPS-mediated
degradation[33] (Figure C). This result was validated by three independent
experiments. We suspect that the suppression of the proteasome causes
enhanced deubiquitylation activity that results in decreased ubiquitination
of IMPDH2. To further clarify the ubiquitination mode of IMPDH2 by
BTBD9, we constructed a 3× FLAG-tagged IMPDH2 plasmid with mutations
of the significantly differentially modified sites and co-transfected
cells with this plasmid in combination with a ubiquitin plasmid. The
IP results demonstrated that there was little difference in the ubiquitination
statues of IMPDH2 proteins with different lysine mutations, and to
our surprise, IMPDH2 with a mutation at lysine K422 was not expressed
in our study (Figure D).
Figure 6
BTBD9 promotes IMPDH2 degradation. (A) Abundance of IMPDH2 under
the treatment of MG132 by 4/8 h in NC cells and OE cells, respectively
(fold change was calculated by Empiria Studio Software v1.3.0.83,
File S1). (B) Immunoprecipitation result showing the influence of
BTBD9 on ubiquitination modification of IMPDH2. (C) Immunoprecipitation
result showing the ubiquitination mode of IMPDH2 by co-transfer 3×
FLAG tagged IMPDH2 with distinct lysine mutant (L > R) ubiquitin.
(D) Immunoprecipitation result showing the ubiquitination mode of
IMPDH2 by co-transfer ubiquitin with distinct lysine mutant (L >
R)
IMPDH2.
BTBD9 promotes IMPDH2 degradation. (A) Abundance of IMPDH2 under
the treatment of MG132 by 4/8 h in NC cells and OE cells, respectively
(fold change was calculated by Empiria Studio Software v1.3.0.83,
File S1). (B) Immunoprecipitation result showing the influence of
BTBD9 on ubiquitination modification of IMPDH2. (C) Immunoprecipitation
result showing the ubiquitination mode of IMPDH2 by co-transfer 3×
FLAG tagged IMPDH2 with distinct lysine mutant (L > R) ubiquitin.
(D) Immunoprecipitation result showing the ubiquitination mode of
IMPDH2 by co-transfer ubiquitin with distinct lysine mutant (L >
R)
IMPDH2.
IMPDH2
is a Potential Target of BTBD9 in Sleep
Regulation
It has been proposed that homeostatic sleep factors
act on brain regions and neurons involved in the regulation of sleep
or wakefulness.[34] We first obtained expression
data for BTBD9 and IMPDH2 in distinct
brain regions from the GTEx database and performed a correlation analysis.
The results showed that there was a significant positive correlation
between the expression of BTBD9 and IMPDH2 in regions with high BTBD9 expression, such as
the hippocampus, and in several sleep regulation-related areas, such
as the midbrain, hypothalamus, and basal ganglia (Figure A). This result suggested that BTBD9 and IMPDH2 may participate in some
biological functions in a coordinated manner.
Figure 7
IMPDH2 is a potential
target of BTBD9 in sleep regulation. (A)
Scatter chart showing the correlation analysis result of BTBD9 expression
and IMPDH2 expression in distinct brain area (cortex, midbrain, hippocampal,
basal ganglia, hypothalamus, and amygdala). (B) Circular Manhattan
plot GWAS showing the association of SNPs 2MB up- and downstream of
IMPDH2 with sleep traits (N1/TST, N2/TST, N3/TST, WK/SPT, TST, and
SE from innermost to outside). (C) Manhattan plot showing the association
of SNPs related to N1/TST (p < 10–3).
IMPDH2 is a potential
target of BTBD9 in sleep regulation. (A)
Scatter chart showing the correlation analysis result of BTBD9 expression
and IMPDH2 expression in distinct brain area (cortex, midbrain, hippocampal,
basal ganglia, hypothalamus, and amygdala). (B) Circular Manhattan
plot GWAS showing the association of SNPs 2MB up- and downstream of
IMPDH2 with sleep traits (N1/TST, N2/TST, N3/TST, WK/SPT, TST, and
SE from innermost to outside). (C) Manhattan plot showing the association
of SNPs related to N1/TST (p < 10–3).Next, to test whether IMPDH2 is a potential regulator
of sleep architecture, we used GWAS data to explore the association
between genetic variations in IMPDH2 and sleep parameters.
The GWAS results showed that no SNPs of IMPDH2 reached
locus-wide significance (P < 10–5). We next set a higher threshold for significance (P < 10–3), and three SNPs were found to be associated
with N1/TST (Figure B). These SNPs were rs143016112 (β = 1.01, se = 0.28, p = 0.00041), rs150690392 (β = 0.76, se = 0.23, p = 0.0008), and rs115854006 (β = 0.95, se = 0.28, p = 0.0008), which are between 48.0 and 50.0 Mbp of chromosome
3. This range includes much more than a 2 kb sequence upstream and
downstream of IMPDH2 (Figure C). Although none of the SNPs were reported
to directly interact with IMPDH2, further analysis
revealed that the intronic SNP rs151331523 of P4HTM (prolyl 4-hydroxylase, transmembrane (endoplasmic reticulum)) was
in linkage disequilibrium with rs150690392 (r2 = 0.98), and data from 3DSNP (http://cbportal.org/3dsnp/) suggest that this SNP interacts with the IMPDH2 gene[35] (Figure S1). Altogether, these data suggest that IMPDH2 is a potential target
for sleep regulation.
Discussion
BTBD9 has been identified as a sleep regulation
gene in several genetic studies, but its downstream targets and the
molecular mechanisms through which it is involved in sleep regulation
remain unclear. Our study revealed that the overexpression of the
BTBD9 protein significantly, as an adaptive component of CRL3,[10] enhanced the ubiquitination of intracellular
proteins and identified unique cellular pathways associated with these
changes through quantitative ubiquitinome profiling. Ubiquitinome
profiling combined with quantitative proteomics identified IMPDH2
as a novel substrate for BTBD9-mediated ubiquitination, which was
confirmed by in vivo protein ubiquitination assays.The role
of BTBD9 in sleep regulation has been
validated by several studies on BTBD9-deficient animals,
but most of the downstream targets of BTBD9 have been identified based
on the mechanisms of restless leg syndrome, such as DNM1[14] and IRP2;[36] therefore,
objective and comprehensive studies of the downstream targets of BTBD9
are lacking. In contrast, our study is the first to systematically
explore the targets of BTBD9 through proteomic and ubiquitinome studies.
First, we found that BTBD9 significantly enhanced ubiquitination in
cells, suggesting that it has several ubiquitination targets. This
is consistent with the finding of the current study, showing that
thousands of proteins can be ubiquitinated, but only approximately
600 CRLs have been found.[7] However, the
proteomic results suggest that BTBD9 has a slight effect on protein
expression, suggesting that BTBD9 may be necessary for the normal
function or localization of different ubiquitinated proteins rather
than for the degradation of these proteins. This result is similar
to the findings of quantitative phosphoproteomic studies of sleep-requiring
substrates, showing that PTMs are also essential for sleep control
but that protein abundance is unchanged.[37] On the other hand, enrichment analysis in both analyses showed that
BTBD9 plays an important role in protein transport, especially in
membrane structures, and down-regulates the expression of membrane
proteins such as ATP1A1 and ATP1B1. Since previous studies have highlighted
the important role of ion channels in sleep regulation[38,39] and Atp2b3 knockout mice have been reported to show increased sleep
durations,[40] we hypothesized that BTBD9
may affect sleep homeostasis through ion metabolism, which may be
mediated through regulation of protein localization. Furthermore,
we identified IMPDH2 as a novel specific target whose stability is
affected by BTBD9-mediated ubiquitination as IMPDH2 showed the most
pronounced changes in both the proteomic and ubiquitinome studies.
Furthermore, we found that the ubiquitination modifications of IMPDH2
are mainly K11-, K29-, and K63-dependent, but not the common K48 ubiquitination
modifications. As a matter of fact, K11 and K29, the non-classical
ubiquitination modification mode, have been shown to play a role in
the proteasomal degradation pathway.[41,42] In contrast,
K63 ubiquitination modifications are the most widely studied ubiquitination
modifications other than K48 and have been partially found to have
a role in triggering lysosomal-dependent protein degradation in addition
to mediating signaling transduction.[43] Considering
the function of IMPDH2 in adenosine metabolism,[32,44] we generated an alternative hypothesis that BTBD9 can alter the
function and expression of IMPDH2, subsequently leading to arousal
by promoting the activity of pro-wake neurons due to a decrease in
adenosine levels.[45] A previous study has
shown that glutamatergic neurons in the basal forebrain are closely
related to adenosine concentrations in the basal forebrain and that
the disruption of these neurons leads to impaired sleep homeostasis
regulation in mice,[46] and the results of
analysis of IMPDH2 expression patterns in the brain further support
our hypothesis. In addition, we performed a GWAS to explore the relationship
between variants associated with IMPDH2 and sleep characteristics
and found that an SNP that interacts with IMPDH2 may affect N1 sleep
stages.Although our study made important discoveries and led
to a hypothesis
about the mechanism by which how BTBD9 regulates
sleep, there are some limitations. First, we chose cell lines as our
research subject because they exhibit conserved biological function,
thus allowing better extrapolation. However, brain tissues from BTBD9-deficient animals may better reflect the specific
physiological process of sleep regulation. Second, only the interaction
between IMPDH2 and BTBD9 was analyzed in this study. Although data
from previous studies validated the association between these proteins
in sleep-regulating brain areas, the role of IMPDH2 in sleep regulation
needs to be further validated in animal models. Third, our GWAS data
did not provide convincing results regarding the effect of IMPDH2
on sleep due to the small sample size. The use of a larger cohort
may address this problem.In summary, we comprehensively revealed
the changes in ubiquitination
mediated by BTBD9 and identified a novel target of BTBD9 with a potential
role in sleep regulation; however, evidence from animal models supporting
our hypothesis is still lacking. Given the numerous medical developments
related to IMPDH2, once the role of IMPDH2 in sleep regulation is
validated, targeting this protein may be greatly beneficial for the
treatment of sleep diseases.
Authors: Matthias Blum; Hsin-Yu Chang; Sara Chuguransky; Tiago Grego; Swaathi Kandasaamy; Alex Mitchell; Gift Nuka; Typhaine Paysan-Lafosse; Matloob Qureshi; Shriya Raj; Lorna Richardson; Gustavo A Salazar; Lowri Williams; Peer Bork; Alan Bridge; Julian Gough; Daniel H Haft; Ivica Letunic; Aron Marchler-Bauer; Huaiyu Mi; Darren A Natale; Marco Necci; Christine A Orengo; Arun P Pandurangan; Catherine Rivoire; Christian J A Sigrist; Ian Sillitoe; Narmada Thanki; Paul D Thomas; Silvio C E Tosatto; Cathy H Wu; Alex Bateman; Robert D Finn Journal: Nucleic Acids Res Date: 2021-01-08 Impact factor: 16.971