Rong-Fang Gu1, Terry Fang2, Ashley Nelson2, Stefka Gyoneva2, Benbo Gao1, Joe Hedde2, Kate Henry2, Emily Peterson3, Linda C Burkly2, Ru Wei1. 1. Chemical Biology and Proteomics, Biogen, 225 Binney Street, Cambridge, Massachusetts 02142, United States. 2. Genetic and Neurodevelopmental Disorders Research, Biogen, 225 Binney Street, Cambridge, Massachusetts 02142, United States. 3. Medicinal Chemistry, Biogen, 225 Binney Street, Cambridge, Massachusetts 02142, United States.
Abstract
Novel therapies and biomarkers are needed for the treatment of acute ischemic stroke (AIS). This study aimed to provide comprehensive insights into the dynamic proteome changes and underlying molecular mechanisms post-ischemic stroke. TMT-coupled proteomic analysis was conducted on mouse brain cortex tissue from five time points up to 4 weeks poststroke in the distal hypoxic-middle cerebral artery occlusion (DH-MCAO) model. We found that nearly half of the detected proteome was altered following stroke, but only ∼8.6% of the changes were at relatively large scales. Clustering on the changed proteome defined four distinct expression patterns characterized by temporal and quantitative changes in innate and adaptive immune response pathways and cytoskeletal and neuronal remodeling. Further analysis on a subset of 309 "top hits", which temporally responded to stroke with relatively large and sustained changes, revealed that they were mostly secreted proteins, highly correlated to different cortical cytokines, and thereby potential pharmacodynamic biomarker candidates for inflammation-targeting therapies. Closer examination of the top enriched neurophysiologic pathways identified 57 proteins potentially associated with poststroke recovery. Altogether, our study generated a rich dataset with candidate proteins worthy of further validation as biomarkers and/or therapeutic targets for stroke. The proteomics data are available in the PRIDE Archive with identifier PXD025077.
Novel therapies and biomarkers are needed for the treatment of acute ischemic stroke (AIS). This study aimed to provide comprehensive insights into the dynamic proteome changes and underlying molecular mechanisms post-ischemic stroke. TMT-coupled proteomic analysis was conducted on mouse brain cortex tissue from five time points up to 4 weeks poststroke in the distal hypoxic-middle cerebral artery occlusion (DH-MCAO) model. We found that nearly half of the detected proteome was altered following stroke, but only ∼8.6% of the changes were at relatively large scales. Clustering on the changed proteome defined four distinct expression patterns characterized by temporal and quantitative changes in innate and adaptive immune response pathways and cytoskeletal and neuronal remodeling. Further analysis on a subset of 309 "top hits", which temporally responded to stroke with relatively large and sustained changes, revealed that they were mostly secreted proteins, highly correlated to different cortical cytokines, and thereby potential pharmacodynamic biomarker candidates for inflammation-targeting therapies. Closer examination of the top enriched neurophysiologic pathways identified 57 proteins potentially associated with poststroke recovery. Altogether, our study generated a rich dataset with candidate proteins worthy of further validation as biomarkers and/or therapeutic targets for stroke. The proteomics data are available in the PRIDE Archive with identifier PXD025077.
Entities:
Keywords:
DH-MCAO; MCAO; acute ischemic stroke (AIS); immune response; inflammation; proteomics; stroke; stroke biomarkers; stroke recovery; tandem mass tags (TMT)
Acute
ischemic stroke (AIS) is the second leading cause of death
worldwide and over half of affected patients suffer persistent long-term
disability from the neurologic injury.[1] Only one drug, recombinant tissue plasminogen activator (tPA), is
approved for treating AIS, and it must be administrated within a few
hours. A deeper understanding of the pathophysiological mechanisms
underlying stroke would enable the identification of targets and development
of therapeutics to prevent further brain injury and facilitate neurological
recovery, as well as biomarker discovery to improve diagnosis, guide
treatment, and monitor response to therapy. Animal models are essential
tools in these activities. The distal hypoxic-middle cerebral artery
occlusion (DH-MCAO) model features permanent occlusion at the distal
middle cerebral artery, followed by 1 h of hypoxia, which results
in reliable and reproducible infarcts, neuroplasticity changes, and
behavioral deficits.[2−4] Moreover, the permanent nature of the artery occlusion
may better mimic humanstroke, in which revascularization is rare.[2] DH-MCAO is thus advantageous for assessments
of therapeutic strategies.Several proteomic analyses of AIS
using MCAO models[5−9] have been reported. They were conducted on transient MCAOmouse
or rat models and generally focused on the acute phase poststroke.
Law et al.[10] also used a transient model
but evaluated differentiated recovery of eight cynomolgus monkeys
at 28 days after MCAO-induced stroke. To our knowledge, no proteomic
analysis of AIS using a permanent MCAO occlusion model has been reported.
This study aimed to characterize the longitudinal proteomic changes
caused by ischemic stroke using the DH-MCAOmouse model.
Experimental
Procedures
Study Design
The proteomics study was carried out using
45 male mice (C57Bl/6, 16-week old), divided into five groups. Each
group contained nine mice, six of which underwent MCAO surgery (strokemice) and three underwent sham surgery, i.e., the same surgical procedure
but without cauterization of the MCA. All stroke or sham mice were
followed by 1 h hypoxia treatment. The five groups of animals were
sacrificed on day 1, 3, 7, 14, and 28, respectively, after surgery
and hypoxia treatment to capture both initial injury and recovery
processes. At each time point, brain cortex tissues were collected
from both ipsilateral and contralateral hemispheres of strokemice
(2 samples per mouse) and from ipsilateral hemispheres of sham mice
(1 sample per mouse), resulting in 15 tissue samples in each group
(time point): 6 biological replicates of stroke ipsilateral, 6 biological
replicates of stroke contralateral, and 3 biological replicates of
sham ipsilateral, hereafter referred to as “strk_Ip”,
“strk_Ct”, and “sham_Ip”, respectively.To fit tandem mass tag (TMT) 10-plex labeling design, 70 tissue
samples were used in this study: all 15 samples collected on days
1, 3, 7, and 14, and 10 samples collected on day 28 (4 strk_Ip, 4
strk_Ct, and 2 sham_Ip). Additionally, plasma samples were collected
from all mice at corresponding time points for cytokine measurements.
DH-MCAO Sample Generation
All experimental procedures
using mice were approved by the Biogen Institutional Animal Care and
Use Committee (IACUC). Male, 16-week old, C57BL/6J mice were used
for this study. Throughout the duration of this study, water and chow
were provided to mice ad libitum.All mice were subjected to
MCAO or sham surgery as described by Doyle et al.,[4] followed by 1 h hypoxia treatment [see details in the Supplementary Methods, Supporting Information
(SI)]. At the end of the hour in hypoxia, mice were returned to their
home cage. At either 1, 3, 7, 14, or 28 days post-surgery and hypoxia
treatment, mice were humanely euthanized in a CO2 chamber.
Blood was collected via cardiac puncture into K2EDTA tubes
and stored on ice. Within 0.5–1 h of collection, the blood
samples were centrifuged at 3000g for 5 min at 4
°C, and plasma was separated, aliquoted, and stored at −80
°C until analysis. Mice were subsequently transcardially perfused
with cold PBS. Brains were removed and dissected to obtain both the
infarcted region on the ipsilateral cortex and a corresponding area
of tissue on the contralateral cortex. All tissues were snap-frozen
and stored at −80 °C until processing.
Tissue Processing
and Cytokine and Albumin Analysis
Brain cortical tissue was
homogenized with metal beads in 20×
volume to weight (e.g., 200 uL for 10 mg) using cold 1× Tris
buffer (25 mM, pH 7.4) with cOmplete Mini protease inhibitor tablets
(1 tablet per 10 mL of buffer). Homogenized samples were centrifuged
at 1000g for 10 min at 4 °C to generate supernatant
(S1) for proteomic analysis. The S1 supernatant was further centrifuged
at 6000g for 1 h at 4 °C to generate supernatant
(S2) for cytokine and albumin analysis. Protein concentration of the
S2 supernatant was determined with the BCA assay, and samples were
next normalized to 1.5 mg/mL protein concentration.For cytokine
analysis, brain S2 supernatants (1.5 mg/mL) and plasma samples were
diluted in Diluent 41 at 1:4 and 1:20, respectively. Fifty microliters
of diluted samples was added to MSD V-plex ELISAs (K15048D, K15245D)
and incubated overnight with shaking at 4 °C. Secondary antibody
addition and development were performed according to the manufacturer’s
instructions. Western blot analysis was used to measure albumin concentrations
in brain tissue using S2 supernatants (see details in the Supplementary Methods, SI).
Proteomics
Tissue Lysis and Protein Digestion
The S1
supernatants were further lysed in ice-cold lysis buffer (8 M urea
in 50 mM Tris-HCl, pH 7.4) with protease inhibitor cocktails (Sigma)
by sonicating at 150 W for 10 s in 4 °C water bath using an E220
focused-ultrasonicator (Covaris, MA). The lysates were next clarified
by centrifuging at 4 °C and 18 000g for
20 min and protein concentrations were measured using the Bradford
assay. The protein extracts were subsequently reduced (dithioerythritol,
15 mM, 45 min at 37 °C), alkylated (iodoacetamide, 30 mM, 30
min at room temperature in the dark), and diluted to 1 M urea in 50
mM ammonium bicarbonate followed by tryptic digestion (protein-to-protease
ratio at 30:1 w/w) for 16 h at 37 °C.
Tandem Mass Tag (TMT) 10-plex
Labeling and Peptide Fractionation
The digests were desalted
using C18 StageTip with three-punch C18
mesh (Empore), SpeedVac dried, and stored at −20 °C. Peptide
concentrations were measured on a NanoDrop 2000 (Thermo). For each
sample, 20 μg of peptides was TMT-labeled according to the vendor’s
instruction. TMT tags were randomized within each of TMT 10-plex set.
The efficiency of the TMT labeling was determined to be greater than
99%. The TMT-labeled samples were then pooled at equal peptide ratios
into seven TMT 10-plex sets (Supplementary Methods, Table S1, SI), followed by vacuum centrifugation
to near dryness. Subsequently, the resulting peptide mixtures were
each high-pH fractionated and concatenated into 12 fractions of similar
complexities, resulting in 84 subsamples. The peptides were desalted
using C18 StageTip, SpeedVac dried, and kept at −20 °C
until nanoLC–MS/MS analysis. The detailed labeling procedure
and fractionation steps are provided in the Supplementary Methods (SI). To obtain relatively homogeneous proteomes among
seven TMT 10-plex sets, we assumed that day 1 and 3 samples have relatively
similar proteomes, as do those of day 7 and 14; thus, day 1 and 7
samples were pooled in three sets, day 3 and 14 samples were pooled
in three sets, and day 28 samples were one set.
nanoLC–MS/MS
Analysis
Prior to analysis, the
peptide fractions were reconstituted in 2% acetonitrile/0.2% formic
acid and analyzed on a nanoLC–MS/MS platform composed of an
EASY nLC 1200 coupled to a Q Exactive HF mass spectrometer (Thermo).
Peptides were separated on a Thermo EASY-Spray C18 column (50 cm ×
75 μm, 2 μm) over a 120 min effective gradient at a flow
rate of 275 nL/min. Mass spectrometric data were acquired at data-dependent
acquisition mode with a scan range of 200 to 2000 m/z, and the top 10 peptide ions were subjected to
MS2. A 60 000 resolution with 3e6 AGC and 20 ms Maximum IT
was set for MS1, and a 60 000 resolution with 1e6 AGC and 120
ms Maximum IT was set for MS2. The isolation window was set to 0.7
Th, an NCE of 33% was used, and the first mass was fixed at 100 m/z. In addition, unassigned and singly
or >7+ charged species were excluded from MS2 analysis, and the
dynamic
exclusion was set to 30 s.The mass spectrometric data were
searched using MaxQuant (version 1.5.3.8) against the Swiss-Prot mouse
database (UniProtKB Release 2016_04) with cysteine carbamidomethylation
as a fixed modification and methionine oxidation and protein N-term
acetylation as variable modifications. A mass error of 20 ppm and
up to two miscleavages were allowed. The false discovery rates (FDRs)
at the protein and peptide levels were both set at 1%.
Data Analysis
Protein lists exported from MaxQuant
were used for statistical analysis. The data normalization, principal
component analysis (PCA) of proteomes, K-means clustering
analysis on significantly changed proteins, Pearson correlation and
hierarchical clustering between proteomics and biochemical readouts
of cytokines and albumin, and statistical analysis of proteomics data
were performed using R (version 3.6.1, www.r-project.org). The raw
data were normalized to the median intensity of all proteins within
each TMT 10-plex set. The normalized intensity for each protein was
then transformed to a relative ratio by dividing by the mean normalized
intensity of the protein, achieving the mean value of 1 for each protein
across all TMT sets. The proteins detected in at least 30% of samples
were included in statistical analysis that was performed using the
LIMMA package. The p-values were adjusted for multiple
comparisons using the Benjamini–Hochberg (BH) method. A protein
was considered significantly different between groups if it had a
BH-adjusted p-value <0.05. Only strk_Ip and sham_Ip
data, which are most relevant to plasma cytokine measurement, were
used in the correlation analysis. Correlation was considered as weak,
moderate, or high when the correlation coefficient (r) is 0.33–0.5, 0.5–0.75, or >0.75, respectively.
The
relevant number of K-mean clusters was defined using
NbClust (R package). Pathway enrichment analysis was conducted using
MetaCore (Clarivate Analytics, version 20.3.70200) with p < 0.05 and FDR < 0.05 as a cutoff. A resulting enriched pathway
for the changed-protein clusters (see results section later) was considered to be due to detection bias if the FDR-based
enrichment ranking was the same as or lower than the enrichment from
using the detected proteome (7318 proteins).
Data Availability
The mass spectrometry proteomics
data have been deposited to the PRIDE Archive (http://www.ebi.ac.uk/pride/archive/) via the PRIDE partner repository with the data set identifier PXD025077.
Results and Discussion
Proteomic analysis of a total of
70 samples, comprised of stroke
ipsilateral, stroke contralateral control, and sham ipsilateral control
samples over a time course of 28 days (collected at days 1, 3, 7,
14 and 28), identified and quantified 7609 protein groups, 7318 of
which were detected in at least 30% of samples and included in data
analysis (referred hereafter as the detected proteome). The relative
abundances of the quantified proteins, by TMT reporter ions, span
more than 5 orders of magnitude, indicating a broad dynamic range
in our quantitative measurement. Variation in protein quantitation
among biological replicates is relatively low, with a percent coefficient
variation (%CV) for the majority of the quantified proteins in stroke
or control groups at each time point of less than 15%. It is worthy
to note that to enhance performance of the proteomic analysis for
identifying proteins changed in stroke, two control groups, ipsilateral
samples from the sham animals (sham_Ip) and contralateral samples
from the strokemice (strk_Ct), were used in this study. The TMT-labeling
set design was balanced between obtaining confidence in stroke-changed
proteins versus control samples and capturing kinetic changes of proteins
over the time course.
Stroke-Induced Temporal Proteome Shift
To assess stroke-induced
proteome changes over time, we performed principal component analysis
(PCA) on the stroke and control samples across the five time points
poststroke (days 1, 3, 7, 14, 28). This analysis showed a clear temporal
separation of strk_Ip samples over the first 2 weeks, as indicated
by PC-1 (41%), PC-2 (17%), and PC-3 (8%), and a slight overlap of
the day 14 with the day 28 proteome (Figure A). Notably, both day 14 and 28 strk_Ip proteomes
have smaller separation from day 1 and 3 as compared to day 7, suggesting
potential recovery at the proteome scale starting by day 14. In contrast,
there was minimal proteome separation of sham_Ip samples over this
time course (Figure B), indicative of low biological sample variability and the high
quality of the proteomic analysis. Temporal separation of strk_Ct
proteomes was observed, but only on days 3 and 7 and at a smaller
scale (Figure C) as
compared to that of strk_Ip samples, consistent with the notion that
the contralateral area also exhibits stroke-induced changes, likely
at delayed time and smaller scale. We next compared the proteomes
of stroke to control samples (strk_Ct or sham_Ip) at each time point.
Large separation of the strk_Ip proteome from both strk_Ct and sham_Ip
control proteomes, by PC1 (46%–69%), was clearly evident at
each time point, and minimal or no separation was observed between
the two control proteomes, i.e., strk_Ct and sham_Ip (Figure S1, SI). Most importantly, the PCA revealed
large proteome changes in response to stroke, with the largest alteration
occurring on day 7.
Figure 1
Stroke-induced temporal proteome shift. Principal component
analysis
of proteomes of day 1 (turquoise), day 3 (orange), day 7 (purple),
day 14 (pink) and day 28 (green) in (A) stroke ipsilateral (strk_Ip),
(B) sham ipsilateral (sham_Ip), and (C) stroke contralateral (strk_Ct)
samples. The ovals are for better visualization.
Stroke-induced temporal proteome shift. Principal component
analysis
of proteomes of day 1 (turquoise), day 3 (orange), day 7 (purple),
day 14 (pink) and day 28 (green) in (A) stroke ipsilateral (strk_Ip),
(B) sham ipsilateral (sham_Ip), and (C) stroke contralateral (strk_Ct)
samples. The ovals are for better visualization.
Proteins Significantly Changed by Stroke
We next identified
the proteins significantly changed by stroke, namely “hits”,
at each time point. We performed two separate data analyses using
strk_Ct and sham_Ip as control, respectively. Comparison of strk_Ip
and strk_Ct samples generated 1893 (day 1), 2590 (day 3), 3538 (day
7), 3216 (day 14), and 1083 (day 28) hits, while strk_Ip and sham_Ip
comparison generated 1300 (day 1), 2273 (day 3), 2694 (day 7), 2728
(day 14), and 242 (day 5) hits (Figure A). Notably, using strk_Ct as a control yielded a higher
number of hits. This could result from the smaller inter-animal variation
as strk_Ct samples were collected from the same set of mice that generated
strk_Ip samples, while sham_Ip samples were from a different set of
mice. Nearly 46% more hits were identified for day 1 when using strk_Ct
as a control. This is likely due to “unchanged” strk_Ct
proteome on day 1 in the contralateral area as afore-described and/or
an effect of craniotomy surgery in the ipsilateral area of sham mice.
With the fact that 71–80% of hits from strk_Ip and sham_Ip
comparison were also identified as stroke hits when comparing against
strk_Ct (Figure A),
this study suggests that strk_Ct could serve as a control with the
advantages of skipping sham animals and potentially having the smaller
inter-animal variation, particularly for studies focused on the acute
phase of stroke.
Figure 2
Proteins significantly changed by stroke. (A) Number of
significantly
changed proteins (adjusted p-value <0.05) vs stroke
contralateral (strk_Ct) only (orange), vs sham ipsilateral (sham_Ip)
only (purple), and vs both strk_Ct and sham_Ip (green). The overlap
in changed proteins using either strk_Ct or sham_Ip control are considered
as the hits (green) in this study. (B) Number of up-regulated (red)
or down-regulated (blue) proteins among the hits (green bars in A).
Proteins significantly changed by stroke. (A) Number of
significantly
changed proteins (adjusted p-value <0.05) vs stroke
contralateral (strk_Ct) only (orange), vs sham ipsilateral (sham_Ip)
only (purple), and vs both strk_Ct and sham_Ip (green). The overlap
in changed proteins using either strk_Ct or sham_Ip control are considered
as the hits (green) in this study. (B) Number of up-regulated (red)
or down-regulated (blue) proteins among the hits (green bars in A).In this study, we focused on high-confidence protein
hits generated
by both above-described analyses with criteria of adjusted p-value <0.05 and the same direction of changes at each
time point. This resulted in 1044 (day 1), 1631 (day 3), 2174 (day
7), 1941 (day 14), and 199 (day 28) proteins (represented as green
bars in Figure A),
yielding a total of 3625 proteins, hereafter referred to as “hits”
(Table S2, SI). Notably, nearly half of
the detected proteome responded to stroke, and most (57–76%)
of the 3625 hits were down-regulated by stroke in the first four time
points (Figure B).
Proteome Change Patterns and Associated Pathways
To
understand the molecular pathways underlying the observed proteomic
changes, we first performed clustering of the 3625 protein hits on
expression means across 15 biological groups (strk_Ip, sham_Ip and
strk_Ct at five time points) and then conducted enrichment analyses
to determine pathways underlying different clusters. The K-mean clustering defined four distinct clusters representing 3.3%
(cluster 1), 5.3% (cluster 2), 19.9% (cluster 3), and 71.5% (cluster
4) of the significantly changed proteome, respectively (Figure A). Cluster 1 showed a stroke-induced
early peak of 119 proteins on day 3, and cluster 2 presented a later
peak of 191 proteins on day 7, referred to as cluster-1_earlyUp and
cluster-2_laterUp, respectively. Importantly, these two clusters together
accounted for only ∼8.6% of the protein hits, and they are
all increased by stroke at the aforementioned time points compared
to controls, with relatively large changes mostly greater than 2-fold.
Cluster 3 also showed stroke-induced up-regulation in the expression
of 722 proteins but with smaller amplitudes. This cluster contains
proteins mostly peaked at day 7 with less than 1.5-fold, referred
to as cluster-3_smUp. Cluster 4, composed of 2593 proteins, is the
largest and the only one exhibiting stroke-induced down-regulation,
mostly peaked on day 7 with relatively small magnitudes (<1.5-fold),
referred to as cluster-4_smDw.
Figure 3
Stroke-induced proteome change patterns
and associated pathways.
(A) K-mean clusters of 3625 stroke-changed protein
hits. Each line represents a pattern of one protein’s relative
mean expression level across 15 sample groups. Cluster-1_earlyUp:
119 (3.3%) proteins mostly showed a peak on day 3 in strk_Ip with
a >2-fold increase. Cluster-2_laterUp: 191 (5.3%) proteins mostly
peaked on day 7 in strk_Ip with a >2-fold increase. Cluster-3_smUp:
722 (19.9%) proteins mostly peaked on day 7 in strk_Ip with an increase
<1.5-fold. Cluster-4_smDw: 2593 (71.5%) proteins mostly peaked
on day 7 in strk_Ip with a decrease <1.5-fold. Note that the fold
changes were compared to protein mean expression levels in sham_Ip
and/or strk_Ct. (B) Top 10 enriched pathways (FDR < 0.05, ranked
by FDR) underlying the four clusters. * indicates that the enrichment
is likely due to detection bias (see the methods). a indicates that the full pathway name is “Putative
pathways of MHC class I-dependent postsynaptic long-term depression
in major depressive disorder”.
Stroke-induced proteome change patterns
and associated pathways.
(A) K-mean clusters of 3625 stroke-changed protein
hits. Each line represents a pattern of one protein’s relative
mean expression level across 15 sample groups. Cluster-1_earlyUp:
119 (3.3%) proteins mostly showed a peak on day 3 in strk_Ip with
a >2-fold increase. Cluster-2_laterUp: 191 (5.3%) proteins mostly
peaked on day 7 in strk_Ip with a >2-fold increase. Cluster-3_smUp:
722 (19.9%) proteins mostly peaked on day 7 in strk_Ip with an increase
<1.5-fold. Cluster-4_smDw: 2593 (71.5%) proteins mostly peaked
on day 7 in strk_Ip with a decrease <1.5-fold. Note that the fold
changes were compared to protein mean expression levels in sham_Ip
and/or strk_Ct. (B) Top 10 enriched pathways (FDR < 0.05, ranked
by FDR) underlying the four clusters. * indicates that the enrichment
is likely due to detection bias (see the methods). a indicates that the full pathway name is “Putative
pathways of MHC class I-dependent postsynaptic long-term depression
in major depressive disorder”.We next conducted pathway enrichment analysis using MetaCore (Clarivate
Analytics) on each cluster. Of note, we also conducted the same analysis
on all 7318 proteins to make sure the observed enrichment was not
due to detection bias. More than 50 pathways were significantly enriched
at FDR < 0.05 (Table S3 SI) for almost
all clusters. We examined the top 10 pathways enriched from each cluster.
As shown in Figure B, the most enriched pathways on cluster-1_earlyUp proteins are blood
coagulation, immune responses via complement pathways, and plasmin
signaling, reflecting early acute responses to stroke-induced diaschisis
and edema. Cluster-2_laterUp proteins are enriched in both innate
and adaptive immune responses. The later immune responses are mostly
through B and T cell activation, resulting from phagocytosis, antigen
presentation, and immunological synapse formation and involving IL-5
signaling. Differently, cluster-3_smUp proteins are mostly enriched
in actin cytoskeleton remodeling, via Rho GTPases, and integrin-mediated
cell adhesion and migration. In contrast to the three up-regulated
clusters, the pathway enrichment of cluster-4_smDw proteins revealed
that neurophysiological processes involving synaptic vesicle recycling,
AMPA receptor removal and delivery, and GABA-A receptor life cycle
are the most enriched pathways, reflecting disrupted synaptic transmission
and plasticity with both glutaminergic and GABAergic signaling being
affected. Associated with neuronal activity and angiogenesis, respectively,
the roles of CDK5 and angiotensin II signaling via beta-arrestin were
also among the top enriched pathways. Interestingly, “ubiquinone
metabolism” was also highly enriched, and 29 subunits of the
NADH:ubiquinone oxidoreductase (respiratory complex I) were down-regulated
by stroke (Table S3E, SI), signifying the
dysfunction of the mitochondrial electron transport chain.Collectively,
clustering and pathway enrichment analyses revealed
that the relatively large-scale changes in a small number of proteins
are associated with acute and adaptive immune responses, suggesting
their potential roles in the course of stroke via immune regulatory
mechanisms.[11,12] In contrast, the relatively smaller
magnitude changes in the majority of the protein hits are linked to
cytoskeleton remodeling (cluster-3_smUp) and synaptic signaling (cluster-4_smDw),
which could be potentially relevant to spontaneous self-recovery from
stroke-induced disruption.
Pharmacodynamic Biomarker Candidates
In addition to
identifying the stroke-induced proteome changes and deciphering molecular
pathways underlying AIS, we further explored the utilities of this
dataset for drug discovery. We aimed to identify pharmacodynamic (PD)
biomarker candidates from the 3625 protein hits by applying the following
criteria: a PD biomarker candidate must be (1) a hit on at least two
consecutive poststroke time points, (2) with a fold change (FC) ≥
2 in stroke versus one or both controls on at least one of the consecutive
time points, (3) with one of the consecutive time points in the first
week, i.e., day 1, 3, or 7, and (4) identified by minimally two unique
peptides. This resulted in a total of 309 proteins as top hits (referred
to as “top hits” hereafter), of which 115, 106, and
88 proteins started changing on day 1, 3, and 7, respectively (Table , Figure S2, SI). Notably, all 309 top hits are up-regulated
in stroke and mostly belonged to cluster-1_earlyUp or cluster-2_laterUp
(Table S4, SI), likely reflecting the sustained
and relatively large changes in inflammation and immune responses
as observed in the pathway enrichment analyses.
Table 1
Top 309 Proteins as Potential PD Markers
and/or Therapeutic Targets for AISa
Gene
names were used for proteins.
The value in parentheses indicates the number of proteins in the category.
Red/Blue font indicates that the protein is annotated as “secreted”
by UniprotKB subcellular location and/or a signal peptide. Proteins
in red font are detected in human plasma according to PPD.
Gene
names were used for proteins.
The value in parentheses indicates the number of proteins in the category.
Red/Blue font indicates that the protein is annotated as “secreted”
by UniprotKB subcellular location and/or a signal peptide. Proteins
in red font are detected in human plasma according to PPD.To further facilitate selection
of biomarker candidates, we used
UniprotKB[13] to determine whether these
proteins have a secretion annotation. Promisingly, 182 of the 309
top hits are annotated as secreted proteins by their subcellular location
and/or the presence of a signal peptide. Approximately 50% of these
secreted proteins responded to stroke on day 1 and lasted up to a
week, and ∼40% responded on day 3 or 7 and lasted up to 2 or
4 weeks, covering a wide poststroke time range. Remarkably, more than
81% of the 182 secretion-annotated proteins are detected in human
plasma according to the Plasma Proteome Database (PPD) (http://www.plasmaproteomedatabase.org/).[14] Together, we propose that these proteins,
particularly those secreted and/or present in human plasma (Table ), could serve as
PD biomarker candidates for acute, subacute, and chronic phases of
stroke. It should be noted that while individual or a few protein
hits could be further validated via complementary methodologies, a
similar proteomic study using the serum or plasma from this mouse
model would be an efficient approach to validate a large number of
the candidates by correlating the protein changes between plasma and
the brain cortex.It is worth noting that some of these 309
top hits could be considered
as therapeutic candidates as well, for example, proteins in the complement
cascade. It is intriguing that many components of the complement cascade
were among the 309 top hits and showed sustained changes for multiple
time points. Previous work has shown that complement protein is deposited
in the peri-infarct region in models of stroke and in post-mortem
human tissue,[15−19] which has been implicated in subsequent neuronal cell death. Indeed,
genetic deletion of several complement proteins reduces infarct volume
in the acute stages after stroke.[17,19,20] More recently, pharmacological inhibition of the
complement cascade, including when administered within 24 h after
stroke, has been shown to reduce infarct size and promote behavioral
recovery.[18,21−23] These studies suggest
that targeting complement proteins could be a potential neuroprotective
strategy after stroke.
Association of the Top Hits to Inflammatory
Markers
Preceding analyses revealed that cluster-1_earlyUp
and/or cluster-2_laterUp
proteins are highly enriched for inflammatory pathways, and the 309
top hits are mostly components of these two clusters. We thus sought
to further determine the associations of these 309 top hits to known
inflammatory markers in stroke, to identify potential PD biomarkers
for AIS therapeutic approaches targeting inflammation. We quantified
the levels of commonly used PD markers for inflammatory targets, namely,
TNFα, IL-1β, IL-6, IL-10, IP-10, CXCL1, MCP-1, and MIP-1α,
in the same brain cortex tissue analyzed in the proteomics study and
evaluated their correlations to the 309 top hits. We additionally
measured albumin (Alb) (Figure S3, SI)
in the same brain tissues as a surrogate marker of the blood–brain
barrier (BBB) leakage poststroke.Pearson correlation analysis
and hierarchical clustering on correlation coefficients (r) showed that the 309 top hits fell into two subgroups, one composed
of about one-third (referred to as the small subgroup) and the other
about two-thirds of the top hits (referred to as the large subgroup),
with each generally correlated to a different subset of cytokines
(Figure ). Most of
the top hits in the small subgroup were highly correlated to Alb (r > 0.6), suggesting potential associations with BBB
disruption,
and correlated to a set of five cytokines (CXCL1, MCP-1, TNFα,
IL-6, and MIP-1α) at different strengths. In contrast, most
of proteins in the large subgroup had no correlation to Alb (|r| < 0.3) and were highly correlated to IP-10, moderately
to highly correlated to MIP-1α and IL-1β, and weakly to
moderately negatively correlated to IL-6. Notably, most proteins in
the small subgroup belong to cluster-1_earlyUp and most from the large
subgroup are part of cluster-2_laterUp (Table S4, SI). This suggests that CXCL1, MCP-1, and TNFα represent
acute responses, and IP-10, MIP-1α and IL-1β reflect the
late response to stroke. Remarkably, at least 15 top hits overlap
with “promising” stroke biomarker candidates (including
MCP-1, TNFα, and IL-6) repeatedly investigated in other studies[24,25] and/or clinical trials[26] for differentiating
AIS from controls and/or subtypes of stroke. Eight of the 15 top hits,
namely, Apoa1, Apoc1, Cd14, Egfr, Fn1, Orm1, Rbp4, and S100a9, belong
to the small subgroup described above and 7 (Arg1, Cybb, Gbp4, Gfap,
Iqgap1, Lag3, and Npl) to the large subgroup (Table S4, SI), indicating that they may play roles in immune
responses at early or later stages poststroke, respectively. The fact
that the top hits from this study recapitulate many promising stroke
biomarkers not only further supports their potential as biomarkers
but also endorses this study’s other novel top hits, with similar
or stronger correlations to cytokines. For example, 10 proteins (Mag,
Igfbp7, Lifr, C8b, C8a, Il1rap, Serpina3k, Serping1 and Serpinf2,
Serpina3n) had the highest correlations to MCP-1 (r > 0.77) and were moderately to highly correlated to CXCL1, IL-6,
TNFα, and Alb (Figure A), and a different set of 10 proteins (Fcer1g, Itgam, Cd68,
Lgals9, Itgb5, Olfml3, Ly86, Lgmn, Hexb, Fcgr1) had the highest correlations
to IP-10 (r > 0.85) and were also correlated to
MIP-1α,
IL-1β, and TNFα, but not to Alb and the other four cytokines
(Figure B).
Figure 4
Association
of the top hits to inflammatory markers. The heatmap
shows pairwise correlation with a color gradient of blue (negative, r = −1) to red (positive, r = 1),
indicating the strength. The dendrograms are hierarchical clustering
on the Pearson correlation coefficients (r); x-axis, the 309 top hits from the proteomic analysis; y-axis, the eight cortical cytokines and albumin by biochemical
analysis. The two big clusters of the top dendrogram are referred
to as “the large subgroup” and “the small subgroup”,
as described in text. Abbreviations: ctx, cortex; BC, biochemical
method. Only strk_Ip and sham_Ip data were used in correlation analysis.
Figure 5
Exemplified differential correlation of top hits to cytokines
and
Alb. Example top hits (y-axis) from (A) the small
subgroup or (B) the large subgroup (see Figure ) were correlated, at cutoff r ≥ 0.33 (r value shown), to different cytokines
(x-axis) and/or Alb. Top hits that had the highest
correlations to MCP-1 or IP-10 were selected for demonstration of
the differentiated correlations to cytokines and Alb. The empty eclipses
indicate no correlation (r < 0.33).
Association
of the top hits to inflammatory markers. The heatmap
shows pairwise correlation with a color gradient of blue (negative, r = −1) to red (positive, r = 1),
indicating the strength. The dendrograms are hierarchical clustering
on the Pearson correlation coefficients (r); x-axis, the 309 top hits from the proteomic analysis; y-axis, the eight cortical cytokines and albumin by biochemical
analysis. The two big clusters of the top dendrogram are referred
to as “the large subgroup” and “the small subgroup”,
as described in text. Abbreviations: ctx, cortex; BC, biochemical
method. Only strk_Ip and sham_Ip data were used in correlation analysis.Exemplified differential correlation of top hits to cytokines
and
Alb. Example top hits (y-axis) from (A) the small
subgroup or (B) the large subgroup (see Figure ) were correlated, at cutoff r ≥ 0.33 (r value shown), to different cytokines
(x-axis) and/or Alb. Top hits that had the highest
correlations to MCP-1 or IP-10 were selected for demonstration of
the differentiated correlations to cytokines and Alb. The empty eclipses
indicate no correlation (r < 0.33).The correlation analysis also divulged other intriguing observations.
IL-6, an important mediator of inflammatory processes in the acute
phase of stroke and reportedly a neurotrophic factor in the later
phase,[27] was observed in this study to
positively or negatively correlate to cluster-1_earlyUp or a subset
of cluster-2_laterUp proteins respectively, suggesting potential roles
of some of cluster-2_laterUp top hits in neurogenesis. Additionally,
proteins from both the small and large subgroups were, moderately
and weakly, respectively, correlated to TNFα, a proinflammatory
cytokine acting both locally in the brain and systemically in the
periphery.[28] This may reflect that some
of the top hits in the large subgroup are related to infiltration
of peripheral immune cells. Furthermore, fewer proteins of the 309
top hits were correlated to IL-10, the only tested anti-inflammatory
cytokine, suggesting that those IL-10-correlated top hits may have
protective roles in stroke. Concertedly, the proteins highly correlated
with cortical cytokines (Table S4, SI)
could serve as PD biomarker candidates for stroke therapeutics modulating
acute or chronic inflammation.In contrast, the correlation
analysis showed that the 309 top hits
were not correlated to the measured cytokine levels in plasma, except
for many of the small subgroup top hits that have weak negative correlations
(r = −0.33 to −0.43) to plasma TNFα
(Table S4, SI), possibly due to increased
peripheral TNFα entering the brain. In addition, the cytokines
measured in plasma were not correlated to their counterparts in brain,
except for a weak correlation (r = 0.36) between
brain and plasma IP-10 (Table S5, SI).
These findings are consistent with the observation that the plasma
levels of these cytokines are, in general, not good indicators of
central nervous system inflammation in this DH-MCAO model.[29] Our results support that plasma levels of these
cytokines do not have the desired specificity as stroke biomarkers.Taken together, our analyses revealed that many of the stroke-changed
top protein hits were highly correlated to one or more brain cytokines
and could therefore be further validated as biomarkers for drug discovery
aimed at early or later stages poststroke or other diseases with underlying
inflammation.
Proteins Associated with Neurophysiological
Processes
As the K-mean clustering analysis
showed, the majority
of stroke-changed proteins belong to cluster-3_smUp (approximately
20%) and cluster-4_smDw (over 70%), and none of these proteins met
the criteria for the 309 top hits due to the small magnitudes of their
changes. Cluster-3_smUp is enriched in proteins involved in cytoskeleton
remodeling (RhoA, RhoC, Cdc42, Rac1, vinculin, different myosin subunits),
as well as integrins (ITGB1, ITGA5) that participate in interactions
with the extracellular matrix. We speculate that the changes in these
proteins may underly pathological or adaptive changes in neurons to
adjust to the damage in their microenvironment, by retracting damaged
neurites or extending new ones. These responses would need to be characterized
functionally to determine whether they are maladaptive or aimed at
restoring homeostasis.The pathway analysis for cluster-4_smDw
showed that many neurophysiological processes were highly enriched
for this cluster of proteins (Table S3E, SI) but not for the other three clusters (Table S3B–D, SI). Given that the initial phase of damage after
stroke is followed by repair processes (e.g., increases in synaptic
tone and axonal sprouting and synaptogenesis) aimed to restore normal
neurological function, we speculate that the significantly changed
proteins in the enriched neurophysiological processes could potentially
play a role in recovery from stroke. We therefore had a closer look
at the top five most enriched (ranked by FDR) neurophysiological process
pathways for cluster-4_smDw, excluding those that were likely due
to detection bias (see methods). The pathways
we examined are synaptic vesicle fusion and recycling in nerve terminals
(rank 1); activity-dependent synaptic AMPA receptor removal (rank
2); constitutive and activity-dependent synaptic AMPA receptor delivery
(rank 6); GABA-A receptor life cycle (rank 10); and constitutive and
regulated NMDA receptor trafficking (rank 12). Among the protein hits
involved in these pathways (Table S3E),
we found 57 proteins that were decreased by stroke on at least 2 consecutive
time points and identified by at least 2 unique peptides (Figure ). Notably, most
of the 57 proteins showed the strongest decrease at day 7, followed
by a return to baseline, which might indicate potential recovery from
stroke injury.
Figure 6
Profiles of 57 stroke-decreased proteins associated with
top enriched
neurophysiological process pathways. x-Axis, sample
collection day poststroke. y-Axis, normalized protein
expression level. Color legend: red, strk_Ip; blue, sham_Ip; and green,
strk_Ct. The letters a–e in parentheses indicate the top enriched
pathway in which the protein was involved: (a) synaptic vesicle fusion
and recycling in nerve terminals (rank 1), (b) activity-dependent
synaptic AMPA receptor removal (rank 2), (c) constitutive and activity-dependent
synaptic AMPA receptor delivery (rank 6), (d) GABA-A receptor life
cycle (rank 10), and (e) constitutive and regulated NMDA receptor
trafficking (rank 12). The numbers 1–5 in parentheses indicate
on which day the protein was significantly changed poststroke: (1)
days 1, 3, 7, and 14, (2) days 1, 3, 7, 14, and 28, (3) days 3 and
7, (4) days 3, 7, and 14, (5) days 7 and 14.
Profiles of 57 stroke-decreased proteins associated with
top enriched
neurophysiological process pathways. x-Axis, sample
collection day poststroke. y-Axis, normalized protein
expression level. Color legend: red, strk_Ip; blue, sham_Ip; and green,
strk_Ct. The letters a–e in parentheses indicate the top enriched
pathway in which the protein was involved: (a) synaptic vesicle fusion
and recycling in nerve terminals (rank 1), (b) activity-dependent
synaptic AMPA receptor removal (rank 2), (c) constitutive and activity-dependent
synaptic AMPA receptor delivery (rank 6), (d) GABA-A receptor life
cycle (rank 10), and (e) constitutive and regulated NMDA receptor
trafficking (rank 12). The numbers 1–5 in parentheses indicate
on which day the protein was significantly changed poststroke: (1)
days 1, 3, 7, and 14, (2) days 1, 3, 7, 14, and 28, (3) days 3 and
7, (4) days 3, 7, and 14, (5) days 7 and 14.Stroke changed the abundance of many proteins related to the synaptic
expression of neurotransmitter receptors, such as proteins in AMPA
receptor removal and delivery, NMDA receptor trafficking, and GABA-A
receptor life cycle. Included are many phosphatase subunits of PP2A,
PP2B, and PP1B, components of clustering of AMPA-type glutamate receptors
(Nptx1, Nptx2, Nptxr), multiple PKC subunits (Prkca, Prkce, Prkg,
Prkcz), the Arp2/3 complex (Arpc1a), cytoskeleton remodeling (Actr3b,
Gripap1, Kif5a, Kif5c) and postsynaptic components or those related
(Dlgap1, Shank1), and protein kinases (Cdk5, Prkar1b, Prkar2a, Csnk2a1,
Csnk2a2). Multiple α- and β-tubulin subunits (Tuba4a,
Tuba8, Tubb2a, Tubb3, Tubb4a, Tubb4b, Tubb5), changed at different
time points, were among 13 stroke-decreased proteins associated with
the GABA-A receptor life cycle, indicating continued cytoskeleton
remodeling via its major component microtubules. These changes in
abundance of proteins related to AMPA and GABA-A receptors are consistent
with recent attempts to target these receptor systems to promote stroke
recovery. For example, potentiating the function of AMPA receptors
and increasing their cell surface expression via the cytoskeletal
protein CRMP2 promote the recovery of motor function in stroke models.[30,31] Other components of AMPA receptor clustering or cytoskeleton remodeling
identified here may serve as additional therapeutic targets. Similarly,
inhibiting α5-containing GABA receptors has been shown to promote
sensorimotor recovery in mice.[32−34]Additionally, 14 proteins
in synaptic vesicle fusion and recycling
were significantly altered poststroke. These include calmodulin (Calm1),
PP2B catalytic subunits (Ppp3ca, Ppp3cb, Ppp3cc), and PIP5K1-gamma
(Pip5k1c), which are important upstream regulators of the transduction
of intracellular Ca2+-mediated signals, indicating potentially
disrupted neurotransmission and/or plasticity in stroke. Some of these
proteins may also be potential therapeutic targets to promote recovery
poststroke, and further work will be needed to validate them.Taken together, our analysis of the cortex of stroke animals showed
that synaptic function regulation via both glutaminergic and GABAergic
receptor recycling is the most highly enriched underlying neurophysiological
process, and the analysis generated a short list of protein candidates
potentially valuable for neuronal-modulating therapeutics and poststroke
recovery. It is worth pointing out that the changes we observed in
the cortex likely reflect synaptic remodeling mechanisms encompassing
motor and sensory projection neurons, transcallosal projections, and
cortico-thalamic neurons.[35] To obtain a
more complete picture of the recovery pathways, proteomic analysis
of the thalamus and medulla, which are major neuronal relay stations,
would need to be performed. Moreover, because many synaptic and plasticity
events are regulated by phosphorylation, a phosphoproteomic analysis
may provide additional insights about the mechanisms of poststroke
recovery and identify new ways to promote recovery. Finally, it would
be worthwhile to correlate the changes in protein abundance (or phosphorylation)
to other measurements of poststroke recovery, such as an increase
in electrophysiological activity or behavioral improvements.
Conclusion
The DH-MCAO model of stroke is well-suited for drug discovery research
due to its reproducible detection of neurological deficits and molecular
changes. To our knowledge, this is the first in-depth proteomic study
using this model spanning a time course of 4 weeks poststroke. Our
analysis reliably detected stroke-induced changes in the cortex in
nearly half of the measured proteome and showed possible proteome
recovery starting at day 14 poststroke. This study revealed that immune
response pathways were enriched at both acute and chronic phases of
stroke and reflected ∼8.6% of the changed proteome, shedding
new insights into the immune-regulatory mechanisms underlying stroke.
The 309 top hits generated from this study, which overlapped with
many promising stroke biomarkers, temporally responded to stroke at
a relatively large-scale and differentially correlated to cortical
cytokines and mostly secreted proteins, making them good biomarker
candidates for immune-modulating and inflammation-targeting therapies.
The study further divulged that decreases in protein abundances were
relatively small, mostly peaked at 1 week poststroke, and subsequently
returned to baseline, suggesting a process of spontaneous poststroke
recovery. The identification of highly enriched neurophysiological
processes and associated proteins offers new understandings in molecular
mechanisms of AIS and should facilitate efforts to modulate synaptic
receptor systems to promote neurological recovery from stroke.
Authors: Carl Atkinson; Hong Zhu; Fei Qiao; Juan Carlos Varela; Jin Yu; Hongbin Song; Mark S Kindy; Stephen Tomlinson Journal: J Immunol Date: 2006-11-15 Impact factor: 5.422
Authors: Y Chicheportiche; P R Bourdon; H Xu; Y M Hsu; H Scott; C Hession; I Garcia; J L Browning Journal: J Biol Chem Date: 1997-12-19 Impact factor: 5.157
Authors: Ali Alawieh; Andrew Elvington; Hong Zhu; Jin Yu; Mark S Kindy; Carl Atkinson; Stephen Tomlinson Journal: J Neuroinflammation Date: 2015-12-30 Impact factor: 8.322
Authors: Constanze Schanbacher; Michael Bieber; Yvonne Reinders; Deya Cherpokova; Christina Teichert; Bernhard Nieswandt; Albert Sickmann; Christoph Kleinschnitz; Friederike Langhauser; Kristina Lorenz Journal: Int J Mol Sci Date: 2022-01-09 Impact factor: 5.923