Bryna L Fitzgerald1, M Nurul Islam1, Barbara Graham1, Sebabrata Mahapatra1, Kristofor Webb1, W Henry Boom2,3, Stephanus T Malherbe4, Moses L Joloba5, John L Johnson2,3, Jill Winter6, Gerhard Walzl4, John T Belisle1. 1. Mycobacteria Research Laboratories, Department of Microbiology, Immunology and Pathology , Colorado State University , 200 West Lake Street, 0922 Campus Delivery , Fort Collins , Colorado 80523 , United States. 2. Department of Medicine, Tuberculosis Research Unit (TBRU) , Case Western Reserve University , 10900 Euclid Avenue , Cleveland , Ohio 44106 , United States. 3. Uganda-Case Western Reserve University Research Collaboration , 28A Upper Kololo Terrace , Kampala , Uganda. 4. DST/NRF Centre of Excellence for Biomedical Tuberculosis Research and MRC Centre for Molecular and Cellular Biology, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences , Stellenbosch University , P.O. Box 241, Francie van Zijl Drive , Cape Town 8000 , South Africa. 5. School for Biomedical Sciences , Makerere University , P.O. Box 7062, Kampala , Uganda. 6. Catalysis Foundation for Health , 2100 Addison Street , Berkeley , California 94704 , United States.
Abstract
The evaluation of new tuberculosis (TB) therapies is limited by the paucity of biomarkers to monitor treatment response. Previous work detected an uncharacterized urine metabolite with a molecular mass of 874.3547 Da that showed promise as a biomarker for successful TB treatment. Using mass spectrometry combined with enzymatic digestions, the metabolite was structurally characterized as a seryl-leucine core 1 O-glycosylated peptide (SLC1G) of human origin. Examination of SLC1G in urine revealed a significant abundance increase in individuals with active TB versus their household contacts and healthy controls. Moreover, differential decreases in SLC1G levels were observed by week one in TB patients during successful treatment versus those that failed treatment. The SLC1G levels were also associated with clinical parameters used to measure bacterial burden (GeneXpert) and inflammation (positron emission tomography-computed tomography (PET-CT)). These results demonstrate the importance of metabolite identification and provide strong evidence for applying SLC1G as a biomarker of TB treatment response.
The evaluation of new tuberculosis (TB) therapies is limited by the paucity of biomarkers to monitor treatment response. Previous work detected an uncharacterized urine metabolite with a molecular mass of 874.3547 Da that showed promise as a biomarker for successful TB treatment. Using mass spectrometry combined with enzymatic digestions, the metabolite was structurally characterized as a seryl-leucine core 1 O-glycosylated peptide (SLC1G) of human origin. Examination of SLC1G in urine revealed a significant abundance increase in individuals with active TB versus their household contacts and healthy controls. Moreover, differential decreases in SLC1G levels were observed by week one in TB patients during successful treatment versus those that failed treatment. The SLC1G levels were also associated with clinical parameters used to measure bacterial burden (GeneXpert) and inflammation (positron emission tomography-computed tomography (PET-CT)). These results demonstrate the importance of metabolite identification and provide strong evidence for applying SLC1G as a biomarker of TB treatment response.
Tuberculosis
(TB) is the leading cause of mortality worldwide due to an infectious
agent, Mycobacterium tuberculosis (Mtb).[1] A contributing factor to the global
burden of TB is the continued emergence of drug resistant Mtb strains. Efforts to combat such strains have led to
new anti-TB therapies and regimens.[2] Nevertheless,
clinical evaluations of new drugs are protracted and typically include
a six month treatment regimen with patient follow-up for 12 to 18
months to monitor for disease relapse. These parameters increase the
expense of clinical trials and impede the implementation of new drugs
for patient care.[2] Thus, identification
and validation of biomarkers that provide an early surrogate end point
for the long-term outcome of TB treatment has emerged as a research
priority.During the course of Mtb infection
and treatment, metabolic changes occur in the human host as well as
the pathogen.[3] Metabolomics provides a
unique tool to capture changes in the profile of small molecule metabolites
during infection and treatment. In fact, liquid chromatography–mass
spectrometry (LC-MS)-based metabolomics have successfully detected
metabolic alterations during the progression from Mtb infection to active disease and during anti-TB therapy, supporting
the development of metabolite-based biosignatures.[4−6] However, many
of the metabolites detected in untargeted metabolomics studies remain
structurally uncharacterized or only putatively identified and require de novo structural identification.[7,8]Previously, 12 urine metabolites that significantly changed during
successful anti-TB treatment were reported. However, five of these
metabolites did not match with any compound in available databases,
and seven had putative, yet unconfirmed, identifications.[5] The structural characterization of one of these
metabolites as N-acetylisoputreanine has led to the
discovery that alterations in polyamine metabolism occur during TB
and response to treatment,[8] an observation
now confirmed by others.[9] A second unidentified
metabolite designated as molecular feature (MF) 874.3547, on the basis
of its monoisotopic mass of 874.3547 Da, decreased in abundance early
in the treatment of TB and remained at decreased levels until the
end of treatment. In this current study, we applied liquid chromatography
(LC) tandem mass spectrometry (MS/MS) combined with enzymatic degradation
and analyses of synthetic products to elucidate the structure of MF
874.3547 as a seryl-leucine core 1 O-glycosylated
peptide (SLC1G). Evaluation of SLC1G in urine from patients with active
pulmonary TB (index cases) and their healthy household contacts (HHC)
confirmed that this novel urine metabolite is significantly elevated
in active TB. Assessment of SLC1G levels in TB patients undergoing
standard anti-TB therapy revealed changes in metabolite abundances
that are associated with different clinical outcomes (treatment failure,
stable long-term cure, or recurrent TB) and measures of inflammation
and bacterial load.
Results and Discussion
MS/MS Analyses of MF 874.3547
Reveal Glycopeptide Fragmentation Patterns
Initial structural
information pertaining to MF 874.3547 was obtained by LC-MS/MS fragmentation
at different collision energies, with the consistency of the MF 874.3547
fragmentation pattern confirmed using urine samples from six TB patients.
MF 874.3547 also was present in commercial and healthy control (HC)
urine (Figure S1). A representative MS/MS
spectrum of the [M + H]+ adduct (m/z 875.3582) of MF 874.3547 from one TB patient is shown
in Figure A. Manual
interrogation of the MS/MS spectrum revealed diagnostic fragment ions
for N-acetylhexosamine (m/z 204.0856) and N-acetylneuraminic acid
(Neu5Ac) (m/z 292.1045), as well
as neutral losses corresponding to Neu5Ac (291.09 Da), hexose (162.05
Da), and N-acetylhexosamine (203.07 Da) (Figure A). Importantly,
the fragment ions m/z 584.2627, m/z 422.2120, and m/z 219.1328 represented the sequential loss of Neu5Ac, hexose,
and N-acetylhexosamine from the parent ion (m/z 875.3582), respectively. The difference
between the parent ion m/z 875.3582
and the fragment ion m/z 657.2318
represented a neutral loss of 218.1264 Da that corresponded to the
fragment ion m/z 219.1328 (Figure A). These data led
to the hypothesis that MF 874.3547 is a glycoconjugate containing
an oligosaccharide composed of N-acetylhexosamine,
hexose, and Neu5Ac (m/z 657.2318)
attached to an unknown structure with a mass of 218.1264 Da (m/z 219.1328).
Figure 1
MS/MS interrogation of
MF 874.3547 leads to hypothesis of glycopeptide structure. Representative
MS/MS spectrum of MF 874.3547 in TB patient urine obtained using collision
energy of 20 V (A). Horizontal lines depict neutral losses within
the MS/MS spectrum for Neu5Ac (291.09 amu) in red, hexose (162.05
amu) in green, N-acetylhexosamine (203.07 amu) in
blue, and the unknown dipeptide (218.13 amu) in black. Diagnostic
fragment ions for N-acetylhexosamine (m/z 204.0856) and Neu5Ac (m/z 292.1045) were present. (B) A hypothesized structure with
theoretical fragment ions matching those observed in the experimental
MS/MS spectrum.
MS/MS interrogation of
MF 874.3547 leads to hypothesis of glycopeptide structure. Representative
MS/MS spectrum of MF 874.3547 in TB patient urine obtained using collision
energy of 20 V (A). Horizontal lines depict neutral losses within
the MS/MS spectrum for Neu5Ac (291.09 amu) in red, hexose (162.05
amu) in green, N-acetylhexosamine (203.07 amu) in
blue, and the unknown dipeptide (218.13 amu) in black. Diagnostic
fragment ions for N-acetylhexosamine (m/z 204.0856) and Neu5Ac (m/z 292.1045) were present. (B) A hypothesized structure with
theoretical fragment ions matching those observed in the experimental
MS/MS spectrum.Typically, biological
glycoconjugates are glycolipids or glycoproteins; thus, the m/z 219.1328 fragment ion likely represented
a lipid or peptide structure. This m/z value did not match any lipid moiety or fragments of glycolipids
annotated in existing databases,[10] and
the glycosylation pattern of N-acetylhexosamine,
hexose, and Neu5Ac was similar to that of glycoproteins rather than
glycolipids.[11,12] Thus, the m/z 219.1328 fragment ion was hypothesized to represent a
peptide. Glycosylation of glycoproteins typically occurs on asparagine
(Asn) residues (N-linked), as well as serine (Ser)
or threonine (Thr) residues and to a lesser extent tyrosine (Tyr),
hydroxylysine, or hydroxyproline residues (O-linked).[12−14] Six amino acid combinations that included at least one of these
amino acids and possessed a predicted monoisotopic mass of 218.1311
Da were identified: Ser and leucine (Leu) or isoleucine (Ile), or
Thr and valine (Val). This indicated an O-glycosylated
peptide.The most predominant form of protein O-glycosylation is a Ser or Thr residue α linked to N-acetylgalactosamine (GalNAc) followed by subsequent sugar
moieties and terminating with Neu5Ac.[12,15] Using this
information and the fragment ion data (Figure A), we developed a hypothesized structure
for MF 874.3547 (Figure B). In silico analysis of this hypothesized structure
yielded the fragment ions m/z 657.2354,
584.2668, 454.1561, 422.2140, 292.1032, and 219.1346 that were observed
in the experimental spectrum (Figure A).
Confirmation of MF 874.3547 as Seryl-Leucine
Core 1 O-Glycosylated Peptide (SLC1G)
To
confirm that the proposed oligosaccharide structure terminated in
Neu5Ac, MF 874.3547 was enriched from human urine by HPLC and treated
with neuraminidase enzymes.[15] Two neuraminidases,
one with specificity for α2-3 terminal Neu5Ac and one with broad
specificity for α2-3, α2-6, or α2-8, were applied
to MF 874.3547 and commercial standards with α2-3 or α2-6
linked terminal Neu5Ac residues (Figures S2 and S3). LC-MS analyses of the enzymatic digests demonstrated complete
hydrolysis of MF 874.3547 with both enzymes. Note the absence of the m/z 875.3582 product and emergence of new
products, m/z 310.1130 (Neu5Ac)
and m/z 584.2661 (a disaccharide
linked dipeptide) in Figure S2. This revealed
that the terminal sugar was an α2-3 linked Neu5Ac, as the use
of commercial glycopeptide standards demonstrated a linkage other
than α2-3 could not be digested with the α2-3 specific
neuraminidase. Specifically, the α2-3 neuraminidase digested
products with an α2-3 linked Neu5Ac (Figure S3B,C) but not a structure with an α2-6 linked Neu5Ac
(Figure S3A).The disaccharide moiety
remaining after digestion with neuraminidase was proposed to be a
core 1 oligosaccharide (Galβ1-3GalNAc-), a structure that can
be capped with Neu5Ac.[12] To test this,
MF 874.3547 was digested with neuraminidase and an O-glycosidase possessing endo-α-N-acetylgalactosaminidase
activity and specificity for core 1 and core 3 O-linked
disaccharides.[12,15] Treatment of commercial standards
with the O-glycosidase demonstrated activity consistent
with that expected, as this enzyme only digested a commercial standard
possessing the core 1 disaccharide (Figure S3C). LC-MS analyses of enriched MF 874.3547 following neuraminidase
and O-glycosidase treatment resulted in detection
of two ions belonging to a disaccharide (m/z 384.1500) and the putative dipeptide (m/z 219.1328), as well as the products observed with
the neuraminidase only digest (Figure ). These data confirmed that MF 874.3547 contained
either a core 1 or core 3 O-linked disaccharide with
a terminal α2-3 linked Neu5Ac. Since core 3 disaccharides are
represented by GlcNAcβ1-3GalNAc-, only a core 1 disaccharide
was consistent with both the MS/MS spectrum of MF 874.3547 (Figure A) and the neuraminidase
and O-glycosidase treatment data (Figure ). Thus, the glycosyl structure
of MF 874.3547 was confirmed as Neu5Acα2-3Galβ1-3GalNAc.
Figure 2
Enzymatic
deglycosylation and MS confirmation of core 1 glycosylation. MF 874.3547
untreated (A, C, E, G, I) and treated with α2-3,6,8 neuraminidase
and O-glycosidase (B, D, F, H, J) were analyzed by
LC-MS, and the spectra were evaluated by extracted ion chromatogram
(EIC) for intact glycopeptide (A and B), the glycopeptide minus Neu5Ac
(C and D), the Hex–HexNAc disaccharide (E and F), Neu5Ac (G
and H), and the deglycosylated putative diamino acid S/TX (I and J).
The * in panel C indicates a low level of the glycopeptide minus Neu5Ac
(m/z 584.2661) present in the undigested
sample. Note the level of this product was considerably higher following
digestion (G). In source fragmentation of SLC1G yielded the m/z 584.2661 product (# in panel C) at
the same retention time as the undigested glycopeptide.
Enzymatic
deglycosylation and MS confirmation of core 1 glycosylation. MF 874.3547
untreated (A, C, E, G, I) and treated with α2-3,6,8 neuraminidase
and O-glycosidase (B, D, F, H, J) were analyzed by
LC-MS, and the spectra were evaluated by extracted ion chromatogram
(EIC) for intact glycopeptide (A and B), the glycopeptide minus Neu5Ac
(C and D), the Hex–HexNAc disaccharide (E and F), Neu5Ac (G
and H), and the deglycosylated putative diamino acid S/TX (I and J).
The * in panel C indicates a low level of the glycopeptide minus Neu5Ac
(m/z 584.2661) present in the undigested
sample. Note the level of this product was considerably higher following
digestion (G). In source fragmentation of SLC1G yielded the m/z 584.2661 product (# in panel C) at
the same retention time as the undigested glycopeptide.The structure of the dipeptide was determined by
comparing synthetic peptide standards to the retention time and MS/MS
spectrum for the m/z 219.1328 product
resulting from the neuraminidase and O-glycosidase
digestion of MF 874.3547. Only the seryl-leucine standard matched
both the retention time and MS/MS spectrum of the m/z 219.1328 product (Figures and S4). This
confirmed that the entire structure of MF 874.3547 was Neu5Acα2-3Galβ1-3GalNAcα1-O-SerLeu, a SLC1G (Figure E).
Figure 3
Confirmation of seryl-leucine peptide and SLC1G structure.
EIC for m/z 219.1328 in LC-MS spectra
of α2-3,6,8 neuraminidase and O-glycosidase
treated MF 874.3547 (A) and seryl-leucine standard (C). MS/MS of m/z 219.1328 from α2-3,6,8 neuraminidase
and O-glycosidase treated MF 874.3547 (B) and the
seryl-leucine standard (D). The confirmed structure of Neu5Acα2-3Galβ1-3GalNAcα1-O-SerLeu for MF 874.3547 (E).
Confirmation of seryl-leucine peptide and SLC1G structure.
EIC for m/z 219.1328 in LC-MS spectra
of α2-3,6,8 neuraminidase and O-glycosidase
treated MF 874.3547 (A) and seryl-leucine standard (C). MS/MS of m/z 219.1328 from α2-3,6,8 neuraminidase
and O-glycosidase treated MF 874.3547 (B) and the
seryl-leucine standard (D). The confirmed structure of Neu5Acα2-3Galβ1-3GalNAcα1-O-SerLeu for MF 874.3547 (E).Naturally occurring peptides are common in the urine of healthy
individuals and are mainly products of normal proteolytic degradation
that occurs in tissue or body fluids. Altered levels of these peptides
may correspond to changes in protein level or protease activity and
have been associated with a variety of diseases such as diabetes,
chronic kidney disease, and rheumatoid arthritis, as well as aging.[16−19] Although Mtb produces O-linked
glycoproteins, these products possess oligosaccharides of mannose[20] and not the core 1 oligosaccharide structure
defined for SLC1G. Additionally, as demonstrated below, SLC1G was
identified in urine of healthy controls. Thus, we concluded that SLC1G
results from the proteolysis of a host glycoprotein.The elucidation
of MF 874.3547 as SLC1G provides a premise for the structural characterization
of additional metabolites identified via untargeted metabolomics experiments
and supports the use of host metabolites as indicators of infectious
disease. This is not a new concept in TB as host metabolites were
being investigated as biomarkers in the 1950s.[21] More recently, Rhee and colleagues[9] revealed several host urine metabolites that distinguished TB patients
from those with other pulmonary diseases. These metabolites included
sugars contained in the SLC1G structure.
Levels of SLC1G Are Increased
in Patients with Active TB Compared to HHC
To establish that
increased levels of SLC1G are associated with active TB, the relative
abundance of this glycopeptide in active index TB patients and HHCs
was evaluated with urine collected under the Kawempe Community Health
Study (KCHS).[22] The HHC included both tuberculin
skin test (TST) negative (−) and positive (+) individuals.
SLC1G levels were significantly higher in the index patients compared
to that of HHC (index vs TST–, p = 0.0003;
index vs TST+ , p = 0.0255) (Figure A). SLC1G was moderately higher in the TST+
group as compared to the TST– group of HHCs, but a significant
difference (p = 0.2201) was not obtained (Figure A). Using samples
from a separate study, the Catalysis Study,[23] a significant difference between urine SLC1G levels in active TB
patients at the time of diagnosis and healthy controls was observed
(Figure B). It is
noted that the relative abundance of the SLC1G in the TB patients
of the Catalysis Study was similar to that of the SLC1G in the index
patients of the KCHS. The increased level of SLC1G in active TB as
compared to healthy individuals provides strong evidence that this
metabolite is associated with active disease and its initial discovery
as a potential treatment response biomarker via untargeted metabolomics[5] was not due to alteration of host metabolism
by anti-TB drugs.
Figure 4
SLC1G levels are associated with active TB and different
treatment response outcomes. Log2 abundances of SLC1G in
index TB patients, as compared to TST– and TST+ HHCs of the
KCHS (A) and active TB patients at their diagnostic time point and
HCs of the Catalysis Study (B). Log2-fold change in SLC1G
levels over 24 weeks of treatment differed in cured (C) and recurrent
(E) treatment outcome groups as compared to the failed (D) treatment
group. The dashed line indicates a log2-fold change of
−2. Fold changes at each time point are based on SLC1G levels
at the Dx time point. The data are representative of index (n = 10), TST– (n = 14), and TST+
(n = 12) patients (A), 14 healthy controls and TB
patients (n = 36) (B), and cured (n = 15), failed (n = 8), and recurrent patients (n = 12) (C, D, E). One cured patient was not included in
the analysis for (C) due to a missing week 8 time point. Statistical
significance was determined using a one-way ANOVA with Tukey’s
multiple comparison test (A), an unpaired two-tailed t test (B), and a repeated measures one-way ANOVA with Tukey’s
multiple comparison test (C, D, E). Error bars represent the 95% confidence
interval from the mean (*, p < 0.05; **, p < 0.01; ***, p < 0.001; ****, p < 0.0001).
SLC1G levels are associated with active TB and different
treatment response outcomes. Log2 abundances of SLC1G in
index TB patients, as compared to TST– and TST+ HHCs of the
KCHS (A) and active TB patients at their diagnostic time point and
HCs of the Catalysis Study (B). Log2-fold change in SLC1G
levels over 24 weeks of treatment differed in cured (C) and recurrent
(E) treatment outcome groups as compared to the failed (D) treatment
group. The dashed line indicates a log2-fold change of
−2. Fold changes at each time point are based on SLC1G levels
at the Dx time point. The data are representative of index (n = 10), TST– (n = 14), and TST+
(n = 12) patients (A), 14 healthy controls and TB
patients (n = 36) (B), and cured (n = 15), failed (n = 8), and recurrent patients (n = 12) (C, D, E). One cured patient was not included in
the analysis for (C) due to a missing week 8 time point. Statistical
significance was determined using a one-way ANOVA with Tukey’s
multiple comparison test (A), an unpaired two-tailed t test (B), and a repeated measures one-way ANOVA with Tukey’s
multiple comparison test (C, D, E). Error bars represent the 95% confidence
interval from the mean (*, p < 0.05; **, p < 0.01; ***, p < 0.001; ****, p < 0.0001).
Changes in SLC1G Levels Correlate with Different Treatment Response
Outcomes
Our previous study that detected SLC1G as a potential
biomarker anti-TB treatment did not evaluate whether changes in SLC1G
levels during treatment differed between individuals successfully
treated versus those that failed treatment.[5] To demonstrate that SLC1G levels could serve as a prognostic biomarker
of treatment success, the Catalysis study samples were utilized on
the basis of availability of longitudinal patient samples corresponding
to different treatment outcomes. We evaluated urine samples from TB
patients designated as clinically cured (individuals that converted
to sputum culture negative before or at the end of the 24 week treatment
and remained culture negative during a two year follow-up) (n = 15), those designated as treatment failures (individuals
that remained sputum culture positive at the end of the 24 week treatment)
(n = 8), and a group of patients designated as cured
but that developed recurrent disease within 18 months of successful
treatment (n = 12)[23] (Table ). The levels of SLC1G
at each time point in the cured and failed patient groups were compared
using one-way ANOVA with Tukey’s multiple comparison test.
This demonstrated that the only significant difference in SLC1G levels
between these two patient groups was at the week 24 time point (p = 0.038).
Table 2
Patient Characteristics, Catalysis Study
(South Africa)a
TB cases (n = 35)
participant characteristics (n = 48)
cured (n = 15)
recurrence (n = 12)
failure (n = 8)
HC (n = 13)
mean age, years (SD)
33 (9)
35 (10)
34 (15)
32 (12)
male, n (%)
9 (60%)
7 (58%)
5 (63%)
7 (54%)
previous TB, n (%)
6 (40%)
4 (33%)
4 (50%)
NA
mean
treatments missed, n (range of treatments missed)
7 (0–23)
7 (0–32)
43 (0–99)b
NA
NA, not applicable.
The
mean number of missed treatments in the intensive phase (first two
months) and continuation phase (two to six months) of treatment were
8 and 35, respectively.
However, a comparison of SLC1G levels
at baseline (diagnosis) versus treatment time points (1, 4, 8, and
24 weeks) demonstrated a decrease during the first week of treatment
in all patient groups; however, the decrease was only significant
for the cured and recurrent TB groups (p = 0.0010
and 0.0243, respectively) (Figure C–E). Likewise, SLC1G levels at all subsequent
time points of treatment (4, 8, and 24 weeks) in comparison to baseline
levels differed significantly for the cured and recurrent TB groups
but not the treatment failure group (Figure ). The rapid decrease in urine levels of
SLC1G after the start of anti-TB therapy coincides with early periods
of treatment when the largest decrease in bacterial load is observed.[24]The cured patient group also yielded a
mean log2-fold change between baseline and the end of treatment
(week 24) that was significantly larger than the fold change at week
1 of treatment (Figures and S5A–C). The magnitude of change
in SLC1G levels at weeks 4, 8, and 24 in the failed treatment group
was less than in the other groups, with a mean log2-fold
change of around −1. The cured and recurrent TB groups, in
comparison, yielded mean log2-fold changes below −1
(weeks 4 and 8) or below −2 (week 24) (Figures and S5A–C). The lack of a significant decrease in SLC1G levels during the
initial intensive phase of therapy (first two months) could have been
related to a greatly increased number of missed treatments in this
patient group (Table ). However, the vast majority of missed treatments occurred in the
continuation phase, with only one patient having a greater number
of missed treatments during the intensive phase.Overall, SLC1G
levels behaved similarly in cured and recurrent TB patients. This
observation was not unexpected as the recurrent TB patients were initially
designated as cured at the end of treatment (week 24) and developed
recurrent TB within one year of successfully completing treatment.
Further, it is unknown whether recurrent TB in this patient cohort
was due to relapse of treated infections or new infections.[23] The relapse of TB after the completion of anti-TB
therapy is an outcome measured during clinical trials and is an important
parameter for which biomarkers are needed.[2] Additional studies with longitudinal urine from true relapse patients
are required to accurately evaluate the potential value of SLC1G as
a biomarker of relapse.
SLC1G Levels Associate with Clinical Measurements
of Inflammation and Bacterial Burden
The measurements of
bacterial load by GeneXpert MTB/RIF and inflammation by positron emission
tomography-computed tomography (PET-CT) imaging are used as indicators
of treatment efficacy and disease resolution in clinical studies of
anti-TB therapies.[23,25−27] Thus, using
the cured and failed treatment groups of the Catalysis Study, we explored
whether a relationship existed between SLC1G levels and pulmonary
inflammation (PET-CT combined total glycolytic activity (COM TGAI)
score) or bacterial load (GeneXpert Ct value) regardless of treatment
outcome. Using linear mixed models, a 2-fold increase in SLC1G levels
was associated with an increase in the PET-CT COM TGAI score of 0.82
(95% CI: 0.58–1.06) and a decrease in the GeneXpert MTB/RIF
Ct values of 3.2 (95% CI: 2.4–4.5). The COM TGAI provides a
single metric to indicate the total burden of lung lesions as measured
by PET-CT. To visualize the associations determined using linear mixed
models, the COM TGAI scores or GeneXpert MTB/RIF Ct values for all
patients and all time points, regardless of treatment outcome, were
grouped into four quantiles and plotted against SLC1G log2 normalized abundances (Figure A,B). A positive association occurred between SLC1G
levels and increased inflammation (higher COM TGAI scores) (Figure A). Likewise, lower
SLC1G levels were observed in patients with lower bacterial burdens
(i.e., increased GeneXpert MTB/RIF Ct values) (Figure B). These associations are also apparent
in scatter plots of COM TGAI scores or GeneXpert MTB/RIF Ct values
versus SLC1G log2 normalized abundances for all patients
and all time points (Figure S5D,E). The
studies of Malherbe et al.[23] revealed that,
within the cured patient population, only 16% had resolved their tubercle
lesions at the end of treatment. We observed SLC1G levels were increased
in the cured patients who remained PET-CT scan positive (i.e., COM-TGAI
scores >400) at the end of treatment relative to those that were
deemed resolved (Figure C). However, this difference was not significant (p = 0.1079).
Figure 5
SLC1G levels associate with clinical measurements of bacterial
burden and inflammation. Box plot of SLC1G abundances versus quantile
grouped GeneXpert MTB/RIF Ct values (A) and COM TGAI scores (B) for
all cured and failed patients at all time points. SLC1G abundances
trended to increase in cured patients when week 24 COM-TGAI scores
indicate persistent inflammation, COM-TGAI scores >400 (C). SLC1G
abundances differed significantly in the cured patient population
before and after TSCC (D). The data represent patients with COM TGAI >
400 (n = 6) and patients with COM TGAI < 400 (n = 10) (C) and cured patients (n = 13)
(D). Three patients were not included in the analysis for (D) due
to missing time points before or after sputum culture conversion.
Statistical significance was performed with an unpaired two-tailed t test (C) and paired t test (D). Box-and-whiskers
plots have lines at the 25th, 50th (median), and 75th percentiles,
and the whiskers extend to 1.5 times the interquartile range (A, B).
Error bars represent the 95% confidence interval from the mean (C,
D) (**, p < 0.01).
SLC1G levels associate with clinical measurements of bacterial
burden and inflammation. Box plot of SLC1G abundances versus quantile
grouped GeneXpert MTB/RIF Ct values (A) and COM TGAI scores (B) for
all cured and failed patients at all time points. SLC1G abundances
trended to increase in cured patients when week 24 COM-TGAI scores
indicate persistent inflammation, COM-TGAI scores >400 (C). SLC1G
abundances differed significantly in the cured patient population
before and after TSCC (D). The data represent patients with COM TGAI >
400 (n = 6) and patients with COM TGAI < 400 (n = 10) (C) and cured patients (n = 13)
(D). Three patients were not included in the analysis for (D) due
to missing time points before or after sputum culture conversion.
Statistical significance was performed with an unpaired two-tailed t test (C) and paired t test (D). Box-and-whiskers
plots have lines at the 25th, 50th (median), and 75th percentiles,
and the whiskers extend to 1.5 times the interquartile range (A, B).
Error bars represent the 95% confidence interval from the mean (C,
D) (**, p < 0.01).Time to sputum culture conversion is another end point applied
in clinical studies of anti-TB drugs as a measure of bacterial clearance.[28] The SLC1G levels in cured patient samples prior
to being designated culture negative were significantly higher (p = 0.0166) than the SLC1G levels in those same patients
following conversion to culture negative (Figure D). These data establish strong evidence
that SLC1G levels were linked with bacterial load and lung pathology
in the population study and could be applied to monitor the resolution
of disease and infection during treatment.
Plasma Protease C1 Inhibitor:
A Potential Source of SLC1G
To establish a potential origin
of SLC1G, the UniProt database was queried for documented glycosylated
proteins, resulting in a list of 4508 proteins. Nine proteins with O-glycosylation of a Ser-Leu motif were identified. On the
basis of previous literature, seven of these proteins were determined
to have the glycan structure of SLC1G (Table S1). Whole blood RNaseq data for the Catalysis Study TB patients[29] were interrogated to determine whether any of
the levels of the transcripts for these seven proteins followed the
same trend as that observed for SLC1G during treatment. The transcript
belonging to plasma protease C1 inhibitor (C1INH) decreased approximately
4-fold during treatment, following a similar trend to that of SLC1G
(Table S1). One study has reported changes
in expression of C1INH transcripts during anti-TB treatment that are
similar to the changes we observed in urine levels of SLC1G.[36] Additionally, expression of C1INH transcripts
also increases in active TB and household contacts that progress to
active disease.[34,35] On the basis of these results,
C1INH was highlighted as a plausible precursor protein of the glycosylated
dipeptide.C1INH is an abundant circulating protease inhibitor
that regulates complement and coagulation cascades and increases during
inflammation.[30] One of the O-glycosylation sites in C1INH (Ser-42) possesses an O-linked oligosaccharide matching that of SLC1G and is followed by
a Leu residue.[31] Other studies have detected
C1INH peptides in human urine but not peptides containing the SLC1G
motif.[32,33] In addition to C1INH, other proteins possessing
the SLC1G glycosylation motif, but that are not annotated as such
in the UniProt database, could also be a source of the glycopeptide.
Specifically, a glycosylated peptide in the C-terminus
of fibrinogen alpha chain isoform 2 with the sequence GKPSLSP contains
the SLC1G motif. This larger glycopeptide has been identified in urine
of individuals with febrile, complicated urinary tract infections.[37] Increased levels of serum fibrinogen and fibrinogen
degradation products also are noted to be associated with active TB.[38−40] Although metabolomics measures the cumulative outcome of metabolic
processes, changes in the level of small peptides will be influenced
by alterations in the production and turnover of transcripts and proteins.
This, along with the potential for multiple proteins giving rise to
the same peptide, elevates the complexity of studies required to determine
the molecular events that influence alterations in the level of specific
peptides. Thus, further studies are required to determine whether
abundance changes in the SLC1G glycopeptide arise from one or more
proteins with this motif or changes in the activity of proteases as
has been described to occur during active TB.[41,42]
Conclusion
Host metabolic responses provide a means
to measure disease severity and patient health. Unfortunately, the
implementation of metabolic biomarkers is impeded by a lack of structural
identification of potential diagnostic or prognostic metabolites identified
via untargeted metabolomics, as well as a scarcity of data associating
host metabolites to pathogen load or inflammation. The elucidation
of a previously uncharacterized seryl-leucine glycopeptide and measurement
of this metabolite in human urine demonstrated that SLC1G levels decreased
in association with a reduction of bacterial load and inflammation
in patients being treated for TB providing evidence for a specific
relationship. The magnitude of change in relative levels of glycopeptide
differed between patients that were cured versus those that failed
TB treatment, and this difference occurred within the first 2 months
of treatment. A limitation of this study was the small number of samples
utilized. Thus, future efforts with larger cohorts are necessary to
validate SLC1G as a prognostic marker of treatment outcome. It is
noted that an absolute quantification of the SLC1G concentration in
urine also was not determined. This is part of ongoing efforts to
establish quantifiable assays to measure the abundance change in SLC1G
and other metabolites during TB treatment. Nevertheless, the data
presented provide a basis for development of host metabolite biomarkers
to track the early treatment response of TB patients and underscore
the value of continuing to structurally elucidate endogenous human
metabolites.
Methods
Study Population
A subset of samples from the DMID 08-0023/TB Trials Consortium NAA2m
study conducted in Uganda[5] were used for
structural elucidation of MF 874.3547. Patient urine for metabolite
quantification was obtained from the TBRU Kawempe Community Health
Study (KCHS) conducted in Uganda (http://www.case.edu/affil/tbru/research_dmid01005.html) and the Catalysis Study (Stellenbosch University Human Research
Ethics Committee approval number N10/01/013) conducted in Cape Town,
South Africa.[23] The KCHS urine samples
represented a subset of two cohorts[22] and
consisted of 10 index cases of TB and 26 HHC that were TST–
(n = 14) and TST+ (n = 12) and were
without HIV comorbidity (Table ). None of the KCHS HHC went on to develop TB. The Catalysis
Study urine samples[23] consisted of 14 HC
and 35 TB patients with different treatment outcomes: 8 failed treatment,
12 recurrences, and 15 cured (Table ). All of the available
failed and recurrent patients were utilized; however, only a subset
of the total cured patient population was utilized to maintain similar
sample numbers in each patient group. A posthoc power analysis was
performed using the mean SLC1G abundance and standard deviation observed
in the HHC samples to ensure adequate power (>80%) for the sample
sizes chosen (http://clincalc.com/stats/Power.aspx). All of the parent studies were reviewed and approved by their
respective institutional review boards or ethics committees in South
Africa, Uganda, and the United States. All participants gave informed
consent for study participation and sample retention for future research
use.
Table 1
Patient Characteristics, Kawempe Community
Household Study (Uganda)
HHC (n = 26)
participant characteristics (n = 36)
TB cases (n = 10)
TST– (n = 14)
TST+ (n = 12)
mean age, years (SD)
35 (11)
24 (16)
28 (10)
male, n (%)
1 (10%)
5 (36%)
6 (50%)
location
Uganda
Uganda
Uganda
mean PPD induration, mm (SD)
14 (6)
2 (3)
16 (1)
NA, not applicable.The
mean number of missed treatments in the intensive phase (first two
months) and continuation phase (two to six months) of treatment were
8 and 35, respectively.
Urine
Processing and Osmolality
Patient urine samples (40 to 45
μL) from the KCHS and Catalysis Study were diluted 1:3 in LC-MS
grade water. To produce quality control samples for LC-MS, aliquots
(5 μL) of individual urine samples from either KCHS or Catalysis
Study were pooled and prepared for LC-MS identical to the individual
samples. Diluted urine (20 μL) osmolality[46] was measured by freezing point depression using a Micro-Osmometer
Model 210 (Advanced Instruments, Norwood, MA). The instrument was
calibrated using both 55 and 850 mOsm standards (Advanced Instruments).
The accuracy of the instrument was verified with a Clintrol 290 standard
(Advanced Instruments).
MF 874.3547 Isolation and Digestion
MF 874.3547 was isolated from human urine (Gemini Bio-Products, West
Sacramento, CA) using the LC conditions previously described.[5] Fractions of 0.25 mL were collected from 18 replicate
LC runs. Fractions within ±1 min of the target retention time
for MF 874.3547 were analyzed by LC-MS for the presence of MF 874.3547.
Fractions containing the MF 874.3547 were dried by SpeedVac and stored
at −20 °C. The stored fractions were suspended in 70 μL
of water before analyses.MF 874.3547 (9 μL) or standards
(60 ng) of 4-nitrophenyl O-(N-acetyl-α-neuraminosyl)-(2-6)-β-d-galactopyranosyl-(1-4)-2-acetamido-2-deoxy-β-d-glucopyranoside (EN4614, Carbosynth Ltd., San Diego, CA), glycan-F58
(ULM-10078-CA, Cambridge Isotope Laboratories, Inc., Tewksbury, MA),
and Sialo Anti-Proliferative Factor (GP131025, Sussex Research Laboratories,
Ottawa, ON) were treated with α2-3 neuraminidase S (PO743S),
α2-3,6,8 neuraminidase (#P0720S, New England Biolabs Inc., Ipswich,
MA, USA), and O-glycosidase (#P0733S, New England
Biolabs Inc.). Specifically, enzymatic digests were performed with
α2-3,6,8 neuraminidase (0.05 units), α2-3 neuraminidase
S (0.008 units), or O-glycosidase (80 units) in 15
μL (final vol) of 1× GlycoBuffer 2 (#B3704, New England
Biolabs Inc.) for 1 h at 37 °C. Reactions were stopped with 70%
methanol (final concentration) and incubated at −80 °C
for 1 h. Precipitated material was removed by centrifugation (18 000g). The methanol extracts were collected, dried by SpeedVac,
and resuspended in 20 μL of water prior to LC-MS and LC-MS/MS
analyses.
LC-MS and LC-MS/MS Analyses
LC was based on a previously
described method using either an Agilent 1200 or 1290 series high-performance
LC system (Agilent Technologies, Palo Alto, CA, USA) with an Atlantis
T3 reverse-phase C18 3.5 μm column (2.1 × 150 mm; Waters
Corp., Milford, MA).[5] Injections (5 μL)
of each diluted urine sample, enriched MF 875.3547, or standard were
made.MS analyses of Catalysis Study samples and MS/MS analyses
for MF 874.3547 and seryl-leucine structural characterization were
performed using an Agilent 6520 quadrupole time-of-flight (Q-TOF)
instrument equipped with an electrospray ionization source operated
in positive ionization mode. The operating conditions were: gas temperature,
300 °C; drying gas, 8 L/min; nebulizer, 45 lb/in2;
capillary voltage, 2000 V; fragmentation energy, 120 V; skimmer, 60
V; octapole RF setting, 750 V. Specified ions were isolated for MS/MS
fragmentation using a mass window of 1.3 Da. Collision induced dissociation
was performed with collision energies of 5 to 40 V. Data were collected
in the profile and centroid modes at a scan rate of 1.2 spectra/s
and a scan range of m/z 100 to 1700
for MS and a scan rate of 2.0 spectra/s and a scan range of m/z 100 to 1700 for MS/MS using the Agilent
MassHunter Data Acquisition software.MS analyses of KCHS samples,
enriched MF 874.3547, and Sialo Anti-Proliferative Factor glycopeptide
were performed using an Agilent 6230 time-of-flight (TOF) instrument
with an electrospray ionization source operated in positive ionization
mode. The operating conditions for the mass spectrometer were: gas
temperature, 300 °C; drying gas, 11 L/min; nebulizer, 40 lb/in2; capillary voltage, 2000 V; fragmentation energy, 120 V;
skimmer, 60 V; octapole RF setting, 750 V.MS analyses of Glycan
F58 and 4-nitrophenyl O-(N-acetyl-α-neuraminosyl)-(2-6)-β-d-galactopyranosyl-(1-4)-2-acetamido-2-deoxy-β-d-glucopyranoside were performed using an Agilent 6224 TOF instrument
with an electrospray ionization source operated in negative ionization
mode. The operating conditions for the mass spectrometer were: gas
temperature, 325 °C; drying gas, 7 L/min; nebulizer, 40 lb/in2; capillary voltage, 3500 V; fragmentation energy, 160 V;
skimmer, 60 V; octapole RF setting, 750 V. Data were collected on
TOF instruments in the profile and centroid modes at a scan rate of
1.0 spectra/s and a scan range of m/z 100 to 1700.All patient samples were randomly assigned to
batches and analyzed by LC-MS in random order to avoid a patient group
bias based on SLC1G stability.
Analyses of MS/MS Spectra
and LC-MS Data
The MS/MS spectra were manually interrogated
against the METLIN database, the Human Metabolome Database (HMDB),
and the NIST/EPA/NIH Mass Spectral Library.[43−45] Theoretical
fragment ions of MF 874.3547 were obtained using an ACD/MS Fragmenter
(Advanced Chemistry Development, Inc., Toronto, ON).The LC-MS
peak area of MF 874.3547 was used as the SLC1G abundance in individual
samples. The LC-MS peak area of MF 874.3547 was normalized to the
osmolality of the corresponding sample as this is described as the
gold standard for estimating urinary concentration.[46] Normalized abundance values were log2 transformed
and used to interrogate significant differences in SLC1G levels between
patient groups using statistical analyses in GraphPad Prism version
6.04 for Windows (GraphPad Software, La Jolla, California, USA). Linear
mixed models were performed using R version 3.4.2[47] with the lme4 package and a random subject effect to account
for repeated measures.[48] Data used for
the linear mixed models were from TB patients of the Catalysis Study
and included combined total glycolytic activity (COM TGAI) from PET-CT
measurements and GeneXpert MTB/RIF Ct values as the response variable
and the normalized log2 SLC1G abundance value from the
same patient as the independent variable. The COM TGAI is an aggregate
value consisting of the TGAI and cavity volume and was calculated
using the formula COM TGAI = [metabolic lesion volume (PET) ×
mean metabolic lesion intensity (PET)] + [cavity volume (CT) ×
mean metabolic lesion intensity (PET)]. PET-CT data are available
through the Catalysis Biomarker Consortium Data Repository (https://codr.c-path.org/main/home.html).
Data Repositories
Whole blood RNaseq data for the Catalysis
Study was obtained from the NCBI GEO database, accession GSE89403.
LC-MS data of SLC1G in patient urine and MS/MS data for SLC1G structural
elucidation have been deposited in Metabolomics Workbench, Study IDs:
ST001104 and ST001069. The SLC1G structure has been deposited in the
PubChem database, Pub Chem CID: 134687025.
Authors: Tiziana Pacchiarotta; Paul J Hensbergen; Manfred Wuhrer; Cees van Nieuwkoop; Ekaterina Nevedomskaya; Rico J E Derks; Bart Schoenmaker; Carolien A M Koeleman; Jaap van Dissel; André M Deelder; Oleg A Mayboroda Journal: J Proteomics Date: 2011-10-31 Impact factor: 4.044
Authors: Stephen T Reece; Christoph Loddenkemper; David J Askew; Ulrike Zedler; Sandra Schommer-Leitner; Maik Stein; Fayaz Ahmad Mir; Anca Dorhoi; Hans-Joachim Mollenkopf; Gary A Silverman; Stefan H E Kaufmann Journal: J Clin Invest Date: 2010-08-02 Impact factor: 14.808
Authors: Liesbeth M Kager; Dana C Blok; Ivar O Lede; Wahid Rahman; Rumana Afroz; Paul Bresser; Jaring S van der Zee; Aniruddha Ghose; Caroline E Visser; Menno D de Jong; Michael W Tanck; Abu Shahed M Zahed; Khan Mashrequl Alam; Mahtabuddin Hassan; Ahmed Hossain; Rene Lutter; Cornelis Van't Veer; Arjen M Dondorp; Joost C M Meijers; Tom van der Poll Journal: J Infect Date: 2014-10-29 Impact factor: 6.072
Authors: Constantin von Zur Mühlen; Thomas Koeck; Eric Schiffer; Christine Sackmann; Petra Zürbig; Ingo Hilgendorf; Jochen Reinöhl; Jennifer Rivera; Andreas Zirlik; Christoph Hehrlein; Harald Mischak; Christoph Bode; Karlheinz Peter Journal: Proteomics Clin Appl Date: 2016-04-04 Impact factor: 3.494
Authors: S C Bock; K Skriver; E Nielsen; H C Thøgersen; B Wiman; V H Donaldson; R L Eddy; J Marrinan; E Radziejewska; R Huber Journal: Biochemistry Date: 1986-07-29 Impact factor: 3.162
Authors: Basim Al Shammari; Takayuki Shiomi; Liku Tezera; Magdalena K Bielecka; Victoria Workman; Tarangini Sathyamoorthy; Francesco Mauri; Suwan N Jayasinghe; Brian D Robertson; Jeanine D'Armiento; Jon S Friedland; Paul T Elkington Journal: J Infect Dis Date: 2015-02-12 Impact factor: 5.226
Authors: Sebabrata Mahapatra; Ann M Hess; John L Johnson; Kathleen D Eisenach; Mary A DeGroote; Phineas Gitta; Moses L Joloba; Gilla Kaplan; Gerhard Walzl; W Henry Boom; John T Belisle Journal: BMC Infect Dis Date: 2014-01-31 Impact factor: 3.090
Authors: Bryna L Fitzgerald; Sebabrata Mahapatra; Delphine K Farmer; Michael R McNeil; Robert A Casero; John T Belisle Journal: ACS Omega Date: 2017-07-25
Authors: Ethan G Thompson; Ying Du; Stephanus T Malherbe; Smitha Shankar; Jackie Braun; Joe Valvo; Katharina Ronacher; Gerard Tromp; David L Tabb; David Alland; Shubhada Shenai; Laura E Via; James Warwick; Alan Aderem; Thomas J Scriba; Jill Winter; Gerhard Walzl; Daniel E Zak Journal: Tuberculosis (Edinb) Date: 2017-08-12 Impact factor: 3.131
Authors: January Weiner; Shreemanta K Parida; Jeroen Maertzdorf; Gillian F Black; Dirk Repsilber; Anna Telaar; Robert P Mohney; Cordelia Arndt-Sullivan; Christian A Ganoza; Kellen C Faé; Gerhard Walzl; Stefan H E Kaufmann Journal: PLoS One Date: 2012-07-23 Impact factor: 3.240
Authors: Stephanus T Malherbe; Ray Y Chen; Patrick Dupont; Ilse Kant; Magdalena Kriel; André G Loxton; Bronwyn Smith; Caroline G G Beltran; Susan van Zyl; Shirely McAnda; Charmaine Abrahams; Elizna Maasdorp; Alex Doruyter; Laura E Via; Clifton E Barry; David Alland; Stephanie Griffith- Richards; Annare Ellman; Thomas Peppard; John Belisle; Gerard Tromp; Katharina Ronacher; James M Warwick; Jill Winter; Gerhard Walzl Journal: EJNMMI Res Date: 2020-02-10 Impact factor: 3.138
Authors: Qianjing Xia; Myung Hee Lee; Kathleen F Walsh; Kathrine McAulay; James M Bean; Daniel W Fitzgerald; Kathryn M Dupnik; Warren D Johnson; Jean W Pape; Kyu Y Rhee; Flonza Isa Journal: JCI Insight Date: 2020-09-17