Thoroughbred horse racing began in the 17th century in England and is currently conducted in
over sixty countries. As a sport that is heavily gambled on, horse racing should be fairly
managed, and controlling doping is one of the primary concerns for horse racing authorities.
In addition, anti-doping in horse racing contributes to the selection of superior
individuals.At present, low molecular weight compounds, such as caffeine and furosemide, are the main
prohibited substances that are routinely checked in doping tests in horses. Mass spectrometry
(MS) directly detects these substances [5, 15] as well as peptides and heavy metals, such as cobalt
[4, 6, 7, 23]. However,
these methods can only detect compounds with known molecular weights and structures;
therefore, unknown designer drugs cannot be detected. In addition, with the rapid development
of medical technologies such as gene and cell therapies, illegal uses of new medical
technologies that are defined as gene and cell doping are becoming a major concern for the
horse racing industry. According to the agreement of the International Federation of
Horseracing Authorities (IFHA), the owner or trainer must maintain full and accurate records
of all such therapies. While direct detection methods based on the polymerase chain reaction
(PCR) amplification of exogenous genes have been developed [20, 21], they require certain genes to be
identified as markers against which primer and probe sequences have to be designed. In short,
mass spectrometry and PCR cannot directly detect doping substances without prior
information.Metabolomic analysis is a method that comprehensively detects and analyses metabolites in the
body; therefore, metabolomics approaches could potentially be used for indirectly detecting
some unknown doping substances. In general, metabolomics targets low molecular weight
biological compounds (<1,000 Da), such as amino acids, nucleotides, lipids, and sugar.
Metabolomics approaches are currently used in several medical [3, 9, 17] and food fields [8, 11, 12]. Several metabolomics
studies have been reported in humans and horses. Hypoxanthine was identified as a candidate
biomarker for detecting salbutamol in humans [22].
Similarly, 1-cyclopentenoylglycine may be a new biomarker of testosterone misuse [16]. In horses, several biomarkers have been identified to
detect the misuse of the erythropoiesis receptor activator [10].However, there are some concerns about using metabolomics approaches for the detection of
doping substances. When applying metabolomics approaches, animals need to be maintained under
controlled conditions because biological components often reflect their physiological states.
In order to identify biomarkers for detecting the use of prohibited substances, samples are
normally obtained under resting conditions. In Japan, samples (urine and/or blood) for testing
are taken within an hour of a race finishing. While one hour may be sufficient for the
racehorse to recover from extensive exercise, biological components metabolized as a result of
this exercise may not fully recover until a complete return to resting/normal conditions.
Because of this, experimentally discovered doping biomarkers cannot be used for routine doping
testing.We hypothesized that several biological components in plasma are quantitatively and
qualitatively different between the resting state and the stage after racing for doping test
(SAD). Therefore, we investigated the metabolic differences between these two states by
comprehensively comparing metabolites using liquid chromatography-high resolution mass
spectrometry (LC-HRMS) to explore metabolites which might or might not be affected by
exercise. LC-HRMS is a widely used technique in metabolomics. We used LC-HRMS method for
humans [2] because metabolites are common between animal
species.
Materials and Methods
Ethics and sample collection
All plasma samples used in this study were provided by the Japan Racing Association
(JRA). The JRA approved the use of all provided samples for this study. In this study,
blood samples were collected from thoroughbreds at SAD and from thoroughbreds under
resting conditions at JRA training centres. Blood samples were centrifuged to isolate the
plasma. Plasma samples were stored at −30°C. From the provided samples, 30 samples for SAD
and resting states were randomly selected for this study.
Sample preparation
Deproteinization was carried out according to the method described by Dunn et
al. [2]. Plasma (100
µl) was diluted with methanol (300 µl), briefly mixed,
and centrifuged at 15,800 × g at room temperature for 15 min. The
supernatant was dried in a centrifugal vacuum evaporator for 18 hr, dissolved in 100
µl ultrapure water, and centrifuged at 15,800 × g at
room temperature for 15 min. Finally, 90 µl of the dissolved sample was
transferred to a vial. A quality control (QC) sample and phosphate buffered saline were
prepared as control and blank samples, respectively, using the same protocol.
LC-HRMS analysis
For chromatographic separation, a Thermo Fisher Scientific Dionex Ultimate 3000 high
performance liquid chromatography system (Thermo Fisher Scientific, Waltham, MA, U.S.A.)
was used. Separation was carried out on an ACQUITY BEH C18 column (100 mm × 2.1 mm, 1.7
µm). The column oven was set at 50°C and a flow rate of 0.3
ml/min was used. Solvent A was composed of 0.1% formic acid in water,
and solvent B was composed of 0.1% formic acid in methanol. Metabolites were eluted using
the following gradient elution method: from 0% B at 1 min to 100% B in 15 min, held for 2
min, returned to the initial conditions in 2 min, and then held for 2 min. The total run
time was 22 min. The injection volume was 5 µl.Detection was performed using a Q Exactive HF Quadrupole-Orbitrap mass spectrometer
(Thermo Fisher Scientific) operated in a polarity switching mode. The ion source
conditions were as follows: sheath gas flow rate, auxiliary gas flow rate, and sweep gas
flow rate=50, 20, and 0 (arbitrary units), respectively; capillary temperature=250°C;
heater temperature=400°C; spray voltage=+3.00 kV (positive ionization mode) and −3.00 kV
(negative ionization mode); lock mass=off. Nitrogen was used as both the source and the
collision gas. Data acquisition was performed at a resolution of 60,000 (FWHM). Each
acquisition cycle included a full scan in positive ionization acquisition mode and a
full-scan in negative ionization acquisition mode. The automatic gain control (AGC) target
was set to 1e6, and the scan range was m/z 100–1,000 for all
acquisition events. For identification of metabolites, MS/MS spectra were obtained by
data-dependent MS/MS (ddMS2) and parallel reaction monitoring (PRM) mode, and
the following parameters were used in ddMS2 and PRM: resolution=15,000;
normalized collision energy=30, 80, or 100; isolation window=2.0 or 1.4; and AGC
target=1e5 or 2e5. LC-HRMS was run once per sample. QC samples
were measured after every five samples in order to normalize peak areas and pick stable
peaks.
Data processing and analysis
Almost all data (excluding one SAD sample) were processed with Compound Discoverer 2.1
(Thermo Fisher Scientific). For aligning retention times, the following parameters were
used: alignment model=adaptive curve; maximum shift [min]=0.2; mass tolerance [ppm]=5. For
detecting unknown compounds, the following parameters were used: mass tolerance [ppm]=5;
intensity tolerance [%]=30, minimum peak intensity=1,000,000. Peaks in which the QC sample
coverage was less than 50% and relative standard deviation of the areas under the peaks
was more than 20% were excluded.The area under each peak was corrected linearly using the area under the peak of the QC
samples to eliminate drift in the signal for each metabolic feature in the samples before
multivariate and statistical analysis. Principal component analysis (PCA) was performed
using normalized areas under the peaks with Compound Discoverer 2.1. To detect
differential metabolites between the resting state and SAD groups, volcano plot analysis
by Student’s t-test was performed with Compound Discoverer 2.1. Adjusted
P-values were calculated using the Benjamini–Hochberg algorithm, while
keeping in mind the false discovery rate. The X-axis represents log2 fold change
(SAD/resting state), and the Y-axis represents −log10 adjusted
P-value.
Peak identification
KEGG numbers for each peak were obtained from the KEGG COMPOUND Database (https://www.genome.jp/kegg/compound/) using in-house software written in
Python. This software sent accurate masses of each peak to the KEGG database and in turn
received KEGG numbers for compounds with exactly the same masses. Mass tolerance was 5
ppm. MS/MS spectra of these metabolites were compared with those in the mzCloud database
or in published papers in order to conclusively identify them.
Metabolite Set Enrichment Analysis (MSEA) and pathway analysis
MSEA was performed with MetaboAnalyst (http://www.metaboanalyst.ca) using
the KEGG number of each peak. Pathway analysis was performed on KEGG metabolic pathways in
Equus Caballus with the user data mapping tool in the KEGG
database.
Results
Peaks detected by MS
One SAD sample was excluded due to poor measurement data. After data processing, 5,010
peaks were detected in horse plasma samples by LC-HRMS. Based on their accurately
determined molecular masses, these peaks were searched in the KEGG metabolite database. As
a result, 1,256 of 5,010 peaks (approximately 25%) were annotated with KEGG numbers (i.e.,
the IDs of the compounds); however, there were some peaks that were annotated with
multiple KEGG numbers. The remaining peaks (approximately 75%) were not annotated with
KEGG numbers because their masses were not contained in the KEGG database.
PCA
PCA was carried out using all detected peaks (5,010). As shown in Fig. 1, 30 resting state and 29 SAD samples were clearly divided into two different
groups. The first principal component (X-axis) divided the samples into the two groups,
and the second principal component (Y-axis) explained the individual variations of samples
in the resting state group.
Fig. 1.
Principal component analysis using all detected peaks. Orange and blue dots
indicate resting state and stage after racing for doping test (SAD) samples,
respectively. The X-axis represents the first principal component, and the Y-axis
represents the second principal component.
Principal component analysis using all detected peaks. Orange and blue dots
indicate resting state and stage after racing for doping test (SAD) samples,
respectively. The X-axis represents the first principal component, and the Y-axis
represents the second principal component.
Volcano plot analysis
Volcano plot analysis was performed for the 1,256 KEGG annotated metabolites (Fig. 2). This analysis revealed that the levels of 247 metabolites (red circles in Fig. 2) were significantly increased in the SAD
group (adjusted P-value <0.05, fold change >2 times), while the
levels of 125 metabolites (blue circles in Fig.
2) were significantly decreased in the SAD group (adjusted
P-value <0.05, fold change >2 times). In particular, 49 of the 247
metabolites in the SAD group (dotted square in Fig.
2) showed a significant increase in levels (adjusted P-value
<1.0 × 10−20, fold change >4 times); except for three metabolites, a
similar increase in metabolite levels was not observed in the resting state group. In
addition, the remaining 884 metabolites were not significantly different between the two
groups.
Fig. 2.
Volcano plot analysis for identifying metabolites differentially detected in horse
plasma between the resting state and stage after racing for doping test (SAD)
groups. Red and blue dots indicate characteristic metabolites in the SAD and resting
state groups (adjusted P-value <0.05, fold change >2 times),
respectively. Grey dots indicate metabolites that are not different between the two
groups. Red dots present within the square represent metabolites that are the most
characteristic of SAD samples (adjusted P-value <1.0 ×
10−20, fold change >4 times).
Volcano plot analysis for identifying metabolites differentially detected in horse
plasma between the resting state and stage after racing for doping test (SAD)
groups. Red and blue dots indicate characteristic metabolites in the SAD and resting
state groups (adjusted P-value <0.05, fold change >2 times),
respectively. Grey dots indicate metabolites that are not different between the two
groups. Red dots present within the square represent metabolites that are the most
characteristic of SAD samples (adjusted P-value <1.0 ×
10−20, fold change >4 times).
MSEA and pathway analysis
MSEA was performed on metabolites whose levels were altered in the SAD group. However,
significantly different pathways (P-value <0.05) between the two
groups were not identified. Seven of the 49 metabolites observed in the SAD group were
intensively mapped to inosine, xanthosine, uric acid, allantoin, hypoxanthine, xanthine,
and deoxyinosine positions in part of the purine metabolism pathway (Fig. 3). Inosine, xanthosine, uric acid, and allantoin were conclusively identified based
on the MS/MS data, which were compared with the mzCloud database or published papers
[1]. However, hypoxanthine and xanthine were not
identified because they were produced by in-source fragmentation of inosine and
xanthosine. Deoxyinosine was also not identified because the MS/MS data were different
from those in mzCloud.
Fig. 3.
KEGG pathway analysis focused on purine metabolism pathway. Black dots indicate
metabolites that are observed in stage after racing for doping test (SAD) group.
Seven metabolites, inosine, xanthosine, uric acid, allantoin, hypoxanthine,
xanthine, and deoxyinosine, were intensively mapped to the purine metabolism
pathway.
KEGG pathway analysis focused on purine metabolism pathway. Black dots indicate
metabolites that are observed in stage after racing for doping test (SAD) group.
Seven metabolites, inosine, xanthosine, uric acid, allantoin, hypoxanthine,
xanthine, and deoxyinosine, were intensively mapped to the purine metabolism
pathway.Areas under the peaks of the four identified metabolites were compared between the SAD
and resting state groups (Fig. 4). The levels of all four metabolites significantly increased in the resting state
group. The fold changes of the peak areas of xanthosine, inosine, uric acid, and allantoin
were 127, 35, 6.4, and 7.6, respectively.
Fig. 4.
Quantitative differences in the metabolites identified in the purine metabolism
pathway between the resting state and stage after racing for doping test (SAD)
groups. A) Xanthosine, B) uric acid, C) inosine, D) allantoin. *Adjusted
P-value <0.05.
Quantitative differences in the metabolites identified in the purine metabolism
pathway between the resting state and stage after racing for doping test (SAD)
groups. A) Xanthosine, B) uric acid, C) inosine, D) allantoin. *Adjusted
P-value <0.05.
Discussion
In this study, 5,010 peaks were detected from horse plasma, and approximately 25% of these
were annotated by KEGG analysis. The remaining peaks (approximately 75%) may not have been
annotated due to fragmented metabolites or due to metabolites having complex structures
[13, 14].
Although there were many peaks that were not annotated by KEGG, PCA of the 5,010 peaks
revealed that all samples were clearly divided into two groups—SAD and resting state. This
suggested that detected peaks, including non-annotated ones, reflect the metabolic
differences between SAD and the resting state.Peak identification was performed using only the accurate mass of each peak; therefore,
some peaks had multiple KEGG numbers. This may be the reason why we could not identify any
pathway that was significantly different between SAD and the resting state.Interestingly, a metabolic increase (adjusted P-value <1.0 ×
10−20, fold change >4 times) was observed in the SAD group. The identified
metabolites were specifically mapped to the purine metabolism pathway. These finding
indicated that a lot of adenosine triphosphate (ATP) was consumed during extensive exercise
[18, 19].
Therefore, these metabolites are not suitable as doping biomarkers but may be used as
biomarkers for extensive exercise.Via the purine pathway, ATP is metabolized to allantoin which is then excreted in urine. In
this study, the fold changes of metabolites (i.e., inosine and xanthosine) that were located
upstream of the purine pathway were much higher when compared with those (i.e., uric acid
and allantoin) that were located downstream. These differences may be due to the degradation
of ATP at several stages.When using metabolomics approaches to identify biomarkers for the detecting the use of
prohibited substances, animals need to be maintained under controlled conditions, usually
resting conditions, to eliminate other factors. As our results have revealed that multiple
metabolites were altered in the SAD group, metabolites that are not altered in the SAD group
should be selected as biomarkers for detecting doping to improve specificity.In addition, the samples from SAD were tightly clustered, while samples from the resting
state were broadly distributed. Metabolites under resting conditions were diverse and
altered with each horse. Therefore, metabolites used as biomarkers should be first
quantified and evaluated using multiple samples from different horses.This was a preliminary study that comprehensively compared metabolites in horse plasma
samples taken under resting conditions and after racing, and its aim was to identify
biomarkers for doping in horses. We expect that metabolomics approaches will contribute to
the detection of prohibited substances which cannot be detected by conventional methods,
such as unexpected genes and unknown designer drugs, ultimately ensuring fairness in horse
racing.
Authors: Warwick B Dunn; David Broadhurst; Paul Begley; Eva Zelena; Sue Francis-McIntyre; Nadine Anderson; Marie Brown; Joshau D Knowles; Antony Halsall; John N Haselden; Andrew W Nicholls; Ian D Wilson; Douglas B Kell; Royston Goodacre Journal: Nat Protoc Date: 2011-06-30 Impact factor: 13.491
Authors: Emmie N M Ho; George H M Chan; Terence S M Wan; Peter Curl; Christopher M Riggs; Michael J Hurley; David Sykes Journal: Drug Test Anal Date: 2014-09-25 Impact factor: 3.345
Authors: Patricia Lopez-Sanchez; R C H de Vos; H H Jonker; R Mumm; R D Hall; L Bialek; R Leenman; K Strassburg; R Vreeken; T Hankemeier; S Schumm; J van Duynhoven Journal: Food Chem Date: 2014-07-24 Impact factor: 7.514
Authors: Anna Halama; Joao M Oliveira; Silvio A Filho; Muhammad Qasim; Iman W Achkar; Sarah Johnson; Karsten Suhre; Tatiana Vinardell Journal: Metabolites Date: 2021-01-31