Literature DB >> 32302343

'In search of lost time': Identifying the causative role of cumulative competition load and competition time-loss in professional tennis using a structural nested mean model.

Abstract

Injury prevention is critical to the achievement of peak performance in elite sport. For professional tennis players, the topic of injury prevention has gained even greater importance in recent years as multiple of the best male players have been sidelined owing to injury. Identifying potential causative factors of injury is essential for the development of effective prevention strategies, yet such research is hampered by incomplete data, the complexity of injury etiology, and observational study biases. The present study attempts to address these challenges by focusing on competition load and time-loss to competition-a completely observable risk factor and outcome-and using a structural nested mean model (SNMM) to identify the potential causal role of cumulative competition load on the risk of time-loss. Using inverse probability of treatment weights to balance exposure histories with respect to player ability, past injury, and consecutive competition weeks at each time point; the SNMM analysis of 389 professional male players and 55,773 weeks of competition found that total load significantly increases the risk of time-loss (HR = 1.05 per 1,000 games of additional load 95% CI 1.01-1.10) and this effect becomes magnified with age. Standard regression showed a protective effect of load, highlighting the value of more robust causal methods in the study of dynamic exposures and injury in sport and the need for further applications of these methods for understanding how time-loss and injuries of elite athletes might be prevented in the future.

Entities: Chemical Disease Gene Species

Year: 2020 PMID： 32302343 PMCID： PMC7164605 DOI： 10.1371/journal.pone.0231568

Source DB: PubMed Journal: PLoS One ISSN： 1932-6203 Impact factor: 3.240

Introduction

Injury is one of the most significant threats to the longevity of elite athletes and, when injury ends the careers of the industry’s stars prematurely, can pose a significant threat to the business of sports [1]. Continued growth of the sports market has resulted in increasing commercial opportunities and, inevitably, greater physical demands on athletes to play harder, faster, and longer [2]. The growing pressure to stay competitive has made injury prevention a top priority of high-performance sport [3]. The demands of play have reached a crisis point for professional tennis in recent years. At the end of 2018, a spate of injuries at the top of the men’s sport saw multiple of the highest ranked players end their seasons early, several spending more than six months away from the sport [4]. It remains unclear whether these prominent cases are indicative of a broader rise in injury risk, yet several characteristics about the structure of the sport make it vulnerable to systemic fluctuations in its risk profile [5]. Firstly, tennis events are spread throughout the globe and top players are often travelling long distances between events. Secondly, matches themselves have no theoretical limit and the actual duration of a match can change dramatically depending on the match format, surface, and other tournament factors. Thirdly, the calendar of professional events is constantly in flux, with most changes resulting in a more congested season where there are fewer opportunities for recovery [6]. The ‘grind’ [7] that the season schedule imposes is most pronounced for the best players as they are the ones expected to last through the most rounds at each tournament. Etiological models of injury in sport describe a multifactorial process characterized by multiple intrinsic and extrinsic variables [8]. The complexity of the mechanisms of injury makes it clear that no single causative factor can explain all injuries. However, there is consensus among governing bodies of sport that the volume and intensity of physical activity, referred to as ‘load’, is a major risk factor [9]. This consensus is equally prevalent in tennis where load management is a central tenet of injury prevention [10]. Despite general agreement about the importance of load, there is little agreement on how load is best distributed over time to protect against serious injury [11]. Research into the causal relationship between load and injury in tennis faces many challenges. The individualized nature of the sport makes consistent collection of injury events rare. And injuries that may be documented by physicians or athletic trainers at specific events do not follow a standard protocol, making the comparability of injury data between events questionable [5]. Gathering high-quality data about load is also a challenge. One reason for this is the lack of agreement on how ‘load’ is defined. Load can take different meanings depending on the experts who are using it [12]: biomechanists use load to focus on the frequency and force of stress to joints, physiologists use load to refer to the respiratory demands on the cardiovascular system, while sports scientists use load to refer to total accelerations performed. Under any definition, a complete picture of the load an athlete may experience over time is rarely available owing to the difficulties of collecting data during the training periods of top athletes [13, 14]. Even addressing the incompleteness and inconsistency of data on injury and its risk factors would not eliminate the hurdles to studying the causes of injury in tennis. Because of the observational nature of tennis data, any analysis would be vulnerable to multiple biases. Owing to single-elimination tournament designs, for instance, more talented players ‘select’ into higher levels of load but the same players could have better ways to protect against high load (through better movement or recovery practices, for example) such that naive association studies could find load to be associated with fewer injuries. It is well-established that such scenarios generally do not allow the identification of potential causal effects from observational data with standard techniques, like regression analysis [15]. Proper accounting of observational bias is especially difficult in the load-injury setting because of the complexity of load—an exposure of variable intensity that is constantly changing over time and where future exposure may depend on prior exposure [16]. The analytical challenges inherent in modelling a dynamic, continuous exposure is further compounded by the potential for multiple other time-varying factors—player ability, past injury, age, etc.—to moderate or confound the direct effect of load on future injury risk. The present paper takes several steps to address the above challenges. First, we focus on a well-defined subset of experienced load and outcomes that are completely observable for top professional players; namely, competition load and competition time-loss. Competition load, a type of external load, measures the volume and intensity of professional play throughout a player’s entire professional career, allowing the examination of both acute and chronic cumulative effects of load. Time-loss, an extended break from competitive play suggestive of an unintended absence, will be the outcome of focus because, poor health is the primary cause of missed competition at the elite level [17, 18]. Moreover, like injury itself, time-loss is an outcome that top tennis players want to minimize. The second major contribution of this paper is the use of a more principled approach to studying the effects of cumulative load on the risk of time-loss. Specifically, we propose a structural nested mean model (SNMM) to estimate the potential causal effect of load. Like marginal structural models (MSMs), the SNMM when combined with inverse probability of treatment weights, can help to address selection and confounding biases when evaluating the effects of time-varying exposures [19]. SNMM are particularly useful when the primary exposure of interest may be moderated by another time-dependent variable [20]. This is relevant in the present study where age is expected to moderate the risk of a given level of accumulated load. In what follows, we develop an SNMM for the potential causal effect of cumulative load on time-loss risk in the presence of age modification using regression with residuals and an inverse probability of treatment weighting strategy.

Methods and materials

Sample

Competitive results of men’s professional tennis players from 1990 to the present were obtained from the OnCourt database (www.oncourt.info). These data include unique identifiers for the winners and losers of matches, the date of each competition, and the score, which includes the total games and sets played. Player ratings were derived from the first recorded match results in this database using a previously described Elo-based rating system, a statistical algorithm for rating the latent ability of tennis players that accounts for surface and margin of victory [21, 22]. Because the main risk factor of interest in this study was cumulative professional play, it was important to have complete competitive history for players included in the analysis. Results for the lowest-level of competition where money can still be earned professionally, ITF Futures events, are not represented in the database until 2004, only years from 2005 and onward were considered and only for players whose first professional match was 2005 or later. An early step in the data preparation was defining a sample of players who are regular competitors. Here ‘regular’ means, players who, if fit, are expected to play throughout the season. Player schedules are expected to vary considerably with the level of their ability, as this dictates the number of events where they are eligible throughout the year. The regularity in play was explored by grouping player-seasons according to the player’s rating at the start of the season, using rating groups of 100 points over the range of 1900 to 3000, forming 11 total groups (Note that the average rating of a professional player is 1500, while players who compete in the main draw of Grand Slams are usually rated 2000 or higher). Given that official rankings (See https://en.wikipedia.org/wiki/ATP_Rankings) are based on a player’s best 18 tournament results and that no ATP event outside of the World Tour Finals takes place in the months of November and December, it is reasonable to use 3 weeks as a threshold for the upper bound of between-event gaps of a typical top player. Looking over the seasons of the players in the different ratings groups, it was found that only players rated 2300 or higher in the study’s player ratings had the majority of gaps (judged by the 90th percentiles, which include 90% of all observed gap days) less than 3 weeks for at least 9 months of the year (see S1 File). Given this observation, the sample of players were all of those who attained a player rating of 2300 or higher during the observation period. There were 389 players who met this criterion. The basic time unit for the sample was the competition week, as most professional events last one week at most. With a traditional survival analysis, comparing the 25% of players with the highest game load against the rest of the player sample would have a power of 80% to detect a 20% risk increase for competition time loss over the base rate of 3%.

Outcome

The primary outcome of the study was competition time-loss: an extended absence from competition. Definitions for time-loss from competition were individualized to each player using a linear mixed model of the maximum gap during a calendar month (the largest number of consecutive days a player was not competing in a given month) with player random effects and an unstructured covariance-variance [23]. Since a period between competition could extend over one or more months, the period was assigned to the month when the gap commenced and that month alone. From this regression model, we could obtain the expected value for the maximum number of consecutive days a player spends away from competition in a given month. The model was trained on data for players who had 3 or more seasons at a rating of 2300 or more. For players with fewer than 3 seasons at the minimum rating, the expected maximum gap days was set to the average. The above regression model provides a player and month specific estimate of the maximum between-event days (here on called ‘gap days’) under normal conditions. A time-loss event was defined as instances where the actual gap days in a month were 2 weeks longer than expected (4 weeks longer for the month of January, owing to the off-season). This rule was validated against a small sample of former World No. 1 players (n = 3) with well-documented injury histories and was found to identify all of their documented absences from competition due to injury. Full details of the outcome determination and validation are provided as S1 File. Fig 1 shows the 90th percentile range for the gap criteria. The months of March to November show the most consistency, with gaps of 25 to 40 days or fewer being the minimum threshold for an unexpected absence. For February, the range is slightly more, with losses of time greater than 40 days before their first match not being unusual for a subset of players. Inspection of players with longer time-loss in February suggests that this subset are players who did not tend to compete in the Australian events in January and also players who participated in first round Davis Cup events in early February, a non-tour event that, like exhibition events, is not included in the competition calendar of players in this study. January and December stand out clearly as periods following a player’s elected off-season, lasting no more than 60 to 100 days for the typical professional player.

Fig 1

The 90th percentile range for ‘gap days’ for each month, which depicts the range containing 90% of the longest periods outside of competition that were observed for each month.

Baseline for monitoring competition time-loss began after a player accumulated 8,000 cumulative games played. Thus all players entered the risk pool with an equal career load.

Censoring and competing risk

Because a player is only observed when they compete, the last observation is either the last observation in the study observation window, the last observation before an absence in play, or the last professional match. Retirement, a permanent break from professional play, is a competing risk for temporary absences from competition. Of the 1901 player seasons in the sample, there were 84 instances where a player’s last observed professional match was at least one year before the end of the study observation (March 2019). In 36 instances, the final recorded match was in 2018. A review of players whose last observed matches were in 2016 to 2018 revealed 20 players who were not officially retired and 2 who were serving an extended ban. The analysis considered the 20 cases to be instances of an unintended absence, while the remaining 64 cases were treated as censored observations.

Exposure and moderator

In the absence of confounding, the basic effect modification relationship can be depicted with the directed acyclic graph (DAG) shown in Fig 2 [24]. The target outcome of interest Y, which has direct causal effects from X. The factor M moderates the process between X and Y, as denoted by an edge directed to the edge between X and Y. Changes in X are not expected to change the level of the moderator M and M does not have direct influence on Y in the absence of X, but the level of M influences the effect of X on the outcome [25, 26].

Fig 2

Point treatment directed acyclic graph for exposure with effect modification.

In the context of competition time lost in professional tennis, the primary causal factor is load (Fig 3). The term ‘load’ has taken various definitions in the sport injury literature [9]. The present work load will be used to refer to the intensity of competition, a type of external load, which will be measured by the games played during a match.

Fig 3

Point treatment directed acyclic graph for causal effect of load on time-loss with age effect modification.

The effect of load is presumed to be moderated by an athlete’s age, as an equal level of load may become more damaging owing to increase physiological susceptibility associated with age. A more accurate model for the load-age relationship on competition time-loss accounts for the fact that both factors are changing over time and can be influenced by measured and unmeasured confounders. Let be the potential outcome of a time-loss from play given age and game load history after the tth week of professional competition. The model below is a time-dependent DAG describing the exposure-moderator history in the presence of confounders, assuming that all edges capture the factors that have a causal influence on Y. Fig 4 shows the exposure having direct effects on the target outcome, the moderator having an influence on these effects as well as the values of the primary exposure. The exposure is not presumed to have any direct effects on the moderator. However, both the exposure and mediator could be influenced by observed confounders and unmeasured confounders , which could both have direct effects on the future risk of time-loss from competition.

Fig 4

Directed acyclic graph of presumed causal model for time-varying exposure with effect modification and time-varying measured and unmeasured confounders.

Marginal structural model

Our primary interest is in the role that accumulated load has on the risk of time-loss from competition. A player’s historical load and age could influence their risk of time-loss in a variety of ways. This paper will consider the role of simple cumulative load. Under this model, the potential outcome of the hazard of a time-loss at time t, if load were set to history and age were set to the aging history , has the following log-linear dose-response relationship where is the total game load through to the tth competition week and A is the calendar age at the tth competition week. There are many possible ways a player could get to the tth week with total load and age A, but the MSM says that all the relevant information for the present hazard is captured by the total load and current age. Standard MSMs cannot be used to model the effect modification of time-varying covariates [15]. The reason for this is two-fold. First, when a time-varying moderator is correlated with previous levels of treatment, adjusting for the moderator can lead to a biased estimate of the direct and interactive effects of treatment. Second, bias of the direct and interactive effects of treatment could also arise when conditioning on the time-varying moderator owing to unknown causes of the moderator, what is sometimes called a ‘collider bias’ [27]. An approach for dealing with these sources of potential bias is the structural nested mean model (SNMM) [20]. The SNMM specifies the moderated time-varying causal effects of interest in a conditional mean model for a continuous response given time-varying treatments and candidate moderators. The specific form of the model in the present case can be derived by decomposing Eq (1) into its conditional components. Suppose that the actual level of load received by time t is g* and the counterfactual age that would be reached by this load is A(g*). The potential hazard can be expressed as, a sum of the direct and interactive effects of changing g* to g when age is fixed and the direct effect of changing A(g*) to A(g) when load is fixed. Both of these components are conditional means that, for simplicity, we can assign to the functions μ(.) and ϵ(.), The function μ(A(g), g*) captures the causal effect of changing load at time t for a player of a fixed age. The function ϵ(A(g*), g) represents the causal and non-causal relationship between the moderator and response, and is considered a ‘nuisance function’ in the SNMM framework. In relation to the dose-response model in Eq (1), we model the conditional mean function μ as a linear function of the cumulative load, which states that, conditional on a player’s age, a unit increase in load has the same change on the log-hazard no matter the time t or the load prior to t. The nuisance function ϵ has to have the property that E[ϵ(A(g), g)] is zero with respect to the random variable A(g). We can guarantee this property by specifying the following residualized form of ϵ. Namely, which says that the conditional mean of age is linearly related to the cumulative load g, such that a unit increase in load has the same expected association with age no matter the specific history it took to get to load g. This conditional expectation is subtracted from the realization of A(g), giving the residual between the observed and expected age, conditional on all prior history of load. Combining the above, we get the complete log-linear SNMM, Eq (6) looks much like a standard log-linear model with interactions. However, the SNMM is based on a conditional mean of the supposed moderator. It also lacks direct adjustment for time-dependent confounders. Imbalance in these factors are instead handled through the use of inverse probability of treatment weights. The ‘treatment’, a general term the causal inference literature uses to refer to the main explanatory variable of interest, in this case is the cumulative competition load. Weighting observations by the inverse probability of the observed dose of treatment received is well known to be a more effective strategy for protecting against confounder bias than regression [28].

Estimation

The estimation of the SNMM begins by preparing the outcomes and covariates of the observed data. Baseline for the player sample began when all players reached 8,000 cumulative games. From baseline, every competition week was collected and summarized until a player’s last professional event or the end of data collection (March 2019). For each week, we computed the total game load, player age in years, player rating, whether the player competed in the previous week, and whether they had a time-loss of more than 180 days at any time in their past 30 events played. The outcome of time-loss was determined at the end of each competition week according to the player-specific criteria described above. The first step of the SNMM estimation is the derivation of inverse-probability of treatment weights (IPTW). The purpose of these weights is to create a pseudo population that is balanced with respect to confounding variables, like player ability or competitive play, for all time t. Let g be the observed game load for the ith subject in the tth competition week. In the absence of censoring, the stabilized weights are given by, Here, f(.) is a likelihood function for some parametric family appropriate for a continuous treatment variable. The denominator’s role is to identify individuals who would be unlikely to have received a given level of load, g, given their covariate history, and to upweight them accordingly. This is how the weights function to balance the sample with respect to time-varying confounders. The numerator does not include any confounding factors and its purpose is solely to provide stability to the overall weights by providing a number on the scale of the denominator. Let C(t) be the indicator of a player who is censored in the tth competition, because of administrative censoring or retirement. Censoring weights take a similar form as in Eq 8, with the probability of having made it to the tth competition week without having been censored given load and moderator history, in the numerator, and additional confounder history, in the denominator. The final weight assigned at time t is w = sw × cw. The treatment denominator weights were obtained from the following linear regression model, where x1 is the indicator of back-to-back competition weeks, x2 is the indicator of a 180 day time loss in the past 30 competition weeks, and x3 is a player’s rating. The function s(.) is a smoothing cubic spline. The treatment numerator weights were similarly obtained using, Estimates for the expected means given in Eqs (9) and (10) were fitted using a GEE model with player as clusters and independent covariance structure between players. Let . The denominator is then calculated as , where ϕ(.) is the density function for the standard normal and is the residual standard deviation of the fitted model. Numerator estimates are obtained with the same methodology but without the conditioning on time-dependent covariates. Given numerator expectation, , with dispersion , the stable weight is calculated as, The identical right-hand side models in Eqs (9) and (10) were used for the models of the censoring weights. However, these were fit in a logistic regression with the outcome being the binary indicator of censoring at the end of the tth competition. The stability of the weights were evaluated graphically by plotting box-plots against the weeks from baseline. Balance was evaluated by calculating the population standard bias (PSB) at each time t for all time-varying covariates. PSB ≤ 0.25 was set as the criteria for good balance. A weighted generalized linear model was used to fit the conditional mean model of the moderator at time t given weights . The age residuals were obtained and served as covariates for the model to estimate the SNMM outcome model. For this model, a weighted pooled logistic regression was used [16]. Effects of load and its moderation by age were illustrated by comparing the estimated hazard ratio at the lower and upper 25th percentiles of cumulative load observed in the sample for players of age 25, 27 and 29. The upper 25th percentile was approximately 3,000 games more for each of these age groups and this was the increased in load used for these comparisons. For all hazard ratios shown, the reference player was a 25 year-old with a total competition load of 10,000 games. Because the conventional standard errors of the logistic model fail to account for the estimation of the residuals and stabilized weights, bootstrap standard errors and confidence intervals were obtained by repeating the weight, residual, and outcome model estimation for 1,000 bootstrap resamples. SNMM estimates were compared to the standard association models using an unweighted pooled logistic regression of load and age effect modification with and without adjustment for other measured time-varying covariates in this study. An additional analysis, included the same time-varying covariates in the SNMM outcome model, a so-called ‘doubly robust estimate’ of the potential causal effect of load [29]. Summaries of the absolute risk from the SNMM model were also estimated for a range of ages and game loads, using the baseline rate for players of 25 years and a 10,000 game load. To understand the potential reduction in absolute risk for top players with some of the highest cumulative loads on tour, the absolute risk reduction for a decrease of 1,000 and 2,000 games played from actual games played was calculated for several prominent players. All data analysis and modeling was performed in the R statistical programming language [30].

Identifiability

The ability to identify the causal effects of load using the framework presented in this study rests on several assumptions. The first concerns the correct specification of the dose-response model. A central assumption to the simple cumulative load model is the premise that load and aging history is independent of the actual sequence load was received with age conditional on the current total load and age. In mathematical terms, Related to the above, is the consistency assumption [31]. This states that a player who has the same treatment and aging history must have the same potential outcome, The next assumption concerns the positivity of treatment. In the case of the cumulative load, it requires that all possible treatment levels can be observed for any at-risk person at a given time over all time points. Finally, in the setting of time-varying treatments with observational data, we have to have sequential ignorability of treatment to be able to identify causal effects [19]. This states that, the particular treatment received at time t must be ignorbale given the measured confounders, pre-treatment moderator history, and pre-treatment treatment history. Because sequential ignorability can be violated if any unmeasured confounders are present, it is an assumption that is not verifiable.

Results

The data sample included 55,773 competition weeks for 389 professional male tennis players (Table 1). Three percent of the weeks in the study sample were followed by a time-loss. Cumulative game loads had an average of 13,615 games and a maximum of 39,303 across the sampled competition weeks. By design, players had a mean rating over 2300 and the majority of players were rated between 2100 and 2500 in any given week. One of every three competitive events were played on the back of another competition, indicative of the congestion of the tennis calendar. Only a small fraction of players were competing when having had a more than 6 month break from competition in the past 30 events played.

Table 1

Summary of overall sample characteristics.

Unless otherwise noted, the summary is the mean (standard deviation) for all competition weeks.

Characteristic	Value
Competition Weeks, n	55,773
Players, n	389
Time-Loss %	3 (17)
Game load	13,615 (4,208)
Age	28 (3.24)
Competed in previous week %	33 (47)
Player rating	2325 (199)
30-event ≥180 day time-loss %	3 (17)

Summary of overall sample characteristics.

Unless otherwise noted, the summary is the mean (standard deviation) for all competition weeks. Stratifying the sample by the quintiles of cumulative load shows how the time-dependent covariates vary with increasing games played. As expected, age increases steadily with increasing load with most players being age 29 years or older by the time they have accumulated 15,000 games played (Table 2). Positive trends with load were also observed for the 30-event 180 day or more time-loss, which had its highest observed rates of 5% for the two highest quintiles of load, and player rating, which had a greater average with each higher strata of load. No pronounced trend with back-to-back competition was observed.

Table 2

Summary of outcome and covariates (mean, SD) by quintiles of cumulative game load.

Load Quintile	Load	Time-Loss	Age	Competed in Previous Week	Elo Rating	30-event Time-Loss
Q1	8,851 (499)	3 (16)	24.8 (1.77)	34 (47)	2232 (176)	0 (0)
Q2	10,673 (558)	2 (16)	26.1 (1.85)	34 (47)	2266 (173)	2 (15)
Q3	12,821 (689)	2 (15)	27.5 (1.96)	34 (47)	2318 (175)	3 (16)
Q4	15,526 (889)	3 (17)	29.3 (2.12)	33 (47)	2358 (180)	5 (23)
Q5	20,205 (2,767)	4 (19)	32.0 (2.39)	31 (46)	2452 (213)	5 (22)

The combined stabilized weights show good stability over the competition weeks (Fig 5). On the log-scale, the average weight across the time periods was 1.07 and the interquartile-range was an average of 0.5, a moderate amount of variation over the competition weeks. More instability was observable in the latest weeks, where the sample size was smallest. By the 350th competition week, the average weight dropped to 0.2.

Fig 5

Log-scale of combined IPTW and censor weights by competition week.

In terms of balance, the player rating showed the greatest potential for confounding bias (Fig 6). The unweighted PSB for the player rating had an average of 0.33, a maximum of 1.15 and exceeded the threshold of 0.25 in 63% of the competition weeks. With weighting, the PSB for the rating reduced to an average of 0.08 and never exceeded 1 in any competition week. For 20% of weeks, the PSB was greater than 0.25 even with weighting. However, this rate was just 4% for the first 200 competition weeks. All other covariates were well-balanced across all competition weeks.

Fig 6

Population standard bias for time-varying covariates by competition week with and without weighting.

The size of points is scaled to reflect the number of observations at each time point.

Population standard bias for time-varying covariates by competition week with and without weighting.

The size of points is scaled to reflect the number of observations at each time point. Unweighted regression analysis without adjustment for time-varying covariates showed a protective effect with increasing game load (Table 3). Counterintuitively, the estimates from this regression suggested that an increase of 1,000 accumulated games was associated with a decrease of 3% risk in time-loss for a player of the same age. Adjusting for time-dependent covariates removed any statistically significant association, supporting the conclusion that, for players of the same age and skill level, game load has no causal influence on risk of time-loss.

Table 3

Hazard ratio (95% CI) change with load and the effect modification of age for the observed lower and upper 25th percentiles of load for each age.

Age	Game Load	Unweighted Unadjusted	Unweighted Adjusted^a	SNMM Unadjusted	SNMM Adjusted^a
Same Age	1,000 more	0.97 (0.94-0.99)	1.00 (0.97-1.03)	1.04 (1.00-1.09)	1.05 (1.01-1.10)
25	10,000	1.00 (Ref.)	1.00 (Ref.)	1.00 (Ref.)	1.00 (Ref.)
25	13,000	0.90 (0.83 -0.97)	0.99 (0.90 -1.08)	1.14 (1.00-1.30)	1.17 (1.02-1.35)
27	11,000	0.97 (0.94 -1.00)	1.00 (0.96 -1.03)	1.06 (1.02 -1.09)	1.06 (1.03-1.10)
27	14,000	0.89 (0.79-1.01)	1.00 (0.86-1.15)	1.24 (1.10-1.43)	1.28 (1.12-1.47)
29	14,000	0.92 (0.79 -1.07)	1.01 (0.86 -1.19)	1.31 (1.20 -1.43)	1.33 (1.20-1.47)
29	17,000	0.87 (0.66-1.12)	1.02 (0.76-1.36)	1.61 (1.37-1.88)	1.65 (1.38-1.95)

a, Adjusted for player rating, back-to-back competition, and 30-event 180-day time-loss or longer

a, Adjusted for player rating, back-to-back competition, and 30-event 180-day time-loss or longer In contrast to the standard regression analysis, the SNMM found a significant positive relationship between increased load and risk of time-loss (Table 3). The direct effect of an increase of 1,000 games, for example, is estimated to increase the risk of time-loss by 4% (95% CI 0-9%) in the unadjusted and 5% (95% CI 1-10%) for the doubly-robust analysis. Comparisons between the lowest 25th and highest 25th percentiles of empirical load observed at ages 25, 27 and 29 showed even starker effects. Based on the doubly-robust estimates, the hazard ratios of players in the top 25% of load were consistently greater than those of players of the same age but with the lowest 25% of experienced load. At age 25, the hazard ratio of 1.17 (95% CI 1.02-1.35) corresponds to a risk increase of 17%; at age 27, the hazard ratios of 1.28 (95% CI 1.12-1.47) and 1.06 (95% CI 1.03-1.10) correspond to an increase of 21%; and at age 29, the hazard ratios of 1.65 (95% CI 1.38-1.95) and 1.33 (95% CI 1.20-1.47) correspond to an increase of 24%. Though each of these comparisons correspond to a fixed increase of 3,000 games, we see that the risk associated with that same change in load is increasing, indicating the positive effect modification due to age. Confidence bands for the effect of load over a broader age range show the results of the SNMM in greater detail (Fig 7). The grey regions highlight the 90th interquartile range of observed load for the specific age group in each panel. In the observed load range for ages 25 to 28 years old, the absolute risk ranged from 2 to 4%. For ages 29 to 32, the absolute risk increased to 6 to 8% for players in the highest load range. Over the age of 32, even players in the lowest observed range had an absolute risk over 4%. Among the highest loads, the risk of time-loss was as high as 18% by age 35.

Fig 7

Absolute risk time-loss 90% confidence bands associated with changing load and the modification effect of age according to the SNMM.

The shaded grey regions show the empirical 90th percentile range of actual game load observed for each age group.

Absolute risk time-loss 90% confidence bands associated with changing load and the modification effect of age according to the SNMM.

The shaded grey regions show the empirical 90th percentile range of actual game load observed for each age group. Since 2016, several of the most successful male players in tennis have simultaneously suffered long breaks from their usual competition schedule owing to injury. Andy Murray is one among these and is especially notable for having announced his intended retirement at the beginning of 2019. The nature of the sport means that players who consistently perform well tend to accumulate greater load at a similar age. In Fig 8 we see the estimated risk of time-loss for three of the most successful contemporaries—Rafael Nadal, Novak Djokovic, and Andy Murray—given their observed cumulative load at ages 23 to 30 years old. By age 30, Murray had accumulated 24,000 games, Nadal 26,000 games, and Djokovic 27,000 games; levels of load that put them each in the top 10% of load acquired by age 30. At those levels, each of these players would have an expected risk of time-loss for 1 in every 25 weeks of competition. Reductions of 1,000 games would be expected to provide a negligible change in that risk, while 5,000 games fewer of load would be expected to have a more appreciable reduction in the risk of time loss, decreasing the risk to 1 in every 30 weeks of competition.

Fig 8

Absolute risk (± SD) of time-loss for actual age-specific total games played and counterfactual reductions of 1,000 and 5,000 games for Andy Murray, Rafael Nadal and Novak Djokovic.

Discussion

Despite general agreement about the etiological importance of load for injuries in sport, few studies have examined the longitudinal effects of load [9]. To our knowledge, the present study is the first to investigate the risk of competition load for professional tennis players [32]. Our analysis of tens of thousands of competition weeks over the complete professional careers of top male tennis players found significant increases in the risk of time-loss from competition with greater total competition load. We also demonstrated that the risk for the same increase in load increased with a player’s biological age, indicating that the harmful effects of load are magnified for older players compared to younger players. One of the strengths of the present study was its use of a structural nested mean model to identify the potential causal role of load in the presence of moderation by age. The SNMM is a recently developed technique that provides a more principled way to address observational study biases compared to standard regression techniques, the mainstay of epidemiological studies of injury in sport [32]. The SNMM is particularly necessary when the exposure and moderators of interest are varying in time, as is likely the case for any study of load and injury, owing to the dynamic nature of load and other putative factors involved in the mechanisms of injury. The main reason for this is that past exposure could influence levels of future exposure, moderators, and confounders. Careful balancing and conditioning at each time point of the analysis is required to remove the bias induced by the interplay of these factors over time [16]. Indeed, the present study found that standard methods, which do not protect against such biases, would lead to the conclusion that cumulative competition load was protective against absences from competition. One contributor to this discrepancy is age, which both theory and prior empirical evidence suggests is likely to modify the effect of load [33]. When player age is adjusted for with standard regression, it cuts off the effect of load because it is a correlate of future levels of load and a moderator of load’s effect. Another contributor is the confounding of player rating, as greater ability predicts higher levels of future load but is also an independent predictor of a lower risk of time-loss. The counterintuitive protective effect of load has been frequently reported in studies of load and injury in sport [6]. The fact that we, in the context of time-loss in tennis, observe a reversal in effect when using causal inference methods raises the possibility that conclusions about the protective effects of load have been grounded in questionable methodology that has not adequately addressed effect modifiers or confounders. A recent critique of methods behind the literature on acute-chronic load ratios in the management of load, reinforces the dangers of observational study biases when researching the effects of dynamic exposures in sport [34]. Today’s top men’s tennis players compete in an average of 25 events per year and play an average of 50 games per event. Given current playing schedules, a reduction of 1,000 games of load is approximately equal to skipping an entire season of competition. Since the present study found only modest reductions in risk with 1,000 fewer games played, practically meaningful reductions in risk would appear to require early-career implementation of long-term load management strategies. Although reductions in load could be aided by structural changes in the sport, via a reduced calendar or shortened match format, for example, the commercial drives to see the best players playing longer and more often would suggest that immediate reductions in load will rest on the ability of players to sacrifice play opportunities and adopt more selective schedules. Although the present study provides some broad guidance for load management in tennis, many questions remain to be addressed in order to develop more practically useful, individualized strategies for specific players. How effects of competition load vary by gender, playing surface, upper-body versus lower-body, or player anthropometry are all relevant questions for future research. Because all of these could be regarded as types of stratification analysis, the SNMM framework we have presented could be readily applied to address these questions. It would also be possible to explore the severity of time-loss by considering the number of days between competitive play as the primary outcome of analysis or to link outcomes directly to injuries, as more consistent injury documentation becomes available. Other major remaining questions would be difficult to address without resolving the limitations of present data and methodology. Our approach has presumed that cumulative external load is the primary mechanism by which load effects risk of injury. While total absolute load may be a reasonable measure of the cumulative stress on tissue, other factors about how this load is distributed over time may be important for further elucidation of the effects of load. The variability in load over time and the level of acute load relative to long-term load have both been stressed as important factors to the risk of tissue damage in athletes [35, 36]. Study into the casual effects of varying short- and long-term temporal patterns of load will require extending current causal methods for time-varying exposures and effect modifications that allow for greater flexibility in the exposure mechanism. In focusing on load information that could be consistently observed for a large group of players over time, we have used game load as a proxy for the intensity of biomechanical stress player’s are exposed to in a given match. Owing to differences in player movement and playing strategy, games played could mask significant differences in the duration and severity of experienced biomechanical stress. Of even greater concern than the coarseness of available measures of competition load is the scarcity of training load. Without information about training load, the present study’s ability to identify the potential causal effects of competition load rests on the assumption that competition load is the primary causative factor of time-loss and that, after accounting for observed covariates, the level of competition load is ignorable regardless of past training load. These are both strong assumptions that require further research to verify. Although the incompleteness of training load data is unlikely to allow for large-scale study, smaller scale investigations may be possible and still informative. Indeed, for epidemiological work of tennis injuries to have a meaningful impact on clinical practice, combining more principled statistical methods with a more complete picture of player load in training and competition will be a crucial next step.

Conclusions

This study provides valuable new evidence about the potential causal role of cumulative competition load and time-loss events in professional tennis. Both the use of causal methods that are appropriate for dynamic dose-response mechanisms and the application of these methods to complete competition histories of hundreds of players makes this study’s evidence for the harmful effects of load some of the strongest yet reported. We hope that our findings will highlight the need for casual methods in observational studies of injury in sport and will spark continued development and application of these techniques to further understand the causative role of load and guide future load management strategies.

A document with a discussion of the incentives for competing in professional tennis, more detail on the definitions of a regular schedule and time-loss from competition, and some validatory analysis of the time-loss definition.

(PDF) Click here for additional data file. 14 Jan 2020 PONE-D-19-16725 `In Search of Lost Time': Identifying the causative role of cumulative competition load and competition time-loss in professional tennis using a structural nested mean model PLOS ONE Dear Dr. Kovalchik, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Along with the revised version, we ask that you provide point-to-point responses to each of the comments raised by the reviewers. Please explain what revisions have been made within the manuscript to address each of the points raised. If a comment is not addressed, please justify accordingly in your response to the reviewer's comment. In addition to the reviewers' comments, I would like to ask that any claims of causation are relaxed. You can state that you are studying the potential causes of some injury, but be careful not to state that your work identifies causes of injury. Further, Fig 4 presents a DAG and states that it is a causal model. The correct way to express this is to say that the DAG models the variables of interest under the assumption that the arcs represent causal influence; this is an important distinction (even though the DAG appears to follow some sort of a termporal order of events, rather than causation). Though I`m unsure why you would like to state this; a DAG under causal assumptions is generally used for simulating interventions, something which is not present in this study and hence, no point in making this assumption. Moreover, Figures 2 and 3 state that they present a DAG, but those graphs are not DAGs (i.e., nodes M and Age are parents of which node?). We would appreciate receiving your revised manuscript by Feb 28 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols Please include the following items when submitting your revised manuscript: A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. We look forward to receiving your revised manuscript. Kind regards, Anthony C Constantinou Academic Editor PLOS ONE Journal Requirements: When submitting your revision, we need you to address these additional requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at http://www.plosone.org/attachments/PLOSOne_formatting_sample_main_body.pdf and http://www.plosone.org/attachments/PLOSOne_formatting_sample_title_authors_affiliations.pdf 2. PLOS requires an ORCID iD for the corresponding author in Editorial Manager on papers submitted after December 6th, 2016. Please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. Please see the following video for instructions on linking an ORCID iD to your Editorial Manager account: https://www.youtube.com/watch?v=_xcclfuvtxQ 3. Thank you for stating the following in the Competing Interests section: "The authors has declared that no competing interests exist." We note that one or more of the authors are employed by a commercial company: Game Insight Group, Tennis Australia. a. Please provide an amended Funding Statement declaring this commercial affiliation, as well as a statement regarding the Role of Funders in your study. If the funding organization did not play a role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript and only provided financial support in the form of authors' salaries and/or research materials, please review your statements relating to the author contributions, and ensure you have specifically and accurately indicated the role(s) that these authors had in your study. You can update author roles in the Author Contributions section of the online submission form. Please also include the following statement within your amended Funding Statement. “The funder provided support in the form of salaries for authors [insert relevant initials], but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section.” If your commercial affiliation did play a role in your study, please state and explain this role within your updated Funding Statement. b. Please also provide an updated Competing Interests Statement declaring this commercial affiliation along with any other relevant declarations relating to employment, consultancy, patents, products in development, or marketed products, etc. Within your Competing Interests Statement, please confirm that this commercial affiliation does not alter your adherence to all PLOS ONE policies on sharing data and materials by including the following statement: "This does not alter our adherence to PLOS ONE policies on sharing data and materials.” (as detailed online in our guide for authors http://journals.plos.org/plosone/s/competing-interests) . If this adherence statement is not accurate and there are restrictions on sharing of data and/or materials, please state these. Please note that we cannot proceed with consideration of your article until this information has been declared. c. Please include both an updated Funding Statement and Competing Interests Statement in your cover letter. We will change the online submission form on your behalf. Please know it is PLOS ONE policy for corresponding authors to declare, on behalf of all authors, all potential competing interests for the purposes of transparency. PLOS defines a competing interest as anything that interferes with, or could reasonably be perceived as interfering with, the full and objective presentation, peer review, editorial decision-making, or publication of research or non-research articles submitted to one of the journals. Competing interests can be financial or non-financial, professional, or personal. Competing interests can arise in relationship to an organization or another person. Please follow this link to our website for more details on competing interests: http://journals.plos.org/plosone/s/competing-interests 4. Please ensure that you refer to Figures 2-4 in your text as, if accepted, production will need this reference to link the reader to the figure. [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes ********** 2. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: Yes ********** 3. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes ********** 4. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes ********** 5. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: Specific questions, suggestions, and comments referenced to manuscript line number(s): 3: How exactly does injury threaten the sustainability of the sports industry? 12: The possessive pronoun “its” does not contain an apostrophe. 18: The pronoun “this” apparently refers to the “congested season” mentioned in the previous sentence. If this is the case, the “congested season” may “impose” stresses (the grind) on the best players, but the “congested season” would not “incur” them. 22: The phrase “the injury mechanism” implies that there is only one mechanism. 25: A better explanation of the “load” concept is needed. Most experts in the area would agree that “mechanical load” represents a combination of force (mass X acceleration) imposed on body tissues and the frequency of exposure within a specific time period. Others may define “load” in terms of physiological demand on the cardiorespiratory system in relation to some measure of exposure duration and/or frequency. The term “player load” is often used to refer to measurements derived from wearable technology that quantifies instantaneous changes in whole-body inertia over a defined amount of time. 31: The term “trainers” lacks specificity. Are you referring to coaches who guide strengthening and conditioning activities or “athletic trainers” who are charged with injury prevention and treatment? 66: Define “treatment weights” and explain how they address selection and confounding biases. What is meant by “treatment” and how are “weights” applied. 79-80: What is an “Elo-based” rating system? No reference is cited. 92-93: No information has been provided for the reader to have any understanding of the 1900 to 3000 range of points. Does rating groups of 100 points mean that there were 11 groups? 93-95: Does “a player rating of 2300 or higher” mean that the number of groups was reduced from 11 to 6? Explain why less than or equal to 3 weeks was chosen as a standard. The reference to “absences by month” (line 93) and the phrase “for at least 9 months of the year” (line 95) makes this content extremely hard to understand. Please be more explicit in explaining the basis for your operational definition of time loss. 98: The word “criterion” refers to a single standard. The word “criteria” is plural. 104: Please explain what is meant by “maximum gap” during a calendar month. Is this an alternate term for “absence from competition” mentioned in the first line sentence of the paragraph? 104-110: Does the reference to “linear mixed model” mean that you used a linear regression equation to estimate absence from competition for each player on a monthly basis? Line 108 refers to “a gap that was 2 weeks longer than expected,” but the Fig 1 legend refers to “maximum gap (in days)” between competitive events. This content does not clearly convey your definition of time loss from competition. 114-115: How can “gaps” defined as “25 to 40 days or fewer” represent a threshold for months that have only 30 or 31 days each? Does this mean the number of days of absence prior to the first competition during a given month? Surely, there is a way to explain your procedure in a manner that is more clearly understandable. A clearer distinction needs to be made between “elected” absence from competition and “unintended” absence attributed to injury. Figure 1 needs better explanation: Does 90th Percentile Range mean the range of 90th Percentile values for days of absence among the 389 players? 142: The referenced Fig is not designated by number (Fig 2?). 147: The referenced Fig is not designated by number (Fig 3?). 147-149: This clarification of the meaning of “load” should appear earlier in the manuscript (see the previous comment referenced to line 25). 158: The abbreviation “DAG” (directed acyclic graph?) is not defined in the text. 160: The referenced Fig is not designated by number (Fig 4?). 176: The abbreviation “MSM” (marginal structural model?) is not defined in the text. 180-182: The meaning of the phrase “previous levels of treatment” is not clear (see the previous comment referenced to line 66). Most readers are likely to interpret the word “treatment” as having something to do with therapeutic interventions following an injury. 185-216: I certainly do not possess the requisite level of knowledge about advanced mathematical modeling to appraise the quality of the content pertaining to the SNMM method. 297-309: I can follow the reasoning for the model specification, but I remain confused about the meaning of the term “treatment” in this section of the text. 330-332: This portion of the text refers to “competition week” (as well as the Fig 5 legend), but “Competition Age” is the label on the x-axis of the Fig. The latter term has not been introduced anywhere in the manuscript text. The y-axis “log(w)” label apparently refers to the log of inverse-probability of combined treatment weights and censor weights (lines 227-228 and the Fig 5 legend). Inconsistency in the use of terms and labels further confuses reader interpretation of the graph’s meaning. 333-335: Again, the term “competition weeks” appears in the text, but “Competition Age” is the label on the x-axis of the Fig. 353-354: The content in lines 350-351 connects the term “doubly-robust analysis” with the hazard ratio reported in the “SNMM Adjusted” column of Table 3 (5% increased in risk; 1.05). The “doubly robust estimates” of increased risk of time-loss for ages 25, 27, and 29 for an increase in game load of 1000 or more reported in line 354 apparently do not have corresponding hazard ratio values in Table 3, which complicates the reader’s understanding of the correspondence between information presented in the text with that presented in the Table. 401-409: This portion of the text provides the clearest explanation of the connection between the risk modeling methods and its results. After reading it, I finally figured out that “treatment weights” related to “player ability” and “past injury” as time-varying covariates. I strongly recommend making this connection much more explicit throughout the manuscript. 409-412: I suggest that content be added to the end of this sentence: “…questionable methodology that has not adequately addressed effect modifiers or confounders.” Reviewer #2: Is the manuscript technically sound, and do the data support the conclusions? Yes, but again – some of the statistics are beyond my understanding so very key to have a biostatistician review the paper to ensure this aspect. PLOS Data policy: I believe they have used only commercially available data from the tennis world – they have quoted the site – they have used several player names, again with only public data and also mention of an injury that is very public so it does not appear that any GDPR or HIPAA violations on the use of their data would be applicable. Line 78 – please provide greater information for the reader on ‘player ratings’ – this can easily be confused with straight player rankings – ie number 1, 2, 4, in the world etc….. the ratings are important to the paper and many will not understand how this is calculated and how it is applicable. Would add this early in the manuscript. Line 100 – Except for Grand Slams (4) and Indian Wells and Miami – which are essentially 10 day events and often could appear to have a gap in player competition days with early loss in IW, followed by no events to compete in until the next tournament. Also players with lower rankings often have very few competition opportunities during the month of March / if their ranking does not allow access to IW or Miami….. Line 106 – so no direct injury illness reports were accessed, just player competition data….. you did a good job later in the paper stating this could be a limitation and that access to the player injury data could provide additional insight beyond what you have reported…. This is very good and true. Line 110 – you mention a small sample to test this – was it like 8 players, or 90 players ? would be good to list the number so the reader knows – will add credibility to the data sample here. Line 149 – Games…. Excellent – several prior epidemiological studies have found number of games to more closely represent player volume / load etc, compared to sets or matches which can be very misleading. Just as an aside, did you also look at points played ? or any other volume metrics, this would add additional information to the paper if you did study this but did not report it. Line 152 – great point – Kibler et al, 1996 showed decreases in shoulder IR and total rotation ROM based on years of player and numbers of tournaments. This would parallel the statements you find and are reporting here that there is an effect of cumulative loading and age and that this does ultimately affect injury risk and time loss. Line 274 - consider rewording sentence here ? Line 418 – good point about limiting games, but likely as you state, 1000 or 5000 not practical due to exposure needed for ranking and success in the sport. Line 454 – the the ? Lines 450 – 460 – good discussion. For sure you bring up that training load in this study only represents competition load. There is limited ability to measure training load…. Which in many ways can be more repetitive and lead to injury away from competition with year round play inherent in the sport. With the advent of wearable technology, it may become more common to measure this parameter for researchers in the future, but at this time unlike other team sports with dedicated and consistent medical teams who measure this (training or practice) we may not have this aspect known in tennis for some time. As a general rule, if you can increase the clinical application aspect of the paper, it would strengthen it for many readers of the journal. Several take a way points, what are the bottom lines from your amazing work ? ********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Gary B. Wilkerson Reviewer #2: Yes: Todd Ellenbecker [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step. 27 Jan 2020 PONE-D-19-16725 `In Search of Lost Time': Identifying the causative role of cumulative competition load and competition time-loss in professional tennis using a structural nested mean model’ I am thankful to the Editor and the two Referees for their constructive feedback. I am pleased that there was general agreement about the merits of the work and suggestions on areas that the manuscript could be improved. I am thankful for the opportunity to respond to those suggestions in a revision. Below is a point-by-point response to all of the feedback that was received. I believe the changes made in response to these suggestions have strengthened the paper, and I hope resolved any remaining concerns. Please note that in what follows the original comments are proceeded with ‘COMMENT’. Editor Comments COMMENT. In addition to the reviewers' comments, I would like to ask that any claims of causation are relaxed. You can state that you are studying the potential causes of some injury, but be careful not to state that your work identifies causes of injury.] RESPONSE. The Editor makes an excellent point that any study based on observational evidence can attempt to estimate potential causes of outcomes. I’ve accordingly reviewed all of the instances were ‘causal’ claims are made in the paper and, where appropriate, replaced with ‘potential causes’. COMMENT. Further, Fig 4 presents a DAG and states that it is a causal model. The correct way to express this is to say that the DAG models the variables of interest under the assumption that the arcs represent causal influence; this is an important distinction (even though the DAG appears to follow some sort of a termporal order of events, rather than causation). RESPONSE. The Editor is correct that more precision was needed in the terminology around the DAG. I’ve added text in the description of Figure 4 in the main text, “...assuming that all edges capture the factors that have a causal influence on Y” and in the Figure caption as well (using ‘presumed causal model’). COMMENT. Though I`m unsure why you would like to state this; a DAG under causal assumptions is generally used for simulating interventions, something which is not present in this study and hence, no point in making this assumption. Moreover, Figures 2 and 3 state that they present a DAG, but those graphs are not DAGs (i.e., nodes M and Age are parents of which node?). RESPONSE. In this case, the arrow between ‘M’ and the directed edge between X and Y in Figure 2 is denoting modification on the causal process between X and Y. This is distinct from direct or indirect modification. I believe this is an established way for a DAG to denote this type of effect modification (see for example, Weinberg, C. R. (2007). Can DAGs clarify effect modification?. Epidemiology (Cambridge, Mass.), 18(5), 569.). But I recognize that it may not be familiar to all readers so I have added text to explain what the “edge-to-edge” represents. Reviewer 1 Comments: COMMENT. L3: How exactly does injury threaten the sustainability of the sports industry? RESPONSE. After considering the Reviewer’s question, it was clear that ‘sustainability’ was not the appropriate word choice. I have revised the sentence to instead point to the potential economic costs to the sports industry when top athletes are unable to play due to injury with this rephrasing: ‘Injury is one of the most significant threats to the longevity of elite athletes and, when injury ends the careers of the industry's stars prematurely, can pose a significant threat to the business of sports’ (L2-3). COMMENT. L12: The possessive pronoun “its” does not contain an apostrophe. RESPONSE. The Reviewer is correct that there were three instances in the paper were the possessive pronoun incorrectly included an apostrophe. These have all been corrected in the revision. COMMENT. L18. The pronoun “this” apparently refers to the “congested season” mentioned in the previous sentence. If this is the case, the “congested season” may “impose” stresses (the grind) on the best players, but the “congested season” would not “incur” them. RESPONSE. I thank the Reviewer for pointing out the need for rephrasing here. I have replace ‘this incurs’ with ‘season schedule imposes’ to be more clear. COMMENT. L22. The phrase “the injury mechanism” implies that there is only one mechanism. RESPONSE. I agree that this implies a single mechanism, which oversimplifies the causes of injury. I’ve replaced all instances of ‘injury mechanism’ with ‘mechanisms of injury’. COMMENT. L25: A better explanation of the “load” concept is needed. Most experts in the area would agree that “mechanical load” represents a combination of force (mass X acceleration) imposed on body tissues and the frequency of exposure within a specific time period. Others may define “load” in terms of physiological demand on the cardiorespiratory system in relation to some measure of exposure duration and/or frequency. The term “player load” is often used to refer to measurements derived from wearable technology that quantifies instantaneous changes in whole-body inertia over a defined amount of time. RESPONSE. This is an excellent point. The wording in the paper implied a single definition of load which is far from the case. I’ve now added the following text to clarify that there are many definitions of load, which is one of the challenges to this area of research: ‘Gathering high-quality data about load is also a challenge. One reason for this is the lack of agreement on how `load` is defined. Load can take different meanings depending on the experts who are using it [10]: biomechanists use load to focus on the frequency and force of stress to joints, physiologists use load to refer to the respiratory demands on the cardiovascular system, while sports scientists use load to refer to total accelerations performed. Under any definition, a complete picture of the load an athlete may experience over time is rarely available owing to the difficulties of collecting data during the training periods of top athletes [11,12].’ (L34-42). COMMENT. L31. The term “trainers” lacks specificity. Are you referring to coaches who guide strengthening and conditioning activities or “athletic trainers” who are charged with injury prevention and treatment? RESPONSE. ‘Athletic trainers’ was the group referred to and this is what is used in the revision. COMMENT. L66: Define “treatment weights” and explain how they address selection and confounding biases. What is meant by “treatment” and how are “weights” applied. RESPONSE. The term “inverse probability of treatment weights” is a well-established technical term in causal inference. But I understand that some definition in the paper would help make this more approachable for a more general audience. I have added this in the Methods section with the lines: ‘...Imbalance in these factors are instead handled through the use of inverse probability of treatment weights. The `treatment', a general term the causal inference literature uses to refer to the main explanatory variable of interest, in this case is the cumulative competition load. Weighting observations by the inverse probability of the observed dose of treatment received is well known to be a more effective strategy for protecting against confounder bias than regression’. (L237-243) COMMENT. L79-80: What is an “Elo-based” rating system? No reference is cited. RESPONSE. I am grateful to the Reviewer for pointing out this oversight. This paper was still in press at the time of review. I’ve now added the paper’s citation as well as a reference to Arpad Elo’s original text on the system. I’ve also noted that the system is a ‘statistical algorithm for rating the latent ability of tennis players’ for further clarification of the basic goal of the system and how it differs from official rankings. COMMENT. L92-93: No information has been provided for the reader to have any understanding of the 1900 to 3000 range of points. Does rating groups of 100 points mean that there were 11 groups? RESPONSE. The Reviewer is correct, there were 11 groups used in this descriptive analysis. I have also added a footnote to explain that a 1900 to 3000 would be a range in ratings that would capture players who are competing in Grand Slams, and what many in the sport would consider a minimal criteria for a ‘top player’. COMMENT. L93-95: Does “a player rating of 2300 or higher” mean that the number of groups was reduced from 11 to 6? Explain why less than or equal to 3 weeks was chosen as a standard. The reference to “absences by month” (line 93) and the phrase “for at least 9 months of the year” (line 95) makes this content extremely hard to understand. Please be more explicit in explaining the basis for your operational definition of time loss. RESPONSE. The Reviewer is correct that 2300 was a threshold for selecting the sample, though the players were not grouped any further in the actual study analysis. Those groups were only used in the exploration of a time-loss definition as it was anticipated that tournament entry would vary over the range of the ratings distribution. We agree that more justification for the time-loss definition was needed. The following text was added to the description of the reasons for the rule we applied: ‘Given that official rankings are based on a player's best 18 tournament results and that no ATP event outside of the World Tour Finals takes place in the months of November and December, it is reasonable to use 3 weeks as a threshold for the upper bound of between-event gaps of a typical top player’. (L101-105) COMMENT. L98: The word “criterion” refers to a single standard. The word “criteria” is plural. RESPONSE. I thank the Reviewer for catching this. The use of ‘criteria’ has been replaced with ‘criterion’. COMMENT. L104: Please explain what is meant by “maximum gap” during a calendar month. Is this an alternate term for “absence from competition” mentioned in the first line sentence of the paragraph? RESPONSE. For further clarification, I’ve added the following parenthetical statement after the introduction of ‘maximum gap’: ‘the largest number of consecutive days a player was not competing in a given month’. (L117) COMMENT. L104-110: Does the reference to “linear mixed model” mean that you used a linear regression equation to estimate absence from competition for each player on a monthly basis? Line 108 refers to “a gap that was 2 weeks longer than expected,” but the Fig 1 legend refers to “maximum gap (in days)” between competitive events. This content does not clearly convey your definition of time loss from competition. RESPONSE. The Reviewer is correct that a regression model was used as part of the definition of a time-loss event. The model was used to estimate what we would expect the max consecutive days between events (“gap days”) to be when a player is healthy and playing regularly. We then look for instances when the number of gap days in a month is 2 weeks longer or more than what the model would suggest is normal. I hope this is made more clear by the addition of the following explanation in the ‘Outcome’ section (L116-133): “Definitions for time-loss from competition were individualized to each player using a linear mixed model of the maximum gap during a calendar month (the largest number of consecutive days a player was not competing in a given month) with player random effects. From this regression model, we could obtain the expected value for the maximum number of consecutive days a player spends away from competition in a given month. The model was trained on data for players who had 3 or more seasons at a rating of 2300 or more. For players with fewer than 3 seasons at the minimum rating, the expected maximum gap days was set to the average. The above regression model provides a player and month specific estimate of the maximum between-event days (here on called `gap days') under normal conditions. A time-loss event was defined as instances where the actual gap days in a month were 2 weeks longer than expected (4 weeks longer for the month of January, owing to the off-season).” For the Figure 1 caption, it now reads: “The 90th percentile range for ‘gap days’ for each month, which depicts the range containing 90% of the longest periods outside of competition that were observed for each month.” COMMENT. L114-115: How can “gaps” defined as “25 to 40 days or fewer” represent a threshold for months that have only 30 or 31 days each? Does this mean the number of days of absence prior to the first competition during a given month? Surely, there is a way to explain your procedure in a manner that is more clearly understandable. RESPONSE. There did need to be an explanation of how we handled cases where the time between competition extended beyond one month. This is now addressed with the following addition to the revision: “Since a period between competition could extend over one or more months, the period was assigned to the month when the gap commenced and that month alone.” (L119-121) COMMENT. A clearer distinction needs to be made between “elected” absence from competition and “unintended” absence attributed to injury. RESPONSE. The Reviewer makes an important point. We cannot know a player’s true intentions, so we have removed the word “unintended” and instead defined time-loss as an “extended break suggestive of an unintended absence” (L64-65). COMMENT. Figure 1 needs better explanation: Does 90th Percentile Range mean the range of 90th Percentile values for days of absence among the 389 players? RESPONSE. The Reviewer’s interpretation is exactly right. To make this clearer, the caption of Figure 1 has been rephrased as follows: “The 90th percentile range for `gap days' for each month, which depicts the range containing 90% of the longest periods outside of competition that were observed for each month.” COMMENT. L142: The referenced Fig is not designated by number (Fig 2?). RESPONSE. The reference was an issue with the placement of the figure labels in LaTeX. I apologize for not having noticed this before the original submission. This has been corrected in the revision. COMMENT. L147: The referenced Fig is not designated by number (Fig 3?). RESPONSE. See comment on Fig 2 reference above. COMMENT. L147-149: This clarification of the meaning of “load” should appear earlier in the manuscript (see the previous comment referenced to line 25). RESPONSE. I completely agree. Please see my response to the comment on line 25. COMMENT. L158: The abbreviation “DAG” (directed acyclic graph?) is not defined in the text. RESPONSE. The abbreviation is now introduced in the first line of section ‘Exposure and moderator’. COMMENT. L160: The referenced Fig is not designated by number (Fig 4?). RESPONSE. See comment on Fig 2 reference above. COMMENT. L176: The abbreviation “MSM” (marginal structural model?) is not defined in the text. RESPONSE. The abbreviation ‘MSM’ is now introduced in the last paragraph of the Introduction. COMMENT. L180-182: The meaning of the phrase “previous levels of treatment” is not clear (see the previous comment referenced to line 66). Most readers are likely to interpret the word “treatment” as having something to do with therapeutic interventions following an injury. RESPONSE. “Treatment” is used as a general reference to the explanatory variable of interest in the causal inference literature. I know this might look strange to readers unfamiliar with this researchers but, on the other hand, it would be completely normal and clear for researchers who are. To try to strike a balance, I’ve added an explanation of this particular use of “treatment” in causal modelling at lines 239-241. COMMENT. L297-309: I can follow the reasoning for the model specification, but I remain confused about the meaning of the term “treatment” in this section of the text. RESPONSE. Please see the response to the previous comment. COMMENT. L330-332: This portion of the text refers to “competition week” (as well as the Fig 5 legend), but “Competition Age” is the label on the x-axis of the Fig. The latter term has not been introduced anywhere in the manuscript text. RESPONSE. I agree with the Reviewer that it is important to have consistency in terminology. The revision has now replaced ‘competition age’ in Figures 5 and 6 with ‘competition weeks’. COMMENT. The y-axis “log(w)” label apparently refers to the log of inverse-probability of combined treatment weights and censor weights (lines 227-228 and the Fig 5 legend). Inconsistency in the use of terms and labels further confuses reader interpretation of the graph’s meaning. RESPONSE. The y-axis title in Figure 5 has been changed to “log(combined weight)” to be consistent with the text and caption. COMMENT. L333-335: Again, the term “competition weeks” appears in the text, but “Competition Age” is the label on the x-axis of the Fig. RESPONSE. Please see the comment in response to the note on 330-332. COMMENT. L353-354: The content in lines 350-351 connects the term “doubly-robust analysis” with the hazard ratio reported in the “SNMM Adjusted” column of Table 3 (5% increased in risk; 1.05). The “doubly robust estimates” of increased risk of time-loss for ages 25, 27, and 29 for an increase in game load of 1000 or more reported in line 354 apparently do not have corresponding hazard ratio values in Table 3, which complicates the reader’s understanding of the correspondence between information presented in the text with that presented in the Table. RESPONSE. I agree with the Reviewer that the description needed to be clearer here, especially in relating the summary in the text to the numbers in Table 3. The following edits have been made to that description in order to improve its clarity: “Comparisons between the lowest 25th and highest 25th percentiles of empirical load observed at ages 25, 27 and 29 showed even starker effects. Based on the doubly-robust estimates, the hazard ratios of players in the top 25% of load were consistently greater than those of players of the same age but with the lowest 25% of experienced load. At age 25, the hazard ratio of 1.17 (95% CI 1.02-1.35) corresponds to a risk increase of 17%; at age 27, the hazard ratios of 1.28 (95% CI 1.12-1.47) and 1.06 (95% CI 1.03-1.10) correspond to an increase of 21%; and at age 29, the hazard ratios of 1.65 (95% CI 1.38-1.95) and 1.33 (95% CI 1.20-1.47) correspond to an increase of 24%. Though each of these comparisons correspond to a fixed increase of 3,000 games, we see that the risk associated with that same change in load is increasing, indicating the positive effect modification due to age.” (L381-391) COMMENT. L401-409: This portion of the text provides the clearest explanation of the connection between the risk modeling methods and its results. After reading it, I finally figured out that “treatment weights” related to “player ability” and “past injury” as time-varying covariates. I strongly recommend making this connection much more explicit throughout the manuscript. RESPONSE. To help link the ‘treatment weights’ to these covariates, we’ve revised the following sentence that appears early in the ‘Estimation’ section: “The purpose of these weights is to create a pseudo population that is balanced with respect to confounding variables, like player ability or competitive play, for all time t.” (L256-258) COMMENT. L409-412: I suggest that content be added to the end of this sentence: “…questionable methodology that has not adequately addressed effect modifiers or confounders.” RESPONSE. Thank you for this suggestion. The additional text has been added to this statement in the revised ‘Discussion’. Reviewer 2 Comments: COMMENT. Line 78 – please provide greater information for the reader on ‘player ratings’ – this can easily be confused with straight player rankings – ie number 1, 2, 4, in the world etc….. the ratings are important to the paper and many will not understand how this is calculated and how it is applicable. Would add this early in the manuscript. RESPONSE. The Reviewer is correct that a better explanation of the player ratings was needed. The following has been added to this section to explain that the ratings are a ‘statistical algorithm for rating the latent ability of tennis players’. In addition, two references have been added that describe the Elo system and the specific version we are using in the paper. I hope this clarifies the basic goal of the system and how it differs from official rankings. COMMENT. Line 100 – Except for Grand Slams (4) and Indian Wells and Miami – which are essentially 10 day events and often could appear to have a gap in player competition days with early loss in IW, followed by no events to compete in until the next tournament. Also players with lower rankings often have very few competition opportunities during the month of March / if their ranking does not allow access to IW or Miami….. RESPONSE. These are all excellent points. In fact, we do observe some of the top players who have some of their longest breaks (in the absence of injury) in February. These player- and month-specific effects are exactly why we needed to use a model approach with these exact variables in order to identify potential time-loss events for any individual player, as, for example, a 4-week gap going into March might be entirely normal for some players but unusual for others. COMMENT. Line 106 – so no direct injury illness reports were accessed, just player competition data….. you did a good job later in the paper stating this could be a limitation and that access to the player injury data could provide additional insight beyond what you have reported…. This is very good and true. RESPONSE. I am pleased that this crucial point came across. Yes, we have focused on when a player is and isn’t competing, which has the advantage that it can be completely observed for all professional players throughout their career. But the Reviewer rightly points out that, by not using direct injury data, we are at best only getting indirectly at a subset of injury events. COMMENT. Line 110 – you mention a small sample to test this – was it like 8 players, or 90 players ? would be good to list the number so the reader knows – will add credibility to the data sample here. RESPONSE. The Reviewer makes an excellent point. We have added the sample size of this validation, which was for three players, to the revision. Unfortunately, the injury history on public sources, like Wikipedia, are only well documented for a handful of players, otherwise studying injury in tennis would be much more straightforward! COMMENT. Line 149 – Games…. Excellent – several prior epidemiological studies have found number of games to more closely represent player volume / load etc, compared to sets or matches which can be very misleading. Just as an aside, did you also look at points played ? or any other volume metrics, this would add additional information to the paper if you did study this but did not report it. RESPONSE. This is a great question. While match statistics, like service points played or total points played, have quite a long history for Grand Slams, they are not as well documented for other events. Game and Set scores, however, have been recorded for all professional matches for decades. So we focused exclusively on total games. As statistical documentation for matches continues to improve over time, I suspect in a few years we may have at least a decade of more detailed match statistics for all professional matches and the question of these alternative measures of load could be investigated. COMMENT. Line 152 – great point – Kibler et al, 1996 showed decreases in shoulder IR and total rotation ROM based on years of player and numbers of tournaments. This would parallel the statements you find and are reporting here that there is an effect of cumulative loading and age and that this does ultimately affect injury risk and time loss. RESPONSE. I am grateful to the Reviewer for bringing this study to my attention. I have now added this reference in the Discussion as additional support for the effect modification of age. “One contributor to this discrepancy is age, which both theory and prior empirical evidence suggests is likely to modify the effect of load (Kibler et al. 1996)”. COMMENT. Line 274 - consider rewording sentence here ? RESPONSE. The Reviewer is correct that this sentence was not clearly worded. The following has been used in its place in the revision: “For all hazard ratios shown, the reference player was a 25 year-old with a total competition load of 10,000 games.” COMMENT. Line 418 – good point about limiting games, but likely as you state, 1000 or 5000 not practical due to exposure needed for ranking and success in the sport. RESPONSE. Yes, one implication of these findings is that, without the structures of tennis making an effort to change the current scheduling demands, the practical barriers to making meaningful reductions in competition load are considerable. I am glad that these comes across in the paper’s Discussion. COMMENT. Line 454 – the the ? RESPONSE. Thank you for pointing out this type. This has been corrected in the revision. COMMENT. Lines 450 – 460 – good discussion. For sure you bring up that training load in this study only represents competition load. There is limited ability to measure training load…. Which in many ways can be more repetitive and lead to injury away from competition with year round play inherent in the sport. With the advent of wearable technology, it may become more common to measure this parameter for researchers in the future, but at this time unlike other team sports with dedicated and consistent medical teams who measure this (training or practice) we may not have this aspect known in tennis for some time. As a general rule, if you can increase the clinical application aspect of the paper, it would strengthen it for many readers of the journal. Several take a way points, what are the bottom lines from your amazing work ? RESPONSE. I agree with the Reviewer that the study would be strengthened if there was a more direct link to clinical applications. However, given the preliminary nature of this study, the first true causal inference approach to tennis competition loss, and the lack of training data in the measure of load, I would caution against drawing too many implications for clinical decision-making at this stage. But I do see this as an important step towards such work and I hope the paper better points to this with the addition of the following statement in the ‘Discussion’: “Indeed, for epidemiological work of tennis injuries to have a meaningful impact on clinical practice, combining more principled statistical methods with a more complete picture of player load in training and competition will be a crucial next step.” (L495-L598). 18 Mar 2020 PONE-D-19-16725R1 `In Search of Lost Time': Identifying the causative role of cumulative competition load and competition time-loss in professional tennis using a structural nested mean model PLOS ONE Dear Dr. Kovalchik, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. As you will see, the reviewers are generally happy with the revised version. Reviewer #3 has identified a few more minor revisions, which need to be addressed before the manuscript is accepted for publication. We would appreciate receiving your revised manuscript by May 02 2020 11:59PM. When you are ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols Please include the following items when submitting your revised manuscript: A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). This letter should be uploaded as separate file and labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. This file should be uploaded as separate file and labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. This file should be uploaded as separate file and labeled 'Manuscript'. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. We look forward to receiving your revised manuscript. Kind regards, Anthony C Constantinou Academic Editor PLOS ONE [Note: HTML markup is below. Please do not edit.] Reviewers' comments: Reviewer's Responses to Questions Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: All comments have been addressed Reviewer #2: All comments have been addressed Reviewer #3: (No Response) ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 3. Has the statistical analysis been performed appropriately and rigorously? Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 4. Have the authors made all data underlying the findings in their manuscript fully available? The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 5. Is the manuscript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 6. Review Comments to the Author Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: Responses to the specific review comments were very thorough and the related manuscript revisions have been done well. Reviewer #2: Thank you for the opportunity to re-review this fine work. The authors have provided a thoughtful and effective response to all queries imposed after the initial review process in my opinion. I do not have any additional comments or requirements for this paper prior to acceptance for publication. I feel this paper will be an excellent addition to the literature in this area for tennis. The authors should be congratulated for a fine work as well as the other reviewer who was able to review and comment on very technical statistical aspects of the paper which provided exceptional rigor to this review process. Reviewer #3: The study sought to identify potential causative factors associated with injury in male tennis players (n=389) by focusing on competition load and time loss to competition using a structural nested mean model. The total load significantly increased the risk of time-loss with a hazard rate of 1.05 per 1,000 games. Minor revisions: 1- Line 117: State the covariance structure used in the linear mixed model and the criteria for selecting it. 2- Line 131: Indicate how the small sample of three players was identified. 3- State and justify the study’s target sample size with a pre-study statistical power calculation. The power calculation should include: sample size, alpha level (indicating one or two-sided), minimal detectable difference and statistical testing method. 4- Cite the statistical software used for the analysis. ********** 7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files. If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: Yes: Gary B. Wilkerson, EdD, ATC Reviewer #2: Yes: Todd S. Ellenbecker, DPT, MS, SCS, OCS, CSCS Reviewer #3: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files to be viewed.] While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org. Please note that Supporting Information files do not need this step. 20 Mar 2020 I want to thank the Referees for their latest set of comments. I recognize that there were some minor revisions that were still needed. Below I have detailed each suggestion and how I've responded to it in the latest version (responses given in []). I hope these have addressed all remaining concerns. 1- Line 117: State the covariance structure used in the linear mixed model and the criteria for selecting it. [I’ve now added that an unstructured covariance-variance structure was used, which is the default for the lme4 package in R and has the advantage that it makes no assumptions about the within-player covariance-variance.] 2- Line 131: Indicate how the small sample of three players was identified. [The sample was chosen by identifying former World No. 1 players who have been competitive within the past decade. Within this group, the Wikipedia pages were reviewed and the subset with detailed notes on their play activity in each season were selected for the validation sample. ] 3- State and justify the study’s target sample size with a pre-study statistical power calculation. The power calculation should include: sample size, alpha level (indicating one or two-sided), minimal detectable difference and statistical testing method. [The goal of the study was to have a census of recent top players who we could consider as “regular” tour players. Thus, this was the primary goal in deciding on the exclusion criteria and therefore sample size for the study. However, we agree with the reviewer that even if power was not the primary driver for the study sample, it is still important to consider when interpreting the study results. If we were to use a traditional survival analysis to compare the 25% of players with the greatest game load to the rest of the sample, we estimate we would have 80% power to detect a 20% increase or more in the risk of competition time loss compared to a base rate of 3%. This analysis has been added to the data description in the revision (L114-116).] 4- Cite the statistical software used for the analysis. [All the analysis was conducted in the R programming language, which I have indicated in the revision along with a citation for the language (L 323-324).] 27 Mar 2020 `In Search of Lost Time': Identifying the causative role of cumulative competition load and competition time-loss in professional tennis using a structural nested mean model PONE-D-19-16725R2 Dear Dr. Kovalchik, We are pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it complies with all outstanding technical requirements. Within one week, you will receive an e-mail containing information on the amendments required prior to publication. When all required modifications have been addressed, you will receive a formal acceptance letter and your manuscript will proceed to our production department and be scheduled for publication. Shortly after the formal acceptance letter is sent, an invoice for payment will follow. To ensure an efficient production and billing process, please log into Editorial Manager at https://www.editorialmanager.com/pone/, click the "Update My Information" link at the top of the page, and update your user information. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, you must inform our press team as soon as possible and no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. With kind regards, Anthony C Constantinou Academic Editor PLOS ONE Additional Editor Comments (optional): Reviewers' comments: 31 Mar 2020 PONE-D-19-16725R2 ‘In Search of Lost Time’: Identifying the causative role of cumulative competition load and competition time-loss in professional tennis using a structural nested mean model Dear Dr. Kovalchik: I am pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximize its impact. If they will be preparing press materials for this manuscript, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. For any other questions or concerns, please email plosone@plos.org. Thank you for submitting your work to PLOS ONE. With kind regards, PLOS ONE Editorial Office Staff on behalf of Dr. Anthony C Constantinou Academic Editor PLOS ONE

24 in total

1. Quantifying biases in causal models: classical confounding vs collider-stratification bias.

Authors: Sander Greenland
Journal: Epidemiology Date: 2003-05 Impact factor: 4.822

2. Matching methods for causal inference: A review and a look forward.

Authors: Elizabeth A Stuart
Journal: Stat Sci Date: 2010-02-01 Impact factor: 2.901

3. Doubly robust estimation in missing data and causal inference models.

Authors: Heejung Bang; James M Robins
Journal: Biometrics Date: 2005-12 Impact factor: 2.571

4. A dynamic model of etiology in sport injury: the recursive nature of risk and causation.

Authors: Willem H Meeuwisse; Hugh Tyreman; Brent Hagel; Carolyn Emery
Journal: Clin J Sport Med Date: 2007-05 Impact factor: 3.638

5. Can DAGs clarify effect modification?

Authors: Clarice R Weinberg
Journal: Epidemiology Date: 2007-09 Impact factor: 4.822

Review 6. Debunking the myths about training load, injury and performance: empirical evidence, hot topics and recommendations for practitioners.

Authors: Tim J Gabbett
Journal: Br J Sports Med Date: 2018-10-26 Impact factor: 13.800

7. How much is too much? (Part 1) International Olympic Committee consensus statement on load in sport and risk of injury.

Authors: Torbjørn Soligard; Martin Schwellnus; Juan-Manuel Alonso; Roald Bahr; Ben Clarsen; H Paul Dijkstra; Tim Gabbett; Michael Gleeson; Martin Hägglund; Mark R Hutchinson; Christa Janse van Rensburg; Karim M Khan; Romain Meeusen; John W Orchard; Babette M Pluim; Martin Raftery; Richard Budgett; Lars Engebretsen
Journal: Br J Sports Med Date: 2016-09 Impact factor: 13.800

8. Structural nested mean models for assessing time-varying effect moderation.

Authors: Daniel Almirall; Thomas Ten Have; Susan A Murphy
Journal: Biometrics Date: 2009-04-13 Impact factor: 2.571

Review 9. The Relationship Between Training Load and Injury, Illness and Soreness: A Systematic and Literature Review.

Authors: Michael K Drew; Caroline F Finch
Journal: Sports Med Date: 2016-06 Impact factor: 11.136

10. Collecting Health and Exposure Data in Australian Olympic Combat Sports: Feasibility Study Utilizing an Electronic System.

Authors: Sally Bromley; Michael Drew; Scott Talpey; Andrew McIntosh; Caroline Finch
Journal: JMIR Hum Factors Date: 2018-10-09