| Literature DB >> 31270636 |
João Gustavo Claudino1,2, Daniel de Oliveira Capanema3, Thiago Vieira de Souza4, Julio Cerca Serrão5, Adriano C Machado Pereira6, George P Nassis7,8.
Abstract
BACKGROUND: The application of artificial intelligence (AI) opens an interesting perspective for predicting injury risk and performance in team sports. A better understanding of the techniques of AI employed and of the sports that are using AI is clearly warranted. The purpose of this study is to identify which AI approaches have been applied to investigate sport performance and injury risk and to find out which AI techniques each sport has been using.Entities:
Keywords: Analytics; Artificial neural networks; Basketball; Handball; Innovation; Machine learning; Soccer; SportsTech; Technology; Volleyball
Year: 2019 PMID: 31270636 PMCID: PMC6609928 DOI: 10.1186/s40798-019-0202-3
Source DB: PubMed Journal: Sports Med Open ISSN: 2198-9761
Risk of bias assessment criteria
| Criteria | Definition | Scoring | |||
|---|---|---|---|---|---|
| 0 | 1 | 2 | |||
| A | Peer-reviewed | Study published in peer-reviewed journal | No | Yes | – |
| B | Real-world approach | The approach was performed with real results/data of the athletes | No | Yes | – |
| C | Population defined | Age, gender, sport, and level was described | No | Partly | Yes |
| D | Experimental design | Experimental design of the study period was described and replicable | No | Partly | Yes |
| E | Artificial intelligence | The artificial intelligence approaches/techniques were described | No | Yes | – |
Fig. 1Study selection PRISMA flow diagram
AI techniques or methods descriptions
| AI | Description |
|---|---|
| Absolute shrinkage and selection operator | In statistics and machine learning, least absolute shrinkage and selection operator (LASSO) is a regression analysis method that performs both variable selection and regularization in order to enhance the prediction accuracy and interpretability of the statistical model it produces. It was originally introduced in the geophysics literature in 1986 but was independently rediscovered and popularized by Robert Tibshirani in 1996; he coined the term and provided further insights into the observed performance. LASSO was originally formulated for least squares models. This simple case provides a vast scenario regarding the behavior of the estimator, including its relationship to ridge regression, best subset selection, besides the connections between LASSO coefficient estimates and the so-called soft thresholding. It also reveals that (as standard linear regression) the coefficient estimates need not be unique if covariates are collinear. Although originally defined for least squares, LASSO regularization is easily extended to a wide variety of statistical models including generalized linear models, generalized estimating equations, proportional hazards models, and M-estimators, in a straightforward fashion. |
| Artificial neural network | An artificial neural network (ANN) is a computational model based on the structure and functions of biological neural networks. Information that flows through the network affects the structure of the ANN because a neural network changes—or learns, in a sense—based on inputs and outputs. ANNs are considered nonlinear statistical data modeling tools whereby the complex relationships between inputs and outputs are modeled or patterns are found. |
| Bayesian logistic | In statistics, the logistic model (or logit model) is a widely used statistical model that, in its basic form, uses a logistic function to model a binary dependent variable; many more complex extensions exist. In regression analysis, logistic regression (or logit regression) means estimating the parameters of a logistic model; it is a form of binomial regression. Logistic regression is a type of regression analysis used for predicting the outcome of a categorical (a variable that can take on a limited number of categories) dependent variable based on one or more predictor variables. The probabilities describing the possible outcome of a single trial are modeled, as a function of explanatory variables, using a logistic function. Logistic regression measures the relationship between a categorical dependent variable and usually a continuous independent variable (or several), by converting the dependent variable to probability scores. Logistic regression can be binomial or multinomial. In the multinomial type, there exist different models to perform the regression, such as the Bayesian method. |
| Bayesian networks | Bayesian networks are a type of probabilistic graphical model that uses Bayesian inference for probability computations. Bayesian networks aim to model conditional dependence and, therefore, causation, by representing conditional dependence by edges in a directed graph. Through these relationships, one can efficiently conduct the inference of the random variables in the graph by using factors. |
| Decision tree classifier | A decision tree classifier (DTC) is a decision support tool that uses a tree-like graph or model of decisions and their possible consequences, including chance event outcomes, resource costs, and utility. It is one way to display an algorithm that only contains conditional control statements. Decision trees are commonly used in operations research, specifically in decision analysis, to help identify a strategy most likely to reach a goal, but are also a popular tool in machine learning. Decision tree learning uses a decision tree (as a predictive model) to go from observations about an item (represented in the branches) to conclusions about the item target value (represented in the leaves). It is one of the predictive modeling approaches used in statistics, data mining, and machine learning. Tree models whereby the target variable can take a discrete set of values are called classification trees; in these tree structures, leaves represent class labels and branches represent conjunctions of features that lead to those class labels. |
| Fuzzy clustering | Fuzzy clustering is an alternative method to conventional or hard clustering algorithms, which partitions data containing similar subjects. Fuzzy clustering contrasts with hard clustering by its nonlinear nature and discipline of flexibility in grouping massive data. It provides more accurate and close-to-nature solutions for partitions and herein implies more possibilities of solutions for decision-making. In the specific matter of computation, fuzzy clustering has its roots in fuzzy logic and indicates the likelihood or degrees of one data point belonging to more than one group. |
| Markov process | A Markov process is a random process indexed by time, and with the property that the future isindependent of the past. The present. Markov processes, named for Andrei Markov, are among the most important of all random processes. In a sense, they are the stochastic analogs of differential equations and recurrence relations, which are certainly among the most important deterministic processes. |
| Support vector machine | Support vector machine (SVM) is a discriminative classifier formally defined by a separating hyperplane. In other words, given the labeled training data (supervised learning), the algorithm outputs an optimal hyperplane which categorizes new examples. In a two-dimensional space, this hyperplane is a line dividing a plane into two parts in which each class lies on either side. The learning of the hyperplane in linear SVM is done by transforming the problem using some linear algebra. This is where the kernel plays its role. |
| Support vector machine + decision tree classifier | The idea of combining SVM and DTC is to provide a hybrid approach, which attempts to embed SVM within a decision tree algorithm as a decision tree pre-pruning method and resulting into a more accurate and efficient hybrid classifier. |
Fig. 2AI for predicting injury risk in various sports
Fig. 3AI for predicting sporting performance in various sports