| Literature DB >> 27595001 |
Hendrik Edelhoff1, Johannes Signer1, Niko Balkenhol1.
Abstract
Increased availability of high-resolution movement data has led to the development of numerous methods for studying changes in animal movement behavior. Path segmentation methods provide basics for detecting movement changes and the behavioral mechanisms driving them. However, available path segmentation methods differ vastly with respect to underlying statistical assumptions and output produced. Consequently, it is currently difficult for researchers new to path segmentation to gain an overview of the different methods, and choose one that is appropriate for their data and research questions. Here, we provide an overview of different methods for segmenting movement paths according to potential changes in underlying behavior. To structure our overview, we outline three broad types of research questions that are commonly addressed through path segmentation: 1) the quantitative description of movement patterns, 2) the detection of significant change-points, and 3) the identification of underlying processes or 'hidden states'. We discuss advantages and limitations of different approaches for addressing these research questions using path-level movement data, and present general guidelines for choosing methods based on data characteristics and questions. Our overview illustrates the large diversity of available path segmentation approaches, highlights the need for studies that compare the utility of different methods, and identifies opportunities for future developments in path-level data analysis.Entities:
Keywords: Animal behavior; Bio-logging; GPS; Path segmentation; Path topology; Path-level analyses; State-space models; Telemetry
Year: 2016 PMID: 27595001 PMCID: PMC5010771 DOI: 10.1186/s40462-016-0086-5
Source DB: PubMed Journal: Mov Ecol ISSN: 2051-3933 Impact factor: 3.600
Fig. 1Overview of important steps throughout a segmentation analysis. In general, the actual continuous movement path of an organism is sampled as a set of consecutive relocations (Step 1; e.g., field work). Step 2: exploratory and descriptive analyses of path characteristics exploring and visualizing of the data structure. Step 3: applying one or several path segmentation method(s) to objectively distinguish different movement states. Step 4: Some methods require the use of clustering and summary statistics to quantify differences in distinguished movement states, and to facilitate biological interpretation in terms of behavioral modes
Currently applied path characteristics. Different signals or parameters can be calculated either based on consecutive relocations within a trajectory (“stepwise”) or for multiple relocations such as identified path-segments (“across multiple steps”)
| Characteristic | Description | Type | Calculation | Reference |
|---|---|---|---|---|
| Displacement | Increment of the X and Y values between two consecutive relocations, change in absolute spatial position | primary | stepwise | [ |
| Time lag | Duration / increment in time between consecutive relocations (usually determined by sampling regime) | primary | stepwise | [ |
| Turning angles / heading | Relative and absolute turning angles between consecutive relocations, change in direction | primary | stepwise | [ |
| Step length | Euclidean distance between two consecutive relocations | primary | stepwise | [ |
| Velocity / speed | Distance traveled in a given time interval between two relocations; less sensitive to missing data than step length | primary | stepwise | [ |
| Persistence / turning velocity | Transformations of speed and turning angle: persistence velocity represents the tendency and degree of a movement to persist in a certain direction. Turning velocity shows the tendency of a movement to turn in a perpendicular/opposite direction | secondary | stepwise | [ |
| Net / mean squared displacement | Squared displacement between the first and current relocation of the trajectory; applied to characterize diffusion behavior or migration patterns | secondary | stepwise | [ |
| First passage time | Time required for crossing a predefined endpoint based on a circle (radius) around a starting relocation. Sums the times of all forward and backwards relocations within the radius; index of area-restricted search behavior | secondary | stepwise | [ |
| Residence time | Extension of the first passage time accounting for returns of the animal in a given area. Sums the times of all relocations (backwards and forwards) of a trajectory within a given vicinity around a relocation. | secondary | stepwise | [ |
| Pseudo-Azimuth | Recalculates the basic azimuth value at the midpoint between two consecutive steps to range within 0 and 360. Can be used as indicators for movements with same or parallel directions. | primary | stepwise | [ |
| Straightness index | Ratio of Euclidean distance between the beginning and end of a trajectory and the total path length (sum of all step lengths) | secondary | across multiple steps | [ |
| Sinuosity / Tortuosity | Adaptions of the straightness index analyzing the probabilistic distributions of the changes in the turning angles and the beeline distance between the start and end points of the trajectory; index of path orientation | secondary | across multiple steps | [ |
| Fractal dimension | Measure of path tortuosity; non-Euclidean dimension of the trajectory varying between one (completely straight) and two (tortuous, completely spanning two-dimensional space); different implementations exist | secondary | across multiple steps | [ |
| Multi-scale straightness index | Repeated calculation of the straightness index of a trajectory over a range of different temporal scales | secondary | across multiple steps | [ |
| Area interest index | Repeated calculation of the straightness index for a limited size of a sliding window along the trajectory. With each repetition, the number of relocations within the trajectory is reduced | secondary | across multiple steps | [ |
Fig. 2The main study aims of path segmentation and types of methods to address them. a Pattern description: Topology-based analyses rely directly on signals calculated from the movement trajectory (e.g. step length and bearing). They combine movement steps into groups based on similarity in the considered path-signals, for example by applying clustering algorithms. b Change-point detection: Time-series analyses assess a path-signal (y-axis) along its time-axis. For example, a moving window (rectangle) can be used to search for points along the time-series where local parameters (e.g. the mean) of the path-signal are significantly different from the global averages of these parameters. Significant change-points are assumed to indicate switches in underlying movement modes or behavioral states, and are used to separate the trajectory into segments (dashed lines). c Process identification: The majority of the presented state-space models link two stochastic models describing the state process and its observation. For example, the state process could consist of two discrete behavioral states (red and blue). The process model describes how the hidden state (x) emerges based on a Markov process. Therefore, it accounts for the conditional probability of a future state depending on the one of the current relocation. The observation model links the actual observed data (y) at given points in time to the hidden state. As a result, the most probable state of each observation, the switching probabilities between the states, as well as the distributions of the measured path-signals within each state are provided.
Characteristics of the methodological approaches for the three different categories of research questions. Different methods for answering the three type of broad research questions (study aims) are listed together with the analytical category they stem from, a short description of each method as well as the considered categories of input path-signals and important references
| Study aim | Method | Analytical category | Description | Input signal | References |
|---|---|---|---|---|---|
| Movement pattern description | Thresholding | Topology-based | Applies thresholding schemes (cut-off values) to separate relocations into different groups based on single or multiple path parameters (e.g., short- vs. Long-range movements) | Primary and secondary signals | [ |
| Supervised Classification | Topology-based | Relocations (steps) of a trajectory are assigned to certain classes of movement behavior based on a classification scheme fitted with a training dataset | Primary and secondary signals, additional information like activity data | [ | |
| Clustering | Topology-based | Unsupervised classification for identifying distinctive groups within a multivariate set of path-signals | Primary and secondary signals, additional information like activity data | [ | |
| Bayesian Partitioning of Markov Models (BPMM) | Topology- and time- series based | Classification algorithm for determining the number and sequence of homogenous classes within a sequential path-signal (time series) | Primary and secondary signals | [ | |
| Change-point detection | Line Simplification | Topology- or time-series based | Tests whether reducing the number of vertices in a trajecotry significantly impacts path topology to determine change points (can also be applied with graphs of sequential path-signals) | Primitive signals (spatial position) | [ |
| Change Point Test | Topology-based | Detects significant changes in the observed movement direction (orientation) between the starting point and an attraction point of a trajectory | Primitive signals (spatial position) | [ | |
| Spatio-Temporal Criteria Segmentation | Topology-based | Special type of thresholding seeking optimal segmentation of a trajectory based on monotone criteria: relocations are included in a segment as long as they fullfill certain predefined requirements | Primitive, primary and secondary signals | [ | |
| Piecewise Regression | Time-series analysis | Splits time-series model into representative segments based on a signficant change-point (fits a polynomial model for each segment) | Primary and secondary signals | [ | |
| Penalized Contrast Method (PCM) | Time-series analysis | Non-parametric segmentation of a path-signal: the unknown number of segments is estimated by minimizing a penalized contrast function | Mostly secondary signals | [ | |
| Behavioral Change Point Analysis (BCPA) | Time-series analysis | Likelihood-based method for detecting significant change points; applies moving window over continuous autocorrelated time series of a path-signal | Mostly secondary signals | [ | |
| Pruned Exact Linear Time (PELT) Algorithm | Time-series analysis | Search method for detecting optimal number and locations of change points minimizing different cost and penalty functions | primary and secondary signals | [ | |
| Behavioral Movement Segmentation (BMS) | Time-series analysis | Combined search algorithm which optimizes segmentation based on parsimony and subsequent clustering for assigning segments to similar behaviors | primary and secondary signals, additional information like activity data | [ | |
| Process identification | Hidden-Markov Models (HMM) | State-space models | Estimate the sequence and composition of a predifined number of discrete states (e.g., movement behaviors) as well as the switching-probabilities between these states | Primary signals, additional information like activity data | [ |
| State-Space Models with Location Filtering | State-space models | More complex models which can model hidden movement states and also correct for errors in the observation process (e.g., GPS errors) | Primitive (spatial position) and primary signals, additional information like activity data | [ | |
| Hierarchical State-Space Models | State-space models | Hierarchical models accounting for variability of number and composition of movement states between individuals (further making inferences at population level) | Primary signals | [ | |
| Bayesian Partitioning of Markov Models (BPMM) | Topology- and time- series based | Can also be used as partitioning algorithm determining the number and sequence of homogenous models (“states”) within a sequential path-signal | primary and secondary signals | [ |
Fig. 3Decision guidelines for choosing appropriate segmentation methods. The process should begin with preliminary analyses of the trajectory data and derived path-parameters (1). Choosing among methods is then first directed by the data structure and sampling regime (2). Capability of the methods to account for temporal autocorrelation further determines the decision process. In the end, study aims and objectives guide the final decision on a given segmentation method (3)
Fig. 4Simulated trajectory and results of preliminary analyses. a overview of the simulated movement path and habitat configuration. b distributions of observed step lengths within and outside the habitat (matrix) of the tracked animal. Results of preliminary analyses for the net-squared displacement signal including the distribution (c) and the time-series across the entire tracking period (d). Distributions of observed step lengths at different hours of the day (e)
Fig. 5Results of three different segmentation methods using the simulated movement data. a The left panel shows the distribution of the observed step lengths as well as the applied cut-off value (threshold = 2 units). The proportions of the resulting behavioral states (short- and long-range movements) within and outside of the habitat are shown in the right panel. b Results from the behavioral change point analyses applied with the net-squared displacement signal. The observed time-series was segmented at significant change-points (vertical lines) to distinguish movements within the main ranges of the animal and two migratory periods. The color of the estimated parameter ρ^ indicates the level of temporal autocorrelation. c Change in switching probabilities between the two states (resting vs. active) dependent on the different hours of the day. Switching probabilities also differed with regard to whether the animal was in its habitat or not. Black lines indicate the switches from the resting state to the active state. Red lines are showing the switching probabilities from active to resting state