| Literature DB >> 30252874 |
Ana Rita Marques1, Henny Forde2, Crawford W Revie1.
Abstract
Sea lice Lepeophtheirus salmonis (Krøyer) are a major ectoparasite affecting farmed Atlantic salmon in most major salmon producing regions. Substantial resources are applied to sea lice control and the development of new technologies towards this end. Identifying and understanding how sea lice population patterns vary among cages on a salmon farm can be an important step in the design and analysis of any sea lice control strategy. Norway's intense monitoring efforts have provided salmon farmers and researchers with a wealth of sea lice infestation data. A frequently registered parameter is the number of adult female sea lice per cage. These time-series data can be analysed descriptively, the similarity between time-series quantified, so that groups and patterns can be identified among cages, using clustering algorithms capable of handling such dynamic data. We apply such algorithms to investigate the pattern of female sea lice counts among cages for three Atlantic salmon farms in Norway. A series of strategies involving a combination of distance measures and prototypes were explored and cluster evaluation was performed using cluster validity indices. Repeated agreement on cluster membership for different combinations of distance and centroids was taken to be a strong indicator of clustering while the stability of these results reinforced this likelihood. Though drivers behind clustering are not thoroughly investigated here, it appeared that fish weight at time of stocking and other management practices were strongly related to cluster membership. In addition to these internally driven factors it is also possible that external sources of infestation may drive patterns of sea lice infestation in groups of cages; for example, those most proximal to an external source. This exploratory method proved useful as a pattern discovery tool for cages in salmon farms.Entities:
Mesh:
Year: 2018 PMID: 30252874 PMCID: PMC6156026 DOI: 10.1371/journal.pone.0204319
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Clustering validity indices (CVI), the distances that are used as a measure of intra-cluster cohesion or inter-cluster separation and whether their value should be maximized or minimized when choosing the most robust set of clusters.
| CVI | Cohesion | Separation | Goal |
|---|---|---|---|
| Calinski-Harabasz | Ponts to centroid | Cluster centroid to global centroid | Maximized |
| Davies-Bouldin and Davies Bouldin Modified | Points to centroid | Between centroids | Minimized |
| Silhouette | Between all points | Nearest-neighbor distance | Maximized |
| Dunn | Nearest-neighbor distance | Maximum cluster diameter | Maximized |
| COP | Points to centroid | Furthest neighbour distance | Minimized |
| Score function | Points to centroid | Cluster centroid to global centroid | Maximized |
Fig 1Farm I CVI plots.
Cluster validity index plots for the most stable algorithms applied to the time-series data for Farm I in 2012-2013. (A) individual CVI plots; (B) average values for each group of CVIs.
Clustering results for Farm I, with cluster membership as shown in the plots in Figs 2, 3 and 4.
The most stable clusters are identified per cycle, alphabetically.
| Production cycle from 2012-03-23 to 2013-10-03 | ||||
| Distance metric | Centroid | No. clusters | Cluster members | Stability 20;100 |
| GAK | PAM | 3 | A{2,3,5};B{4};C{1} | all;all |
| DBA | 3 | A{2,3,5};B{4};C{1} | all;all | |
| MEAN | 3 | A{2,3,5};B{4};C{1} | all;all | |
| DTW | PAM | 2 | C{1};D{2,3,4,5} | all;all |
| DBA | 3 | C{1};E{3};F{2,4,5} | nn;nn | |
| MEAN | 2 | C{1};D{2,3,4,5} | all;all | |
| Euclidean | PAM | 3 | A{2,3,5};B{4};C{1} | all;ptt |
| DBA | 3 | A{2,3,5};B{4};C{1} | all;ptt | |
| MEAN | 3 | A{2,3,5};B{4};C{1} | all;ptt | |
| Production cycle from 2014-05-18 to 2015-10-17 | ||||
| Distance metric | Centroid | No. clusters | Cluster members | Stability 20;100 |
| GAK | PAM | 2 | A{1,3,4,5,6};B{2,7,8} | all;all |
| DBA | 2 | A{1,3,4,5,6};B{2,7,8} | all;all | |
| MEAN | 3 | A{1,3,4,5,6};C{2,8};D{7} | all;all | |
| DTW | PAM | 4 | E{4};F{7,8};G{1,2};H{3,5,6} | nn;nn |
| DBA | 2 | I{1,2,3,4};J{5,6,7,8} | ptt;ptt | |
| MEAN | 2 | F{7,8};K{1,2,3,4,5,6} | all;all | |
| Euclidean | PAM | 3 | C{2,8};L{5,6,7};M{1,3,4} | all;all |
| DBA | 2 | M{1,3,4};N{2,5,6,7,8} | ptt;ptt | |
| MEAN | 3 | A{1,3,4,5,6};F{7,8};O{2} | ptt;all | |
| Production cycle from 2016-03-03 to 2017-06-29 | ||||
| Distance metric | Centroid | No. clusters | Cluster members | Stability 20;100 |
| GAK | PAM | 2 | A{4,7,8};B{1,2,3,5,6} | all;all |
| DBA | 2 | A{4,7,8};B{1,2,3,5,6} | all;all | |
| MEAN | 2 | A{4,7,8};B{1,2,3,5,6} | all;all | |
| DTW | PAM | 2 | A{4,7,8};B{1,2,3,5,6} | all;all |
| DBA | 2 | A{4,7,8};B{1,2,3,5,6} | all;all | |
| MEAN | 2 | A{4,7,8};B{1,2,3,5,6} | all;all | |
| Euclidean | PAM | 2 | A{4,7,8};B{1,2,3,5,6} | all;all |
| DBA | 2 | C{4};D{1,2,3,5,6,7,8} | ptt;ptt | |
| MEAN | 2 | E{4,8};F{1,2,3,5,6,7} | ptt;ptt | |
*ptt: partial; nn: none; GAK: Global Alignemnt Kernels; DTW: Dynamic Time Warping; PAM: Partition Around Medoids; DBA: Dynamic Barycenter Averaging.
Fig 2Farm I clustering for 2012-2013.
Time-series plots for the most stable clustering algorithms for Farm I, production cycle of 2012-2013.
Fig 3Farm I clustering for 2014-2015.
Time-series plots for the most stable clustering algorithms for Farm I, production cycle of 2014-2015.
Fig 4Farm I clustering for 2016-2017.
Time-series plots for the most stable clustering algorithm for Farm I, production cycle of 2016-2017.
Clustering results for Farm II.
The most stable clusters are identified per cycle, alphabetically.
| Production cycle from 2012-04-20 to 2013-05-25 | ||||
| Distance metric | Centroid | No. clusters | Cluster members | Stability 20;100 |
| GAK | PAM | 4 | A{5};B{3,4,8,9};C{1,2,6,7};D{10} | ptt;ptt |
| DBA | 4 | A{5};B{3,4,8,9};C{1,2,6,7};D{10} | ptt;ptt | |
| MEAN | 2 | D{10};E{1,2,3,4,5,6,7,8,9} | all;all | |
| DTW | PAM | 4 | D{10};F{3,5};G{4,7,8,9};H{1,2,6} | ptt;ptt |
| DBA | 4 | D{10};F{3,5};G{4,7,8,9};H{1,2,6} | ptt;ptt | |
| MEAN | 4 | D{10};F{3,5};G{4,7,8,9};H{1,2,6} | all;ptt | |
| Euclidean | PAM | 4 | D{10};I{3,4,5};J{2,6,7,8,9};K{1} | all;all |
| DBA | 4 | D{10};F{3,5};L{6,7,8,9};M{1,2,4} | ptt;ptt | |
| MEAN | 4 | D{10};F{3,5};L{6,7,8,9};M{1,2,4} | ptt;ptt | |
| Production cycle from 2013-08-30 to 2015-06-5 | ||||
| Distance metric | Centroid | No. clusters | Cluster members | Stability 20;100 |
| GAK | PAM | 2 | A{4};B{1,2,3,5,6,7,8,9,10,11,12} | ptt;all |
| DBA | 2 | C{4,5,6};D{1,2,3,7,8,9,10,11,12} | ptt;ptt | |
| MEAN | 2 | E{4,6};F{1,2,3,5,7,8,9,10,11,12} | ptt;ptt | |
| DTW | PAM | 2 | G{6};H{1,2,3,4,5,7,8,9,10,11,12} | ptt;ptt |
| DBA | 2 | G{6};H{1,2,3,4,5,7,8,9,10,11,12} | ptt;ptt | |
| MEAN | 2 | G{6};H{1,2,3,4,5,7,8,9,10,11,12} | ptt;ptt | |
| Euclidean | PAM | 2 | G{6};H{1,3,4,5,6,7,8,9,10,11,12} | ptt;ptt |
| DBA | 2 | G{6};H{1,2,3,4,5,7,8,9,10,11,12} | all;ptt | |
| MEAN | 3 | A{4};F{1,2,3,5,7,8,9,10,11,12};G{6} | ptt;all | |
| Production cycle from 2015-08-09 to 2017-04-07 | ||||
| Distance metric | Centroid | No. clusters | Cluster members | Stability 20;100 |
| GAK | PAM | 3 | A{1,3,4,5};B{8};C{2,6,7,9,10 | ptt;all |
| DBA | 3 | A{1,3,4,5};B{8};C{2,6,7,9,10} | nn;all | |
| MEAN | 4 | B{8};C{2,6,7,9,10};D{1,3,5};E{4} | all;ptt | |
| DTW | PAM | 3 | F{4,5};G{1,3};H{2,6,7,8,9,10} | all;all |
| DBA | 2 | I{3,4,5};J{1,2,6,7,8,9,10} | ptt;ptt | |
| MEAN | 2 | I{3,4,5};J{1,2,6,7,8,9,10} | ptt;ptt | |
| Euclidean | PAM | 2 | B{8};K{1,2,3,4,5,6,7,9,10} | ptt;ptt |
| DBA | 3 | A{1,3,4,5};B{8};C{2,6,7,9,10} | all;all | |
| MEAN | 3 | B{8};I{3,4,5};L{1,2,6,7,9,10} | all;ptt | |
*ptt: partial; nn: none; GAK: Global Alignemnt Kernels; DTW: Dynamic Time Warping; PAM: Partition Around Medoids; DBA: Dynamic Barycenter Averaging.
Fig 5Farm II clustering per production cycle.
Time series plots for the most stable clustering algorithm for Farm II, one from each production cycle.
Clustering results for Farm III.
The most stable clusters are identified per cycle, alphabetically.
| Production cycle from 2013-03-24 to 2014-11-13 | ||||
| Distance metric | Centroid | No. clusters | Cluster members | Stability 20;100 |
| GAK | PAM | 3 | A{3,5,6,7,11};B{1,4,9,10,12};C{2,8} | all;all |
| DBA | 3 | A{3,5,6,7,11};B{1,4,9,10,12};C{2,8} | all;all | |
| MEAN | 3 | A{3,5,6,7,11};B{1,4,9,10,12};C{2,8} | all;all | |
| DTW | PAM | 3 | A{3,5,6,7,11};B{1,4,9,10,12};C{2,8} | all;all |
| DBA | 3 | A{3,5,6,7,11};B{1,4,9,10,12};C{2,8} | all;all | |
| MEAN | 3 | A{3,5,6,7,11};B{1,4,9,10,12};C{2,8} | ptt;ptt | |
| Euclidean | PAM | 3 | A{3,5,6,7,11};B{1,4,9,10,12};C{2,8} | all;all |
| DBA | 3 | A{3,5,6,7,11};B{1,4,9,10,12};C{2,8} | all;all | |
| MEAN | 3 | A{3,5,6,7,11};B{1,4,9,10,12};C{2,8} | all;all | |
| Production cycle from 2015-05-04 to 2016-09-05 | ||||
| Distance metric | Centroid | No. clusters | Cluster members | Stability 20;100 |
| GAK | PAM | 3 | A{4,6};B{2,3,5,9,11,12};C{1,7,8,10} | all;all |
| DBA | 2 | A{4,6};D{1,2,3,5,7,8,9,10,11,12} | all;all | |
| MEAN | 2 | A{4,6};D{1,2,3,5,7,8,9,10,11,12} | all;all | |
| DTW | PAM | 2 | E{1,4,6,7};F{2,3,5,8,9,10,11,12} | ptt;ptt |
| DBA | 2 | E{1,4,6,7};F{2,3,5,8,9,10,11,12} | ptt;ptt | |
| MEAN | 2 | G{1,4,6,7,8};H{2,3,5,9,10,11,12} | all;all | |
| Euclidean | PAM | 3 | A{4,6};B{2,3,5,9,11,12};C{1,7,8,10} | all;all |
| DBA | 2 | A{4,6};D{1,2,3,5,7,8,9,10,11,12} | all;all | |
| MEAN | 4 | A{4,6};B{2,3,5,9,11,12};I{8};J{1,7,10} | all;all | |
*ptt: partial; nn: none; GAK: Global Alignemnt Kernels; DTW: Dynamic Time Warping; PAM: Partition Around Medoids; DBA: Dynamic Barycenter Averaging.
Fig 6Farm III clustering per production cycle.
Time series plots for the most stable clustering algorithm for Farm III, one from each production cycle.