| Literature DB >> 33931681 |
Yersultan Mirasbekov1, Adina Zhumakhanova1, Almira Zhantuyakova1,2, Kuanysh Sarkytbayev1,3, Dmitry V Malashenkov4, Assel Baishulakova1, Veronika Dashkova1,5, Thomas A Davidson6, Ivan A Vorobjev1,3, Erik Jeppesen6,7,8,9, Natasha S Barteneva10,11.
Abstract
A machine learning approach was employed to detect and quantify Microcystis colonial morphospecies using FlowCAM-based imaging flow cytometry. The system was trained and tested using samples from a long-term mesocosm experiment (LMWE, Central Jutland, Denmark). The statistical validation of the classification approaches was performed using Hellinger distances, Bray-Curtis dissimilarity, and Kullback-Leibler divergence. The semi-automatic classification based on well-balanced training sets from Microcystis seasonal bloom provided a high level of intergeneric accuracy (96-100%) but relatively low intrageneric accuracy (67-78%). Our results provide a proof-of-concept of how machine learning approaches can be applied to analyze the colonial microalgae. This approach allowed to evaluate Microcystis seasonal bloom in individual mesocosms with high level of temporal and spatial resolution. The observation that some Microcystis morphotypes completely disappeared and re-appeared along the mesocosm experiment timeline supports the hypothesis of the main transition pathways of colonial Microcystis morphoforms. We demonstrated that significant changes in the training sets with colonial images required for accurate classification of Microcystis spp. from time points differed by only two weeks due to Microcystis high phenotypic heterogeneity during the bloom. We conclude that automatic methods not only allow a performance level of human taxonomist, and thus be a valuable time-saving tool in the routine-like identification of colonial phytoplankton taxa, but also can be applied to increase temporal and spatial resolution of the study.Entities:
Year: 2021 PMID: 33931681 PMCID: PMC8087837 DOI: 10.1038/s41598-021-88661-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1An image of one of the 24 flow-through tanks in the LMWE experiment run at the experimental facility belonging to Aarhus University, Denmark (modified from[42]). The collection tank was placed at right side (not included). This image was created with BioRender (https://biorender.com/).
Figure 2FlowCAM imaging flow cytometry of phytoplankton from the LMWE 2019 experiment. (A) Cryptomonas sp.; (B) Micractinium; (C) Microcystis aeruginosa; (D) M. ichthyoblabe; (E) M. novacekii; (F) M. smithii; (G) M. wesenbergii.
List of particle properties with corresponding type, description and possible value range.
(modified from FlowCAM user manual).
| Particle properties | Type | Descriptions | Value range for X |
|---|---|---|---|
| Average Blue | Color | Average pixel value for blue color plane | X ∈ [0, 255]; |
| Diameter (ABD) | Size | Circle-based diameter that equal to ABD Area | X > 0; |
| Edge gradient | Texture | Average pixels intensity of outside border of a particle after an application of Sobel Edge Detect convolution filter | X ∈ [0, 255]; |
| Intensity | Grayscale | Average grayscale value of pixels of a particle (grayscale sum / number of particle pixels) | X ∈ [0, 255]; |
| Length | Size | Maximum value of 36 feret measurements | X > 0; |
| Perimeter | Size | Total length of edges including edges of any hole | X > 0; |
| Ratio red/blue | Color | Ratio between Average Red and Average Blue | X ≥ 0; |
| Ratio red/green | Color | Ratio between Average Red and Average Green | X ≥ 0; |
| Roughness | Shape | Unevenness/irregularity of a particle's surface, defined as the ratio between perimeter and convex perimeter. Larger values have a non-convex perimeter and/or interior holes | X ≥ 1; X = 1 for a filled shape with convex perimeter; |
| Sigma intensity | Grayscale | Standard deviation of particle’s grayscale values | X ≥ 0 |
Set of particle properties within each filter set for intergeneric classification (A) and intrageneric classification (B).
| A | ||||||
|---|---|---|---|---|---|---|
| Particle property | 25 selected | 50 selected | Intersection | |||
| Min | Max | Min | Max | Min | Max | |
| Diameter (ABD) (μm) | 8.23 | 11.53 | 8.22 | 12.72 | 8.23 | 11.53 |
| Diameter (ABD) (μm) | 16.19 | 40.52 | 15.85 | 40.52 | 16.19 | 40.52 |
| Intensity | 79.20 | 105.85 | 73.64 | 105.85 | 79.20 | 105.85 |
| Diameter (ABD) (μm) | 45.35 | 85.69 | 33.48 | 83.89 | 45.35 | 83.89 |
| Intensity | 49.12 | 92.47 | 40.55 | 97.59 | 49.12 | 92.47 |
The results of classification using intersecting ranges for the intergeneric classification of three classes (upper row) and the intrageneric classification of five Microcystis morphospecies (lower row).
(A, B) Numbers represent results when corresponding classifier was applied. Confusion matrices for used filter sets with precision values for correct predictions (highlighted in green) and false discovery rate for misclassifications (highlighted in orange). The coloring was assigned depending on relative frequency in each column. (C, D) Percentage values for accuracy and reliability (precision) of used methods. Overall performance for intergeneric and intrageneric classification in percentage is highlighted in blue.
Summary of the evaluation of filter sets using balanced accuracy, Hellinger distance and SMAPE, Bray–Curtis dissimilarity, and Kullback–Leibler Divergence.
| Filter set used for classification | Hellinger distance | Balanced accuracy (%) | SMAPE (%) | Bray–Curtis dissimilarity | Kullback–Leibler Divergence |
|---|---|---|---|---|---|
| 0.13 | 90.13 | 3.60 | 0.128 | 0.0006 | |
| 0.10 | 87.26 | 5.04 | |||
| 0.07 | 91.32 | 3.17 | |||
| 0.19 | 73.2 | 17.24 | 0.145 | 0.0281 | |
| 0.15 | 71.3 | 16.44 | |||
| 0.24 | 90.7 | 2.75 | |||
| 0.17 | 77.7 | 11.05 | |||
| 0.16 | 74.3 | 9.68 | |||
Figure 3The distribution of Microcystis morphospecies in samples from the LMWE experiment in 2019. Grouping is based on type of tank and the date when the samples were taken.
Figure 4Seasonal changes of abundances of Microcystis spp. colonial morphoforms and water temperature from May to September 2019 in mesocosm tanks D1 (A) and G1 (B).