| Literature DB >> 35664223 |
Martin Žofka1, Linh Thuy Nguyen1, Eva Mašátová1, Petra Matoušková1.
Abstract
Poor efficacy of some anthelmintics and rising concerns about the widespread drug resistance have highlighted the need for new drug discovery. The parasitic nematode Haemonchus contortus is an important model organism widely used for studies of drug resistance and drug screening with the current gold standard being the motility assay. We applied a deep learning approach Mask R-CNN for analysing motility videos containing varying rates of motile worms and compared it to other commonly used algorithms with different levels of complexity, namely the Wiggle Index and the Wide Field-of-View Nematode Tracking Platform. Mask R-CNN consistently outperformed the other algorithms in terms of the detection of worms as well as the precision of motility forecasts, having a mean absolute percentage error of 7.6% and a mean absolute error of 5.6% for the detection and motility forecasts, respectively. Using Mask R-CNN for motility assays confirmed the common problem with algorithms that use non-maximum suppression in detecting overlapping objects, which negatively impacts the overall precision. The use of intersect over union as a measure of the classification of motile / non-motile instances had an overall accuracy of 89%, indicating that it is a viable alternative to previously used methods based on movement characteristics, such as body bends. In comparison to the existing methods evaluated here, Mask R-CNN performed better and we anticipate that this method will broaden the number of possible approaches to video analysis of worm motility.Entities:
Keywords: CNN, convolutional neural network; GPU, graphics processing unit; Instance segmentation; IoU, intersection over union; L3, third-stage larva; MAE, mean absolute error; MAPE, mean absolute percentage error; ME, mean error; MPE, mean percentage error; Mask R-CNN; Mask R-CNN, region based convolutional neural network; NMS, non-maximum suppression; Nematode; Object detection; Parasite; ROI, region of interest; RPN, regional proposal network; WF-NTP, Wide Field-of-View Nematode Tracking Platform; WI, Wiggle Index; fps, frames per second; mAP, mean average precision
Year: 2022 PMID: 35664223 PMCID: PMC9127531 DOI: 10.1016/j.csbj.2022.05.014
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 6.155
Fig. 1Mask R-CNN training and detection process (a) annotated images used for training the algorithm, (b) regions of interest (ROI) detected in the first step by the regional proposal network (solid boxes) and the refined regions (dashed boxes), (c) Mask R-CNN predictions containing the object boundaries, predicted object classes (i.e., L3 – third-stage larva) with an associated confidence level and the mask of the object (coloured contour).
Fig. 2Sample frames of common detection scenarios in motility videos. The blue outlines are the masks detected by Mask R-CNN. Each detected worm has an associated identification number displayed in white based on the tracking across the frames. (a) common scenario with low or no overlaps of individual worms, (b) example of a motile worm (id 32) colliding with a non-motile worm (id 26) causing the later to be perceived as motile, (c) complex detection scenario with a high number of worms with significant overlaps. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Motility error metrics for algorithms.
| Wiggle Index | 14.2 | −0.71 |
| WF-NTP | 8.76 | 6.52 |
| Mask R-CNN | 5.61 | 1.95 |
The differences between the manual processing and the respective algorithm were expressed as the mean absolute error (MAE) and mean error (ME).
Fig. 3Motility rates comparisons for individual motility groups per algorithm. The number of the motility group indicates the percentage of live larvae. The motility rate is visualized as box plots.
Worm count error metrics for algorithms.
| Algorithm / Metric | MAPE, % | MPE, % |
|---|---|---|
| WF-NTP | 40.23 | 40.23 |
| Mask R-CNN | 7.6 | 5.61 |
The differences between the manual counts and the respective algorithm were expressed as the mean absolute percentage error (MAPE) and mean percentage error (MPE).
Fig. 4Worm count comparisons for individual motility groups per algorithm. The number of the motility group indicates the percentage of live larvae. The number of worms is visualized as box plots.
Fig. 5The distribution for different intersection over union (IoU). Both (a) a histogram of IoU and the associated number of worms per bin classified into motile / non-motile and (b) a histogram of IoU and the associated probability per class (motile / non-motile) show that the majority of misclassified cases were located around the 0.8 IoU.
Classification performance metrics.
| Classification | Precision, % | Recall, % |
|---|---|---|
| Motile | 94 | 88 |
| Non-Motile | 84 | 92 |
Precision and recall metrics for classification performance.
Normality tests for the motility rate error terms between Mask R-CNN and the manual processing.
| Normality test | Statistic | p-value |
|---|---|---|
| D’Agostino-Pearson | 0.32 | 0.85 |
| Shapiro-Wilk | 0.97 | 0.35 |
For both normality tests we cannot reject the null hypothesis that the data is from a normal distribution.
Correlation matrix for the prediction errors of the individual algorithms.
| 1 | −0.04 | 0.08 | |
| −0.04 | 1 | 0.65 | |
| 0.08 | 0.65 | 1 |