| Literature DB >> 36060761 |
Weimin Yu1, Qingqing Xiang2, Yingchao Hu3, Yukun Du4, Xiaodong Kang5, Dongyun Zheng5, He Shi5, Quyi Xu5, Zhigang Li5, Yong Niu6, Chao Liu5, Jian Zhao5.
Abstract
The diatom test is a forensic technique that can provide supportive evidence in the diagnosis of drowning but requires the laborious observation and counting of diatoms using a microscopy with too much effort, and therefore it is promising to introduce artificial intelligence (AI) to make the test process automatic. In this article, we propose an artificial intelligence solution based on the YOLOv5 framework for the automatic detection and recognition of the diatom genera. To evaluate the performance of this AI solution in different scenarios, we collected five lab-grown diatom genera and samples of some organic tissues from drowning cases to investigate the potential upper/lower limits of the capability in detecting the diatoms and recognizing their genera. Based on the study of the article, a recall score of 0.95 together with the corresponding precision score of 0.9 were achieved on the samples of the five lab-grown diatom genera via cross-validation, and the accuracy of the evaluation in the cases of kidney and liver is above 0.85 based on the precision and recall scores, which demonstrate the effectiveness of the AI solution to be used in drowning forensic routine.Entities:
Keywords: YOLOv5 framework; artificial intelligence; diatom test; drowning; forensic science; microwave digestion-vacuum filtration-automated scanning electron microscopy
Year: 2022 PMID: 36060761 PMCID: PMC9437702 DOI: 10.3389/fmicb.2022.963059
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 6.064
Figure 1A fully connected network with 3 FC layers (Left); An illustration of how a convolutional layer works (Right).
Figure 2The workflow of the proposed diatom detection and recognition solution.
Summary of the SEM images scanned from the standard samples.
|
|
|
|
|
|---|---|---|---|
|
| 2,018 | 630 | 812 |
|
| 2,084 | 672 | 921 |
|
| 1,966 | 930 | 1,356 |
|
| 1,999 | 1,476 | 6,515 |
|
| 1,875 | 1,622 | 5,741 |
| Total | 9,942 | 5,330 | 15,345 |
Summary of the SEM images scanned from the liver and kidney samples.
|
|
|
|
|---|---|---|
| #01 | 904 | 2,789 |
| #02 | 938 | 1,168 |
| #03 | 8 | 8 |
| #04 | 509 | 597 |
| #05 | 108 | 113 |
| #06 | 3 | 3 |
| #07 | 1,687 | 2,125 |
| #08 | 54 | 56 |
| #09 | 69 | 72 |
| #10 | 58 | 60 |
| #11 | 35 | 39 |
| Total | 4,373 | 7,030 |
Figure 3The number distribution of different diatom genus in the lung (a), liver and kidney samples (b).
Figure 4The YOLOv5m architecture.
The configuration of hardware and software environment for evaluation.
| Hardware | CPU | Intel Xeon CPU E5-1620 v2 @ 3.70GHz |
| RAM | 24GB | |
| GPU | NVIDIA GeForce RTX 2080 Ti (×1) | |
| Video Memory | 12GB | |
| Hard Disk | 500GB | |
| Software | OS | Windows 10 |
| Programming Toolkit | Python 3.9 + PyTorch 1.9 + CUDA 11.1 | |
| IDE | PyCharm Professional |
Figure 5(a) The precision-recall curves of the single-class diatom detection under the confidence threshold 0.5 and the IoU threshold 0.5. (b) The precisions and recalls at the confidence threshold 0.5 achieved by different models. (c–g) The qualitative demonstration of the detection cases of the five test genus.
Figure 6The precision-recall curves of all the diatom genus achieved by the Last-640, Last-800, Best-640, and Best-800 models.
The confusion matrices derived from the multi-class recognition of the lab-grown diatoms with the Last-640 and Best-640 models.
|
|
| ||||||
|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
| ||
| Actual |
| 0.94 | 0 | 0 | 0 | 0 | 0.06 |
|
| 0 | 1 | 0 | 0 | 0 | 0 | |
|
| 0 | 0 | 0.94 | 0 | 0.01 | 0.05 | |
|
| 0 | 0 | 0 | 0.51 | 0.45 | 0.04 | |
|
| 0 | 0 | 0 | 0 | 0.71 | 0.29 | |
| Background | 0.01 | 0.01 | 0.02 | 0.81 | 0.15 | 0 | |
|
|
| ||||||
|
|
|
|
|
|
| ||
| Actual |
| 0.94 | 0 | 0 | 0 | 0 | 0.06 |
|
| 0 | 1 | 0 | 0 | 0 | 0 | |
|
| 0 | 0 | 0.94 | 0 | 0 | 0.06 | |
|
| 0 | 0 | 0 | 0.88 | 0 | 0.12 | |
|
| 0 | 0 | 0 | 0.05 | 0.88 | 0.07 | |
| Background | 0.01 | 0.02 | 0.01 | 0.31 | 0.66 | 0 | |
The boxed values indicate high mis-recognition cases among some diatom genera and background.
Figure 7The demonstration of the quantitative results of the lung samples (a,b). Several qualitative cases achieved by the Best-800 model (c–f).
Figure 8(a–c) Three annotated images acquired from the liver and kidney samples with different situations on background.
Figure 9The evaluation result summary of the liver and kidney samples (a,b) and two drowning cases qualitatively compared between the YOLOv5m-Last-800 model and the RetinaNet-101-Last-800 model (c–f).
The precision and recall scores when changing the IoU confidence threshold from 0.1 to 0.5 and the confidence score is always 0.5.
|
|
|
| ||
|---|---|---|---|---|
|
|
|
|
| |
| 0.1 | 0.843 | 0.858 | 0.905 | 0.705 |
| 0.2 | 0.843 | 0.859 | 0.905 | 0.705 |
| 0.3 | 0.843 | 0.860 | 0.905 | 0.705 |
| 0.4 | 0.843 | 0.860 | 0.905 | 0.706 |
| 0.5 | 0.843 | 0.860 | 0.902 | 0.706 |
The precision and recall scores by changing the confidence threshold from 0.1 to 0.5 while the IoU score is fixed at 0.5.
|
|
|
| ||
|---|---|---|---|---|
|
|
|
|
| |
| 0.1 | 0.707 | 0.914 | 0.778 | 0.800 |
| 0.2 | 0.764 | 0.899 | 0.847 | 0.764 |
| 0.3 | 0.796 | 0.887 | 0.874 | 0.737 |
| 0.4 | 0.819 | 0.875 | 0.891 | 0.723 |
| 0.5 | 0.843 | 0.860 | 0.902 | 0.706 |