| Literature DB >> 32884097 |
Jhih-Rong Liao1, Hsiao-Chin Lee1, Ming-Chih Chiu2, Chiun-Cheng Ko3.
Abstract
The accurate identification of biological control agents is necessary for monitoring and preventing contamination in integrated pest management (IPM); however, this is difficult for non-taxonomists to achieve in the field. Many machine learning techniques have been developed for multiple applications (e.g., identification of biological organisms). Some phytoseiids are biological control agents for small pests, such as Neoseiulus barkeri Hughes. To identify a precise biological control agent, a boosting machine learning classification, namely eXtreme Gradient Boosting (XGBoost), was introduced in this study for the semi-automated identification of phytoseiid mites. XGBoost analyses were based on 22 quantitative morphological features among 512 specimens of N. barkeri and related phytoseiid species. These features were extracted manually from photomicrograph of mites and included dorsal and ventrianal shield lengths, setal lengths, and length and width of spermatheca. The results revealed 100% accuracy rating, and seta j4 achieved significant discrimination among specimens. The present study provides a path through which skills and experiences can be transferred between experts and non-experts. This can serve as a foundation for future studies on the automated identification of biological control agents for IPM.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32884097 PMCID: PMC7471324 DOI: 10.1038/s41598-020-71798-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Measured variables of female phytoseiid mites: (a) dorsal shield, (b) ventral view, (c) spermatheca, (d) leg IV. Dorsal aspect: DSL, dorsal shield length; DSWj6, dorsal shield width at seta j6 level. Ventral aspect: VASL, ventrianal shield length; VASWZV2, ventrianal shield width at seta ZV2 level. Spermatheca aspect: Calyx L, spermatheca calyx length; Calyx W, spermatheca calyx width. Leg aspect: St IV, macroseta on barsitarsus IV.
List and abbreviations of morphological features (μm).
| Abbreviation | Morphological features | |
|---|---|---|
| 1 | DSL | Dorsal shield length |
| 2 | DSW | Dorsal shield width at |
| 3 | Length of the seta | |
| 4 | Length of the seta | |
| 5 | Length of the seta | |
| 6 | Length of the seta | |
| 7 | Length of the seta | |
| 8 | Length of the seta | |
| 9 | Length of the seta | |
| 10 | Length of the seta | |
| 11 | Length of the seta | |
| 12 | Length of the seta | |
| 13 | Length of the seta | |
| 14 | Length of the seta | |
| 15 | Length of the seta | |
| 16 | Length of the seta | |
| 17 | VSL | Length of the ventrianal shield |
| 18 | VSW | Width of the ventrianal shield (at |
| 19 | Length of the seta | |
| 20 | Length of the macroseta | |
| 21 | Calyx L | Length of the calyx of spermatheca (without atrium) |
| 22 | Calyx W | Width of the calyx of spermatheca (at widest level) |
Figure 2Classification error of modelling with training data (blue line) and cross-validation test (red line) along with the number of trees (iter). The binary classification error rate was calculated as the number of incorrect case divided by the total number of all cases.
Figure 3Relative importance of features, with automatically divided clusters in the eXtreme Gradient Boosting model.
Figure 4Centred individual conditional expectation plot for predicted probability of being the target species (Neoseiulus barkeri) based on 21 key features (μm). The lines reveal the difference in prediction from that with the respective feature value at its observed minimum.