| Literature DB >> 34940222 |
Mark T Fowler1, Rosemary S Lees1, Josias Fagbohoun2, Nancy S Matowo3,4, Corine Ngufor3, Natacha Protopopoff3, Angus Spiers1.
Abstract
Pyriproxyfen (PPF) may become an alternative insecticide for areas where pyrethroid-resistant vectors are prevalent. The efficacy of PPF can be assessed through the dissection and assessment of vector ovaries. However, this reliance on expertise is subject to limitations. We show here that these limitations can be overcome using a convolutional neural network (CNN) to automate the classification of egg development and thus fertility status. Using TensorFlow, a resnet-50 CNN was pretrained with the ImageNet dataset. This CNN architecture was then retrained using a novel dataset of 524 dissected ovary images from An. gambiae s.l. An. gambiae Akron, and An. funestus s.l., whose fertility status and PPF exposure were known. Data augmentation increased the training set to 6973 images. A test set of 157 images was used to measure accuracy. This CNN model achieved an accuracy score of 94%, and application took a mean time of 38.5 s. Such a CNN can achieve an acceptable level of precision in a quick, robust format and can be distributed in a practical, accessible, and free manner. Furthermore, this approach is useful for measuring the efficacy and durability of PPF treated bednets, and it is applicable to any PPF-treated tool or similarly acting insecticide.Entities:
Keywords: Anopheles mosquito; automated identification; convolutional neural network; fertility; image classification; machine learning; ovary development; pyriproxyfen (PPF); side-effects
Year: 2021 PMID: 34940222 PMCID: PMC8703609 DOI: 10.3390/insects12121134
Source DB: PubMed Journal: Insects ISSN: 2075-4450 Impact factor: 2.769
Image dataset summary.
| Dataset | Source | Strain | Image Count | Fertile | Infertile |
|---|---|---|---|---|---|
| 1 | Cove, Benin | 124 | 79 | 45 | |
| 2 | Insectary colony | 187 | 43 | 144 | |
| 3 | Mwanza, Tanzania | 125 | 67 | 58 | |
| 4 | Mwanza, Tanzania | 88 | 38 | 50 | |
| Total | 524 | 227 | 297 | ||
Figure 1Christopher stages of egg development. Mosquitos whose eggs have fully developed to stage V (normal elongated, boat/sausage-shaped eggs with lateral floats) are classified as ‘fecund’ or ‘fertile’. If eggs have not fully developed and remain in stages I–IV (less elongated, round shape, lacking floats), the mosquito is classified as ‘non-fecund’ or ‘infertile’.
Figure 2Summary of analysis workflow. (A) Data are pre-processed as described in Section 2.2. Images and labels are loaded before the images are resized and undergo a random 70%/30% split into training and test sets. (B) The training set undergoes data augmentation as described in Section 2.3. Each original image produces 18 variations based on a random rotation around 360°, a random horizontal flip, a random vertical flip, and a random brightness shift between 60% and 140%. Each variation retains the same classification as its original. (C) The training set is fitted to a range of CNNs, and classifiers are built and tested as described in Section 2.4.
Performance of transfer learning architectures against the test set.
| Architecture | Accuracy 1 | Recall (Fer) 2 | Recall (Inf) 3 | Precision (Fer) 4 | Precision (Inf) 5 | Speed 6 |
|---|---|---|---|---|---|---|
| Bespoke CNN | 0.777 | 0.951 | 0.592 | 0.918 | 0.713 | 28.1 s |
| CNN + data Augmentation | 0.815 | 0.901 | 0.724 | 0.873 | 0.777 | 28.7 s |
| VGG-16 | 0.885 | 0.901 | 0.868 | 0.892 | 0.880 | 41.7 s |
| ResNet-50 | 0.943 | 0.951 | 0.934 | 0.947 | 0.939 | 38.5 s |
| InceptionV3 | 0.803 | 0.716 | 0.895 | 0.747 | 0.879 | 36.5 s |
1 Accuracy—correct predictions divided by total number of predictions; 2 Recall (Fer)—fraction of fertile observations successfully retrieved; 3 Recall (Inf)—fraction of infertile observations successfully retrieved; 4 Precision (Fer)—true fertile predictions divided by total fertile predictions; 5 Precision (Inf)—true infertile predictions divided by total infertile predictions; 6 Speed—mean time (in seconds) over five repetitions for the model to load and classify 157 images.
Confusion matrix for ResNet-50.
| Predicted Infertile | Predicted Fertile | ||
|---|---|---|---|
|
| 77 | 4 | 81 |
|
| 5 | 71 | 76 |
| 82 | 75 |
1 Confusion matrix—comparison of actual values with those predicted by the model and giving details on true positive (top-left), true negative (bottom-right), false positive (top-right), and false negative (bottom-left) rates.