| Literature DB >> 35840589 |
Song-Quan Ong1, Hamdan Ahmad2.
Abstract
This paper introduces a new mosquito images dataset that is suitable for training and evaluating a recognition system on mosquitoes in normal or smashed conditions. The images dataset served mainly for the development a machine learning model that can recognize the mosquito in the public community, which commonly found in the smashed/damaged form by human. Especially the images of mosquito in hashed condition, which to the best of our knowledge, a dataset that fulfilled such condition is not available. There are three mosquito species in the dataset, which are Aedes aegypti, Aedes albopictus and Culex quinquefasciatus, and the images were annotated until species level due to the specimen was purely bred in a WHO accredited breeding laboratory. The dataset consists of seven root files, six root files that composed of six classes (each species with either normal landing, or random damaged conditions) with a total of 1500 images, and one pre-processed file which consists of a train, test and prediction set, respectively for model construction.Entities:
Mesh:
Year: 2022 PMID: 35840589 PMCID: PMC9287291 DOI: 10.1038/s41597-022-01541-w
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Fig. 1Outline of mosquito preparation and image collection.
Fig. 2(a) Mosquitos’ colonies and culture from VCRU USM. Mosquito was released one by one for image acquisition, (b) The process Image acquisition is carried out within a 30 × 30 × 30 cm netted cage with 36 W LED Ring Light white colored illumination (5500 K).
Fig. 3All the images were resized into 224 × 224 pixels from the original dimension.
Description, Labels, and Example of images for the dataset: Six root files that represent six classes of mosquitoes, and one pre-processed file.
| Six root files of raw image data | ||||
|---|---|---|---|---|
| Sample of images | Labels | Species | Conditions on human skin | Number of images |
| Normal landed | 250 | |||
| Smashed or damaged | 250 | |||
| Normal landed | 250 | |||
| Smashed or damaged | 250 | |||
| Normal landed | 250 | |||
| Smashed or damaged | 250 | |||
| data_splitting | Train | 4200 | ||
| Test | 1800 | |||
| Prediction | 3600 | |||
*Pre-processed the image data with augmentation and data splitting.
Fig. 4Confusion matrix, accuracy, and error loss of the pilot test of a deep learning model by using the dataset at three learning rates.
| Measurement(s) | recognition system with machine/deep learning |
| Technology Type(s) | Camera Device |
| Sample Characteristic - Organism | Aedes aegypti • Aedes albopictus |