Literature DB >> 30825372

Automated Taxonomic Identification of Insects with Expert-Level Accuracy Using Effective Feature Transfer from Convolutional Networks.

Miroslav Valan1,2,3, Karoly Makonyi1,4, Atsuto Maki5, Dominik Vondráček6,7, Fredrik Ronquist2.   

Abstract

Rapid and reliable identification of insects is important in many contexts, from the detection of disease vectors and invasive species to the sorting of material from biodiversity inventories. Because of the shortage of adequate expertise, there has long been an interest in developing automated systems for this task. Previous attempts have been based on laborious and complex handcrafted extraction of image features, but in recent years it has been shown that sophisticated convolutional neural networks (CNNs) can learn to extract relevant features automatically, without human intervention. Unfortunately, reaching expert-level accuracy in CNN identifications requires substantial computational power and huge training data sets, which are often not available for taxonomic tasks. This can be addressed using feature transfer: a CNN that has been pretrained on a generic image classification task is exposed to the taxonomic images of interest, and information about its perception of those images is used in training a simpler, dedicated identification system. Here, we develop an effective method of CNN feature transfer, which achieves expert-level accuracy in taxonomic identification of insects with training sets of 100 images or less per category, depending on the nature of data set. Specifically, we extract rich representations of intermediate to high-level image features from the CNN architecture VGG16 pretrained on the ImageNet data set. This information is submitted to a linear support vector machine classifier, which is trained on the target problem. We tested the performance of our approach on two types of challenging taxonomic tasks: 1) identifying insects to higher groups when they are likely to belong to subgroups that have not been seen previously and 2) identifying visually similar species that are difficult to separate even for experts. For the first task, our approach reached $CDATA[$CDATA[$>$$92% accuracy on one data set (884 face images of 11 families of Diptera, all specimens representing unique species), and $CDATA[$CDATA[$>$$96% accuracy on another (2936 dorsal habitus images of 14 families of Coleoptera, over 90% of specimens belonging to unique species). For the second task, our approach outperformed a leading taxonomic expert on one data set (339 images of three species of the Coleoptera genus Oxythyrea; 97% accuracy), and both humans and traditional automated identification systems on another data set (3845 images of nine species of Plecoptera larvae; 98.6 % accuracy). Reanalyzing several biological image identification tasks studied in the recent literature, we show that our approach is broadly applicable and provides significant improvements over previous methods, whether based on dedicated CNNs, CNN feature transfer, or more traditional techniques. Thus, our method, which is easy to apply, can be highly successful in developing automated taxonomic identification systems even when training data sets are small and computational budgets limited. We conclude by briefly discussing some promising CNN-based research directions in morphological systematics opened up by the success of these techniques in providing accurate diagnostic tools.
© The Author(s) 2019. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.

Entities:  

Mesh:

Year:  2019        PMID: 30825372      PMCID: PMC6802574          DOI: 10.1093/sysbio/syz014

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  17 in total

1.  Bounds on error expectation for support vector machines.

Authors:  V Vapnik; O Chapelle
Journal:  Neural Comput       Date:  2000-09       Impact factor: 2.026

2.  Time to automate identification.

Authors:  Norman MacLeod; Mark Benfield; Phil Culverhouse
Journal:  Nature       Date:  2010-09-09       Impact factor: 49.962

3.  Factors of Transferability for a Generic ConvNet Representation.

Authors:  Hossein Azizpour; Ali Sharif Razavian; Josephine Sullivan; Atsuto Maki; Stefan Carlsson
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2015-11-12       Impact factor: 6.226

4.  An overview of statistical learning theory.

Authors:  V N Vapnik
Journal:  IEEE Trans Neural Netw       Date:  1999

Review 5.  Deep learning.

Authors:  Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

6.  Selective Convolutional Descriptor Aggregation for Fine-Grained Image Retrieval.

Authors:  Xiu-Shen Wei; Jian-Hao Luo; Jianxin Wu; Zhi-Hua Zhou
Journal:  IEEE Trans Image Process       Date:  2017-03-27       Impact factor: 10.856

7.  DrawWing, a program for numerical description of insect wings.

Authors:  Adam Tofilski
Journal:  J Insect Sci       Date:  2004-05-21       Impact factor: 1.857

8.  Feature Extraction and Machine Learning for the Classification of Brazilian Savannah Pollen Grains.

Authors:  Ariadne Barbosa Gonçalves; Junior Silva Souza; Gercina Gonçalves da Silva; Marney Pascoli Cereda; Arnildo Pott; Marco Hiroshi Naka; Hemerson Pistori
Journal:  PLoS One       Date:  2016-06-08       Impact factor: 3.240

9.  Going deeper in the automated identification of Herbarium specimens.

Authors:  Jose Carranza-Rojas; Herve Goeau; Pierre Bonnet; Erick Mata-Montero; Alexis Joly
Journal:  BMC Evol Biol       Date:  2017-08-11       Impact factor: 3.260

10.  Deep Learning for Plant Identification in Natural Environment.

Authors:  Yu Sun; Yuan Liu; Guan Wang; Haiyan Zhang
Journal:  Comput Intell Neurosci       Date:  2017-05-22
View more
  15 in total

1.  Deep learning and computer vision will transform entomology.

Authors:  Toke T Høye; Johanna Ärje; Kim Bjerge; Oskar L P Hansen; Alexandros Iosifidis; Florian Leese; Hjalte M R Mann; Kristian Meissner; Claus Melvad; Jenni Raitoharju
Journal:  Proc Natl Acad Sci U S A       Date:  2021-01-12       Impact factor: 11.205

2.  A demonstration of unsupervised machine learning in species delimitation.

Authors:  Shahan Derkarabetian; Stephanie Castillo; Peter K Koo; Sergey Ovchinnikov; Marshal Hedin
Journal:  Mol Phylogenet Evol       Date:  2019-07-16       Impact factor: 4.286

3.  Toward global integration of biodiversity big data: a harmonized metabarcode data generation module for terrestrial arthropods.

Authors:  Paula Arribas; Carmelo Andújar; Kristine Bohmann; Jeremy R deWaard; Evan P Economo; Vasco Elbrecht; Stefan Geisen; Marta Goberna; Henrik Krehenwinkel; Vojtech Novotny; Lucie Zinger; Thomas J Creedy; Emmanouil Meramveliotakis; Víctor Noguerales; Isaac Overcast; Hélène Morlon; Anna Papadopoulou; Alfried P Vogler; Brent C Emerson
Journal:  Gigascience       Date:  2022-07-19       Impact factor: 7.658

4.  A ResNet attention model for classifying mosquitoes from wing-beating sounds.

Authors:  Xutong Wei; Md Zakir Hossain; Khandaker Asif Ahmed
Journal:  Sci Rep       Date:  2022-06-20       Impact factor: 4.996

5.  A Computer Vision Approach to Identifying Ticks Related to Lyme Disease.

Authors:  Sina Akbarian; Mark P Nelder; Curtis B Russell; Tania Cawston; Laurent Moreno; Samir N Patel; Vanessa G Allen; Elham Dolatabadi
Journal:  IEEE J Transl Eng Health Med       Date:  2021-12-30

6.  Getting science priorities straight: how to increase the reliability of specimen identification?

Authors:  Filipe Michels Bianchi; Leonardo Tresoldi Gonçalves
Journal:  Biol Lett       Date:  2021-04-28       Impact factor: 3.703

7.  Technological Advances to Address Current Issues in Entomology: 2020 Student Debates.

Authors:  Lina Bernaola; Molly Darlington; Kadie Britt; Patricia Prade; Morgan Roth; Adrian Pekarcik; Michelle Boone; Dylan Ricke; Anh Tran; Joanie King; Kelly Carruthers; Morgan Thompson; John J Ternest; Sarah E Anderson; Scott W Gula; Kayleigh C Hauri; Jacob R Pecenka; Sajjan Grover; Heena Puri; Surabhi Gupta Vakil
Journal:  J Insect Sci       Date:  2021-03-01       Impact factor: 1.857

8.  Flora Capture: a citizen science application for collecting structured plant observations.

Authors:  David Boho; Michael Rzanny; Jana Wäldchen; Fabian Nitsche; Alice Deggelmann; Hans Christian Wittich; Marco Seeland; Patrick Mäder
Journal:  BMC Bioinformatics       Date:  2020-12-14       Impact factor: 3.169

9.  The application of rapid evaporative ionization mass spectrometry in the analysis of Drosophila species-a potential new tool in entomology.

Authors:  Iris Wagner; Natalie I Koch; Joscelyn Sarsby; Nicola White; Tom A R Price; Sam Jones; Jane L Hurst; Robert J Beynon
Journal:  Open Biol       Date:  2020-11-25       Impact factor: 6.411

10.  ClassifyMe: A Field-Scouting Software for the Identification of Wildlife in Camera Trap Images.

Authors:  Greg Falzon; Christopher Lawson; Ka-Wai Cheung; Karl Vernes; Guy A Ballard; Peter J S Fleming; Alistair S Glen; Heath Milne; Atalya Mather-Zardain; Paul D Meek
Journal:  Animals (Basel)       Date:  2019-12-27       Impact factor: 2.752

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.