Literature DB >> 22885252

Non-destructive species identification of Drosophila obscura and D. subobscura (Diptera) using near-infrared spectroscopy.

Stefanie Fischnaller1, Floyd E Dowell, Alexandra Lusser, Birgit C Schlick-Steiner, Florian M Steiner.   

Abstract

The vinegar flies Drosophila subobscura and D. obscura frequently serve as study organisms for evolutionary biology. Their high morphological similarity renders traditional species determination difficult, especially when living specimens for setting up laboratory populations need to be identified. Here we test the usefulness of cuticular chemical profiles collected via the non-invasive method near-infrared spectroscopy for discriminating live individuals of the two species. We find a classification success for wild-caught specimens of 85%. The species specificity of the chemical profiles persists in laboratory offspring (87-92% success). Thus, we conclude that the cuticular chemistry is genetically determined, despite changes in the cuticular fingerprints, which we interpret as due to laboratory adaptation, genetic drift and/or diet changes. However, because of these changes, laboratory-reared specimens should not be used to predict the species-membership of wild-caught individuals, and vice versa. Finally, we demonstrate that by applying an appropriate cut-off value for interpreting the prediction values, the classification success can be immensely improved (to up to 99%), albeit at the cost of excluding a considerable portion of specimens from identification.

Entities:  

Mesh:

Year:  2012        PMID: 22885252      PMCID: PMC3519663          DOI: 10.4161/fly.21535

Source DB:  PubMed          Journal:  Fly (Austin)        ISSN: 1933-6934            Impact factor:   2.160


Introduction

Drosophila obscura and D. subobscura (Diptera: Drosophilidae) are closely related species of the D. obscura group, with a wide distribution in the Palaearctic. Both are generalists and co-occur broadly in the colline and alpine zone. They are frequently used species in evolutionary-biological studies (for review see refs. 3–7). Accurate species identification of living specimens of both sexes is difficult, as the two species are morphologically highly similar, with considerable intraspecific variation in the diagnostic characters. The problem is aggravated by the need to keep to a minimum the anesthesia by CO2, to avoid reduced longevity and fecundity., For introducing wild-caught individuals to the laboratory with the aim to retain genetic variation, a rapid and non-destructive method for species identification with the potential for high throughput would thus be desirable as an alternative to morphology-based methods. Insect cuticular layers contain complex mixtures of hydrocarbons (CHCs), many of which are synthesized by the insect itself, i.e., supposedly genetically determined. In addition to their central role in the prevention of desiccation, CHCs are important for chemical communication, for example in mate choice, and social behavior. The idea that the bouquet of CHCs will thus be species specific led researchers to enquire into their usefulness in species identification, and various examples of success exist. We decided to test the usefulness of near-infrared spectroscopy (NIRS) to discriminate between D. subobscura and D. obscura: Previous studies suggested differences in desiccation resistance between the two species, and, possibly due to a lack of interspecific mate recognition, no hybridization between them has been reported as yet, both of which may involve CHC species specificity. NIRS characterizes chemical patterns qualitatively and quantitatively based primarily on C-H, N-H, O-H stretching vibrations. It is thus a useful tool for the characterization of biological material, and is, also owing to its non-destructive and non-invasive nature, becoming common practice in ecology and entomology., Of relevance here, it was successfully used in the identification of (non-drosophilid) dipterans.- The objectives of this study were to determine if NIRS (1) can be used to discriminate living D. obscura and D. subobscura specimens by using multivariate chemometrics, and (2) whether calibration models elaborated for wild-caught specimens and for specimens from different laboratory-reared generations can be cross-applied. Cross-applicability would reduce significantly the effort needed for establishing the identification of specimens with such differing backgrounds. However, it needs to be kept in mind that genetic and phenotypic changes can arise from evolution in a novel environment. The CHC bouquet, in particular, can evolve due to changes in the ambient thermal regime and in diet composition but can also change due to acquisition of hydrocarbons from food.

Results

Statistical parameters of the partial least squares (PLS) calibration models (number of PLS factors used, coefficient of determination (r2) and standard error of cross validation (SECV)) and the classification results for the validation sets are listed in Table 1. The calibration models had r2 values between 0.43 and 0.63, and SECV values between 0.33 and 0.40. The correct classification for the validation sets ranged between 85% and 92%; the best prediction results were achieved for the eighth lab-reared generation (F8), with 90% for F8 males (F8 min) and 92% for F8 females (F8f).

Table 1. Classification results of Drosophila subobscura and D. obscura based on PLS regression models developed from near-infrared spectra (500–2200 nm).

 
Males
Females
 
Wild
F1
F8
F1
F8
PLS factors
7
11
9
12
13
r2
0.57
0.63
0.55
0.49
0.43
SECV
0.33
0.34
0.34
0.38
0.40
n in the validation sets: D. subobscura / D. obscura
252 / 15
19 / 24
90 / 64
27 / 41
60 / 67
Cross-validation results: % correctly classified
90%
94%
84%
87%
83%
n correctly classified (%) / n total
226 (85%) / 267
38 (88%) / 43
138 (90%) / 154
59 (87%) / 68
117 (92%) / 127
After exclusion of class values 1.4–1.6: n (%) correctly classified / n total
213 (90%) / 237
36 (90%) / 40
127 (91%) / 139
56 (90%) / 62
111 (96%) / 116
After exclusion of class values 1.3–1.7: n (%) correctly classified / n total
182 (95%) / 191
26 (90%) / 29
106 (94%) / 113
47 (92%) / 51
96 (97%) / 99
After exclusion of class values 1.2–1.8: n (%) correctly classified / n total145 (97%) / 14818 (90%) / 2086 (92%) / 9344 (94%) /4780 (99%) / 82

n = number of individuals

n = number of individuals We then explored how the exclusion of prediction values around 1.5 influenced classification rate and loss of specimens, by symmetrically excluding values below and above 1.5, decreasing and increasing in steps of 0.02 to the extremes of 1.0 and 2.0, respectively. Exclusion of values between 1.4 and 1.6 resulted in, for example, the F8 females in an increase of the correct classification from 92% to 96% and in the exclusion of 10% of specimens (Fig. 1). For the other models and validation sets, the corresponding values were similar at increases of correct classification to 90–91% and 9–11% of individuals excluded (Table 1); for our data set sizes, we found this to represent an acceptable compromise across models between accuracy and number of specimens excluded. When values 1.30–1.70 and 1.20–1.80 were excluded, the success rate for F8f increased to 97% and 99%, but the portion of specimens identified dropped to 78% and 65%, respectively (Fig. 1).

Figure 1. The portion of correctly classified specimens depends on the exclusion of spectra with ambiguous prediction values, exemplified by the F8 females. The thin line shows the increasing loss of individuals with increasing range of excluded prediction values, the bold line shows the corresponding increase in correct classification. Exclusion ranges in boxes are discussed in the text.

Figure 1. The portion of correctly classified specimens depends on the exclusion of spectra with ambiguous prediction values, exemplified by the F8 females. The thin line shows the increasing loss of individuals with increasing range of excluded prediction values, the bold line shows the corresponding increase in correct classification. Exclusion ranges in boxes are discussed in the text. Wavelengths important for the identification of D. subobscura and D. obscura were identified from the PLS regression coefficients, with wavelengths showing very high or very low coefficients being more important. There were peaks occurring in all of the five calibration models and peaks that were important only in single models. Figure 2 shows the regression-coefficient plot for F8f.

Figure 2. PLS regression coefficient used for identifying important wavelengths for classification of Drosophila subobscura and D. obscura females from the F8.

Figure 2. PLS regression coefficient used for identifying important wavelengths for classification of Drosophila subobscura and D. obscura females from the F8. When calibration models created for one group were used to predict validation sets from the other groups, the classification success ranged from 56% to 83% (Table 2; prediction values between 1.4 and 1.6 excluded).

Table 2. Correct classification rate (%) for validation sets performed on the different calibration models to test their cross-applicability (classification values 1.4–1.6 excluded).

 
Calibration model
Validation set
Wild
F1
F8
Wm
90
83
75
F1m
65
90
77
F8m
67
77
91
F1f
n.a.
90
56
F8fn.a.5796

n.a. = not applicable

n.a. = not applicable

Discussion

Here we show that NIRS can be used to distinguish between Drosophila subobscura and D. obscura with an accuracy of 85% to 92% using PLS analysis, when using the full range of prediction values. This indicates that the composition of CHCs may differ between the two species. We cannot directly relate NIR-spectral differences to CHCs, and also the visible spectral range was relevant to successful PLS models (see further down), but we assume that CHC composition contributed significantly to species differences (compare refs. 14–19). The prediction results for the wild-caught flies were comparable to those obtained for laboratory-reared specimens, in line with the notion that hydrocarbon profiles are more genetically than environmentally determined – although the two species were reared under the same conditions, differences in the cuticular profiles persisted and were detectable by NIRS. These findings contrast the NIRS study by Mayagaya et al. who predicted two Anopheles species reared in the laboratory with an accuracy of almost 100%, and field-collected specimens with 80% accuracy. Including both wild-caught flies and laboratory-reared flies (from all generations) in the same model did not improve our prediction results, the best models resulting in 82% and 79% prediction success for females and males, respectively (S. Fischnaller, unpubl.). However, from the practical point of view of setting up breeding lines based on identification via NIRS, our error rates are not fatal, given that Drosophila obscura and D. subobscura do not hybridize. Hence, no interspecific gene flow is expected for unintentionally heterospecific cultures, and the identification procedure can be repeated in consecutive generations. The lower rate of correct classification in our study as compared with the work by Rodriguez-Fernandez et al., who used nine Diptera species, could be caused by a closer phylogenetic relatedness of our species as well as by our including multiple populations in the sample – genetic diversity was found to be very high across other wild populations of D. subobscura. Furthermore, we included individuals of all ages, and thus likely both unmated and mated individuals, in our calibration and validation sets. NIRS is sensitive to the age of individuals, and thus used for age-grading of various insects,,, and Everaerts et al. showed that in Drosophilidae, in both females and males, CHC changes occur during mating. The variation introduced by either or both of these effects may possibly have impeded greater success of our calibration procedures. One way to improve classification is the exclusion of specimens with prediction values around 1.5 (Table 1, Fig. 1). This procedure was suggested by Sikulu et al. in general, but to our knowledge the trade-off between increase of classification success and loss of specimens has not yet been explored in a quantitative manner. We suggest that such exploration be adopted as a standard procedure in NIRS species-identification studies. Depending on the demands for the specific project, researchers could thus prioritise either classification success or number of specimens identified in a controlled manner. Another way of improving accuracy with our species could be to scan just wings. Using the pulled-out right wings of thawed males in NIRS analysis enabled us to distinguish D. subobscura from D. obscura with 100% accuracy (n = 50 males per species; data not shown). This is in line with the findings from Shevtsova et al. who found high interspecific variation in the wing interference patterns of Drosophilidae. Scanning just wings of live specimens is very difficult to put into praxis, however, due to the need for standardised positioning of wings on the one hand and minimum CO2 exposure of specimens at the other (S. Fischnaller, unpubl.). Exploring this possibility in depth remains subject to future exploration. Examination and comparison of the regression coefficient plots indicated that there are peaks important to species discrimination that are common to all five calibration models. The region around 510, 540 and 610 nm indicates that there are differences between the two species in the visible region, possibly caused by variation in cuticle thickness, bristles and/or pigmentation. The region of 1050–1070 nm indicates vibration of water molecules at the third overtone, as well as occurrence of molecules containing N-H functional groups (ref. 36, also used for the interpretation of the subsequently listed wavelengths). Peaks at 1370–1390 nm (CH2 second overtone, and water), 1720–1730 nm (CH3 first overtone), 1810–1840 nm and 1870 nm (C-H first overtone, water), and 2140–2180 nm (N-H and O-H combination bands) also contributed to all calibration models. Our study suggests that wild-caught specimens of our species should not be used to identify laboratory-reared specimens, and vice versa, due to excessive failure rates (Table 2). This contrasts the findings of Mayagaya et al. of 79% correctly classified wild-caught Anopheles when using models based on laboratory-reared individuals. Our low success rate is supported by absorption peaks in the regression coefficients exclusive to just one of the calibration models (e.g., 1025 nm, 1460 nm in Wm; 1500 nm, 2050 nm in F1 min; 1770 nm in F1f; 2000 nm in F8f). In other words, chemical differences led to the observed generation specificity of the models. Toolson and Kuper-Simbron reported for Drosophila pseudoobscura that maintenance in the laboratory leads to physiological and biochemical changes. They reported a shift in the cuticular composition even for the first generation of large populations reared in the laboratory, and explained it by changes of selective pressure and fitness advantages under novel environmental factors. Especially in small populations genetic drift can additionally increase the genetic differentiation across populations (D. subobscura: see refs. 28, 37). Also, hydrocarbon profiles can change in a non-inherited manner due to acquisition of food-derived hydrocarbons (ant example: ref. 30). Thus, changes in the metabolic profiles – either due to genetic or environmental changes – may have altered the recorded NIRS data across generations, impeding the use of calibration models generated for one generation in the others. Future research should aim to pinpoint potential non-inherited contributions as well as assess if this problem ceases in later generations which would indicate that it is due to rapid laboratory adaptation, or whether larger population sizes diminish it which would indicate that it is due to genetic drift (but note that our population sizes were in line with general practice, e.g., Fry). In conclusion, there are three main findings to our study: First, near-infrared spectroscopy proved a useful tool for the identification of living Drosophila flies. Second, we could not cross-apply models and validation sets among field-caught and lab-reared individuals and across generations, indicating changes due to laboratory adaptation, genetic drift and/or diet changes. Third, classification rates could be considerably improved by excluding prediction values around 1.5, suggesting that researchers should consider excluding a particular range of prediction values depending on their research question. Our study thus underscores the enormous potential of the NIRS technique to species identification (e.g., refs. 24, 25, 26, 40, 41), and indicates that it could become an important tool also for the delimitation of species in integrative taxonomy, as well as in other biological fields.

Materials and Methods

Insects

Specimens were collected from six different locations in North Tyrol (Austria) during August and September 2010. To represent a wide range of habitats, the collection sites were chosen from various altitudes between 570 and 2000 min above sea level (Table 3). The minimum and maximum distances between populations were 2 and 60 km, respectively. Collecting was done by net sweeping over baits of fermented banana in the evening hours from 5 to 7 p.m. The field-caught flies were transported alive to the laboratory and anaesthetised with CO2 for morphology-based species identification. CO2 exposure length for species identification, as well as for spectra collection (see below), was kept to a minimum and never exceeded four minutes per specimen. Flies that were identified as D. subobscura or D. obscura according to Bächli and Burla were used to set up breeding lines for each location sampled. All lines were kept at a minimum census size of 60 individuals on an artificial diet (corn-meal, sugar, agar, yeast, Tegosept) and at a photoperiod of 12/12 h (light/dark) at 19°C.

Table 3. Sampling data for field-collected Drosophila obscura and D. subobscura.

 
 
 
Number of specimens collected
 
 
 
Drosophila obscura
Drosophila subobscura
Location
Geographic coordinates
Altitude (m a.s.l.)
females
males
females
males
Kaserstattalm
47°07′34.86”N 11°17′30.83”E
2,029
4
27
3
11
Hahntennjoch
47°17′24.07”N 10°39′19.97”E
1,973
3
0
6
10
Buzihütte
47°16′20.99”N 11°21′23.27”E
711
0
0
32
45
Mentlberg
47°14′55.34”N 11°21′56.31”E
616
0
11
59
133
Arzl
47°17′11.22”N 11°25′09.80”E
707
0
3
22
26
Innsbruck city47°15′53.43”N 11°20′34.59”E579292462

a.s.l. = above sea level

a.s.l. = above sea level

Data collection

Spectra were collected from anaesthetised flies using a Labspec® 5000 Portable Vis/NIR Spectrometer (350–2,500 nm; ASD Inc.) by placing flies individually on their backs on a 9 cm diameter Spectralon plate. The 3 mm diameter bifurcated fiber-optic probe was positioned about 2 mm above the specimen, focusing on the abdomen. The spectrometer automatically calculated and saved the average spectrum of 30 collected spectra of each individual. Background reference (the baseline) was measured using a separate 3 cm diameter Spectralon plate to avoid contamination. All field-caught individuals as well as 251 randomly chosen individuals of the F1 and 421 of the F8 of the breeding lines were sexed and scanned. We thus included a wide range of individual ages in our sample.

Data analysis

All recorded spectra were converted into Galactic spectrum file format using ASD ViewSpecPro. Spectra used for the calibration sets were pre-processed by mean-centring and analyzed using PLS regression and leave-one-out cross validation, implemented in GRAMS software PLS/IQ. Spectra were generally very noisy below 500 nm and above 2200 nm and these regions were excluded from further analysis. Calibration models were elaborated separately for males (m) and females (f), because females can be easily distinguished from males and because Drosophila sexes differ in their CHC-profiles. We performed models for the following five groups: (1) the wild, field-collected males, referred to as “Wm” (due to the low number of field-caught D. obscura females, no model could be created for this group), (2) the first lab-reared generation, referred to as “F1m” and (3) “F1f,” and (4) the eighth lab-reared generation, referred to as “F8m” and (5) “F8f.” The training sets for each calibration model contained 70 spectra (35 of each species). A two-way comparison in PLS analysis was made by assigning integer values 1 and 2 to D. subobscura and D. obscura, respectively. Independent validation sets, treated as “unknown” specimens, were then classified on the basis of the calibration model in each group. Spectra predicted to have a class value of ≤ 1.5 were considered to belong to D. subobscura, those with a predicted value of ≥ 1.5 to D. obscura. The numbers of PLS regression factors to be used in the prediction models were determined by examining the values of the predicted residual sum of squares and the classification results of the independent validation sets. Accuracy of the calibration models was examined by checking the r2 indicating the closeness of fit between NIRS and reference data, the SECV of the leave-one-out procedure, and by calculating the prediction results using the validation sets—the most rigorous indicator of model quality. Spectral residuals, which were possibly due to technical problems such as movement of insufficiently anaesthetised specimens, were discarded from the sample. Such outliers were detected by visual examination of the spectra using spekwin32 (F. Menges “Spekwin32—free software for optical spectroscopy”- Vers.1.71.5, 2010, http://www.effemm2.de/spekwin/) and by examination of the leverage and studentised residuals plots generated in GRAMS (compare ref. 48).
  25 in total

1.  Chronological age-grading of house flies by using near-infrared spectroscopy.

Authors:  Joel Perez-Mendoza; Floyd E Dowell; Alberto B Broce; James E Throne; Robert A Wirtz; Feng Xie; Jeffrey A Fabrick; James E Baker
Journal:  J Med Entomol       Date:  2002-05       Impact factor: 2.278

Review 2.  Courtship, aggression and avoidance: pheromones, receptors and neurons for social behaviors in Drosophila.

Authors:  Anupama Dahanukar; Anandasankar Ray
Journal:  Fly (Austin)       Date:  2011-01-01       Impact factor: 2.160

3.  Stable structural color patterns displayed on transparent insect wings.

Authors:  Ekaterina Shevtsova; Christer Hansson; Daniel H Janzen; Jostein Kjærandsen
Journal:  Proc Natl Acad Sci U S A       Date:  2011-01-03       Impact factor: 11.205

4.  Age-grading the biting midge Culicoides sonorensis using near-infrared spectroscopy.

Authors:  W K Reeves; K H S Peiris; E-J Scholte; R A Wirtz; F E Dowell
Journal:  Med Vet Entomol       Date:  2010-03       Impact factor: 2.739

5.  Near-infrared imaging spectroscopy as a tool to discriminate two cryptic Tetramorium ant species.

Authors:  Jasmin Klarica; Lukas Bittner; Johannes Pallua; Christine Pezzei; Verena Huck-Pezzei; Floyd Dowell; Johannes Schied; Günther K Bonn; Christian Huck; Birgit C Schlick-Steiner; Florian M Steiner
Journal:  J Chem Ecol       Date:  2011-05-03       Impact factor: 2.626

6.  Differentiation between species of the Anopheles gambiae Giles complex (Diptera: Culicidae) by analysis of cuticular hydrocarbons.

Authors:  D A Carlson; M W Service
Journal:  Ann Trop Med Parasitol       Date:  1979-12

7.  Swift laboratory thermal evolution of wing shape (but not size) in Drosophila subobscura and its relationship with chromosomal inversion polymorphism.

Authors:  M Santos; P F Iriarte; W Céspedes; J Balanyà; A Fontdevila; L Serra
Journal:  J Evol Biol       Date:  2004-07       Impact factor: 2.411

8.  Non-destructive determination of age and species of Anopheles gambiae s.l. using near-infrared spectroscopy.

Authors:  Valeliana S Mayagaya; Kristin Michel; Mark Q Benedict; Gerry F Killeen; Robert A Wirtz; Heather M Ferguson; Floyd E Dowell
Journal:  Am J Trop Med Hyg       Date:  2009-10       Impact factor: 2.345

9.  Evolutionary dynamics of molecular markers during local adaptation: a case study in Drosophila subobscura.

Authors:  Pedro Simões; Marta Pascual; Josiane Santos; Michael R Rose; Margarida Matos
Journal:  BMC Evol Biol       Date:  2008-02-26       Impact factor: 3.260

10.  Molecular phylogeny of the Drosophila obscura species group, with emphasis on the Old World species.

Authors:  Jian-jun Gao; Hide-aki Watabe; Tadashi Aotsuka; Jun-feng Pang; Ya-ping Zhang
Journal:  BMC Evol Biol       Date:  2007-06-07       Impact factor: 3.260

View more
  4 in total

1.  The Application of DNA Barcodes for the Identification of Marine Crustaceans from the North Sea and Adjacent Regions.

Authors:  Michael J Raupach; Andrea Barco; Dirk Steinke; Jan Beermann; Silke Laakmann; Inga Mohrbeck; Hermann Neumann; Terue C Kihara; Karin Pointner; Adriana Radulovici; Alexandra Segelken-Voigt; Christina Wesse; Thomas Knebelsberger
Journal:  PLoS One       Date:  2015-09-29       Impact factor: 3.240

2.  Highly Efficient Use of Infrared Spectroscopy (ATR-FTIR) to Identify Aphid Species.

Authors:  Roma Durak; Beata Ciak; Tomasz Durak
Journal:  Biology (Basel)       Date:  2022-08-18

3.  A near-infrared spectroscopy routine for unambiguous identification of cryptic ant species.

Authors:  Birgit C Schlick-Steiner; Florian M Steiner; Martin-Carl Kinzner; Herbert C Wagner; Andrea Peskoller; Karl Moder; Floyd E Dowell; Wolfgang Arthofer
Journal:  PeerJ       Date:  2015-09-15       Impact factor: 2.984

4.  Phenomic Selection Is a Low-Cost and High-Throughput Method Based on Indirect Predictions: Proof of Concept on Wheat and Poplar.

Authors:  Renaud Rincent; Jean-Paul Charpentier; Patricia Faivre-Rampant; Etienne Paux; Jacques Le Gouis; Catherine Bastien; Vincent Segura
Journal:  G3 (Bethesda)       Date:  2018-12-10       Impact factor: 3.154

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.