| Literature DB >> 24086666 |
Ranjana Jaiswara1, Diptarup Nandi, Rohini Balakrishnan.
Abstract
Traditional taxonomy based on morphology has often failed in accurate species identification owing to the occurrence of cryptic species, which are reproductively isolated but morphologically identical. Molecular data have thus been used to complement morphology in species identification. The sexual advertisement calls in several groups of acoustically communicating animals are species-specific and can thus complement molecular data as non-invasive tools for identification. Several statistical tools and automated identifier algorithms have been used to investigate the efficiency of acoustic signals in species identification. Despite a plethora of such methods, there is a general lack of knowledge regarding the appropriate usage of these methods in specific taxa. In this study, we investigated the performance of two commonly used statistical methods, discriminant function analysis (DFA) and cluster analysis, in identification and classification based on acoustic signals of field cricket species belonging to the subfamily Gryllinae. Using a comparative approach we evaluated the optimal number of species and calling song characteristics for both the methods that lead to most accurate classification and identification. The accuracy of classification using DFA was high and was not affected by the number of taxa used. However, a constraint in using discriminant function analysis is the need for a priori classification of songs. Accuracy of classification using cluster analysis, which does not require a priori knowledge, was maximum for 6-7 taxa and decreased significantly when more than ten taxa were analysed together. We also investigated the efficacy of two novel derived acoustic features in improving the accuracy of identification. Our results show that DFA is a reliable statistical tool for species identification using acoustic signals. Our results also show that cluster analysis of acoustic signals in crickets works effectively for species classification and identification.Entities:
Mesh:
Year: 2013 PMID: 24086666 PMCID: PMC3783383 DOI: 10.1371/journal.pone.0075930
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The sampling sites of field crickets in Southern India.
Mean (standard error) values of song features of the 14 species of field crickets used for the study.
| Genus | Collection site | Call period (ms) | Call duration (ms) | Syllable period (ms) | Syllable duration (ms) | Dominant frequency (kHz) | Relative variance (ms) | Constancy factor (ms) | No. of individuals |
|
| Coimbatore | 93.2 (1.9) | 37.3 (1.4) | 23.8 (1.3) | 14.1 (0.9) | 5.3 (0.1) | 943.2 (22.4) | 843.9 (29.9) | 5 |
|
| Kuppam | 130.1 (6.4) | 46.8 (1.6) | 16.3 (0.7) | 20.6 (0.6) | 5.9 (0.1) | 59.7 (8) | 5.6 (0.9) | 6 |
|
| Bangalore | 345.5 (11.9) | 166.9 (4.3) | 36.8 (1.3) | 20.6 (0.6) | 5 (0.1) | 57 (9.4) | 16.9 (2.6) | 6 |
|
| Bangalore | 11342.9 (915.5) | 8343.1 (983.2) | 99.7 (2.8) | 40 (1.1) | 6.6 (0.2) | 5.5 (1.4) | 7.5 (1) | 6 |
|
| Kadari | 1155.6 (85) | 420.6 (76.8) | 42.5 (2.3) | 22.3 (1.2) | 7.5 (0.1) | 67.7 (10.4) | 67.4 (8.8) | 5 |
|
| Valparai | 1504.4 (122.2) | 296.5 (16.8) | 54.9 (1.3) | 26.4 (0.8) | 6.5 (0.1) | 125.1 (18.7) | 122.5 (17.2) | 7 |
|
| Kadari | 467 (14.6) | 312.1 (22.2) | 35.8 (1.8) | 12.8 (0.9) | 6.9 (0.1) | 124.7 (25.6) | 35.5 (5.1) | 6 |
|
| Bangalore | 68.9 (1) | 36.2 (0.6) | 34.5 (0.5) | 13.8 (0.2) | 6.6 (0.1) | 30.9 (2.8) | 2.3 (0.2) | 6 |
|
| Mudumalai | 667.5 (43.2) | 317.5 (15.6) | 26.6 (0.5) | 16.3 (0.4) | 4.4 (0.1) | 118.8 (14.2) | 64.5 (6.7) | 8 |
|
| Ullodu | 217.8 (7.7) | 122.7 (6.4) | 31 (0.6) | 17.7 (0.4) | 6 (0.1) | 224.8 (36.7) | 57.4 (13.3) | 7 |
|
| Kadari | 305.9 (13.7) | 115.7 (12.2) | 54.5 (1.2) | 41.2 (0.9) | 3.1 (0.1) | 211.5 (17) | 64.2 (5.8) | 6 |
|
| Mudumalai | 863.2 (29.7) | 600.6 (34.2) | 8.9 (0.2) | 5.6 (0.3) | 6.2 (0.1) | 167.2 (16.5) | 136.2 (18.7) | 8 |
|
| Kuppam | 9271 (1445.2) | 1090.4 (136.8) | 42.3 (1.8) | 24 (0.8) | 4.4 (0.1) | 541.5 (268.4) | 1692.7 (330.9) | 5 |
|
| Kadari | 318.6 (26.4) | 144.4 (17.6) | 38.6 (1.4) | 24.2 (0.8) | 4.2 (0.0) | 251.2 (42.9) | 1474.4 (101.3) | 4 |
Figure 2Oscillogram of Gryllus bimaculatus illustrating measured temporal features. Scale bar represents 0.5 second.
Percentage of correctly allocated individuals by discriminant function analysis (DFA).
| Number of taxa and characters | 1st randomization | 2nd randomization | 3rd randomization | 4th randomization | 5th randomization | 6th randomization | 7th randomization | 8th randomization | 9th randomization | 10th randomization | Average of correct classification |
| 5 T:7 C | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
| 5 T:5 C | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
| 6 T:7 C | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
| 6 T:5 C | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
| 7 T:7 C | 100 | 100 | 100 | 97.8 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
| 7 T:5 C | 100 | 100 | 100 | 91.1 | 100 | 100 | 100 | 100 | 100 | 100 | 99 |
| 8 T:7 C | 100 | 97.8 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
| 8 T:5 C | 100 | 91.8 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 99 |
| 9 T:7 C | 94.5 | 100 | 100 | 100 | 98.2 | 94.7 | 100 | 100 | 100 | 100 | 99 |
| 9 T:5 C | 92.7 | 100 | 100 | 100 | 92.7 | 92.9 | 100 | 100 | 100 | 100 | 98 |
| 10 T:7 C | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
| 10 T:5 C | 100 | 100 | 100 | 93.1 | 100 | 100 | 100 | 93.3 | 100 | 100 | 99 |
| 11 T:7 C | 100 | 98.4 | 100 | 98.4 | 98.3 | 100 | 96.8 | 98.4 | 98.5 | 98.4 | 99 |
| 11 T:5 C | 100 | 93.7 | 100 | 93.7 | 93.5 | 100 | 93.7 | 93.7 | 89.5 | 93.7 | 95 |
| 12 T:7 C | 100 | 95.9 | 95.8 | 95.7 | 100 | 98.6 | 100 | 91.1 | 100 | 95.4 | 97 |
| 12 T:5 C | 100 | 94.6 | 94.4 | 94.4 | 100 | 94.6 | 100 | 91.2 | 100 | 93.8 | 96 |
| 13 T:7 C | 96.2 | 96.2 | 94.9 | 96.2 | 96.1 | 98.7 | 96.1 | 96.1 | 100 | 100 | 97 |
| 13 T:5 C | 94.9 | 94.9 | 92.4 | 95 | 94.9 | 95 | 94.8 | 94.8 | 100 | 100 | 96 |
xT:yC in the row headers refer to ‘x’ taxa with ‘y’ characters used for DFA.
Percentage of correctly allocated individuals by cluster analysis.
| Number of taxa and characters | 1st randomization | 2nd randomization | 3rd randomization | 4th randomization | 5th randomization | 6th randomization | 7th randomization | 8th randomization | 9th randomization | 10th randomization | Average of correct classification |
| 5T:5C | 100 | 60 | 60 | 60 | 60 | 100 | 60 | 60 | 100 | 100 | 76 |
| 5T:7C | 100 | 100 | 60 | 80 | 60 | 100 | 80 | 100 | 80 | 100 | 86 |
| 6T:5C | 33 | 100 | 100 | 100 | 67 | 100 | 100 | 100 | 83 | 100 | 88 |
| 6T:7C | 50 | 100 | 100 | 100 | 67 | 100 | 100 | 100 | 83 | 100 | 90 |
| 7T:5C | 100 | 86 | 100 | 71 | 100 | 100 | 100 | 57 | 57 | 86 | 86 |
| 7T:7C | 100 | 86 | 100 | 86 | 100 | 100 | 100 | 71 | 71 | 86 | 90 |
| 8T:5C | 75 | 75 | 75 | 75 | 88 | 100 | 88 | 100 | 75 | 75 | 83 |
| 8T:7C | 75 | 75 | 88 | 88 | 88 | 100 | 63 | 88 | 50 | 100 | 81 |
| 9T:5C | 78 | 56 | 78 | 78 | 56 | 78 | 100 | 67 | 89 | 78 | 76 |
| 9T:7C | 78 | 67 | 100 | 78 | 67 | 78 | 100 | 89 | 89 | 89 | 83 |
| 10T:5C | 90 | 90 | 60 | 60 | 70 | 60 | 90 | 60 | 50 | 80 | 71 |
| 10T:7C | 90 | 90 | 100 | 60 | 100 | 100 | 90 | 60 | 60 | 100 | 85 |
| 11T:5C | 64 | 55 | 64 | 55 | 64 | 100 | 55 | 55 | 64 | 64 | 64 |
| 11T:7C | 45 | 55 | 64 | 36 | 64 | 82 | 55 | 55 | 45 | 55 | 55 |
| 12T:5C | 67 | 75 | 67 | 50 | 75 | 67 | 92 | 67 | 83 | 75 | 72 |
| 12:7C | 67 | 58 | 58 | 58 | 67 | 83 | 92 | 83 | 75 | 58 | 70 |
| 13T:5C | 62 | 62 | 69 | 62 | 62 | 62 | 62 | 62 | 69 | 69 | 64 |
| 13T:7C | 62 | 62 | 62 | 46 | 62 | 85 | 62 | 62 | 69 | 69 | 64 |
xT:yC in the row headers refer to ‘x’ taxa with ‘y’ characters used for cluster analysis.
Figure 3Mean proportion of correct classification as derived from bootstrapping with 100 iterations.
Error bars represent 95% confidence interval. (A) Clusters with 5 acoustic characters. (B) Clusters with 7 acoustic characters.
Results of the Generalised Linear Model analysis with all the pairwise comparisons between taxa5 and the other eight taxa groups and their interactions with character sets.
| Categories | z Value | P Value |
| taxa5 (intercept) | 4.454 | 0.000008 |
| taxa6 | 0.940 | 0.347313 |
| taxa7 | 0.670 | 0.502782 |
| taxa8 | −0.700 | 0.483626 |
| taxa9 | −0.415 | 0.678184 |
| taxa10 | −0.163 | 0.870386 |
| taxa11 | −3.544 |
|
| taxa12 | −2.134 |
|
| taxa13 | −2.791 |
|
| character2 | −1.262 | 0.207020 |
| taxa6:character2 | 0.357 | 0.721072 |
| taxa7:character2 | 0.346 | 0.729065 |
| taxa8:character2 | 1.120 | 0.262556 |
| taxa9:character2 | 0.282 | 0.778217 |
| taxa10:character2 | −0.278 | 0.780797 |
| taxa11:character2 | 1.691 | 0.090797 |
| taxa12:character2 | 1.245 | 0.213159 |
| taxa13:character2 | 1.132 | 0.257484 |
| Residual deviance: 215.35 on 162 degrees of freedom | ||
| AIC: 585.65 | ||
Generalised Linear Model with all the pairwise comparisons between taxa8 and the other eight taxa groups and their interactions with character sets.
| Categories | z Value | P value |
| taxa8 (Intercept) | 5.119 |
|
| taxa5 | 0.700 | 0.483626 |
| taxa6 | 1.700 | 0.089108 |
| taxa7 | 1.489 | 0.136357 |
| taxa9 | 0.355 | 0.722230 |
| taxa10 | 0.670 | 0.503080 |
| taxa11 | −3.618 |
|
| taxa12 | −1.774 | 0.076023 |
| taxa13 | −2.643 |
|
| character2 | 0.205 | 0.837424 |
| taxa5:character2 | −1.120 | 0.262556 |
| taxa6:character2 | −0.618 | 0.536520 |
| taxa7:character2 | −0.735 | 0.462381 |
| taxa9:character2 | −1.017 | 0.309071 |
| taxa10:character2 | −1.698 | 0.089423 |
| taxa11:character2 | 0.518 | 0.604385 |
| taxa12:character2 | −0.007 | 0.994293 |
| taxa13:character2 | −0.174 | 0.862092 |
| Residual deviance: 215.35 on 162 degrees of freedom | ||
| AIC: 585.65 | ||
Binomial Test results for proportions of correct classification of Velarifictorus sp.2 and Coiblemmus sp.
|
|
| |||
| Chi-squared Estimate | P value | Chi-squared Estimate | P value | |
| 6 taxa | 0.31 | 0.58 |
|
|
| 7 taxa | 0.21 | 0.65 |
|
|
| 10 taxa | 0.81 | 0.37 |
|
|