| Literature DB >> 22574186 |
Violaine Nicolas1, Brigitte Schaeffer, Alain Didier Missoup, Jan Kennis, Marc Colyn, Christiane Denys, Caroline Tatard, Corinne Cruaud, Catherine Laredo.
Abstract
The Praomyini tribe is one of the most diverse and abundant groups of Old World rodents. Several species are known to be involved in crop damage and in the epidemiology of several human and cattle diseases. Due to the existence of sibling species their identification is often problematic. Thus an easy, fast and accurate species identification tool is needed for non-systematicians to correctly identify Praomyini species. In this study we compare the usefulness of three genes (16S, Cytb, CO1) for identifying species of this tribe. A total of 426 specimens representing 40 species (sampled across their geographical range) were sequenced for the three genes. Nearly all of the species included in our study are monophyletic in the neighbour joining trees. The degree of intra-specific variability tends to be lower than the divergence between species, but no barcoding gap is detected. The success rate of the statistical methods of species identification is excellent (up to 99% or 100% for statistical supervised classification methods as the k-Nearest Neighbour or Random Forest). The 16S gene is 2.5 less variable than the Cytb and CO1 genes. As a result its discriminatory power is smaller. To sum up, our results suggest that using DNA markers for identifying species in the Praomyini tribe is a largely valid approach, and that the CO1 and Cytb genes are better DNA markers than the 16S gene. Our results confirm the usefulness of statistical methods such as the Random Forest and the 1-NN methods to assign a sequence to a species, even when the number of species is relatively large. Based on our NJ trees and the distribution of all intraspecific and interspecific pairwise nucleotide distances, we highlight the presence of several potentially new species within the Praomyini tribe that should be subject to corroboration assessments.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22574186 PMCID: PMC3344912 DOI: 10.1371/journal.pone.0036586
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Mean, minimum and maximum distances observed between individuals of the same (intraspecific) or distinct species (interspecific) for each gene.
| Intraspecific | Interspecific | |||||
| mean | min | max | mean | min | Max | |
| P distance | ||||||
| 16S | 0.77 | 0.00 | 4.41 | 5.24 | 0.00 | 9.31 |
| Cytb | 2.92 | 0.00 | 14.42 | 13.56 | 1.36 | 25.12 |
| Cytb part 1 | 2.49 | 0.00 | 10.00 | 12.27 | 1.04 | 18.98 |
| CO1 | 2.89 | 0.00 | 14.29 | 12.00 | 1.00 | 16.90 |
| K2P distance | ||||||
| 16S | 0.78 | 0.00 | 4.56 | 5.46 | 0.00 | 10.01 |
| Cytb | 3.07 | 0.00 | 16.44 | 15.27 | 1.38 | 31.13 |
| Cytb part 1 | 2.60 | 0.00 | 10.90 | 13.66 | 1.36 | 22.25 |
| CO1 | 2.03 | 0.00 | 11.73 | 13.32 | 1.01 | 19.83 |
Figure 1Distribution of intraspecific (white bars) and interspecific (black bars) divergences estimated from the K2P distance for the genes 16S, Cytb and CO1 and for the first part of the Cytb gene.
In several cases a non-null number of occurrences was observed, but this is not apparent on the histograms because of the scale. The symbol “*” indicates a non-null number of occurrences within species, and “+”a non-null number of occurrences between species.
Figure 216S neighbour-joining tree of Praomyini (K2P distance), with bootstrap support (500 replicates).
To improve clarity, bootstrap support of each species is not indicated on the tree but is reported in Table 1. For species codes, see Table S1.
Figure 3CO1 neighbour-joining tree of Praomyini (K2P distance), with bootstrap support (500 replicates).
To improve clarity bootstrap support of each species is not indicated on the tree but is reported in Table 1. For species codes (see Table S1).
Success rates (%) obtained by performing the two assignment methods (RF and 1-NN) with the three genes (16S, Cytb, CO1) and the first part of the Cytb gene.
| Gene | 16S | Cytb | CO1 | Cytb-part1 |
|
| 97.87 [96.79–98.95] | 99.53 [98.91–100] | 100 | 99.52 [98.90–100] |
|
| rand | 99.29 [98.29–100] | 100 | 100 |
| next | 99.29 [98.57–100] | 100 | 100 | |
|
| rand | 99.29 [98.29–100] | 100 | 100 |
| next | 99.29 [98.57–100] | 100 | 100 |
Confidence intervals (5%) are given in brackets.
Figure 4Distributions of the cumulative errors among the 40 species of Praomyini tribe, calculated from: (a) the CO1 gene; (b) the Cytb-part1 gene.
Proportion (%) of the pairwise distances belonging to the 0.90th and 0.95th quantile of the distribution of the intraspecific pairwise distances.
| 0.90th quantile | 0.95th quantile | ||||||||
| Species | Nb of pairwise distances | CO1 | Cytb | Cytb-part1 | 16S | CO1 | Cytb | Cytb-part1 | 16S |
| HEF | 66 | 0.00 | 0.00 | 0.00 | 0.06 | ||||
| HS1 | 36 | 0.39 | 0.39 | 0.25 | 0.22 | 0.06 | 0.00 | 0.22 | 0.00 |
| HSI | 78 | 0.41 | 0.40 | 0.46 | 0.12 | 0.01 | 0.00 | 0.06 | 0.00 |
| HYP | 231 | 0.65 | 0.68 | 0.68 | 0.70 | 0.40 | 0.66 | 0.43 | 0.56 |
| HYW | 231 | 0.00 | 0.00 | 0.00 | 0.01 | ||||
| MAE | 153 | 0.08 | 0.00 | 0.00 | 0.12 | 0.02 | 0.00 | 0.00 | 0.00 |
| MAH | 55 | 0.00 | 0.00 | 0.00 | 0.09 | ||||
| PDA | 253 | 0.38 | 0.44 | 0.44 | 0.44 | 0.26 | 0.05 | 0.16 | 0.14 |
| PRL | 55 | 0.45 | 0.33 | 0.33 | 0.00 | 0.02 | 0.00 | 0.00 | 0.00 |
Only lines with non-zero elements are listed.