| Literature DB >> 29084279 |
Joanne L Birch1, Neville G Walsh1, David J Cantrill1, Gareth D Holmes1, Daniel J Murphy1.
Abstract
In Australia, Poaceae tribe Poeae are represented by 19 genera and 99 species, including economically and environmentally important native and introduced pasture grasses [e.g. Poa (Tussock-grasses) and Lolium (Ryegrasses)]. We used this tribe, which are well characterised in regards to morphological diversity and evolutionary relationships, to test the efficacy of DNA barcoding methods. A reference library was generated that included 93.9% of species in Australia (408 individuals, [Formula: see text] = 3.7 individuals per species). Molecular data were generated for official plant barcoding markers (rbcL, matK) and the nuclear ribosomal internal transcribed spacer (ITS) region. We investigated accuracy of specimen identifications using distance- (nearest neighbour, best-close match, and threshold identification) and tree-based (maximum likelihood, Bayesian inference) methods and applied species discovery methods (automatic barcode gap discovery, Poisson tree processes) based on molecular data to assess congruence with recognised species. Across all methods, success rate for specimen identification of genera was high (87.5-99.5%) and of species was low (25.6-44.6%). Distance- and tree-based methods were equally ineffective in providing accurate identifications for specimens to species rank (26.1-44.6% and 25.6-31.3%, respectively). The ITS marker achieved the highest success rate for specimen identification at both generic and species ranks across the majority of methods. For distance-based analyses the best-close match method provided the greatest accuracy for identification of individuals with a high percentage of "correct" (97.6%) and a low percentage of "incorrect" (0.3%) generic identifications, based on the ITS marker. For tribe Poeae, and likely for other grass lineages, sequence data in the standard DNA barcode markers are not variable enough for accurate identification of specimens to species rank. For recently diverged grass species similar challenges are encountered in the application of genetic and morphological data to species delimitations, with taxonomic signal limited by extensive infra-specific variation and shared polymorphisms among species in both data types.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29084279 PMCID: PMC5662090 DOI: 10.1371/journal.pone.0186259
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Australian tribe Poeae diversity for individual and concatenated DNA barcode markers for distance- (dataset A) and tree- (dataset B) based DNA barcode methods included in this study.
| Taxon or clade | Species | Number of individuals for individual and concatenated markers |
|---|---|---|
| Dataset A: 391/354/383/395/399 | ||
| 11/7/11/11/11 | ||
| Dataset A: 7/4/7/7/7 | ||
| 6/5/5/6/6 | ||
| Dataset A: 4/3/4/4/4 | ||
| 6/5/5/6/6 | ||
| 3/3/3/3/3 | ||
| 5/5/4/5/5 | ||
| Dataset A: 26/23/25/26/26 | ||
| 3/3/3/3/3 | ||
| 5/5/7/6/7 | ||
| 3/3/3/3/3 | ||
| Dataset A: 17/17/16/18/19 | ||
| 9/8/8/9/9 | ||
| Dataset A: 254/240/252/256/257 | ||
| 3/3/3/3/3 | ||
| Dataset A: 16/13/15/16/16 | ||
| 3/3/3/3/3 | ||
| 5/2/5/5/5 | ||
| 3/2/3/3/3 | ||
| 2/2/2/2/2 | ||
| Dataset A: 11/5/10/11/12 | ||
| Dataset A: 28/22/26/28/29 | ||
| Dataset A: 27/26/25/28/28 | ||
NA, Not applicable.
* Species is exotic in Australia.
a The number of individuals in datasets A and B for each marker differed for some genera as singletons (taxa that were represented by single individuals) were removed prior to distance-based analyses (as outlined in the text).
b Alternative generic circumscriptions for Briza s.s., Chascolytrum, and of Festuca, Lolium, Psilurus, and Vulpia into fine-, and broad-leaved clades were included for distance-based analyses (as outlined in the text). Specimen numbers for these genera and clades are provided for dataset A only and are NA for dataset B.
Fig 1Intra- and inter-specific pairwise genetic distances (K80 model) for the Internal Transcribed Spacer (ITS) marker.
Data for all Australian tribe Poeae genera with more than two species represented in this study are shown. Inter, Inter-specific; Intra, Intra-specific.
Fig 2Success rates (percentages) for specimen identification of tribe Poeae taxa using distance-based methods.
Success rates were calculated for all genera (A.) and species (B.) collectively and for species within single genus datasets (C.–E.) using nearest neighbour (A., B., C.), best close match (A., B., D.), and threshold ID (A., B., E.) methods based on individual (rbcL, matK, ITS) and concatenated (rbcL+matK, rbcL+matK+ITS) DNA barcode markers. BCM, Best close match; Chlor, Chloroplast dataset (rbcL+matK); Comb, Combined dataset (rbcL+matK+ITS); ITS, Internal transcribed spacer; NN, Nearest neighbour; TID, Threshold Identification.
Fig 3The Bayesian inference of phylogenetic relationships among Australian tribe Poeae based on the ITS marker.
Support values are provided above the branches including bootstrap (maximum likelihood) and posterior probabilities (Bayesian inference) before and after the forward slash, respectively.
Success rates (percentages) for specimen identification based on individual (ITS) and concatenated (rbcL+matK+ITS) DNA barcode markers using distance-based (nearest neighbour and best-close match) and tree-based (maximum likelihood and Bayesian inference) methods.
| Taxon | Number of taxa | Specimen identification (%) | Specimen identification (% | Specimen identification (%) | Specimen identification (%) | ||||
|---|---|---|---|---|---|---|---|---|---|
| (morphology) | (ITS, N = 18 genera, 81 species) | ( | (ITS, N = 19 genera, 96 species) | ( | |||||
| NN | BCM | NN | BCM | ML | BA | ML | BA | ||
| (true/false) | (correct/incorrect/ambiguous/no identification) | (true/false) | (correct/incorrect/ambiguous/no identification) | correct/incorrect/ambiguous | correct/incorrect/ambiguous | correct/incorrect/ambiguous | correct/incorrect/ambiguous | ||
| Genera | 19 | ||||||||
| Species | 96 | ||||||||
BA, Bayesian inference; BCM, Best close match; ITS, Internal transcribed spacer; ML, Maximum likelihood; N = Sample size; NN, Nearest neighbour.
a The number of genera and species for each marker differed for distance- and tree-based analyses as alternative generic circumscriptions for Briza s.s., Chascolytrum, and of Festuca, Lolium, Psilurus, and Vulpia into fine-, and broad-leaved clades were applied and singletons (taxa that were represented by single individuals) were removed for distance-based analyses (as outlined in the text).
b Percentages of “true” and “correct” specimen identifications are indicated in bold.
c In phylogenetic analyses, clades receiving 65% bootstrap support in maximum likelihood searches or 0.95 posterior probabilities in Bayesian inference searches were recognised.
Fig 4Molecular entities distinguished based on genetic sequence data from the ITS marker estimated by ABGD and PTP methods.
Analyses were completed for Australian tribe Poeae genera with more than two native species. The maximum likelihood phylogeny for Australian tribe Poeae based on the ITS dataset is inset in the top left corner with genera analysed indicated with uppercase letters and shown in detail. Genetic entities are indicated by a black bar to the right of the individuals contained within that entity for ABGD (left) and PTP (right) analyses. ABGD, Automated barcode gap discovery; ITS, Internal transcribed spacer; PTP, Poisson tree processes.
Species delimitations for Australian tribe Poeae genera containing greater than two native species using morphological and genetic sequence data from the ITS marker estimated by ABGD and PTP methods.
| Taxon | Number of species | Number of individuals | ABGD | PTP |
|---|---|---|---|---|
| 4 | L1 | L1 | ||
| 3 | L2 | L2 | ||
| 4 | + | L2 | ||
| 1 | + | + | ||
| 3 | S3,4 | S3,4 | ||
| 1 | + | L2 | ||
| 4 | L/S2,5 | L2 | ||
| 3 | L1 | L1 | ||
| 3 | L2 | L2 | ||
| 3 | + | + | ||
| 4 | + | + | ||
| 4 | L/S6,7 | L/S5,9 | ||
| 8 | L/S7,8 | L/S6,9 | ||
| 216 | L/S7,9 | L/S7,8,9 | ||
| 4 | + | + | ||
| 6 | S12, 13,14 | + | ||
| 1 | + | + | ||
| 4 | S10, 11 | + | ||
| 5 | L7 | L/S8,9 | ||
| 4 | + | + | ||
| 3 | L15 | S10,11 | ||
| 2 | L15 | S12,13 | ||
| 3 | + | + | ||
| 1 | L16 | L14 | ||
| 4 | L16 | L14 | ||
| 3 | L/S16,17 | L/S14,15 | ||
| 1 | L16 | L14 |
ABGD, Automated barcode gap discovery; ITS, Internal transcribed spacer; PTP, Poisson tree processes.
* Species is exotic in Australia.
a L refers to a genetic entity that included more than one species; S refers to a single species that was split into more than one genetic entity; + refers to a genetic entity that was congruent with the species.
b Within a column, species that share a superscript letter were delimited as a single genetic entity.
c N (#) refers to the number of genetic entities delimited.