| Literature DB >> 30235400 |
Renato A Ferreira de Lima1, Alexandre Adalardo de Oliveira1, Gabriel Dalla Colletta2,3, Thiago Bevilacqua Flores2,3, Rubens L Gayoso Coelho4, Pedro Dias4, Gabriel Ponzoni Frey1, Amaia Iribar5, Ricardo Ribeiro Rodrigues2, Vinícius Castro Souza2, Jérôme Chave5.
Abstract
DNA barcoding helps to identify species, especially when identification is based on parts of organisms or life stages such as seeds, pollen, wood, roots or juveniles. However, the implementation of this approach strongly depends on the existence of complete reference libraries of DNA sequences. If such a library is incomplete, DNA-based identification will be inefficient. Here, we assess if DNA barcoding can already be implemented in species-rich tropical regions. We focus on the tree flora of São Paulo state, Brazil, which contains more than 2000 tree species. Using new DNA sequence data and carefully assembled GenBank accessions, we assembled 12,113 sequences from ten different regions. The ITS, rbcL, psbA-trnH, matK and trnL regions were better represented within the available sequences for São Paulo tree flora. Currently, only 58% of the São Paulo tree flora currently have at least one barcoding sequence available. However, these species represent on average 89% of the trees in São Paulo state forests. Therefore, conservation-oriented and ecological studies can already benefit from DNA barcoding to obtain more accurate species identifications. We present which taxa remain underrepresented for the São Paulo tree flora and discuss the implications of this result for other species-rich tropical regions.Entities:
Year: 2018 PMID: 30235400 PMCID: PMC6136365 DOI: 10.1590/1678-4685-GMB-2017-0282
Source DB: PubMed Journal: Genet Mol Biol ISSN: 1415-4757 Impact factor: 1.771
Number of sequences and taxa by source of information for the São Paulo tree flora, in respect to the ten DNA barcodes considered in this study. In parentheses, the proportion of each taxon with sequences available for any of these ten barcodes.
| Source of sequences | Accessions for SP trees | Families | Genera | Species |
|---|---|---|---|---|
| (n=107) | (n= 508) | (n= 2097) | ||
| GenBank | 10,493 | 92 | 401 | 1,059 |
| This study | 1,611 | 89 | 302 | 609 |
| Total | 12,104 | 106 (99%) | 456 (90%) | 1,214 (58%) |
Number of sequences, species and the evenness index of the ten DNA markers considered in this study regarding the São Paulo tree flora. In parentheses, the proportion of species sequenced with respect to the total of tree species occurring in São Paulo.
| DNA marker | Sequences | Species (%) | Evenness (Pielou | ||
|---|---|---|---|---|---|
| Family | Genus | Species | |||
|
| 74 | 35 (1.7%) | 0.270 | 0.247 | 0.414 |
| ITS region | 3,380 | 805 (35.5%) | 0.658 | 0.683 | 0.673 |
|
| 2,047 | 855 (37.1%) | 0.779 | 0.867 | 0.820 |
|
| 2,290 | 448 (19.7%) | 0.605 | 0.598 | 0.545 |
|
| 67 | 42 (2.0%) | 0.283 | 0.326 | 0.435 |
|
| 2,403 | 886 (39.9%) | 0.820 | 0.871 | 0.818 |
|
| 123 | 76 (3.6%) | 0.586 | 0.551 | 0.523 |
|
| 171 | 89 (4.1%) | 0.572 | 0.531 | 0.528 |
|
| 1,544 | 560 (24.9%) | 0.698 | 0.763 | 0.694 |
|
| 24 | 17 (0.8%) | 0.463 | 0.359 | 0.352 |
Figure 1Venn diagrams of the number of species with sequences for the combination of (a) rbcL + matK + ITS region and (b) rbcL + matK + psbA-trnH.
Figure 2The sampling priority level in future sequencing for each family of the SP tree flora, plotted against its phylogenetic hypothesis. The priority level was defined based on the representativeness of DNA barcodes within the family and the proportion of threatened and endemic species without barcodes.
Figure 3Increase of the number of (a) species, (b) genera and (c) families across time with respect to the ten DNA markers considered in this study. Bold lines are the best fit of the logistic function to the accumulated number of sequences or taxa. Dashed lines are the total of taxa in the SP tree flora. Dates are aggregated in trimesters and the fit of functions to the accumulated number of taxa was carried out only from mid 1995 on.