| Literature DB >> 35990900 |
Sonia Trujillo-Argueta1, Rafael F Del Castillo1, Abril Velasco-Murguía1.
Abstract
DNA-barcoding is a species identification tool that uses a short section of the genome that provides a genetic signature of the species. The main advantage of this novel technique is that it requires a small sample of tissue from the tested organism. In most animal groups, this technique is very effective. However, in plants, the recommended standard markers, such as rbcLa, may not always work, and their efficacy remains to be tested in many plant groups, particularly from the Neotropical region. We examined the discriminating power of rbcLa in 55 tropical cloud forest vascular plant species from 38 families (Oaxaca, Mexico). We followed the CBOL criteria using BLASTn, genetic distance, and monophyly tree-based analyses (neighbor-joining, NJ, maximum likelihood, ML, and Bayesian inference, BI). rbcLa universal primers amplified 69.0% of the samples and yielded 91.3% bi-directional sequences. Sixty-three new rbcLa sequences were established. BLAST discriminates 80.8% of the genus but only 15.4% of the species. There was nil minimum interspecific genetic distances in Quercus, Oreopanax, and Daphnopsis. Contrastingly, Ericaceae (5.6%), Euphorbiaceae (4.6%), and Asteraceae (3.3%) species displayed the highest within-family genetic distances. According to the most recent angiosperm classification, NJ and ML trees successfully resolved (100%) monophyletic species. ML trees showed the highest mean branch support value (87.3%). Only NJ and ML trees could successfully discriminate Quercus species belonging to different subsections: Quercus martinezii (white oaks) from Q. callophylla and Q. laurina (red oaks). The ML topology could distinguish species in the Solanaceae clade with similar BLAST matches. Also, the BI topology showed a polytomy in this clade, and the NJ tree displayed low-support values. We do not recommend genetic-distance approaches for species discrimination. Severe shortages of rbcLa sequences in public databases of neotropical species hindered effective BLAST comparisons. Instead, ML tree-based analysis displays the highest species discrimination among the tree-based analyses. With the ML topology in selected genera, rbcLa helped distinguish infra-generic taxonomic categories, such as subsections, grouping affine species within the same genus, and discriminating species. Since the ML phylogenetic tree could discriminate 48 species out of our 55 studied species, we recommend this approach to resolve tropical montane cloud forest species using rbcLa, as an initial step and improve DNA amplification methods.Entities:
Keywords: Bayesian inference phylogenetic tree; CBOL; DNA-barcoding; GenBank; Maximum likelihood phylogenetic tree; Neighbor joining phylogenetic tree; Neotropics; Quercus; Solanaceae; Vascular plants
Year: 2022 PMID: 35990900 PMCID: PMC9390329 DOI: 10.7717/peerj.13771
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 3.061
Species of the tropical montane cloud forest of San Miguel Cuevas, Santiago Juxtlahuaca Municipality, Oaxaca, Mexico, used in this study.
| Sample ID | Family | Morphological species | IUCN | bp lenght | BOLD | GeneBank |
|---|---|---|---|---|---|---|
| status | Process ID | Accession No. | ||||
| SMC7 | Rubiaceae | LC | 553 | TFOAX001-19 |
| |
| SMC10 | Styracaceae | DD | 535 | TFOAX002-19 |
| |
| SMC19 | Thymelaeaceae | LC | 535 | TFOAX003-19 |
| |
| SMC28 | Solanaceae | LC | 553 | TFOAX004-19 |
| |
| SMC29 | Polygalaceae | LC | 553 | TFOAX005-19 |
| |
| SMC37 | Ericaceae | DD | 540 | TFOAX006-19 |
| |
| SMC41 | Rubiaceae | LC | 553 | TFOAX007-19 |
| |
| SMC61 | Euphorbiaceae | DD | 541 | TFOAX008-19 |
| |
| SMC70 | Rubiaceae | DD | 553 | TFOAX009-19 |
| |
| SMC75 | Olaceae | DD | 553 | TFOAX010-19 |
| |
| SMC76 | Commelinaceae | DD | 553 | TFOAX011-19 |
| |
| SMC94 | Thymelaeaceae | NT | 540 | TFOAX012-19 |
| |
| SMC97 | Rubiaceae | DD | 536 | TFOAX013-19 |
| |
| SMC99 | Solanaceae | LC | 535 | TFOAX014-19 |
| |
| SMC104 | Apocinaceae | DD | 553 | TFOAX015-19 |
| |
| AVM2 | Betulaceae | LC | 560 | DVHTF001-19 |
| |
| AVM12 | Berberidaceae | DD | 557 | DVHTF002-19 |
| |
| AVM13 | Araliaceae | VU | 557 | DVHTF003-19 |
| |
| AVM14 | Scrophulariaceae | LC | 557 | DVHTF004-19 |
| |
| AVM15 | Convolvulaceae | DD | 557 | DVHTF005-19 |
| |
| AVM16 | Solanaceae | DD | 553 | DVHTF006-19 |
| |
| AVM17 | Solanaceae | DD | 558 | DVHTF007-19 |
| |
| AVM27 | Commelinaceae | DD | 557 | DVHTF008-19 |
| |
| AVM30 | Lycopodeaceae | LC | 560 | DVHTF009-19 |
| |
| AVM32 | Primulaceae | LC | 557 | DVHTF010-19 |
| |
| AVM33 | Ericaceae | DD | 558 | DVHTF011-19 |
| |
| AVM34 | Fagaceae | LC | 560 | DVHTF012-19 |
| |
| AVM35 | Fagaceae | LC | 558 | DVHTF013-19 |
| |
| AVM36 | Melastomataceae | LC | 545 | DVHTF014-19 |
| |
| AVM40 | Pteridaceae | DD | 557 | DVHTF015-19 |
| |
| AVM43 | Rosaceae | DD | 557 | DVHTF016-19 |
| |
| AVM45 | Araliaceae | LC | 556 | DVHTF017-19 |
| |
| AVM46 | Piperaceae | DD | 557 | DVHTF018-19 |
| |
| AVM47 | Lauraceae | DD | 557 | DVHTF019-19 |
| |
| AVM48 | Fabaceae | LC | 558 | DVHTF020-19 |
| |
| AVM50 | Lauraceae | DD | 553 | DVHTF021-20 |
| |
| AVM52 | Primulaceae | LC | 553 | DVHTF022-20 |
| |
| AVM53 | Fagaceae | LC | 553 | DVHTF023-20 |
| |
| AVM54 | Euphorbiaceae | LC | 553 | DVHTF024-20 |
| |
| AVM59 | Amaranthaceae | DD | 553 | DVHTF025-20 |
| |
| AVM62 | Solanaceae | LC | 553 | DVHTF026-20 |
| |
| AVM64 | Rubiaceae | LC | 553 | DVHTF027-20 |
| |
| AVM65 | Passifloraceae | DD | 553 | DVHTF028-20 |
| |
| AVM67 | Urticaceae | LC | 553 | DVHTF029-20 |
| |
| AVM69 | Rubiaceae | DD | 553 | DVHTF030-20 |
| |
| AVM71 | Gesneriaceae | DD | 553 | DVHTF031-20 |
| |
| AVM73 | Pinaceae | LC | 553 | DVHTF032-20 |
| |
| AVM78 | Dicksoniaceae | DD | 553 | DVHTF033-20 |
| |
| AVM79 | Passifloraceae | DD | 553 | DVHTF034-20 |
| |
| AVM80 | Cyperaceae | DD | 553 | DVHTF035-20 |
| |
| AVM82 | Marattiaceae | DD | 553 | DVHTF036-20 |
| |
| AVM83 | Cupressaceae | LC | 553 | DVHTF037-20 |
| |
| AVM84 | Pinaceae | LC | 553 | DVHTF038-20 |
| |
| AVM87 | Asteraceae | DD | 553 | DVHTF039-20 |
| |
| AVM89 | Meliaceae | LC | 553 | DVHTF040-20 |
| |
| AVM90 | Meliaceae | LC | 553 | DVHTF041-20 |
| |
| AVM93 | Rubiaceae | LC | 553 | DVHTF042-20 |
| |
| AVM95 | Gesneriaceae | DD | 553 | DVHTF043-20 |
| |
| AVM98 | Solanaceae | DD | 553 | DVHTF044-20 |
| |
| AVM100 | Poaceae | DD | 553 | DVHTF045-20 |
| |
| AVM101 | Asteraceae | DD | 553 | DVHTF046-20 |
| |
| AVM103 | Verbenaceae | DD | 474 | DVHTF047-20 |
| |
| AVM106 | Lamiaceae | DD | 553 | DVHTF048-20 |
| |
Notes:
The IUCN Red List (2021) status, DNA length obtained with rbcLa barcode, BOLD Process ID, and GenBank Accession numbers are also shown.
LC, Least Concern; DD, Data Deficit; NT, Near Threatened; VU, Vulnerable.
Figure 1Graphic BLAST results.
Fraction of the new rbcL sequences published in the GenBank (left), and resolution at genus and species levels for those species with previous available sequences released in the GenBank (accessed in March 2022) using BLASTn (right).
BLAST results.
| Sample ID | Morphological species | Identity (%) | Best BLAST match | Accession no. |
|---|---|---|---|---|
| AVM2 |
| 100 | ∆ |
|
| SMC7 |
| 99.46 | ○ |
|
| AVM12 |
| 99.82 | ○ |
|
| AVM16 |
| 100 | ○ |
|
| SMC29 |
|
|
|
|
| AVM30 |
| 100 | ○ |
|
| AVM35 |
| 100 | ∆ |
|
| AVM40 |
| 99.82 | ○ |
|
| AVM46 |
| 100 | ○ |
|
| AVM52 |
| 100 | ○ |
|
| AVM53 |
| 100 | ∆ |
|
| AVM54 |
|
|
|
|
| AVM59 |
|
|
|
|
| AVM64 |
| 100 | ○ |
|
| AVM70 |
| 99.64 | ○ |
|
| AVM73 |
| 100 | ∆ |
|
| SMC75 |
| 99.81 | ∆ |
|
| AVM78 |
|
|
|
|
| AVM80 |
| 99.82 | ○ |
|
| AVM82 |
| 100 | ○ |
|
| AVM83 |
| 100 | ○ |
|
| AVM84 | 100 | ○ |
| |
| AVM87 | 100 | ○ |
| |
| AVM89 |
| 100 | ○ |
|
| AVM90 | 99.82 | Meliaceae |
| |
| SMC99 | 100 | ∆ |
|
Notes:
Best BLASTn match found on queries against rbcLa nucleotides sequences in the database of GenBank for those species with previously published sequences in the GenBank. E- value, in all cases 0.0.
In bold morphological species corresponding with only the studied species in GenBank database.
○ >2–<30 species with the same highest percent of identity.
∆ >30 species with the same highest identity percent.
T. havanensis showed the best match with another unidentified species of the Meliaceae in the GenBank published sequence.
Figure 2Genetic divergence.
Intra- and interspecific genetic distances obtained with barcode rbcLa in 55 plant species of tropical montane cloud forest in the Mixteca Alta, Oaxaca, Mexico.
Intergeneric rbcLa genetic distances of the tropical montane cloud forest species.
| Genus | Species comparision | Genetic divergence | |
|---|---|---|---|
|
| 0.0000 | ||
|
| 1.5076 | ||
|
| 1.5076 | ||
|
| 0.0000 | ||
|
| 2.0189 | ||
|
| 0.0000 | ||
|
| 0.0000 | ||
|
| 0.0000 | ||
|
| 0.7269 | ||
|
| 0.7269 | ||
Note:
Intergeneric rbcLa genetic distances of the tropical montane cloud forest species in the Mixteca Baja, Oaxaca, Mexico, found in multi-species genera. The Bold Process ID is below the scientific names.
Mean rbcLa intergeneric and interspecific genetic distances.
| Family | No. Genera | No. Species | Mean |
|---|---|---|---|
| divergence (%) | |||
| Asteraceae | 2 | 2 | 3.3344 |
| Ericaceae | 2 | 2 | 5.5723 |
| Euphorbiaceae | 2 | 2 | 4.5869 |
| Meliaceae | 2 | 2 | 0.7286 |
| Primulaceae | 2 | 2 | 1.2784 |
| Rubiaceae | 3 | 4 | 1.6207 |
| Solanaceae | 4 | 5 | 1.6558 |
Note:
Mean rbcLa intergeneric and interspecific genetic distances in the multigenera and multispecies families of this study.
Figure 3ML cladogram.
Maximum likelihood cladogram of plant core barcoding gene rbcLa for 55 sequences of plants from of Mixteca Baja, Oaxaca, México, tropical montane cloud forest. Bootstrap values based on 1,000 replications are listed as percentages at branching points.
Proportion of resolved monophyletic species and support value obtained (bootstrap/posterior probabilities) with different phylogenetic techniques.
| Neighbor joining tree | Maximum likelihood tree | Bayesian inference tree | |
|---|---|---|---|
| Percent of monophyletic species resolved | 100.00 | 100.00 | 85.45 |
| Percent species resolved with support value >70% | 70.91 | 87.27 | 92.73 |
Note:
Proportion of resolved monophyletic species and support value obtained (bootstrap/posterior probabilities) with different phylogenetic techniques using plant core barcoding gene rbcLa in 55 studied species of the tropical montane cloud forest, Mixteca Baja, Mexico.