| Literature DB >> 32298321 |
Emma K Timpano1, Melissa K R Scheible1, Kelly A Meiklejohn1.
Abstract
Molecular-based taxonomy, specifically DNA barcoding, has streamlined organism identification. For land plants, the recommended 2-locus barcode of rbcL and matK is not suitable for all groups, thus the second subunit of the nuclear internal transcribed spacer (ITS2) has received attention as a possible alternative. To date, evaluations of ITS2 have mostly been limited in scope to specific plant orders/families and single source material. Prior to using ITS2 to routinely characterize land plants present in environmental samples (i.e., DNA metabarcoding), a wet lab protocol optimized for bulk sample types is needed. To address this gap, in this study we determined the broad recoverability across land plants when using published ITS2 primer pairs, and subsequently optimized the PCR reaction constituents and cycling conditions for the best two performing primer pairs (ITS2F/ITSp4 and ITSp3/ITSu4). Using these conditions, both primer pairs were used to characterize land plants present in 17 diverse soils collected from across the US. The resulting PCR amplicons were prepared into libraries and pooled for sequencing on an Illumina® MiniSeq. Our existing bioinformatics workflow was used to process raw sequencing data and taxonomically assign unique ITS2 plant sequences by comparison to GenBank. Given strict quality criteria were imposed on sequences for inclusion in data analysis, only 43.6% and 7.5% of sequences from ITS2F/ITSp4 and ITSp3/ITSu4 respectively remained for taxonomic comparisons; ~7-11% of sequences originated from fungal co-amplification. The number of orders and families recovered did differ between primer pairs, with ITS2F/ITSp4 consistently outperforming ITSp3/ITSu4 by >15%. Primer pair bias was observed in the recovery of certain taxonomic groups; ITS2F/ITSp4 preferentially recovered flowering plants and grasses, whereas ITSp3/ITSu4 recovered more moss taxa. To maximize data recovery and reduce potential bias, we advocate that studies using ITS2 to characterize land plants from environmental samples such as soil use a multiple primer pair approach.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32298321 PMCID: PMC7162488 DOI: 10.1371/journal.pone.0231436
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Schematic diagram of binding sites for second internal transcribed spacer (ITS2) primers used in this study.
Arrow depicts primer direction and flanking regions also shown. Primers shown in light gray indicate those used in screening only; primers shown in black indicate those used in optimization studies.
Comparison of the number of ITS2 barcode sequences remaining at different stages of data processing and analysis from the 17 soil samples.
| ITS2F/ITSp4 | ITSp3/ITSu4 | ||
|---|---|---|---|
| 1) Total unique sequences (reads) from DADA2 | 2,551 (1,278,943) | 5,268 (1,430,676) | |
| 2) Total unique sequences after | |||
| a) exclusion of sequences not containing the sequence of an | 1,623 (63.6%) | 4,782 (90.7%) | |
| b) exclusion of duplicates | 1,486 (58.3%) | 3,688 (70.0%) | |
| 3) Total unique sequences that met | |||
| a) matched to fungus | 101 (6.8%) | 404 (11.0%) | |
| b) matched to plant | 648 (43.6%) | 277 (7.5%) | |
| Total unique sequences used | 556 | 235 | |
| Total number of orders recovered | 29 | 25 | |
| Average (± SD) unique sequences/sample | 17 ± 24 | 15 ± 18 | |
| Average (± SD) reads/sample | 8326 ± 9653 | 5867 ± 6417 | |
| Total unique sequences used | 549 | 227 | |
| Total families recovered | 40 | 29 | |
| Average (± SD) unique sequences/sample | 17 ± 24 | 9 ± 12 | |
| Average (± SD) reads/sample | 8209 ± 9642 | 5499 ± 5963 | |
*sequences and reads present in the negative controls were excluded;
# percentage calculated based on values shown in 1;
+ sequences with >90% sequence coverage, >95% sequence identity, e-value <0.001 and matched to ITS2;
‡ percentage calculated based on values shown in 2b;
^ only sequences in which 100% congruence amongst the best top matches at either the order or family level were included.
Fig 2Difference in the taxonomic abundance recovered using ITS2F/ITSp4 (top panes) and ITSp3/ITSu4 (bottom panes) primer pairs.
(A) orders. (B) families. Plots generated using total number of recovered reads per surface soil sample.