| Literature DB >> 29938097 |
Elisabeth Fitzek1,2, Adline Delcamp3,4, Erwan Guichoux3,4, Marlene Hahn1, Matthew Lobdell1, Andrew L Hipp1,5.
Abstract
DNA barcoding has proved difficult in a number of woody plant genera, including the ecologically important oak genus Quercus. In this study, we utilized restrictionsite-associated DNA sequencing (RAD-seq) to develop an economical single nucleotide polymorphism (SNP) DNA barcoding system that suffices to distinguish eight common, sympatric eastern North American white oak species. Two de novo clustering pipelines, PyRAD and Stacks, were used in combination with postclustering bioinformatic tools to generate a list of 291 potential SNPs, 80 of which were included in a barcoding toolkit that is easily implemented using MassARRAY mass spectrometry technology. As a proof-of-concept, we used the genotyping toolkit to infer potential hybridization between North American white oaks transplanted outside of their native range (Q. michauxii, Q. montana, Q muehlenbergii/Q. prinoides, and Q. stellata) into a horticultural collection surrounded by natural forests of locally native trees (Q. alba and Q. macrocarpa) in the living collection at The Morton Arboretum (Lisle, IL, USA). Phylogenetic and clustering analyses suggested low rates of hybridization between cultivated and native species, with the exception of one Q. michauxii mother tree, the acorns of which exhibited high admixture from either Q. alba or Q. stellata and Q. macrocarpa, and a hybrid between Q. stellata that appears to have backcrossed almost exclusively to Q. alba. Together, RAD-seq and MassARRAY technologies allow for efficient development and implementation of a multispecies barcode for one of the more challenging forest tree genera.Entities:
Keywords: North American white oaks; RAD‐seq; SNP barcode; barcoding; hybridization
Year: 2018 PMID: 29938097 PMCID: PMC6010771 DOI: 10.1002/ece3.4122
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Figure 1Flow diagram outlining analysis used in this study and future directions. RAD‐seq (orange arrows) data from previous project were analyzed using PyRAD/RADami and Stacks. A stringent approach (dotted purple arrows) and relaxed approach (solid blue arrows) were used to develop two lists of potential SNP (pSNPs). These were subjected to the primer design software Assay Design Suite and data analysis using TYPER 4.0 software, screened, and checked for SNP validation. A list of verified SNPs was re‐entered to the primer design software to select for 80 suitable SNP multiplexes, which comprise the barcode (turquoise arrows). Applications of the barcode include genotyping of North American white oaks and hybridization studies (yellow arrows)
Figure 2Fine‐scale map of the study site in the oak living collections at The Morton Arboretum. (a) Overview map of the mother and potential father trees genotyped in this study. Species are indicated with shapes (see legend). Mother trees are represented with black‐colored shapes. Distribution of nearby trees not genotyped is indicated with gray‐colored shapes. Potential fathers are indicated with white‐colored shapes. (b) Selection of eight mother trees. (c) Selection of potential father trees and nearby trees. (d) Selection of Q. stellata mother trees, potential fathers, and nearby trees. Legend: Q. stellata (cross), Q. alba (diamond), Q. michauxii (circle), Q. macrocaropa (pentagon), Q. muehlenbergii (square), Q. montana (triangle), Q. prinoides (arrow)
Success rate of SNPs
| Species |
| SNPs start | SNPs design | SNPs verified | SNPs barcode | SNPs STRUCTURE |
|---|---|---|---|---|---|---|
|
| 4 | 95 | 30 | 20 | 5 | 3 |
|
| 2 | 25 | 8 | 8 | 5 | 4 |
|
| 2 | 22 | 10 | 7 | 5 | 4 |
|
| 2 | 2 | 1 | 0 | 0 | 0 |
|
| 2 | 10 | 4 | 4 | 1 | 1 |
|
| 4 | 35 | 8 | 7 | 4 | 3 |
|
| 9 | 64 | 15 | 7 | 4 | 3 |
|
| 5 | 57 | 23 | 19 | 8 | 5 |
|
| 4 | 84 | 33 | 25 | 8 | 7 |
|
| 6 | 143 | 45 | 25 | 8 | 7 |
|
| 3 | 22 | 7 | 6 | 4 | 4 |
|
| 2 | 31 | 16 | 13 | 8 | 7 |
|
| 3 | 52 | 26 | 21 | 13 | 12 |
| Clade 1 | 14 | 7 | 6 | 7 | 2 | 1 |
| Clade 2 | 4 | 3 | 0 | 0 | 0 | 0 |
| East Asian white oaks | 4 | 98 | 29 | 10 | 4 | 4 |
| European white oaks | 11 | 94 | 30 | 8 | 1 | 1 |
| Total | 844 | 291 | 187 | 80 | 66 | |
| Mean (± | 49.6 (±41.1) | 17.1 (±13.2) | 11 (±8.1) | 4.7 (±3.5) | 3.9 (±3.1) |
Initial number of SNPs (RADami and Stacks), number of SNPs after passing primer design software with percentage in parentheses, and number of SNPs verified by MassARRAY. Clade 1 = Q. sinuata, Q. oglethorpensis, Q. boyntonii, Q. stellata, Q. chapmanii, Q. austirna. Clade 2 = Q. austrina and Q. chapmanii. SNPs start = number of SNPs initially selected for consideration. SNPs design = SNPs that passed assay design step. SNPs verified = SNPs that successfully amplified on SEQUENOM. SNPs barcode = SNPs included in the two‐multiplex barcoding set presented here. SNPs STRUCTURE = SNPs included in STRUCTURE analyses presented in this article (removing SNPs with substantial amounts of missing data).
Figure 3STRUCTURE analysis to assess rate of hybrid offspring by non‐native North American white oaks within the oak living collections at The Morton Arboretum. (a) Genetic assignment of 247 individuals at 66 loci according to MCMC clustering method implemented in STRUCTURE (Pritchard et al., 2000). DISTRUCT v1.1 was used for the visualization (Rosenberg, 2004). Vertical black lines represent grouping of individuals according to their population assignment. Each vertical line color‐codes the genotype probability of an individual according to population grouping. Black bars over individuals represent adult trees. Black dots over individuals represent mother trees producing acorns genotyped in this study. (b) Estimation of best cluster number (K) calculated by STRUCTURE with respect to mean log likelihood of the data over five iterations (left axis) and ΔK plotted as a function of K (K = 1–6) (right axis)
Assay design suite results
| Total | Passed | Multiplexed | |
|---|---|---|---|
| First SNP screen | 395 | 131 | 131 |
| Second SNP screen | 449 | 213 | 160 |
| Both screens | 844 | 344 | 291 |
Detailed breakdown of the SNP screens. Total corresponds to the established lists of potential SNPs and was considered for the primer development. “Passed” represents the number of SNPs meeting the criteria for primer development. “Multiplexed” represents the number of SNPs successfully multiplexed with the maximum of 40 SNP per multiplex and ordered for screening.
Figure 4Maximum‐likelihood tree of mother trees, potential father trees, and learning samples. Species are color‐coded (see legend). The mother trees genotyped in this study are marked with blue boxes. Learning samples are indicated with purple boxes. Q. aliena, Q. fabri, Q. robur, and Q. petraea are combined in this study and represented as Eurasian (Old World) white oaks. The tree is rooted by the Eurasian white oaks. Annotations were made in iTOL (Letunic & Bork, 2016)
Germination and MassARRAY success rate
| Individual ID | Species |
| Germinated | Amplification rate | Hybridization frequency (%) | |
|---|---|---|---|---|---|---|
| 100%–71% | 70%–0% | |||||
| 645‐48*2 |
| 100 | 70 (70%) | 63 (90%) | 7 | 34.9 |
| 602‐2000*1 sd |
| 50 | 45 (90%) | 32 (71%) | 13 | 3.1 |
| 210‐91*1 |
| 71 | 43 (60%) | 35 (81%) | 8 | 5.71 |
| 602‐2000*2 |
| 50 | 35 (70%) | 20 (57%) | 15 | 0 |
| 704‐46*2 sd T |
| 50 | 12 (24%) | 3 (25%) | 9 | 0 |
| 326‐99*2 |
| 29 | 9 (31%) | 6 (66%) | 3 | 0 |
| 742‐51*1 |
| 67 | 8 (12%) | 6 (75%) | 2 | 33.3 |
| 602‐2000*3 |
| 7 | 5 (70%) | 2 (40%) | 3 | 50 |
| 704‐46*3 sd T |
| 6 | 1 (16%) | 1 (100%) | 0 | 0 |
Listed are the individual IDs and species names of the corresponding nine mother trees. The amount of collected and planted acorns in Fall/Winter of 2014 per mother tree and their germination rate. The table is sorted by the number of germinated acorns. The amplification rate is divided into two categories: 71%–100% of markers amplified and 0%–70% of markers amplified. Hybridization frequency per mother tree is based only on individuals for whom 71%–100% of markers amplified.