Literature DB >> 26082877

Next-generation sampling: Pairing genomics with herbarium specimens provides species-level signal in Solidago (Asteraceae).

James B Beck1, John C Semple2.   

Abstract

PREMISE OF THE STUDY: The ability to conduct species delimitation and phylogeny reconstruction with genomic data sets obtained exclusively from herbarium specimens would rapidly enhance our knowledge of large, taxonomically contentious plant genera. In this study, the utility of genotyping by sequencing is assessed in the notoriously difficult genus Solidago (Asteraceae) by attempting to obtain an informative single-nucleotide polymorphism data set from a set of specimens collected between 1970 and 2010.
METHODS: Reduced representation libraries were prepared and Illumina-sequenced from 95 Solidago herbarium specimen DNAs, and resulting reads were processed with the nonreference Universal Network-Enabled Analysis Kit (UNEAK) pipeline. Multidimensional clustering was used to assess the correspondence between genetic groups and morphologically defined species.
RESULTS: Library construction and sequencing were successful in 93 of 95 samples. The UNEAK pipeline identified 8470 single-nucleotide polymorphisms, and a filtered data set was analyzed for each of three Solidago subsections. Although results varied, clustering identified genomic groups that often corresponded to currently recognized species or groups of closely related species. DISCUSSION: These results suggest that genotyping by sequencing is broadly applicable to DNAs obtained from herbarium specimens. The data obtained and their biological signal suggest that pairing genomics with large-scale herbarium sampling is a promising strategy in species-rich plant groups.

Entities:  

Keywords:  Solidago; genotyping by sequencing; herbarium specimens; next-generation sampling; species delimitation

Year:  2015        PMID: 26082877      PMCID: PMC4467758          DOI: 10.3732/apps.1500014

Source DB:  PubMed          Journal:  Appl Plant Sci        ISSN: 2168-0450            Impact factor:   1.936


Shallow genetic differentiation and sampling limitations combine to restrict our understanding of biodiversity and evolution in many species-rich plant groups. Although numerous strategies for obtaining powerful genomic data sets are emerging (reviewed in Lemmon and Lemmon, 2013; McCormack et al., 2013), we remain fundamentally restricted by our access to samples. Other than the adoption of silica gel as a tissue dessicant (Chase and Hills, 1991), samples needed for plant molecular systematics studies are obtained essentially as they were at the beginning of the DNA era (Palmer and Zamir, 1982; Doyle et al., 1985). Researchers still must field-collect the majority of material—a rewarding, but expensive and time-consuming task that often precludes taxonomically rigorous sampling of large groups (>100 species) during the course of a dissertation or 3-yr federally funded project. If we are serious about understanding biodiversity and evolution in species-rich clades, we therefore need a transformative approach to obtaining samples. Extracting DNA from herbarium specimen tissue is an obvious solution, an idea dating from the earliest days of plant molecular systematics (Rogers and Bendich, 1985). This type of sampling is, however, still viewed by most as a way to supplement an otherwise field-collected data set. Although studies utilizing genomic data sets obtained from herbarium specimens are emerging, most involve the recovery of high-copy organelle and/or rDNA cistron regions (Straub et al., 2012; Stull et al., 2013; Besnard et al., 2014; Ripma et al., 2014), or are focused on adaptation within a single species (Vandepitte et al., 2014) or genome assembly of a single individual (Staats et al., 2013). Indeed, we are unaware of a study that has performed species delimitation or phylogeny reconstruction using a genome-wide data set obtained exclusively (or even largely) from herbarium material. Sampling exclusively from herbarium material would allow robust taxonomic and geographic sampling to be achieved rapidly, and if this sampling were performed under the guidance of expert taxonomists it would also ensure the strongest link between taxonomy and DNA. Sample sets obtained through this strategy, what we term “next-generation sampling,” could then be subjected to next-generation genotyping and sequencing techniques, as these workflows are presumably applicable to the sheared DNAs obtained from museum specimens (Nachman, 2013; Stull et al., 2013; Burrell et al., 2015). These rich data sets would then allow for the biodiversity and phylogeny of species-rich groups to be rigorously established in a short time. In this study, we explore the compatibility of this sampling strategy with a genomic single-nucleotide polymorphism (SNP) protocol in the goldenrods (Solidago L., Asteraceae), a genus of ca. 150 currently recognized taxa (Semple and Cook, 2006). Taxonomic uncertainty in Solidago is widely recognized (Fernald, 1950; Nesom, 1993), a problem stemming from a combination of low interspecific genetic divergence (Kress et al., 2005; Fazekas et al., 2008; Schilling et al., 2008; Fazekas et al., 2009; Peirson et al., 2013), polyploidy (Semple, 1992), and species richness. In this study, we attempt to obtain genomic SNP information with a genotyping by sequencing (GBS) approach in a set of 95 herbarium specimens representing three Solidago subsections. These approaches identify SNPs at thousands of points throughout the genome by generating and sequencing a reduced representation library (Narum et al., 2013). Obtaining a genomic data set that carries species-level signal in this difficult genus, using only herbarium material, would be a powerful demonstration of the link between genomics and the expansive incorporation of herbarium material.

METHODS

Sampling and DNA extraction/assessment

Polyploidy adds additional complexity to GBS data collection and analysis, including reduced per-individual sequencing depth due to increased genome size, the complicating nature of additional gene copies for SNP identification, and the relative lack of sophisticated analytical tools for polyploid data sets. We therefore chose to include diploid samples only in this pilot study. Herbarium tissue was obtained from 95 specimens representing 23 species in three Solidago subsections: Junceae (Rydb.) G. L. Nesom, Squarrosae A. Gray, and Triplinerviae (Torr. & A. Gray) G. L. Nesom (Appendix 1). All material was sampled from specimens at the University of Waterloo Herbarium (WAT), now housed as a unit of the Université de Montréal Herbarium (MT). Diploid mitotic chromosome counts were available for 73 of the 95 specimens (Semple et al., 1981, 1984, 1993; Semple and Chmielewski, 1987; J. Semple, unpublished data), and all exhibited microsatellite profiles indicative of diploidy (i.e., no more than two alleles per locus [J. Beck, unpublished data]). These specimens represented both a wide age range (collected between 1970 and 2010) and a diverse array of drying regimes, from field-based forced air techniques (similar to Blanco et al., 2006) to standard drying cabinets utilizing light bulbs or heaters. Approximately 15 mg of tissue were subjected to a cetyltrimethylammonium bromide (CTAB) protocol modified for 96-well plates (Beck et al., 2012). This high-throughput protocol has a history of yielding DNA quantity/quality sufficient for sequencing and genotyping in both herbarium (Beck et al., 2012, 2014; Alexander et al., 2013) and silica-dried (Rothfels et al., 2013) tissue. Concentration was determined with a Qubit 2.0 fluorometer (Life Technologies, Carlsbad, California, USA), and fragment size distribution was visualized by running 100 ng of extract against a λ DNA-HindIII digest (New England Biolabs, Ipswich, Massachusetts, USA) on a 1% agarose gel.

Library preparation, sequencing, and SNP calling

GBS library preparation (Elshire et al., 2011), sequencing, and SNP calling were performed at the Genomic Diversity Facility (GDF) at Cornell University’s Biotechnology Resource Center. Trial libraries for one DNA were generated with three enzymes (ApeKI, EcoT221, PstI). Visual inspection of Experion (Bio-Rad, Hercules, California, USA) traces revealed that all exhibited fragment sizes generally between 150–300 bp. ApeKI was excluded due to the larger fragment pool, and thus lower read depth per fragment, that would result from this five-base recognition enzyme. Of the two six-base recognition enzymes, EcoT221 was then chosen because it exhibited a slightly smaller fragment pool. Libraries prepared from the 95 samples and one blank negative control were sequenced in one lane on an Illumina HiSeq 2500 (Illumina, San Diego, California, USA). Given that a reference genome was not available, the Universal Network-Enabled Analysis Kit (UNEAK) nonreference pipeline (Lu et al., 2013) implemented in TASSEL version 3.0.160 (Glaubitz et al., 2014) was used for tag alignment and subsequent SNP calling. The barcode/sample keyfile and all pipeline XML configuration files are archived at the Dryad Digital Repository (http://dx.doi.org/10.5061/dryad.16pj5; Beck and Semple, 2015).

Data filtering and multivariate clustering

TASSEL 4.3 was used to produce preliminary SNP data sets by implementing high and low levels of missing data filtering on the total SNP set identified by UNEAK. This filtering and all further analyses excluded four samples (noted in Appendix 1). Two subsection Triplinerviae individuals were placed in other subsections in preliminary analyses, which along with other unpublished results strongly suggests that these are mislabeled DNA samples. Also excluded were two subsection Squarrosae individuals exhibiting low sequence read levels (see below). High filtering recovered SNPs present in 70% of samples, whereas low filtering recovered SNPs present in 30% of samples. Both filtering levels enforced a >1% minor allele frequency. These preliminary data sets were subjected to the multidimensional clustering approach employed in the principal coordinates analysis with modal clustering (PCO-MC) workflow (Reeves and Richards, 2009). This approach identifies the most cohesive groups in a data set by simultaneously considering information on all informative axes of a principal coordinates analysis. These groups are ranked by a “stability value,” which ranges from 0–100 and quantifies the relative density of the group in multidimensional space (Reeves and Richards, 2009). Many clustering approaches are available for the analysis of SNP data (Lawson and Falush, 2012), and we employed PCO-MC based on its computational efficiency and ability to objectively identify and rank clusters. Unlike popular methods such as STRUCTURE (Pritchard et al., 2000) and STRUCTURAMA (Huelsenbeck et al., 2011), PCO-MC does not incorporate a model of within-group Hardy–Weinberg equilibrium, an assumption that is unrealistic for sets of individuals sampled at different times across the range of a species. Instead, PCO-MC identifies groups of individuals with similar genotypes, as genotypic similarity is but one of many secondary criteria that can be used to identify lineages (Mallet, 1995; Hausdorf and Hennig, 2010) under the general lineage concept (de Queiroz, 2007). The correspondence between clusters identified by PCO-MC and morphologically defined species (morphospecies) at both filtering levels was assessed. Cluster/morphospecies correspondence at high and low filtering levels was qualitatively similar in subsection Triplinerviae and generally lower at high filtering in subsections Squarrosae and Junceae. Low-filtered data sets were therefore chosen for subsequent PCO-MC clustering.

RESULTS

Sequencing success and SNP recovery

Extracted DNA concentrations ranged from 15–155 ng/μL (mean: 46.2 ± 23.6), and total DNA yield ranged from 1050–10,850 ng (mean: 3185.9 ± 1665.9) (Appendix 1). Only five samples exhibited DNA yields below the 1.5-μg minimum recommended by the GDF. Gel electrophoresis indicated that all extracts were at least partially sheared, exhibiting fragment sizes between >23 kb and <500 bp (Appendix S1). Each extract was given a qualitative score of DNA degradation (1 = mainly large fragments [>23 kb]; 2 = relatively even distribution of large to small fragments; 3 = mainly small fragments [<2 kb]) (Appendix 1, Appendix S1). These degradation scores were strongly related to specimen age, as all 21 group 1 DNAs (least degraded) were collected since 1992 (Appendix 1). Reduced representation library construction and Illumina sequencing yielded 230,232,173 (100 bp) reads. Of these, 197,917,774 were considered quality reads, exhibiting no N’s in the first 72 bases and including both a full barcode and the expected remnant of the restriction cut site (Elshire et al., 2011). These quality reads were then collapsed into 18,947,823 identical sequence tags. The blank sample returned 7604 quality reads, which was 0.003% of the total quality reads and 0.04% of the mean quality reads (2,076,237) per nonblank sample. Two samples were designated as failures by the GDF based on a quality read number <10% of this mean. Overall, quality read number per sample was significantly lower in older specimens (r2 = 0.27, P = 6.8 × 10−8; Fig. 1A). While still significant, the relationship between age and read number was less pronounced in specimens >10 yr old (r2 = 0.080, P = 0.011). There were significant differences between the three DNA degradation categories [one-way ANOVA: F (2,92) = 18.44, P < 0.0001], with category 1 exhibiting more quality reads than categories 2 and 3 (Tukey honestly significant difference [HSD] test). The UNEAK pipeline identified 8470 unfiltered SNPs that were present in at least 10 of the 96 samples (blank included). Missingness, or the percentage of these SNPs exhibiting missing data in a given sample, was significantly higher in older specimens (r2 = 0.22, P = 9.2 × 10−7) (Fig. 1B). There were again significant differences between the three DNA degradation categories [F (2,92) = 20.44, P < 0.0001], with category 1 DNAs exhibiting reduced missingness relative to category 2, which in turn exhibited reduced missingness relative to category 3 (Tukey HSD). Filtering to recover SNPs present in at least 70% of samples resulted in individual data sets of 547 (subsect. Junceae), 185 (subsect. Squarrosae), and 359 (subsect. Triplinerviae) SNPs. Filtering to recover SNPs present in at least 30% of samples resulted in individual data sets of 1633 (subsect. Junceae), 1447 (subsect. Squarrosae), and 2168 (subsect. Triplinerviae) SNPs. Original read data (FASTQ) have been deposited in the National Center for Biotechnology Information (NCBI) Sequence Read Archive (http://www.ncbi.nlm.nih.gov/sra) under BioProject ID PRJNA284163, and filtered subsection-specific HapMap matrices are archived at the Dryad Digital Repository (http://dx.doi.org/10.5061/dryad.16pj5; Beck and Semple, 2015).
Fig. 1.

Effect of Solidago specimen age on data quantity/quality. (A) Relationship between specimen age and the number of quality reads obtained for the 95 analyzed samples (r2 = 0.27, P = 6.8 × 10−8). (B) Relationship between specimen age and the percentage of the 8470 unfiltered SNPs missing for the 95 analyzed samples (r2 = 0.22, P = 9.2 × 10−7).

Effect of Solidago specimen age on data quantity/quality. (A) Relationship between specimen age and the number of quality reads obtained for the 95 analyzed samples (r2 = 0.27, P = 6.8 × 10−8). (B) Relationship between specimen age and the percentage of the 8470 unfiltered SNPs missing for the 95 analyzed samples (r2 = 0.22, P = 9.2 × 10−7).

Multivariate clustering

Correspondence between genetic groups identified by PCO-MC multidimensional clustering and morphospecies was strong in subsection Junceae (Fig. 2A). The five most highly ranked, and thus most cohesive in multivariate space, genetic clusters corresponded either to single morphospecies or groups of morphospecies. This result is particularly striking for the widespread species S. missouriensis Nutt. and S. juncea Aiton. In each case, samples from disparate portions of the morphospecies’ range (S. missouriensis range shown in Fig. 2D) were identified as belonging to a significant genetic cluster. Also notable is the single incidence of a genetic cluster not corresponding to an entire morphospecies or group of morphospecies. PCO-MC identified a highly ranked cluster comprising all three TN specimens of the rare, strongly disjunct S. gattingeri Chapm. ex A. Gray, while the two samples from MO were not placed in this or any other cluster. This suggests that S. gattingeri comprises two morphologically cryptic species separated by the Mississippi Embayment (Fig. 2D), a hypothesis that is supported by multivariate morphological analyses (J. Semple, unpublished data). Correspondence between genetic clusters and morphospecies was also strong in subsection Triplinerviae (Fig. 2B). The six most cohesive clusters corresponded to single morphospecies (S. gigantea Aiton, S. tortifolia Elliott, and S. elongata Nutt.) or groups of morphospecies. While the four TX specimens of S. juliae G. L. Nesom composed a single cluster, the two AZ S. juliae specimens were not placed in this group. This again suggests the presence of two geographically disjunct species (Fig. 2D). The remaining samples, representing S. altissima L., S. canadensis L., S. lepida DC., and S. brendiae Semple, composed a single cluster. These species can at times be difficult to distinguish (Semple et al., 2013, 2015), and their lack of genetic distinctiveness is not unexpected. Although correspondence was not as strong in subsection Squarrosae, multiple highly ranked clusters corresponded to single morphospecies or putatively closely related morphospecies pairs (Fig. 2C). The most highly ranked cluster comprised all individuals of S. pallida (Porter) Rydb. and S. rigidiuscula (Torr. & A. Gray) Porter, two morphologically similar species that were until recently both part of the S. speciosa s.l. complex (Semple et al., 2012). All individuals of S. erecta Banks ex Pursh, another taxon historically placed in the S. speciosa complex, formed the next most highly ranked cluster with those of S. speciosa Nutt. itself. The third-ranked cluster comprised S. puberula Nutt. and two of three S. pulverulenta Nutt. individuals, two species that until recently were considered northern and southern subspecies of S. puberula s.l. (Semple and Cook, 2006; Fig. 2D). Finally, all three individuals of S. squarrosa Muhl. formed the fifth most highly ranked cluster.
Fig. 2.

Multidimensional clustering (PCO-MC) of GBS data for three Solidago subsections. (A) Graphical representation of the five most highly ranked, statistically significant clusters recovered for subsection Junceae. The rank of each cluster by stability (see Methods) and this value (in parentheses) appear at the bottom right of each cluster. Locality information for each specimen refers to the collection locality in Appendix 1. (B) Results for subsection Triplinerviae. (C) Results for subsection Squarrosae. (D) Range maps for select species (scale bars = 100 km).

Multidimensional clustering (PCO-MC) of GBS data for three Solidago subsections. (A) Graphical representation of the five most highly ranked, statistically significant clusters recovered for subsection Junceae. The rank of each cluster by stability (see Methods) and this value (in parentheses) appear at the bottom right of each cluster. Locality information for each specimen refers to the collection locality in Appendix 1. (B) Results for subsection Triplinerviae. (C) Results for subsection Squarrosae. (D) Range maps for select species (scale bars = 100 km).

DISCUSSION

We were able to routinely attain data at >1700 SNPs in a set of herbarium specimens representing 23 species obtained by numerous collectors over a 40-yr time span, and these data carried clear biological signal. Of the 20 strongest clusters identified by PCO-MC, seven comprised all individuals of a single species, two comprised clear geographic subsets of a single species, and three comprised all individuals of potentially sister species. This signal is particularly encouraging given the extremely low sequence divergence among goldenrod species. Schilling et al. (2008) observed <1% sequence divergence among Solidago species at the often highly variable internal transcribed spacer (ITS) region of the nuclear rDNA cistron. Among the eight groups examined in Kress et al. (2005), Solidago harbored the lowest level of diversity at 10 highly variable plastid loci, exhibiting no substitutions at the putatively universal barcoding region psbA-trnH. Fazekas et al. (2008, 2009) examined nine potential barcoding regions in 32 genera and commented that Solidago was one of the two most “intractable” genera. It should also be noted that the inability of these data to recover clusters corresponding to all morphospecies may simply reflect biological reality, as it is unlikely that all currently recognized goldenrod species correspond to genetically cohesive groups (Semple and Cook, 2006). Taken together, these results indicate that the pairing of GBS with next-generation sampling holds considerable promise for species delimitation in large groups.

Recommendations

We were able to consistently recover DNA of sufficient quantity/quality for library construction with a standard CTAB extraction protocol modified for 96-well plates, and the inexpensive and high-throughput nature of this approach pairs well with the large sample sizes we propose. Although specimen age did negatively affect both the number of quality reads and the amount of missing data per sample (Fig. 1), this effect was less pronounced for specimens >10 yr old. This suggests that much of this detrimental effect occurs at the time of collection (drying technique or length of time the sample was held before drying) or during the early years of curation, an insight consistent with studies that have explicitly evaluated the timing of DNA damage (Staats et al., 2011) and shearing (Adams and Sharma, 2010; Neubig et al., 2014) in herbarium material. Sampling could perhaps then be focused on relatively recent specimens if sufficient material is available. Specimen preparation practices and storage conditions have also been shown to exert a strong effect on DNA quality (Ribeiro and Lovato, 2007; Särkinen et al., 2012; Lander et al., 2013; Neubig et al., 2014), and sampling from air-dried material stored in humidity/temperature-controlled facilities should be favored. Following DNA extraction, our data suggest that a qualitative gel-based assessment of DNA degradation can be a strong predictor of downstream success. Regardless, future studies will need to evaluate the timing and degree of herbarium DNA degradation in a range of plant groups, as this process has been shown to proceed at varying rates in different taxa (Neubig et al., 2014). Future studies could greatly enhance SNP discovery by beginning with low-coverage sequencing of one target species. The reference-aided GBS Discovery pipeline is robust to higher levels of divergence during locus identification and often identifies more SNPs, particularly in diverse data sets. Even a highly fragmentary assembly greatly improves SNP discovery, because short (64 bp) GBS reads can be matched to very small contigs. Genome size should also be considered. A recently examined diploid Solidago species exhibited a 1C-value = 1.02 pg (Kubešová et al., 2010), which is considered a relatively small angiosperm holoploid genome size (Leitch and Leitch, 2013). Genome size estimates across the group of interest should be considered during project design, particularly in the choice of restriction enzyme (Elshire et al., 2011). If funds permit, additional sequencing can be performed to reduce missingness in large-genome taxa (Chen et al., 2013). We also recommend the inclusion of multiple replicate samples to assess the background error rate. This is expected to be particularly important at the low read depths likely to be encountered in studies incorporating large numbers of specimens with varying DNA quality. Regarding analysis, a clear limitation of the cluster analysis of GBS data are the inability to reconstruct the pattern/timing of divergence among inferred lineages (Carstens et al., 2013), and fully leveraging these data for species delimitation and phylogeny reconstruction will require analytical tools that allow species trees to be inferred with the short read data obtained with GBS methods (Cariou et al., 2013; Hipp et al., 2014). These tools will no doubt soon be available (Leaché et al., 2014), as will increasingly longer read lengths of reduced representation libraries. These considerations notwithstanding, we feel strongly that pairing herbarium collections with GBS and other increasingly accessible genomic workflows (Straub et al., 2012; Stull et al., 2013; Weitemier et al., 2014) should be a top priority in plant systematics. Besides allowing for rapid and economical sampling of large groups, next-generation sampling allows specimen selection to be performed in collaboration with group experts. Genomic data sets spanning both species’ ranges and intra/interspecific morphological variation can then be used to rigorously test a wide range of hypotheses, thanks to the synergy between big data and big sampling. Click here for additional data file.
Appendix 1.

Voucher information for Solidago individuals included in this study.

SpeciesVoucher specimen accession no.aCollection yearCollection localitybCountybDNA concentrationDNA yieldGel image well (score)Quality readscMissingnessd
S. canadensis L. var. canadensisCook and Faulkenham C-141999OntarioBruce Co.38.72709A01 (2)1,393,5990.717
S. canadensis var. canadensisSemple and Brouillet 36671978New YorkHamilton Co.55.93913C01 (2)744,6530.794
S. canadensis var. canadensisSemple and Brouillet 34461978VermontWashington Co.60.84256D01 (2)1,488,7110.712
S. canadensis var. canadensisSemple & K. Shea 24161976OntarioRussell Co.38.62702E01 (3)518,2180.858
S. elongata Nutt.Semple and Brouillet 71001983OregonHood Co.33.52345F01 (3)527,2800.854
S. elongataSemple and Brouillet 71701983OregonLane Co.41.32891G01 (3)1,301,9750.769
S. elongataSemple and Brouillet 7151A1983OregonDouglas Co.56.43948H01 (3)1,303,2400.759
S. elongataSemple and Heard 84601986CaliforniaSiskiyou Co.62.44368A02 (3)793,6430.815
S. elongataSemple and Heard 84161986CaliforniaPlumas Co.57.94053B02 (2)2,959,2970.659
S. elongataSemple and Heard 86601986CaliforniaTulare Co.89.16237C02 (2)1,532,1680.753
S. altissima L. var. gilvocanescens (Rydb.) SempleSemple and Brouillet 73671983IllinoisAdams Co.32.82296D02 (2)1,551,0300.708
S. altissima var. gilvocanescensSemple and Heard 83291985IllinoisJohnson Co.37.22604E02 (2)1,704,3600.719
S. gigantea AitonSemple and Keir 47211980Nova ScotiaCumberland Co.22.81596F02 (2)2,197,7250.754
S. giganteaSemple and Keir 49601980VermontWindham Co.42.22954G02 (3)2,113,3810.756
S. giganteaSemple and Suripto 101651991MississippiLowndes Co.41.62496H02 (2)883,3530.840
S. juliae G. L. NesomMorton and Venn NA163731985TexasKendall Co.47.83346A03 (2)1,952,1780.739
S. juliaeMorton and Venn NA163701985TexasKendall Co.39.22744B03 (2)1,674,8230.747
S. juliaeNesom 72191989TexasBlanco Co.64.44508C03 (2)1,313,6530.772
S. juliaeReeves R45211975ArizonaCochise Co.54.73829D03 (3)1,056,6580.829
S. juliaeKeil 189891985ArizonaSanta Cruz Co.71.85026E03 (2)3,845,5270.694
S. juliaeNesom 72131989TexasReal Co.39.92793F03 (2)2,986,4330.695
S. tortifolia ElliotteSemple 74221983FloridaJefferson Co.65.44578G03 (2)2,949,2670.820
S. tortifoliaSemple 75341983FloridaBrevard Co.47.63332H03 (2)1,755,9150.703
S. tortifoliaSemple and Godfrey 31751977FloridaHolmes Co.533710A04 (2)733,9050.814
S. tortifoliaKral 417221970AlabamaGeneva Co.58.64102B04 (3)746,8180.825
S. tortifoliaCook et al., C-6692001South CarolinaBerkeley Co.61.94333C04 (1)3,236,5000.657
S. tortifoliaeSemple 118332010GeorgiaBrooks Co.29.82086D04 (1)7,580,7520.788
S. lepida DC. var. salebrosa (Piper) SempleSemple and Brouillet 43811979IdahoBoundary Co.56.33941E04 (3)1,302,9780.737
S. lepida var. salebrosaSemple et al., 92091990WyomingCarbon Co.956650F04 (2)1,542,6910.709
S. lepida var. salebrosaSemple and Heard 77551985ColoradoGunnison Co.47.53325G04 (3)2,686,9670.633
S. lepida var. salebrosaSemple 111542003NW TerritoriesNahanni N.P.R.43.22592H04 (1)1,243,0520.710
S. brendiae SempleSemple and Semple 114322006QuebecGaspésie Co.151050B01 (1)3,085,0920.587
S. brendiaeSemple and Semple 114362006QuebecGaspésie Co.29.52065A05 (1)13,642,4780.499
S. chilensis MeyenLopez Laphitz and Becker 272007ArgentinaCatamarca Province25.61792B05 (1)1,610,3910.782
S. chilensisLopez Laphitz and Becker 122007ArgentinaChubut Province34.82436C05 (1)8,945,2420.671
S. chilensisLopez Laphitz and Becker 102007ChileRegion XI16.61162D05 (1)4,456,3760.709
S. microglossa DC.Lopez Laphitz and Becker 162007ArgentinaChaco Province22.41120E05 (1)5,802,0490.708
S. microglossaLopez Laphitz and Becker 422007ArgentinaChaco Province66.13966F05 (1)1,572,3440.783
S. microglossaLopez Laphitz and Becker 412007ArgentinaFormosa Province81.65712G05 (1)4,912,6120.712
S. squarrosa Muhl.Semple 24261976OntarioRenfrew Co.37.12597H05 (3)323,9610.918
S. squarrosaSemple 36921978OntarioDurham Co.352450A06 (3)980,0800.844
S. squarrosaCook & Seiden C-1252000QuebecLa Vallée-de-la-Gatineau Reg. Co. Mun.27.61932B06 (2)2,577,7560.736
S. bicolor L.Semple & Chmielewski 59271981VirginaNelson Co.38.12667C06 (3)491,7500.885
S. bicolorSemple & Suripto 94871991PennsylvaniaPerry Co.63.24424D06 (2)1,848,9490.734
S. bicolorSemple & Brouillet 36141978ConnecticutHartford Co.27.21904E06 (2)1,015,6720.784
S. bicolorSemple 47081980New BrunswickKent Co.35.42478F06 (3)414,1890.881
S. bicolorSemple & B. Semple 114722006Prince Edward IslandQueens Co.23.61652G06 (1)2,485,7190.722
S. hispida Muhl. ex Willd. var. hispidaSemple & Brouillet 36381978New YorkGreene Co.18.91323H06 (3)354,2520.890
S. hispida var. hispidaSemple & Keir 46341980MaineSomerset Co.37.82646A07 (3)929,6890.831
S. hispida × S. puberulaeSemple, Brammall & Hart 29891977KentuckyWhitley Co.261820B07 (3)196,0270.924
S. hispida var. hispidaSemple & B. Semple 110652001OntarioRenfrew Co.32.32261C07 (2)2,254,6140.716
S. hispida Muhl. ex Willd. var. arnoglossa FernaldMorton NA124741978NewfoundlandDivision No. 5543780D07 (3)986,4310.816
S. hispida var. hispidaSemple & Chmielewski 82981985ArkansasSearcy Co.352450E07 (3)593,0210.866
S. erecta Banks ex PurshSemple & Chmielewski 59841981VirginiaNorthumberland Co.21.11477F07 (3)508,7000.855
S. erectaSemple & B. Semple 111892003TennesseeCoffee Co.38.32681G07 (2)540,7480.843
S. erectaSemple & Suripto 95011991New JerseyAtlantic Co.21.51505H07 (2)865,1690.801
S. erectaSemple & Suripto 94541990KentuckyEstill Co.81.95733A08 (3)864,9350.802
S. erectaSemple & Chmielewski 60981981South CarolinaChester Co.29.52065B08 (2)673,1070.821
S. erectaSemple & Suripto 101751991MississippiItawamba Co.30.72149C08 (2)303,3750.893
S. pulverulenta Nutt.Semple 116352006North CarolinaPender Co.38.12286D08 (1)1,048,8590.811
S. pulverulentaKral 442761971AlabamaEscambia Co.45.93213E08 (3)239,7540.936
S. pulverulentaeSemple & Suripto 101371991FloridaWashington Co.29.42058F08 (2)126,0470.955
S. pulverulentaSemple & Suripto 98131991South CarolinaBarnwell Co.39.52765G08 (2)316,1660.924
S. puberula Nutt.Cook & Seiden C-1182000QuebecVallée-de-l’Or Reg. Co. Mun.644480H08 (2)675,1390.841
S. puberulaSemple & Ringius 76281984MarylandKent Co.43.13017A09 (3)2,287,4480.804
S. puberulaSemple 68671982MassachusettsWorchester Co.94.46608B09 (3)648,0930.859
S. puberulaSemple 108151999North CarolinaMitchell Co.69.54865C09 (2)1,101,6450.807
S. pallida (Porter) Rydb.Semple 113042004South DakotaPennington Co.47.53325D09 (2)1,005,8340.775
S. pallidaSemple 114012006WyomingCrook Co.33.52345E09 (1)13,577,4460.586
S. pallidaSemple & Heard 80821985New MexicoSan Miguel Co.44.63122F09 (3)639,4390.826
S. rigidiuscula (Torr. & A. Gray) PorterSemple & Zhang 106021997OntarioWapole Island28.31981G09 (2)8,082,8240.578
S. rigidiusculaSemple & Brouillet 45321979IndianaPorter Co.24.91743H09 (2)1,807,5600.706
S. rigidiusculaSemple & Chmielewski 91211986TennesseeMarshall Co.35.22464A10 (3)1,447,9220.766
S. rigidiusculaSemple & Chmielewski 50631980WisconsinJackson Co.49.53465B10 (3)1,166,0270.770
S. speciosa Nutt.Semple & Chmielewski 61801981South CarolinaGreenville Co.27.31911C10 (3)389,9970.880
S. speciosaSemple 116132006VirginiaMecklenburg Co.29.32051D10 (1)956,8520.789
S. gattingeri Chapm. ex A. GraySemple & Chmielewski 52881980MissouriCamden Co.80.25614E10 (2)2,735,6080.733
S. gattingeriDietrich & Jenkins 491994MissouriCamden Co.674690F10 (2)2,790,4390.732
S. gattingeriMcNeilus 93-14431993TennesseeWilson Co.34.42408G10 (1)1,413,3560.732
S. gattingeriNordman s.n.2000TennesseeRutherford Co.25.21764H10 (1)1,062,6910.757
S. gattingeriBaily s.n.2000TennesseeRutherford Co.37.62632A11 (2)3,257,0260.679
S. missouriensis Nutt.Semple & Heard 76991985ColoradoYuma Co.68.84816B11 (3)1,110,3800.768
S. missouriensisSemple, Suripto & Ahmed 91951990NebraskaLincoln Co.77.55425C11 (2)1,566,5910.732
S. missouriensisSemple, Suripto & Ahmed 92631990UtahCache Co.14410080D11 (2)2,083,9730.702
S. missouriensisSemple & Jeff Semple 88441987WisconsinAdams Co.38.72709E11 (2)2,286,5680.692
S. missouriensisSemple, Suripto & Ahmed 93811990New MexicoCibola Co.15510850F11 (2)1,071,3790.761
S. missouriensisSemple & Brammall 26691977ManitobaDivision No. 150.53535G11 (2)2,421,0260.659
S. pinetorum SmallSemple & B. Semple 112232003North CarolinaMoore Co.30.91854H11 (1)2,265,6480.668
S. pinetorumSemple 116252006North CarolinaHertford Co.351750A12 (1)7,173,0430.592
S. pinetorumSemple 115992006North CarolinaRowan Co.39.62772B12 (1)6,001,9660.615
S. pinetorumSemple & Suripto 97341991North CarolinaFranklin Co.22.21554C12 (2)1,021,4130.771
S. juncea AitonSemple 106771999PennsylvaniaGreen Co.32.21932D12 (1)540,7480.695
S. junceaSemple & Keir 48971980Nova ScotiaHants Co.26.31841E12 (2)699,2830.818
S. junceaSemple & Brammall 27571977MissouriMadison Co.582900F12 (2)478,4020.858
S. junceaSemple & Brammall 27591977MichiganBerrien Co.32.52275G12 (2)890,4740.789

Vouchers archived at the University of Waterloo Herbarium (WAT), now housed as a unit of the Université de Montréal Herbarium (MT).

State/province; county/administrative unit.

Number of reads containing a full barcode, cut site remnant, and insert sequence.

Percentage of the 8470 unfiltered SNPs missing in the sample.

Samples not analyzed (see text).

  36 in total

1.  Does hybridization drive the transition to asexuality in diploid Boechera?

Authors:  James B Beck; Patrick J Alexander; Loreen Allphin; Ihsan A Al-Shehbaz; Catherine Rushworth; C Donovan Bailey; Michael D Windham
Journal:  Evolution       Date:  2011-12-08       Impact factor: 3.694

Review 2.  Population identification using genetic data.

Authors:  Daniel John Lawson; Daniel Falush
Journal:  Annu Rev Genomics Hum Genet       Date:  2012-06-11       Impact factor: 8.929

3.  Species delimitation using dominant and codominant multilocus markers.

Authors:  Bernhard Hausdorf; Christian Hennig
Journal:  Syst Biol       Date:  2010-08-06       Impact factor: 15.683

4.  Comparative analysis of different DNA extraction protocols in fresh and herbarium specimens of the genus Dalbergia.

Authors:  R A Ribeiro; M B Lovato
Journal:  Genet Mol Res       Date:  2007-03-29

Review 5.  How to fail at species delimitation.

Authors:  Bryan C Carstens; Tara A Pelletier; Noah M Reid; Jordan D Satler
Journal:  Mol Ecol       Date:  2013-07-16       Impact factor: 6.185

6.  Extraction of DNA from milligram amounts of fresh, herbarium and mummified plant tissues.

Authors:  S O Rogers; A J Bendich
Journal:  Plant Mol Biol       Date:  1985-03       Impact factor: 4.076

7.  Chloroplast DNA evolution and phylogenetic relationships in Lycopersicon.

Authors:  J D Palmer; D Zamir
Journal:  Proc Natl Acad Sci U S A       Date:  1982-08       Impact factor: 11.205

8.  DNA damage in plant herbarium tissue.

Authors:  Martijn Staats; Argelia Cuenca; James E Richardson; Ria Vrielink-van Ginkel; Gitte Petersen; Ole Seberg; Freek T Bakker
Journal:  PLoS One       Date:  2011-12-05       Impact factor: 3.240

9.  Hyb-Seq: Combining target enrichment and genome skimming for plant phylogenomics.

Authors:  Kevin Weitemier; Shannon C K Straub; Richard C Cronn; Mark Fishbein; Roswitha Schmickl; Angela McDonnell; Aaron Liston
Journal:  Appl Plant Sci       Date:  2014-08-29       Impact factor: 1.936

10.  A targeted enrichment strategy for massively parallel sequencing of angiosperm plastid genomes.

Authors:  Gregory W Stull; Michael J Moore; Venkata S Mandala; Norman A Douglas; Heather-Rose Kates; Xinshuai Qi; Samuel F Brockington; Pamela S Soltis; Douglas E Soltis; Matthew A Gitzendanner
Journal:  Appl Plant Sci       Date:  2013-01-31       Impact factor: 1.936

View more
  8 in total

1.  Next-generation sampling: Pairing genomics with herbarium specimens provides species-level signal in Solidago (Asteraceae).

Authors:  James B Beck; John C Semple
Journal:  Appl Plant Sci       Date:  2015-06-08       Impact factor: 1.936

2.  Multiplex PCR Targeted Amplicon Sequencing (MTA-Seq): Simple, Flexible, and Versatile SNP Genotyping by Highly Multiplexed PCR Amplicon Sequencing.

Authors:  Yoshihiko Onda; Kotaro Takahagi; Minami Shimizu; Komaki Inoue; Keiichi Mochida
Journal:  Front Plant Sci       Date:  2018-03-23       Impact factor: 5.753

3.  Population genetics and adaptation to climate along elevation gradients in invasive Solidago canadensis.

Authors:  Emily V Moran; Andrea Reid; Jonathan M Levine
Journal:  PLoS One       Date:  2017-09-28       Impact factor: 3.240

4.  A basic ddRADseq two-enzyme protocol performs well with herbarium and silica-dried tissues across four genera.

Authors:  Ingrid E Jordon-Thaden; James B Beck; Catherine A Rushworth; Michael D Windham; Nicolas Diaz; Jason T Cantley; Christopher T Martine; Carl J Rothfels
Journal:  Appl Plant Sci       Date:  2020-04-23       Impact factor: 1.936

5.  Parallel flowering time clines in native and introduced ragweed populations are likely due to adaptation.

Authors:  Brechann V McGoey; Kathryn A Hodgins; John R Stinchcombe
Journal:  Ecol Evol       Date:  2020-04-29       Impact factor: 2.912

6.  Utilizing field collected insects for next generation sequencing: Effects of sampling, storage, and DNA extraction methods.

Authors:  Kimberly M Ballare; Nathaniel S Pope; Antonio R Castilla; Sarah Cusser; Richard P Metz; Shalene Jha
Journal:  Ecol Evol       Date:  2019-12-03       Impact factor: 2.912

7.  High-throughput methods for efficiently building massive phylogenies from natural history collections.

Authors:  Ryan A Folk; Heather R Kates; Raphael LaFrance; Douglas E Soltis; Pamela S Soltis; Robert P Guralnick
Journal:  Appl Plant Sci       Date:  2021-02-27       Impact factor: 1.936

8.  The report of my death was an exaggeration: A review for researchers using microsatellites in the 21st century.

Authors:  Richard G J Hodel; M Claudia Segovia-Salcedo; Jacob B Landis; Andrew A Crowl; Miao Sun; Xiaoxian Liu; Matthew A Gitzendanner; Norman A Douglas; Charlotte C Germain-Aubrey; Shichao Chen; Douglas E Soltis; Pamela S Soltis
Journal:  Appl Plant Sci       Date:  2016-06-16       Impact factor: 1.936

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.