Literature DB >> 34336398

The best of both worlds: Combining lineage-specific and universal bait sets in target-enrichment hybridization reactions.

Kasper P Hendriks1,2, Terezie Mandáková3, Nikolai M Hay4, Elfy Ly1, Alex Hooft van Huysduynen1, Rubin Tamrakar5, Shawn K Thomas6, Oscar Toro-Núñez7, J Chris Pires6, Lachezar A Nikolov8, Marcus A Koch9, Michael D Windham4, Martin A Lysak3, Félix Forest10, Klaus Mummenhoff2, William J Baker10, Frederic Lens1,11, C Donovan Bailey5.   

Abstract

PREMISE: Researchers adopting target-enrichment approaches often struggle with the decision of whether to use universal or lineage-specific probe sets. To circumvent this quandary, we investigate the efficacy of a simultaneous enrichment by combining universal probes and lineage-specific probes in a single hybridization reaction, to benefit from the qualities of both probe sets with little added cost or effort. METHODS AND
RESULTS: Using 26 Brassicaceae libraries and standard enrichment protocols, we compare results from three independent data sets. A large average fraction of reads mapping to the Angiosperms353 (24-31%) and Brassicaceae (35-59%) targets resulted in a sizable reconstruction of loci for each target set (x̄ ≥ 70%).
CONCLUSIONS: High levels of enrichment and locus reconstruction for the two target sets demonstrate that the sampling of genomic regions can be easily extended through the combination of probe sets in single enrichment reactions. We hope that these findings will facilitate the production of expanded data sets that answer individual research questions and simultaneously allow wider applications by the research community as a whole.
© 2021 Hendriks et al. Applications in Plant Sciences published by Wiley Periodicals LLC on behalf of Botanical Society of America.

Entities:  

Keywords:  Brassicaceae; Hyb‐Seq; combining probes; enrichment; phylogenomics; phylogeny; population biology; target enrichment

Year:  2021        PMID: 34336398      PMCID: PMC8312739          DOI: 10.1002/aps3.11438

Source DB:  PubMed          Journal:  Appl Plant Sci        ISSN: 2168-0450            Impact factor:   1.936


Target capture approaches to DNA analyses (e.g., Mandel et al., 2014; Weitemier et al., 2014) are emerging as one of the most important tools in evolutionary biology, especially phylogenomics. Researchers adopting these methods are clear on the importance and utility of the data generated (e.g., Johnson et al., 2019), but often face a difficult decision during the early stages of project design. They must typically choose between the use of a universal probe set (e.g., Buddenhagen et al., 2016; Johnson et al., 2019) developed to work across larger taxonomic scales (e.g., the angiosperms), or a narrower lineage‐specific probe set designed for the group of interest (e.g., Mandel et al., 2014; Weitemier et al., 2014; Vatanparast et al., 2018; Gardiner et al., 2019; Koenen et al., 2020). When considering target enrichment options, the core exons of universal probe sets are perhaps viewed as best suited for higher‐level phylogenetic problems, where their conserved nature tends to have greatest utility (but see Mitchell et al., 2017; Wanke et al., 2017). Such probe sets, which have now been applied across nearly all angiosperm families (e.g., Baker et al., 2017; Dodsworth et al., 2019), produce data that can be easily integrated with studies from other labs focused on alternative samples or even different lineages including outgroup species (e.g., Buddenhagen et al., 2016; Johnson et al., 2019). The potential utility of these markers and their associated flanking regions are also being explored for the elucidation of species complexes (e.g., Larridon et al., 2020) and population‐level studies (e.g., Slimp et al., 2020). By contrast, well‐designed lineage‐specific probes, incorporating local information on single‐copy genes and greater fidelity between probe and target, can successfully select and recover a larger portion of orthologous gene space (e.g., Soto Gomez et al., 2019). They may also maximize the phylogenetic signal per region sequenced (e.g., Folk et al., 2015), generating data even more amenable to solving problems with both recalcitrant nodes in phylogenetic trees and questions in population biology. However, lineage‐specific data tend not to be readily combinable with data generated using other probe sets. The choice between universal and lineage‐specific probe sets can be further complicated when previously generated lineage‐specific data are available for some samples, resulting in a hesitancy to engage a universal set because of the inability to integrate existing data. The tradeoffs associated with these choices can have long‐term consequences, both for the source study and for the downstream utility of the data generated. In an ideal world, researchers would interrogate the same set of comprehensive loci, with targets able to address evolutionary questions ranging from the divergence of major clades to population‐level studies, or even “next generation barcoding” (Johnson et al., 2019). However, the molecular evolution of plant genomes largely dictates that no one set of sampled loci is likely to fit this ideal range of desired qualities for all scales and levels of investigation; thus, researchers continue to struggle with the decision associated with adopting universal probes or designing and applying a lineage‐specific set, leading to suggestions that both classes of probe sets might be engaged in some projects (e.g., Couvreur et al., 2019). As part of a collaboration between the Plant and Fungal Trees of Life project (PAFTOL; https://www.kew.org/science/our‐science/projects/plant‐and‐fungal‐trees‐of‐life) (Baker et al., 2021) and a group of Brassicaceae systematists, we faced this issue when selecting probes for target enrichment–based phylogenomic studies of the Brassicaceae. A confluence of several previously independent research projects has led us to envision performing target capture sequencing for all 4000 species in the family. In this context, a case can be made to favor the use of the universal Angiosperms353 probe set (Johnson et al., 2019), with obvious emphasis on the long‐term added value of sequencing loci that could be combined with data from similar ongoing studies across the angiosperms. However, it could also be argued that a recently published Brassicaceae‐specific probe set (Nikolov et al., 2019), targeting more variable loci and four‐fold greater base pair representation, is better suited to resolving the fine details of the family’s phylogenetic relationships. With the availability of both the Angiosperms353 and Brassicaceae probe sets, and the amount of existing data generated using the latter, our path forward was not entirely clear. We all agreed that one of the least desirable options was embarking on separate, partially overlapping projects applying different probe sets. Ultimately, we settled on a pilot study to investigate the feasibility of applying both probe sets by combining them in a single hybridization reaction and sequencing captured targets simultaneously. Ideally, this would facilitate the capture of universal and lineage‐specific loci with minimal extra effort and only a small additional cost per sample associated with the purchase of two probe sets. Here, we test the efficacy of combining two probe sets that share just 30 loci, the Angiosperms353 probes (353 loci, 260 kbp total length) and the Brassicaceae‐specific set (1827 exons [“Nikolov1827”] derived from 764 loci, 940 kbp total length), using three different sets of Brassicaceae gDNA samples and enriched libraries generated in two independent labs. Because neither lab had prior experience with these approaches, the study offers both an assessment of combining probe sets and the feasibility of doing so in a variety of labs with limited experience in the generation of target capture data.

METHODS AND RESULTS

DNA extraction and library preparation

The DNA samples (Appendix 1) used as part of our broader study were obtained from a combination of new extractions using a QIAGEN DNeasy PowerPlant Pro Kit (with subsequent purification of greenish extracts using the DNeasy PowerClean CleanUp Kit; QIAGEN, Hilden, Germany) and existing extractions from a prior project generated using the extraction protocol of Alexander et al. (2006). These extractions were used to develop three example target‐enrichment Brassicaceae data sets (Table 1) from two independent labs, the Bailey lab (New Mexico State University, Las Cruces, New Mexico, USA) and Naturalis Biodiversity Center (Leiden, The Netherlands; principal investigator: Frederic Lens). Example enrichment sets 1 (six libraries) and 2 (10 libraries) were generated in the Bailey lab, while set 3 (10 libraries) came from Naturalis. The Bailey lab samples were all representatives of the tribe Boechereae, while the Naturalis samples (obtained from collections at the University of Osnabrück, Osnabrück, Germany) represent a broader sampling across the Brassicaceae.
TABLE 1

Samples included in each set of example enrichments. Sample sets 1 and 2 were generated by the Bailey lab (New Mexico State University), while set 3 came from the Naturalis Biodiversity Center.

Sample setSpeciesDNA extraction label a NCBI SRA ID
1 Boechera sanluisensis P. J. AlexanderPJA296ASAMN17836232
Cusickiella douglasii (A. Gray) RollinsPJA370ASAMN17836233
Cusickiella douglasii PJA370BSAMN17836234
Cusickiella douglasii PJA370CSAMN17836235
Halimolobos jaegeri (Munz) RollinsPJA244SAMN17836236
Sandbergia whitedii GreenePJA248SAMN17836237
2 Boechera paupercula (Greene) Windham & Al‐ShehbazJB242SAMN17836238
Boechera pendulina (Greene) W. A. WeberJB152SAMN17836239
Boechera pendulina w4485SAMN17836246
Boechera platysperma (A. Gray) Al‐ShehbazFW443SAMN17836245
Boechera rectissima (Greene) Al‐ShehbazJB274SAMN17836240
Boechera retrofracta (Graham) Á. Löve & D. LöveFW562SAMN17836241
Boechera schistacea (Rollins) DornLA474SAMN17836242
Boechera shevockii Windham & Al‐ShehbazFW757SAMN17836243
Boechera suffrutescens (S. Watson) DornJB967SAMN17836244
Yosemitea repanda (S. Watson) P. J. Alexander & WindhamJB171SAMN17836247
3 Diptychocarpus strictus Trautv.S0673SAMN17103305
Draba nuda (Bél.) Al‐Shehbaz & M. KochS0658SAMN17103302
Heliophila diffusa DC.S0807SAMN17103309
Heliophila elata Sond.S0797SAMN17103308
Heliophila linearis DC.S0816SAMN17103310
Heliophila suavissima Burch. ex DC.S0775SAMN17103306
Morettia canescens Boiss.S0791SAMN17103307
Notoceras bicorne AmoS0642SAMN17103301
Rorippa sylvestris (L.) BesserS0672SAMN17103304
Rytidocarpus moricandioides Coss.S0668SAMN17103303

NCBI SRA ID = National Center for Biotechnology Information Sequence Read Archive identification number.

Abbreviations that link vials of gDNA to specific DNA samples and genomic libraries.

Samples included in each set of example enrichments. Sample sets 1 and 2 were generated by the Bailey lab (New Mexico State University), while set 3 came from the Naturalis Biodiversity Center. NCBI SRA ID = National Center for Biotechnology Information Sequence Read Archive identification number. Abbreviations that link vials of gDNA to specific DNA samples and genomic libraries. Initially, the Bailey lab generated libraries from six silica gel–dried DNA extractions (set 1) of Boechereae species (Table 1). This set derived from fresh silica gel–dried leaves and included four taxa, with three technical replicates of one taxon (PJA370) to investigate reproducibility. Later, the Bailey lab generated results from hybridization reactions including 23–26 herbarium sample–derived libraries per enrichment. Ten samples, with between 1.5 million and 4 million recovered reads, were randomly selected for evaluation and presented in set 2 (Table 1). Similarly, Naturalis generated larger data sets with 15 or 16 herbarium‐derived libraries per hybridization, with 10 samples randomly selected for set 3 (Table 1). In the Bailey lab, the genomic libraries were generated using the NEBNext Ultra II FS kit (New England Biolabs, Ipswich, Massachusetts, USA). All library steps followed the production manual (E7805L kit, version 5.0), with a fragmentation time of 5–10 min and six (set 1) or seven (set 2) cycles of PCR amplification. New England Biolabs single‐ and dual‐index adapters were applied to sets 1 and 2, respectively. Libraries generated at Naturalis (set 3) used the same library kit and protocol, but with a 1‐min fragmentation using sonication in an M220 Focused‐ultrasonicator (Covaris, Woburn, Massachusetts, USA), indexing with IDT 10 primers (Integrated DNA Technologies, Coralville, Iowa, USA), and nine cycles of PCR amplification.

Target enrichment and sequencing

We employed the Brassicaceae‐specific bait set developed by Nikolov et al. (2019), along with Angiosperms353 (Johnson et al., 2019), both of which are available as Arbor Biosciences “myBaits” kits (Arbor Biosciences, Ann Arbor, Michigan, USA; https://arborbiosci.com/genomics/targeted‐sequencing/mybaits/). These kits have just 30 loci in common. Staff at Arbor Biosciences (Brian Brunelle, personal communication) noted that combined bait‐set approaches had been successfully applied and that the logical starting point for exploring a mixture of baits was to maintain the relative representation of each set in the hybridization reaction. The Angiosperms353 and Nikolov1827 kits include 80,000 and 40,000 probes, respectively. To maintain twice as many Angiosperms353 probes, the standard 5.5 µL of a single bait set used in the myBaits hybridization protocol (“Hybridization Capture for Targeted NGS” protocol, version 4.01 [April 2018]) was replaced with a 2 : 1 (v/v) mixture of Angiosperms353 : Nikolov1827 baits. All other hybridization steps followed the myBaits protocol with the 0.2‐mL plate format and four washing steps. For the Bailey lab enrichments, sets 1 and 2 targeted the equal inclusion of libraries based on mass (Qubit dsDNA HS Assay Kit; Thermo Fisher Scientific, Waltham, Massachusetts, USA), with 100 ng and 20 ng DNA per library, respectively. For set 2, the libraries were combined based on similar size distributions (400–450 bp, 450–500 bp, 500–550 bp, or >600 bp), as determined using a 0.7% agarose gel. The post‐hybridization libraries were subjected to 19 cycles of PCR with the KAPA HiFi amplification kit (Roche Sequencing, Pleasanton, California, USA) and IDT xGen amplification primers. The final post‐amplification cleanups were performed using ABM beads (Applied Biological Materials, Richmond, British Columbia, Canada). Quality control checks, the combining of enriched pools (set 2 only), and sequencing were performed by Novogene (Beijing, China). Set 1 was sequenced using an Illumina 150‐bp paired‐end (PE) MiSeq Micro (Illumina, San Diego, California, USA; targeting approximately 2 million reads/sample), while set 2 ran with 96 multiplexed samples on a lane of an Illumina HiSeq4000 (150 bp PE, targeting approximately 3 million reads/sample). A protocol for the hybridization reactions is provided in Appendix 2. The Naturalis‐derived enrichments (set 3) included 15.6 ng (in hybridization reactions with a total of 250 ng) or 33.3 ng (reactions with 500 ng) of each library in the target mixture. The DNA concentrations from libraries included in this study ranged between 1.0 and 25.9 ng/µL. Libraries were pooled into reactions based on the similarity of the fragment length distributions, as measured on a Fragment Analyzer with an HS Small Fragment DNF‐477 kit (Agilent Technologies, Santa Clara, California, USA). The post‐hybridization library was subjected to 20 cycles (plus five additional cycles for library S0775) of PCR with a KAPA HiFi HotStart Library Amp Kit (Roche Sequencing) and the general amplification primers (matching IDT i7 and i5 index primers), followed by a bead cleanup (Macherey‐Nagel, Düren, Germany). The amplified libraries were sequenced as 150 bp PEs using an Illumina NovaSeq 6000 at BaseClear (Leiden, The Netherlands), with a targeted sequence coverage of 325×. All raw data were uploaded to the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA; BioProjects PRJNA678873 and PRJNA700668).

Data analysis

The raw reads were downloaded onto a Supermicro H8QG6 server with 64 AMD 6272 processors and 512 GB of RAM. Their analyses primarily employed SuperDeduper (version 1.3.0, https://github.com/s4hts/HTStream) for tests of PCR duplicate removal, Trimmomatic (version 0.39; Bolger et al., 2014) for adapter removal and quality trimming (with the arguments ILLUMINACLIP:../TruSeq3‐PE.fa:2:30:10 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:50), and HybPiper (version 1.3.1; installed from https://github.com/mossmatters/HybPiper.git) for locus mapping and reconstruction (applying the script “reads_first.py”) and the generation of comparative statistics (applying scripts “get_seq_lengths.py” and “hybpiper_stats.py”). HybPiper is a wrapper that utilizes a variety of publicly available tools. Our analyses utilized elements that applied BWA version 0.7.12‐r1039 (Li and Durbin, 2010) for mapping reads to the target sets, Biopython (Cock et al., 2009) for handling reads, SAMtools version 1.9 (Li et al., 2009) for sorting reads, SPAdes version 3.13.0 (Bankevich et al., 2012) for the de novo assembly of loci, and GNU Parallel (Tange, 2011) for multithreading on the server. The target locus files were the Angiosperms353 set (https://github.com/mossmatters/Angiosperms353/blob/master/Angiosperms353_targetSequences.fasta) and the Nikolov et al. (2019) set obtained directly from the author (L. A. Nikolov, personal communication). Scripts for the applied informatics are available from GitHub (https://github.com/cdb3ny/combined_enrichment_probes). In short, “reads_first.py” generated the de novo reconstruction of each locus while “get_seq_lengths.py” provided the sequence lengths for the downstream statistical summaries that were generated through “hybpiper_stats.py”. Default parameters were applied in all cases. The reported “percent enrichments” represent the number of reads from a sample mapping to the target sequences relative to the total number of reads for that sample ([no. of mapped reads] / [no. of total reads] × 100). Given that the target sequences represent less than 1% of the total genome size for these taxa, this simple measure denotes the relative enrichment in the raw reads while providing a fairly accurate (± <1%) representation of the target enrichment component in relation to the general genome representation in the recovered reads.

Results

Three pipelines were applied to each of the sequenced sets of enriched libraries and target locus sets. These included running all raw paired data through: (1) SuperDeduper, Trimmomatic, and recovered PE data only through HybPiper; (2) Trimmomatic and recovered PE data only through HybPiper; or (3) Trimmomatic and all recovered reads (PE and single‐end [SE]) through HybPiper. A summary of key results is presented in Table 2. We also report the percentage of cleaned reads mapping to the target set and the percentage of loci recovered with at least 75% sequence length as a primary measure of sequence enrichment and locus recovery for the samples within each data set (Appendix 3).
TABLE 2

Summary of the enrichment and locus reconstruction results for assemblies based on all (paired‐end and single‐end) trimmed reads without PCR deduplication. A locus was considered “recovered” from a sample when at least 75% of its read length was reconstructed.

StatisticData set
Set 1Set 2Set 3
No. of samples included61010
Samples in the hybridization reaction623–2615–16
Raw reads per sample (mean (range))2.1 M a (100,000–6.25 M)3 M (1.9 M–3.9 M)1.6 M b (512,000–4.3 M)
Mean % of Angiosperms353 enrichment3124.8024.50
Mean % of Angiosperms353 targets recovered708884
Mean Angiosperms353 theoretical read coverage338343219
Mean % of Nikolov1827 enrichment594335
Mean % of Nikolov1827 targets recovered759479
Mean Nikolov1827 theoretical read coverage18016788

M = million.

Two of the six samples had fewer than 500,000 reads.

Two of 10 samples had fewer than 1 M reads.

Summary of the enrichment and locus reconstruction results for assemblies based on all (paired‐end and single‐end) trimmed reads without PCR deduplication. A locus was considered “recovered” from a sample when at least 75% of its read length was reconstructed. M = million. Two of the six samples had fewer than 500,000 reads. Two of 10 samples had fewer than 1 M reads. Whenever PCR deduplication was applied as the first step in the pipeline, we observed a considerable loss of reads recovered and subsequently available for mapping to loci (Appendix 3). This was especially pronounced for samples with low levels of recovered raw reads (e.g., <1 million), highlighting problems with including a PCR deduplication step. This issue was noted by the author of HybPiper, resulting in his not recommending the use of deduplication when applying the pipeline (M. G. Johnson, Texas Tech University, personal communication). The PCR deduplication–derived results are not discussed or presented further. The two remaining implementations, both excluding deduplication, produced similar results. Unsurprisingly, the use of all reads (PE and SE) recovered a few additional loci (Appendix 3). The utility of adding SE data to the PE data was particularly pronounced with the MiSeq results, which are known to generate lower‐quality reverse reads under some circumstances (M. G. Johnson, personal communication). Thus, the MiSeq data retained more SE forward read–only sequences than SE reverse reads after quality trimming. Even so, the difference in the percentage of loci recovered was minimal (Appendix 3). The most important take home message from either the PE‐only or PE+SE results is the high degree of sequence enrichment achieved for both groups of target loci (Table 2, Appendix 3). From this point, we use the results from the PE+SE analyses (Table 2) to discuss the potential for mixing probe sets in one hybridization reaction. Considering each of the three example data sets, the average percent of cleaned reads mapping to the Angiosperms353 and Nikolov1827 targets were 24–31% and 35–59%, respectively. For some samples, 90% of cleaned reads mapped to the target sequences. These high levels of enrichment were most pronounced in set 1, which included just six libraries. A modest decrease in enrichment efficiency was observed for sets 2 and 3, which each included at least 15 samples per hybridization reaction (Table 2, Appendix 3). The Angiosperms353 and Nikolov1827 bait sets correspond to 260 kbp and 940 kbp of exon‐derived data, respectively; thus, an increased fraction of reads mapping to the Nikolov1827 targets (Table 2) is important for reconstructing a larger portion of genome space. Using the genomic portion represented by each probe set, the total number of reads mapped per sample, and an estimated 145 bp length for the average retained cleaned reads, we calculated an average theoretical coverage across loci ([no. of reads × 145 bp] / [bp of genomic space of each target file per sample]) (Table 2, Appendix 3). The theoretical coverage of Angiosperms353 loci was 1.8–2.5 times greater than that for the Nikolov1827 probe set (Table 2); nonetheless, the percentage recovery of loci was similar (differing by less than 5% within data sets). Hale et al. (2020) suggested that between 300,000 and 1 million reads represented a reasonable target for the 300 bp PE data generated by a MiSeq run for the high recovery of Angiosperms353 loci. Our data are 150 bp PE, making a corresponding estimate for our data of 600,000 to 2 million reads per sample, which fits well with the generally high recovery of loci from both probe sets (Appendix 3). Our results from the simultaneous hybridization of two different probe sets were supportive of the 2 : 1 Angiosperms353 : Nikolov1827 bait ratio, without requiring a greater sequencing depth than one might have applied for a single bait set. We feel that the simultaneous enrichment, using two different groups of probes, is strikingly balanced considering the mixture of up to 26 libraries in the enrichments and the fact that post‐enrichment libraries were subjected to ≥19 cycles of PCR. We consider the results presented here to be a promising outcome, one that is currently guiding the generation of new data for Brassicaceae. Thus far, the larger‐scale preliminary results from those data (Bailey et al. and Hendriks et al., unpublished data) are similar to those presented here. Nonetheless, when choosing bait by taxon combinations with lower hybridization efficiency, adjustments may be needed in both the bait ratio and the depth of sequencing required for the recovery of a high percentage of loci from each target set.

CONCLUSIONS

The high levels of enrichment and locus reconstruction for two different sets of loci, obtained through one enrichment step, demonstrate that target‐enrichment projects can be easily expanded to include a greater portion of genome space. Prior studies suggest that hybridization efficiency can range from around 15% to 80% (Hale et al., 2020). The high degree of hybridization efficiency observed here, ranging up to 90% of cleaned reads mapping to one target file or the other, are likely the outcome of the high sequence similarity between our Boechereae samples and other Brassicaceae samples and between the orthologs used in the design of both sets of probes, which drew heavily on the Arabidopsis thaliana (L.) Heynh. (Brassicaceae) genome. In the case of Angiosperms353, for which 15 or fewer target instances were selected from across the angiosperms for each of the target loci using k‐medoids clustering, a further three instances were added from the A. thaliana, Oryza sativa L., and Amborella trichopoda Baill. genomes, rendering the probe set especially effective in their respective families. This ensures a fair comparison of probe performance (in terms of reads on target) as presented here. When implementing a similar approach using probe mixes whose design lacked a closely matching genome for the study group, lower enrichment efficiencies are likely. It will be prudent to invest in similar preliminary studies early in the project. If an imbalance in recovered loci is detected, adjustments in the ratio of baits can easily be made. This study illustrates the potential ease with which new target capture data can be simultaneously generated for multiple probe sets, with relatively little extra cost or work per sample. Our robust results suggest that researchers interested in combining multiple probe sets (e.g., a universal plus lineage‐specific, multiple universal, or even multiple lineage‐specific sets) can achieve this in one step. The successful simultaneous application of bait sets will hopefully be adopted in other projects to maximize the generation of useful data for wide‐ranging investigations in evolutionary biology. As the availability of bait sets increases and the cost of sequencing continues to decline, there is no obvious reason to limit the combination of probes to just two sets. It should be possible to mix multiple bait sets (e.g., universal, lineage‐specific, or gene family [e.g., nodulation or others]), perhaps even including baits that target different taxa in shared tissues (e.g., endosymbionts and parasites). It is hoped that these practical findings will relieve researchers of some difficult decision‐making, ultimately leading to the generation of a broader spectrum of loci serving the interests of our research communities in terms of generating data with wider downstream applications.

AUTHOR CONTRIBUTIONS

All authors contributed to the design and writing and/or revision of the manuscript. K.P.H., T.M., and A.H.H. isolated the gDNA. C.D.B. and K.P.H. generated the libraries and enrichment data. C.D.B., K.P.H., N.M.H., and E.L. conducted analyses related to the project. C.D.B., W.J.B., F.L., and K.P.H. wrote the primary body of the manuscript. All authors agreed with the final version of the manuscript and its submission for publication.
SpeciesVoucher specimen, collection no., herbarium a Collection localityGeographic coordinates
Boechera sanluisensis P.J. Alexander 599B, NMCCarson National Forest, ±1.75 miles WNW of Tres Piedras, ±0.15 miles N of US Highway 64, Rio Ariba County, New Mexico, USA36.6544, –105.9974
Halimolobos jaegeri Erik Schranz, 1074, personal collectionUSANA
Sandbergia whitedii Erik Schranz, 1080, personal collectionUSANA
Boechera paupercula Alexander 1107, DUKETulare County, California, USA36.4003, –118.5727
Boechera platysperma s.l.Howden 12, UCAlpine County, California, USA38.4704, –119.9967
Boechera pendulina Windham et al. 3709a, DUKEClark County, Nevada, USA36.2609, –115.6086
Boechera rectissima Alexander 1026, DUKEFresno County, California, USA37.0542, –119.1551
Boechera retrofracta Soper 5470, CANBruce County, Ontario, Canada44.9323, –81.1343
Boechera schistacea Windham & Allphin 4307, DUKEUinta County, Wyoming, USA41.0756, –110.3806
Boechera shevockii Shevock 10098, GHTulare County, California, USA36.0210, –118.4167
Boechera suffrutescens Cusick s.n., OREBaker County, Oregon, USA44.9718, –116.862
Boechera platysperma Howden 12, UCAlpine County, California, USA38.4704, –119.9967
Boechera pendulina Windham et al. 4435, DUKEFremont County, Wyoming, USA42.4302, –109.0342
Cusickiella douglasii M.D. Windham & L. Allphin 3362, NMCBox Elder County, Utah, USA41.7675, –113.9419
Yosemitea repanda Alexander et al. 845f, DUKEInyo County, California, USA37.209, –118.6124
Diptychocarpus strictus TUH35369, TUH60 km away from Delijan from Esfahan, Esfahan Province, Iran33.017, –51.567
Draba nuda Solomon et al. 21443, Gomez‐Campo CollectionTajikistanNA
Heliophila diffusa NGS311, NBGClanwilliam, Cederberg, Western Cape, South Africa. Road to Pakhuis Pass, at Leipoldt’s Grave32.135, –18.989
Heliophila elata Mummenhoff & Ramdhani 65, personal collectionSouth Africa. Along road 364 from Butterkloof Pass to Clanwilliam, 200 m W of Elizabethsfontein junctionNA
Heliophila linearis Linder P14, personal collectionGeelbek Lagoon, Darling District, South AfricaNA
Heliophila suavissima Clark et al. 135, GRAFarm Puttersvlei 190, Karoo National Park (Beaufort West), Western Cape, South Africa32.264, –22.499
Morettia canescens Staudinger, 13669, OSBUJbel Sarho, Zagora, MoroccoNA
Notoceras bicorne Neuffer, 19678, OSBUFermes, Lanzarote, Canary Islands, Spain28.883, –13.750
Rorippa sylvestris Neuffer, Hurka, Friesen, 18646, OSBUBezirk Smolenskoje, Altaijski Kraij, Siberia, Russia. About 35 km south of Bijsk and 10 km south of Smolenskoje along the Pestschanaja river37.478, –71.603
Rytidocarpus moricandioides GCC0708‐67, Gomez‐Campo CollectionBotanical Garden Paris, FranceNA

Note: NA = not available.

Herbarium abbreviations are per Thiers et al. (2021).

ComponentAmount per reaction (μL)Amount for four reactions (μL)
Hyb N9.2537
Hyb D3.514
Hyb S0.52
Hyb R1.255
Baits5.522
TOTAL 20 80

Note: The introduction of Hyb S will cause cloudiness; the mixture will clarify after step 3.

ComponentAmount per reaction (μL)Amount for four reactions (μL)
Block A0.52
Block C2.510
Block O2.510
TOTAL 5.5 22
StepTemperatureTime
195°C5 min
2Hybridization temperature (65°C)5 min
3Hybridization temperature (65°C)
ReagentsAmount per reaction (μL)Amount for four reactions (μL)
Hyb S936
NF water9003600
Wash buffer227908
ComponentFinal concentrationAmount per reaction (μL)
Nuclease‐free water8.75
2× KAPA HiFi HotStart ReadyMix25
IDT xGEN amp primers (20 μM)500 nM1.25
Enriched library (pellet the beads before pulling off the 15‐µL aliquot)15*
Total 50

The remaining bead‐bound library can be stored at –20°C for several months.

StepTemperatureTime
198°C2 min
298°C20 s
360°C30 s
472°CLength‐dependent a
5Return to step 2 for appropriate number of cycles b
572°C5 min
68°C

Extension time can be library‐size dependent (when in doubt, a slightly longer time is acceptable). A mean length <500 bp requires 30 s, a mean of 500–700 bp requires 45 s, while a mean length >700 bp requires 1 min.

The number of cycles needs to be empirically determined. For this study, we used 17 cycles total.

Note

SetSampleSource and analysis pipelineTargetNo. of raw readsNo. of trimmed readsNo. of reads mappedFraction mapped to targetLoci with at least 75% of the target sequence length recoveredTheoretical coveragePercentage of loci recovered with 75%
1PJA244_S6Bailey, SDD+T+PEAngio353109,02423,76874150.3154.141.42
PJA248_S5Bailey, SDD+T+PEAngio353207,78447,49914,9480.32278.347.65
PJA296A_S4Bailey, SDD+T+PEAngio3536,255,1181,944,653570,8870.29323318.3891.50
PJA370‐A_S1Bailey, SDD+T+PEAngio3531,946,754664,426194,6660.29296108.5683.85
PJA370‐B_S2Bailey, SDD+T+PEAngio3532,031,530623,004185,5640.30291103.4982.44
PJA370‐C_S3Bailey, SDD+T+PEAngio3532,315,704684,511205,2670.30294114.4883.29
Averages 2,144,319.00 664,643.50 196,457.83 0.30 206.00 109.56 58.36
PJA244_S6Bailey, T+PEAngio353109,02464,58120,6710.325311.5315.01
PJA248_S5Bailey, T+PEAngio353207,784129,52541,6560.3212223.2334.56
PJA296A_S4Bailey, T+PEAngio3536,255,1185,304,1711,661,2750.31327926.4892.63
PJA370‐A_S1Bailey, T+PEAngio3531,946,7541,713,693514,1300.30317286.7389.80
PJA370‐B_S2Bailey, T+PEAngio3532,031,5301,657,000504,3470.30316281.2789.52
PJA370‐C_S3Bailey, T+PEAngio3532,315,7041,845,307564,8510.31318315.0190.08
Averages 2,144,319.00 1,785,712.83 551,155.00 0.31 242.17 307.37 68.60
PJA244_S6Bailey, T+PE+SEAngio353109,02486,53426,7340.316014.9117.00
PJA248_S5Bailey, T+PE+SEAngio353207,784168,75152,7430.3113729.4138.81
PJA296A_S4Bailey, T+PE+SEAngio3536,255,1185,783,8801,811,5390.313271010.2892.63
PJA370‐A_S1Bailey, T+PE+SEAngio3531,946,7541,839,458554,7730.30320309.3990.65
PJA370‐B_S2Bailey, T+PE+SEAngio3532,031,5301,850,926563,4950.30319314.2690.37
PJA370‐C_S3Bailey, T+PE+SEAngio3532,315,7042,086,092636,0190.31319354.7090.37
Averages 2,144,319.00 1,969,273.50 607,550.50 0.31 247.00 338.83 69.97
PJA244_S6Bailey, SDD+T+PENikolov1827109,02423,73414,8190.62182.290.99
PJA248_S5Bailey, SDD+T+PENikolov1827207,78447,34928,0810.591224.336.68
PJA296A_S4Bailey, SDD+T+PENikolov18276,255,1181,942,3091,123,8140.581782173.3597.54
PJA370‐A_S1Bailey, SDD+T+PENikolov18271,946,754661,998380,9370.58150058.7682.10
PJA370‐B_S2Bailey, SDD+T+PENikolov18272,031,530621,380362,9520.58150955.9982.59
PJA370‐C_S3Bailey, SDD+T+PENikolov18272,315,704683,370403,1430.59156862.1985.82
Averages 2,144,319.00 663,356.67 385,624.33 0.59 1083.17 59.48 59.29
PJA244_S6Bailey, T+PENikolov1827109,02464,49040,8870.633186.3117.41
PJA248_S5Bailey, T+PENikolov1827207,784129,10977,5040.6063611.9634.81
PJA296A_S4Bailey, T+PENikolov18276,255,1185,297,5523,250,6080.611813501.4299.23
PJA370‐A_S1Bailey, T+PENikolov18271,946,7541,706,716998,2140.591754153.9896.00
PJA370‐B_S2Bailey, T+PENikolov18272,031,5301,651,985978,4390.591741150.9395.29
PJA370‐C_S3Bailey, T+PENikolov18272,315,7041,841,6111,100,9630.601758169.8396.22
Averages 2,144,319.00 1,781,910.50 1,074,435.83 0.60 1336.67 165.74 73.16
PJA244_S6Bailey, T+PE+SENikolov1827109,02486,31052,4770.614078.0922.28
PJA248_S5Bailey, T+PE+SENikolov1827207,784167,81797,1360.5875814.9841.49
PJA296A_S4Bailey, T+PE+SENikolov18276,255,1185,774,1493,528,0330.611812544.2299.18
PJA370‐A_S1Bailey, T+PE+SENikolov18271,946,7541,826,5761,059,3100.581758163.4096.22
PJA370‐B_S2Bailey, T+PE+SENikolov18272,031,5301,840,7181,075,6800.581742165.9395.35
PJA370‐C_S3Bailey, T+PE+SENikolov18272,315,7042,077,9581,225,4450.591759189.0396.28
Averages 2,144,319.00 1,962,254.67 1,173,013.50 0.59 1372.67 180.94 75.13
2FW443Bailey, SDD+T+PEAngio3531,973,768173,17651,9230.310628.9630.03
FW562Bailey, SDD+T+PEAngio3533,873,0801,185,052117,7240.09927665.6578.19
FW757Bailey, SDD+T+PEAngio3531,916,118284,85073,2890.25721740.8761.47
JB152Bailey, SDD+T+PEAngio3532,225,116687,086119,1500.17328966.4581.87
JB171Bailey, SDD+T+PEAngio3532,786,144795,811125,3530.15827169.9176.77
JB242Bailey, SDD+T+PEAngio3532,875,092953,137205,3600.215312114.5388.39
JB274Bailey, SDD+T+PEAngio3533,896,5341,459,263195,1110.134306108.8186.69
JB967Bailey, SDD+T+PEAngio3533,486,402327,50194,9900.2925852.9873.09
LA474Bailey, SDD+T+PEAngio3533,933,178517,705138,4340.26730377.2085.84
W4485Bailey, SDD+T+PEAngio3533,744,160376,946110,8260.29428461.8180.45
Averages 3,070,959.2 676,052.7 123,216 0.2187 262.2 68.72 74.28
FW443Bailey, T+PEAngio3531,973,768999,416309,2780.309229172.4864.87
FW562Bailey, T+PEAngio3533,873,0803,113,683549,2610.176322306.3291.22
FW757Bailey, T+PEAngio3531,916,1181,194,875335,7280.281300187.2384.99
JB152Bailey, T+PEAngio3532,225,1161,690,398408,7540.242322227.9691.22
JB171Bailey, T+PEAngio3532,786,1441,797,747382,9920.213315213.5989.24
JB242Bailey, T+PEAngio3532,875,0922,367,833535,1580.226329298.4593.20
JB274Bailey, T+PEAngio3533,896,5343,072,867523,5380.17314291.9788.95
JB967Bailey, T+PEAngio3533,486,4021,669,004530,6020.318314295.9188.95
LA474Bailey, T+PEAngio3533,933,1782,715,849812,4590.299322453.1091.22
W4485Bailey, T+PEAngio3533,744,1602,031,630642,8250.316318358.5090.08
Averages 3,070,959.2 2,065,330.2 503,059.5 0.255 308.5 280.55 87.39
FW443Bailey, T+PE+SEAngio3531,973,7681,451,430414,4990.286244231.1669.12
FW562Bailey, T+PE+SEAngio3533,873,0803,476,610601,3700.173322335.3891.22
FW757Bailey, T+PE+SEAngio3531,916,1181,524,794417,4990.274300232.8484.99
JB152Bailey, T+PE+SEAngio3532,225,1161,938,915461,6530.238324257.4691.78
JB171Bailey, T+PE+SEAngio3532,786,1442,221,872462,0380.208321257.6890.93
JB242Bailey, T+PE+SEAngio3532,875,0922,622,550582,9960.222325325.1392.07
JB274Bailey, T+PE+SEAngio3533,896,5343,442,445574,9560.167317320.6589.80
JB967Bailey, T+PE+SEAngio3533,486,4022,525,497773,6990.306314431.4988.95
LA474Bailey, T+PE+SEAngio3533,933,1783,299,070982,3810.298325547.8792.07
W4485Bailey, T+PE+SEAngio3533,744,1602,843,615887,9270.312321495.1990.93
Averages 3,070,959.20 2,534,679.80 615,901.80 0.25 311.30 343.48 88.19
FW443Bailey, SDD+T+PENikolov18273,744,160171,16182,0230.47951512.6528.19
FW562Bailey, SDD+T+PENikolov18273,486,4021,185,811229,9650.194124435.4768.09
FW757Bailey, SDD+T+PENikolov18272,225,116284,213132,2670.46592120.4050.41
JB152Bailey, SDD+T+PENikolov18271,973,768686,733205,9850.3125531.7768.69
JB171Bailey, SDD+T+PENikolov18272,786,144795,638219,6960.276124333.8968.04
JB242Bailey, SDD+T+PENikolov18273,933,178955,986363,0960.38147256.0180.57
JB274Bailey, SDD+T+PENikolov18272,875,0921,467,415344,8300.235149353.1981.72
JB967Bailey, SDD+T+PENikolov18273,873,080326,346155,1010.475107623.9358.89
LA474Bailey, SDD+T+PENikolov18273,896,534517,028231,3710.448137635.6975.31
W4485Bailey, SDD+T+PENikolov18271,916,118375,095185,6120.495125528.6368.69
Averages 3,070,959.20 676,542.60 214,994.60 0.37 1185.00 33.16 64.86
FW443Bailey, T+PENikolov18273,744,160987,931484,5890.491137174.7575.04
FW562Bailey, T+PENikolov18273,486,4023,118,7791,075,2450.3451762165.8696.44
FW757Bailey, T+PENikolov18272,225,1161,190,913605,6660.509161193.4388.18
JB152Bailey, T+PENikolov18271,973,7681,688,260698,3110.4141736107.7295.02
JB171Bailey, T+PENikolov18272,786,1441,795,223664,2510.371687102.4692.34
JB242Bailey, T+PENikolov18273,933,1782,374,936936,3720.3941768144.4496.77
JB274Bailey, T+PENikolov18272,875,0923,098,713914,5470.2951736141.0795.02
JB967Bailey, T+PENikolov18273,873,0801,661,855843,3290.5071655130.0990.59
LA474Bailey, T+PENikolov18273,896,5342,710,9521,326,7740.4891766204.6696.66
W4485Bailey, T+PENikolov18271,916,1182,018,3571,056,2890.5231732162.9494.80
Averages 3,070,959.2 2,064,591.9 860,537.3 0.4337 1682.4 132.74 92.09
FW443Bailey, T+PE+SENikolov18271,973,7681,439,504663,5590.4611446102.3679.15
FW562Bailey, T+PE+SENikolov18273,873,0803,482,3421,183,6420.341768182.5896.77
JB152Bailey, T+PE+SENikolov18272,225,1161,936,510791,0470.4081751122.0295.84
JB171Bailey, T+PE+SENikolov18272,786,1442,218,044800,0400.3611715123.4193.87
JB242Bailey, T+PE+SENikolov18272,875,0922,629,9611,026,0620.391770158.2896.88
JB274Bailey, T+PE+SENikolov18273,896,5343,470,5091,007,9440.291749155.4895.73
JB967Bailey, T+PE+SENikolov18273,486,4022,518,1241,255,7480.4991715193.7193.87
LA474Bailey, T+PE+SENikolov18273,933,1783,294,2001,623,2010.4931777250.3997.26
W4485Bailey, T+PE+SENikolov18273,744,1602,820,4691,442,4580.5111756222.5196.11
Averages 3,070,959.20 2,535,691.99 1,064,593.39 0.43 1714.67 164.22 93.85
3S0642Naturalis, SDD+T+PEAngio3532,241,5581,678,382201,0540.12225112.1363.74
S0658Naturalis, SDD+T+PEAngio3532,341,6301,283,758298,2720.232284166.3480.45
S0668Naturalis, SDD+T+PEAngio3534,323,2243,010,397553,8990.184270308.9176.49
S0672Naturalis, SDD+T+PEAngio3531,005,866715,945181,4590.253267101.2075.64
S0673Naturalis, SDD+T+PEAngio353512,280375,30987,3790.23322248.7362.89
S0775Naturalis, SDD+T+PEAngio3531,855,98689,08918,0770.2032010.085.67
S0791Naturalis, SDD+T+PEAngio3531,403,254554,88472,1820.1318440.2652.12
S0797Naturalis, SDD+T+PEAngio3531,266,122497,66978,7790.15818943.9353.54
S0807Naturalis, SDD+T+PEAngio3531,060,492784,329169,3970.21618494.4752.12
S0816Naturalis, SDD+T+PEAngio353648,334469,121103,9430.22220257.9757.22
Averages 1,665,874.6 945,888.3 176,444.1 0.1951 204.7 98.40 57.99
S0642Naturalis, T+PEAngio3532,241,5582,196,887366,4300.167302204.3685.55
S0658Naturalis, T+PEAngio3532,341,6302,308,263630,1320.273333351.4294.33
S0668Naturalis, T+PEAngio3534,323,2244,282,247997,5570.233329556.3393.20
S0672Naturalis, T+PEAngio3531,005,866996,920295,0270.296324164.5391.78
S0673Naturalis, T+PEAngio353512,280504,950143,4060.28430779.9886.97
S0775Naturalis, T+PEAngio3531,855,9861,832,692513,3040.28201286.2756.94
S0791Naturalis, T+PEAngio3531,403,2541,381,003239,2060.173300133.4084.99
S0797Naturalis, T+PEAngio3531,266,1221,236,899266,0140.215293148.3583.00
S0807Naturalis, T+PEAngio3531,060,4921,047,471273,6890.261276152.6378.19
S0816Naturalis, T+PEAngio353648,334637,945170,0710.26729194.8582.44
Averages 1,665,874.6 1,642,527.7 389,483.6 0.2449 295.6 217.21 83.74
S0642Naturalis, T+PE+SEAngio3532,241,5582,219,047367,7680.166302205.1085.55
S0658Naturalis, T+PE+SEAngio3532,341,6302,325,982633,5750.272333353.3494.33
S0668Naturalis, T+PE+SEAngio3534,323,2244,330,6781,020,0200.236329568.8693.20
S0672Naturalis, T+PE+SEAngio3531,005,8661,002,483296,3530.296324165.2791.78
S0673Naturalis, T+PE+SEAngio353512,280508,547144,0780.28330780.3586.97
S0775Naturalis, T+PE+SEAngio3531,855,9861,846,386516,2460.28201287.9156.94
S0791Naturalis, T+PE+SEAngio3531,403,2541,391,718240,2000.173301133.9685.27
S0797Naturalis, T+PE+SEAngio3531,266,1221,251,466267,5430.214292149.2182.72
S0807Naturalis, T+PE+SEAngio3531,060,4921,054,416274,9410.261276153.3378.19
S0816Naturalis, T+PE+SEAngio353648,334643,131170,9420.26629195.3382.44
Averages 1,665,874.60 1,657,385.40 393,166.60 0.24 295.60 219.27 83.74
S0642Naturalis, SDD+T+PENikolov18272,241,5581,676,774340,1220.2031392189.6876.19
S0658Naturalis, SDD+T+PENikolov18272,341,6301,282,734443,6020.3461572247.3986.04
S0668Naturalis, SDD+T+PENikolov18274,323,2242,990,7301,024,9720.3431664571.6291.08
S0672Naturalis, SDD+T+PENikolov18271,005,866714,302289,2640.4051547161.3284.67
S0673Naturalis, SDD+T+PENikolov1827512,280375,424143,5520.382121480.0666.45
S0775Naturalis, SDD+T+PENikolov18271,855,98688,91826,4200.2979214.735.04
S0791Naturalis, SDD+T+PENikolov18271,403,254554,750114,6760.20796563.9552.82
S0797Naturalis, SDD+T+PENikolov18271,266,122497,342125,8280.25394570.1751.72
S0807Naturalis, SDD+T+PENikolov18271,060,492783,710289,9100.371324161.6872.47
S0816Naturalis, SDD+T+PENikolov1827648,334468,910167,8120.358114393.5962.56
Averages 1,665,874.60 943,359.40 296,615.80 0.32 1185.80 165.42 64.90
S0642Naturalis, T+PENikolov18272,241,5582,194,824528,8890.241152481.5883.42
S0658Naturalis, T+PENikolov18272,341,6302,306,466864,5000.3751701133.3593.10
S0668Naturalis, T+PENikolov18274,323,2244,249,9001,651,1360.3891758254.7096.22
S0672Naturalis, T+PENikolov18271,005,866994,699423,7270.426162565.3688.94
S0673Naturalis, T+PENikolov1827512,280505,046209,0180.414134132.2473.40
S0775Naturalis, T+PENikolov18271,855,9861,829,251640,2490.3584198.7646.03
S0791Naturalis, T+PENikolov18271,403,2541,380,874324,4240.235148150.0481.06
S0797Naturalis, T+PENikolov18271,266,1221,236,252363,7520.294145556.1179.64
S0807Naturalis, T+PENikolov18271,060,4921,046,600420,8740.402143764.9278.65
S0816Naturalis, T+PENikolov1827648,334637,645250,4490.393131338.6371.87
Averages 1,665,874.60 1,638,155.70 567,701.80 0.35 1447.60 87.57 79.23
S0642Naturalis, T+PE+SENikolov18272,241,5582,216,884530,6620.239152781.8683.58
S0658Naturalis, T+PE+SENikolov18272,341,6302,324,136869,1650.3741701134.0793.10
S0668Naturalis, T+PE+SENikolov18274,323,2244,285,6031,655,9190.3861760255.4396.33
S0672Naturalis, T+PE+SENikolov18271,005,8661,000,101425,4260.425162565.6288.94
S0673Naturalis, T+PE+SENikolov1827512,280508,642210,1460.413134232.4273.45
S0775Naturalis, T+PE+SENikolov18271,855,9861,842,191642,7610.34984999.1546.47
S0791Naturalis, T+PE+SENikolov18271,403,2541,391,540325,8980.234148150.2781.06
S0797Naturalis, T+PE+SENikolov18271,266,1221,250,748365,7760.292145456.4279.58
S0807Naturalis, T+PE+SENikolov18271,060,4921,053,332422,4000.401144165.1678.87
S0816Naturalis, T+PE+SENikolov1827648,334642,818251,7660.392131538.8471.98
Averages 1,665,874.60 1,651,599.50 569,991.90 0.35 1449.50 87.92 79.34

Note: PE = recovered paired‐end‐only data; SDD = SuperDeduper; SE = single end; T = Trimmomatic.

  21 in total

1.  A Universal Probe Set for Targeted Sequencing of 353 Nuclear Genes from Any Flowering Plant Designed Using k-Medoids Clustering.

Authors:  Matthew G Johnson; Lisa Pokorny; Steven Dodsworth; Laura R Botigué; Robyn S Cowan; Alison Devault; Wolf L Eiserhardt; Niroshini Epitawalage; Félix Forest; Jan T Kim; James H Leebens-Mack; Ilia J Leitch; Olivier Maurin; Douglas E Soltis; Pamela S Soltis; Gane Ka-Shu Wong; William J Baker; Norman J Wickett
Journal:  Syst Biol       Date:  2019-07-01       Impact factor: 15.683

2.  Anchored phylogenomics improves the resolution of evolutionary relationships in the rapid radiation of Protea L.

Authors:  Nora Mitchell; Paul O Lewis; Emily Moriarty Lemmon; Alan R Lemmon; Kent E Holsinger
Journal:  Am J Bot       Date:  2017-01-19       Impact factor: 3.844

3.  Recalcitrant deep and shallow nodes in Aristolochia (Aristolochiaceae) illuminated using anchored hybrid enrichment.

Authors:  Stefan Wanke; Carolina Granados Mendoza; Sebastian Müller; Anna Paizanni Guillén; Christoph Neinhuis; Alan R Lemmon; Emily Moriarty Lemmon; Marie-Stéphanie Samain
Journal:  Mol Phylogenet Evol       Date:  2017-05-20       Impact factor: 4.286

4.  A Comprehensive Phylogenomic Platform for Exploring the Angiosperm Tree of Life.

Authors:  William J Baker; Paul Bailey; Vanessa Barber; Abigail Barker; Sidonie Bellot; David Bishop; Laura R Botigué; Grace Brewer; Tom Carruthers; James J Clarkson; Jeffrey Cook; Robyn S Cowan; Steven Dodsworth; Niroshini Epitawalage; Elaine Françoso; Berta Gallego; Matthew G Johnson; Jan T Kim; Kevin Leempoel; Olivier Maurin; Catherine Mcginnie; Lisa Pokorny; Shyamali Roy; Malcolm Stone; Eduardo Toledo; Norman J Wickett; Alexandre R Zuntini; Wolf L Eiserhardt; Paul J Kersey; Ilia J Leitch; Félix Forest
Journal:  Syst Biol       Date:  2022-02-10       Impact factor: 15.683

5.  A protocol for targeted enrichment of intron-containing sequence markers for recent radiations: A phylogenomic example from Heuchera (Saxifragaceae).

Authors:  Ryan A Folk; Jennifer R Mandel; John V Freudenstein
Journal:  Appl Plant Sci       Date:  2015-08-14       Impact factor: 1.936

6.  Integrating genomic resources to present full gene and putative promoter capture probe sets for bread wheat.

Authors:  Laura-Jayne Gardiner; Thomas Brabbs; Alina Akhunov; Katherine Jordan; Hikmet Budak; Todd Richmond; Sukhwinder Singh; Leah Catchpole; Eduard Akhunov; Anthony Hall
Journal:  Gigascience       Date:  2019-04-01       Impact factor: 6.524

7.  Fast and accurate long-read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2010-01-15       Impact factor: 6.937

8.  Hyb-Seq: Combining target enrichment and genome skimming for plant phylogenomics.

Authors:  Kevin Weitemier; Shannon C K Straub; Richard C Cronn; Mark Fishbein; Roswitha Schmickl; Angela McDonnell; Aaron Liston
Journal:  Appl Plant Sci       Date:  2014-08-29       Impact factor: 1.936

9.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

10.  Phylogenomics of the Major Tropical Plant Family Annonaceae Using Targeted Enrichment of Nuclear Genes.

Authors:  Thomas L P Couvreur; Andrew J Helmstetter; Erik J M Koenen; Kevin Bethune; Rita D Brandão; Stefan A Little; Hervé Sauquet; Roy H J Erkens
Journal:  Front Plant Sci       Date:  2019-01-09       Impact factor: 5.753

View more
  3 in total

1.  A Comprehensive Phylogenomic Platform for Exploring the Angiosperm Tree of Life.

Authors:  William J Baker; Paul Bailey; Vanessa Barber; Abigail Barker; Sidonie Bellot; David Bishop; Laura R Botigué; Grace Brewer; Tom Carruthers; James J Clarkson; Jeffrey Cook; Robyn S Cowan; Steven Dodsworth; Niroshini Epitawalage; Elaine Françoso; Berta Gallego; Matthew G Johnson; Jan T Kim; Kevin Leempoel; Olivier Maurin; Catherine Mcginnie; Lisa Pokorny; Shyamali Roy; Malcolm Stone; Eduardo Toledo; Norman J Wickett; Alexandre R Zuntini; Wolf L Eiserhardt; Paul J Kersey; Ilia J Leitch; Félix Forest
Journal:  Syst Biol       Date:  2022-02-10       Impact factor: 15.683

2.  A New Approach Using Targeted Sequence Capture for Phylogenomic Studies across Cactaceae.

Authors:  Serena Acha; Lucas C Majure
Journal:  Genes (Basel)       Date:  2022-02-15       Impact factor: 4.096

3.  A target Capture Probe Set Useful for Deep- and Shallow-Level Phylogenetic Studies in Cactaceae.

Authors:  Monique Romeiro-Brito; Milena Cardoso Telhe; Danilo Trabuco Amaral; Fernando Faria Franco; Evandro Marsola Moraes
Journal:  Genes (Basel)       Date:  2022-04-17       Impact factor: 4.141

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.