| Literature DB >> 35342556 |
Alex R Van Dam1, Javier O Covas Orizondo1, Athena W Lam2, Duane D McKenna3,4, Matthew H Van Dam2.
Abstract
Phylogenomics via ultraconserved elements (UCEs) has led to improved phylogenetic reconstructions across the tree of life. However, inadvertently incorporating non-targeted DNA into the UCE marker design will lead to misinformation being incorporated into subsequent analyses. To date, the effectiveness of basic metagenomic filtering strategies has not been assessed in arthropods. Designing markers from museum specimens requires careful consideration of methods due to the high levels of microbial contamination typically found in such specimens. We investigate if contaminant sequences are carried forward into a UCE marker set we developed from insect museum specimens using a standard bioinformatics pipeline. We find that the methods currently employed by most researchers do not exclude contamination from the final set of targets. Lastly, we highlight several paths forward for reducing contamination in UCE marker design.Entities:
Keywords: contamination; insects; museum specimens; ultraconserved elements
Year: 2022 PMID: 35342556 PMCID: PMC8932080 DOI: 10.1002/ece3.8625
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
FIGURE 2Blob plots of UCE‐bearing scaffolds from the master UCE probe set. The Y‐axis is read coverage via the bwa‐mem algorithm, and the X‐axis is the GC content of individual scaffolds. Color is coded by taxonomic class
FIGURE 1Blob plots of draft genome assembly scaffolds, Y‐axis is read coverage via the bwa‐mem algorithm, X‐axis is GC content of individual scaffolds. Color is coded by taxonomic class
FIGURE 3UCE bar chart. Y‐axis is the total number of UCEs, and the X‐axis is the method used to annotate associated species. Color is coded by taxonomic annotation
FIGURE 4Venn diagrams of UCE bearing scaffolds annotated by taxonomic groups. The groupings are the same ones used in the final data matrix
FIGURE 5Linear regression of log UCE loci length versus the log of coverage for Pachyrhynchini taxa in the final data matrix