| Literature DB >> 34551203 |
Kristine Bohmann1, Vasco Elbrecht2, Christian Carøe1, Iliana Bista3,4, Florian Leese5, Michael Bunce6, Douglas W Yu7,8,9, Mathew Seymour10, Alex J Dumbrell11, Simon Creer12.
Abstract
Metabarcoding of DNA extracted from environmental or bulk specimen samples is increasingly used to profile biota in basic and applied biodiversity research because of its targeted nature that allows sequencing of genetic markers from many samples in parallel. To achieve this, PCR amplification is carried out with primers designed to target a taxonomically informative marker within a taxonomic group, and sample-specific nucleotide identifiers are added to the amplicons prior to sequencing. The latter enables assignment of the sequences back to the samples they originated from. Nucleotide identifiers can be added during the metabarcoding PCR and during "library preparation", that is, when amplicons are prepared for sequencing. Different strategies to achieve this labelling exist. All have advantages, challenges and limitations, some of which can lead to misleading results, and in the worst case compromise the fidelity of the metabarcoding data. Given the range of questions addressed using metabarcoding, ensuring that data generation is robust and fit for the chosen purpose is critically important for practitioners seeking to employ metabarcoding for biodiversity assessments. Here, we present an overview of the three main workflows for sample-specific labelling and library preparation in metabarcoding studies on Illumina sequencing platforms; one-step PCR, two-step PCR, and tagged PCR. Further, we distill the key considerations for researchers seeking to select an appropriate metabarcoding strategy for their specific study. Ultimately, by gaining insights into the consequences of different metabarcoding workflows, we hope to further consolidate the power of metabarcoding as a tool to assess biodiversity across a range of applications.Entities:
Keywords: Illumina sequencing; amplicon sequencing; biodiversity assessment; eDNA; environmental DNA; high-throughput sequencing; library preparation
Mesh:
Substances:
Year: 2021 PMID: 34551203 PMCID: PMC9293284 DOI: 10.1111/1755-0998.13512
Source DB: PubMed Journal: Mol Ecol Resour ISSN: 1755-098X Impact factor: 8.678
FIGURE 1Simplified overview of a metabarcoding workflow. (a–b) DNA extracted from environmental samples such as soil, water, and faeces or from bulk specimen samples. The DNA extracts are typically a complex mix of DNA from target and nontarget organisms. (c) DNA extracts are PCR‐amplified with metabarcoding primers that target a taxonomically informative marker for a taxonomic group. Importantly, identifiers unique to each PCR product are added in the form of 5ʹ nucleotide tags on primers and/or as indices added to sequence libraries during library build. (d) The taxonomic markers of hundreds to thousands of samples are sequenced in parallel on a high‐throughput sequencing platform producing millions of sequence reads. (e) The sequences can be traced back to the samples they originated from through the nucleotide tags and/or library indices, and (f) can be further analysed. Images courtesy of the Integration and Application Network, University of Maryland Centre for Environmental Science (ian.umces.edu/symbols/) and Illumina.com
FIGURE 2Metabarcoding approaches can be divided into three overall strategies for adding nucleotide tags and library indices. (a) The composition of a dual‐tagged and dual‐indexed metabarcoding Illumina library sequence. Note that the metabarcoding marker, primers, and tags are sequenced as Illumina read 1 and read 2, while index reads are sequenced separately as i7 and i5 reads and used to multiplex sequencing libraries. (b–e) Strategies for adding nucleotide tags and indices to metabarcoding markers. The one‐step PCR (b) is depicted with the use of nucleotide tags, which eliminates the need for indices
Features of the three main metabarcoding strategies
| Feature | Metabarcoding strategy | ||||
|---|---|---|---|---|---|
| One‐step PCR | Two‐step PCR | Tagged PCR | |||
| With 5′ nucleotide tags, without i5 and i7 indices | Without 5′ nucleotide tags on metabarcoding primers | With 5′ nucleotide tags on metabarcoding primers | Library preparation with T4 DNA polymerase blunt‐ending and post‐ligation PCR | Library preparation without T4 DNA polymerase blunt‐ending and post‐ligation PCR | |
| Handling and workload |
|
|
|
|
|
| Risk of cross‐contamination between PCR products |
|
|
|
|
|
| Tag‐jumps | No | No | No | Yes | No |
| Potential for index misassignment/library bleeding on the flow cell | No (only if indices are used) | Yes | Yes (if indices are used) | Yes | Yes |
| Decrease in PCR efficiency due to nucleotide additions to metabarcoding primers | High | Potentially high | Potentially high | Low | Low |
| Cost of metabarcoding primers |
|
|
|
|
|
| Number PCR steps prior to sequencing | 1 | 2 | 2 | 2 | 1 |