| Literature DB >> 23767809 |
Matthieu Leray1, Joy Y Yang2, Christopher P Meyer3, Suzanne C Mills4, Natalia Agudelo3, Vincent Ranwez5, Joel T Boehm6, Ryuji J Machida7.
Abstract
INTRODUCTION: The PCR-based analysis of homologous genes has become one of the most powerful approaches for species detection and identification, particularly with the recent availability of Next Generation Sequencing platforms (NGS) making it possible to identify species composition from a broad range of environmental samples. Identifying species from these samples relies on the ability to match sequences with reference barcodes for taxonomic identification. Unfortunately, most studies of environmental samples have targeted ribosomal markers, despite the fact that the mitochondrial Cytochrome c Oxidase subunit I gene (COI) is by far the most widely available sequence region in public reference libraries. This is largely because the available versatile ("universal") COI primers target the 658 barcoding region, whose size is considered too large for many NGS applications. Moreover, traditional barcoding primers are known to be poorly conserved across some taxonomic groups.Entities:
Keywords: DNA barcoding; Food web; Mini-barcode; Mitochondrial marker; Second generation sequencing; Trophic interactions
Year: 2013 PMID: 23767809 PMCID: PMC3686579 DOI: 10.1186/1742-9994-10-34
Source DB: PubMed Journal: Front Zool ISSN: 1742-9994 Impact factor: 3.172
Figure 1Design of the “mlCOIint” forward and reverse complements within the highly variable COI fragment. A total of 6643 COI sequences, spanning 17 phyla (provided by the Moorea BIOCODE project) were aligned and the entropy h(x) plotted to visualize the level of variability at each position. h(x) = 0 when the site is conserved across all sequences (e.g., 100% A). h(x) is at a maximum when each nucleotide occurs at equal frequency. We also present the proportion of each nucleotide between sites 320 and 345, the region where the primers were designed.
COI primers used in this study
| LCO1490 | GGTCAACAAATCATAAAGATATTGG | [ |
| HCO2198 | TAAACTTCAGGGTGACCAAAAAATCA | [ |
| dgLCO1490 | GGTCAACAAATCATAAAGAYATYGG | [ |
| dgHCO2198 | TAAACTTCAGGGTGACCAAARAAYCA | [ |
| jgLCO1490 | TITCIACIAAYCAYAARGAYATTGG | [ |
| jgHCO2198 | TAIACYTCIGGRTGICCRAARAAYCA | [ |
| Uni-MinibarF1 | CAAAATCATAATGAAGGCATGAGC | [ |
| Uni-MinibarR1 | TCCACTAATCACAARGATATTGGTAC | [ |
| mlCOIintF | GGWACWGGWTGAACWGTWTAYCCYCC | herein |
| mlCOIintR | GGRGGRTASACSGTTCASCCSGTSCC | herein |
Figure 2Schematic representation of the bioinformatics pipeline used for analysis of COI sequence 454 dataset.
Figure 3Distribution of mismatches between the “mlCOIint” primer sequence and templates from the Moorea BIOCODE database.
Preliminary tests to determine the primer combination that performed best to amplify a short COI fragment
| | |||||||
|---|---|---|---|---|---|---|---|
| | |||||||
| Phylum | Cnidaria (6) | 6 | 6 | 6 | 2 | 2 | 2 |
| Arthropoda (18) | 16 | 15 | 16 | 12 | 11 | 11 | |
| Rotifera (1) | 1 | 1 | 1 | 0 | 0 | 0 | |
| Entoprocta (1) | 0 | 0 | 0 | 1 | 1 | 0 | |
| Annelida (4) | 4 | 4 | 4 | 3 | 4 | 4 | |
| Nemertea (2) | 2 | 2 | 2 | 0 | 0 | 1 | |
| Mollusca (9) | 7 | 7 | 8 | 7 | 7 | 7 | |
| Echiura (1) | 1 | 1 | 1 | 1 | 1 | 1 | |
| Chordata (2) | 2 | 2 | 2 | 1 | 2 | 1 | |
| Hemichordata (2) | 2 | 2 | 2 | 0 | 0 | 2 | |
| Echinodermata (1) | 1 | 1 | 1 | 0 | 0 | 1 | |
Columns show the number of taxa for which the target region was successfully amplified. The total number of taxa used for each phylum is displayed in parentheses.
Performance of universal primer sets for COI across phyla
| | ||||||
|---|---|---|---|---|---|---|
| | ||||||
| Phylum | Radiolaria (1) | 0 | 0 | 0 | 0 | 0 |
| Ciliophora (1) | 0 | 1 | 0 | 0 | 0 | |
| Sarcomastigophora (1) | 0 | 0 | 0 | 0 | 0 | |
| Amoebozoa (1) | 0 | 0 | 0 | 0 | 0 | |
| Placozoa (1) | 0 | 0 | 0 | 0 | 0 | |
| Porifera (4) | 4 | 3 | 3 | 2 | 2 | |
| Cnidaria (28) | 26 | 22 | 23 | 23 | 11 | |
| Ctenophora (2) | 1 | 0 | 0 | 0 | 1 | |
| Chaetognatha (2) | 2 | 1 | 2 | 2 | 0 | |
| Nematomorpha (1) | 0 | 0 | 0 | 0 | 0 | |
| Nematoda (2) | 1 | 1 | 0 | 0 | 0 | |
| Tardigrada (1) | 0 | 0 | 0 | 0 | 0 | |
| Arthropoda (99) | 87 | 84 | 80 | 82 | 30 | |
| Platyhelminthes (4) | 4 | 1 | 1 | 0 | 0 | |
| Gastrotricha (3) | 2 | 0 | 0 | 0 | 0 | |
| Gnathostomulida (3) | 2 | 1 | 0 | 0 | 0 | |
| Rotifera (1) | 1 | 1 | 1 | 0 | 0 | |
| Entoprocta (1) | 0 | 0 | 1 | 0 | 0 | |
| Bryozoa (9) | 9 | 9 | 8 | 7 | 5 | |
| Annelida (25) | 25 | 23 | 25 | 23 | 5 | |
| Nemertea (4) | 3 | 3 | 3 | 1 | 2 | |
| Sipuncula (5) | 5 | 5 | 5 | 5 | 1 | |
| Mollusca (52) | 47 | 45 | 49 | 48 | 11 | |
| Echiura (1) | 1 | 1 | 1 | 1 | 0 | |
| Phoronida (2) | 2 | 2 | 2 | 2 | 2 | |
| Brachiopoda (1) | 0 | 1 | 1 | 1 | 0 | |
| Chordata (18) | 15 | 9 | 12 | 12 | 4 | |
| Acoelomorpha (1) | 0 | 0 | 0 | 0 | 0 | |
| Hemichordata (2) | 2 | 0 | 1 | 2 | 1 | |
| Echinodermata (11) | 11 | 4 | 2 | 11 | 1 | |
Columns present the number of taxa for which the target region was successfully amplified. Amplification success was evaluated on agarose gels (pictures shown in Additional file 2). The total number of taxa used for each phylum is displayed in parentheses.
Figure 4Diversity, identity and sequence abundance of Operational Taxonomic Units (OTUs) recovered from fish gut contents. A) The number of OTUs per phylum is presented for all fish guts pooled together. OTUs were identified from BLASTn searches performed in the Moorea BIOCODE database and GENBANK. We considered a match to be at the species level when sequence similarity to a reference barcode was >98%. When sequence similarity was < 98%, we used the Bayesian assignment tool implemented in SAP to assign each OTU to a higher taxonomic group, accepting assignments at a significance level of 95% (posterior probability). B) The proportion of OTUs presented per abundance classes. Abundance corresponds to the number of sequences per OTUs.
Figure 5Intra- (A) and inter-specific (B) dietary content. The proportion of OTUs in each of the three most diverse phyla is presented. n = number of OTUs in each category. Note that the number of OTUs in A is larger than 334, the total number of OTUs found in fish gut contents, because some OTUs are shared between individuals of the three species. The list of phyla contained in the category “Others” is presented in Figure 4.