| Literature DB >> 32528502 |
Kairi Raime1, Kaarel Krjutškov2, Maido Remm1.
Abstract
Fast and reliable analytical methods for the identification of plants from metagenomic samples play an important role in identifying the components of complex mixtures of processed biological materials, including food, herbal products, gut contents or environmental samples. Different PCR-based methods that are commonly used for plant identification from metagenomic samples are often inapplicable due to DNA degradation, a low level of successful amplification or a lack of detection power. We introduce a method that combines metagenomic sequencing and an alignment-free k-mer based approach for the identification of plant DNA in processed metagenomic samples. Our method identifies plant DNA directly from metagenomic sequencing reads and does not require mapping or assembly of the reads. We identified more than 31,000 Lupinus-specific 32-mers from assembled chloroplast genome sequences. We demonstrate that lupin DNA can be detected from controlled mixtures of sequences from target species (different Lupinus species) and closely related non-target species (Arachis hypogaea, Glycine max, Pisum sativum, Vicia faba, Phaseolus vulgaris, Lens culinaris, and Cicer arietinum). Moreover, these 32-mers are detectable in the following processed samples: lupin flour, conserved seeds and baked cookies containing different amounts of lupin flour. Under controlled conditions, lupin-specific components are detectable in baked cookies containing a minimum of 0.05% of lupin flour in wheat flour.Entities:
Keywords: DNA sequencing reads; alignment-free analysis; chloroplast genome; k-mer; lupin; metagenomics; plant taxa identification
Year: 2020 PMID: 32528502 PMCID: PMC7253697 DOI: 10.3389/fpls.2020.00646
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
The analyzed seed and flour samples and their origins.
| Material | Origin |
| UK (from private seeds seller) | |
| Spain (from local store) | |
| Italy (Di Nunzio srl) | |
| Netherlands (Lupinfood) | |
| Lupin flour (from | Netherlands (Lupinfood) |
| Bolivia (from private seeds seller) | |
| Germany (Bohlsener Mühle) | |
| Chickpea ( | Germany (Müller’;s Mühle) |
Specific primers used to amplify genomic and plastid DNA from Lupinus species (Lup_2_F and Lup_2_R) or plants (CP 03_F and CP 03_R).
| Name | Sequences | Amplicon size (bp) | Region |
| Lup_2_F | ACGACAACAAGATGAGCAAGAAG | 145 | Nuclear, beta conglutin |
| Lup_2_R | GCCAAATCCAAGCAAGCGA | Nuclear, beta conglutin | |
| CP 03_F | CGGACGAGAATAAAGATAGAGT | 123 | Chloroplast |
| CP 03_R | TTTTGGGGATAGAGGGACTTGA | Chloroplast |
FIGURE 1The number of detected Lupinus spp. k-mers in the whole-genome sequencing data from multiple Lupinus species and phylogenetically close non-target species. The variable numbers of sequencing reads for the species are presented on the X-axis. The samples from non-target species are shown in red and Lupinus species are shown in other colors.
FIGURE 2The number of detected Lupinus albus k-mers in the whole-genome sequencing reads from multiple Lupinus species and phylogenetically close non-target species. The variable numbers of sequencing reads for the species are on the X-axis. Lupinus albus samples are shown in blue, the samples from other Lupinus species are shown in orange and other non-target samples are shown in red.
FIGURE 3The number of detected Lupinus spp. k-mers (Y-axis) in the whole-genome sequencing data from cookies containing various concentrations of lupin (L. angustifolius flour) (X-axis).
FIGURE 4The number of detected Lupinus spp. k-mers in the whole-genome sequencing data from cookies containing different amounts of lupin. The cookie samples with variable numbers of sequencing reads are shown on the X-axis. Different cookie samples with different amounts of lupin are indicated by different colors.