| Literature DB >> 29074984 |
Alexandra Bogožalec Košir1,2, Alfred J Arulandhu3,4, Marleen M Voorhuijzen3, Hongmei Xiao5, Rico Hagelaar3, Martijn Staats3, Adalberto Costessi6, Jana Žel1, Esther J Kok3, Jeroen P van Dijk7.
Abstract
The majority of feed products in industrialised countries contains materials derived from genetically modified organisms (GMOs). In parallel, the number of reports of unauthorised GMOs (UGMOs) is gradually increasing. There is a lack of specific detection methods for UGMOs, due to the absence of detailed sequence information and reference materials. In this research, an adapted genome walking approach was developed, called ALF: Amplification of Linearly-enriched Fragments. Coupling of ALF to NGS aims for simultaneous detection and identification of all GMOs, including UGMOs, in one sample, in a single analysis. The ALF approach was assessed on a mixture made of DNA extracts from four reference materials, in an uneven distribution, mimicking a real life situation. The complete insert and genomic flanking regions were known for three of the included GMO events, while for MON15985 only partial sequence information was available. Combined with a known organisation of elements, this GMO served as a model for a UGMO. We successfully identified sequences matching with this organisation of elements serving as proof of principle for ALF as new UGMO detection strategy. Additionally, this study provides a first outline of an automated, web-based analysis pipeline for identification of UGMOs containing known GM elements.Entities:
Mesh:
Year: 2017 PMID: 29074984 PMCID: PMC5658351 DOI: 10.1038/s41598-017-14469-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Schematic overview of the ALF procedure. The procedure yields dsDNA molecules after initial primer elongation, of various length, the longest of which will pass a construct-genome boundary (star), leading to GMO identification.
Figure 2Distances of several qPCR targets from the enrichment starting points in the MON88107 GMO, used for evaluation of the ALF protocol. The light grey boxes indicate the different elements in the GMO. The upper and lower line indicate the enrichment for tNOS (upper) and p35S (lower), 0 indicates the starting points of enrichment and arrows indicate the direction of enrichment (upstream or downstream). The dark grey boxes indicate qPCR targets used for evaluation of the ALF protocol and distance of the qPCR targets from the starting points.
Figure 3Length-dependent increase of specific targets and loss of genomic background shown with qPCR. In all cases the amplicon closest to the element targeted for linear enrichment showed the highest increase in signal, expressed as the relative Cq value and calculated by subtracting the Cq after LE (dark bars) or snPCR (light bars) from the Cq of the starting material. This means the relative Cq value for each starting point is zero. Panel a: Enrichment for tNOS downstream. (b) Enrichment for tNOS upstream. (c) Enrichment for p35S downstream. (d) Enrichment for p35S upstream. In all panels, the dashed bars for HMG indicate a reduction beyond detection.
Details of reference materials[49].
| GMO | Content in mixture (%) | Estimated copy number | Supplier | Code | Description | Element order* | Donor organism |
|---|---|---|---|---|---|---|---|
| MON810 | 97 | ~40.000 | Field trial[ | In-house | 50% MON810, ground corn | P-e35S | Cauliflower mosaic virus |
| I-hsp70 |
| ||||||
| CS-cry1Ab |
| ||||||
| MON89034 | 1 | ~400 | AOCS | 0906-E | >99.42% MON89034, ground corn | V-LB |
|
| P-e35S | Cauliflower mosaic virus | ||||||
| L-cab |
| ||||||
| I-1_act1 |
| ||||||
| CS-cry1A_105 | Synthetic | ||||||
| T-hsp17_3 |
| ||||||
| P-FMV | Figwort mosaic virus | ||||||
| I-hsp70 |
| ||||||
| I-1_rbcS |
| ||||||
| CS-cry2Ab2 | |||||||
| T-nos |
| ||||||
| V-RB |
| ||||||
| MON88017 | 1 | ~400 | AOCS | 0406-D | >99.05% MON88017, ground corn | P-act1 |
|
| I-1_act1 |
| ||||||
| TP-ctp |
| ||||||
| CS-CP4epsps | |||||||
| T-nos |
| ||||||
| P-e35S | Cauliflower mosaic virus | ||||||
| L-cab |
| ||||||
| I-1_act1 |
| ||||||
| CS-cry3Bb1 | |||||||
| T-hsp17_3 |
| ||||||
| MON15985 | 1 | ~400 | AOCS | 0804-D | >98.45% Bollgard II cotton, ground cotton seed | P-e35S | Cauliflower mosaic virus |
| L-hsp70 | Petunia hybrida | ||||||
| TP-ctp |
| ||||||
| CS-cry2Ab2 | |||||||
| T-nos |
| ||||||
| P-e35S | Cauliflower mosaic virus | ||||||
| CS-cry1Ac | |||||||
| T-7Salpha |
| ||||||
| P-35S |
| ||||||
| CS-nptII |
| ||||||
| T-nos |
| ||||||
| P-e35S | Cauliflower mosaic virus | ||||||
| CS-uidA |
| ||||||
| T-nos |
| ||||||
| CS-aadA |
|
*Initial capital characters: P = promoter, I = intron, CS = coding sequence, V = vector, LB = left border, L = leader, T = terminator, RB = right border, TP = transit peptide.
Event database sequences and corresponding NCBI accession numbers.
| GM event | Event sequences | Best hit accession number |
|---|---|---|
| MON810 | TCGAAGGACGAAGGACTCTAACGT | AR490568 |
| MON89034 | TTCTCCATATTGACCATCATACTCATT | FV532179 |
| MON88017 | GAGCAGGACCTGCAGAAGCT | DJ058152/DJ058151 |
| MON15985 | GTTACTAGATCGGGGATATCC | EA135634 |
*Nucleotides in italics denote the string of unknown nucleotides between primer and probe sequences in the search template.
Figure 4Schematic overview of the data analysis workflow. CCS reads are first processed to reduce noise and redundancy. Representative reads are then grouped in relevant bins based on homology with sequences in different databases, using blast. In the results, reads likely to be related to MON15985 are marked in red as this authorised GMO with incomplete sequence information served as a model UGMO in this study.
Figure 5Two partial sequences of the MON531 and MON15985 inserts. Panel a shows the alignment of crCCS reads 133951 and 45207 to the insert and flanking region of MON531, with the position of enrichment primer (downstream NOS terminator). Panel b shows the alignment of and crCCS read 156962 to the insert and flanking region of MON15985, and the enrichment primer (upstream 35S promotor). Both sequences cover a previously unknown insert sequence.
Results of NCBI BLAST+ against the event database.
| crCCS read | Identified GMO | % of identical match | Alignment length | Length of db sequence | Alignment length in % |
|---|---|---|---|---|---|
| 33879 | MON810 | 100.00 | 92 | 92 | 100 |
| 40876 | MON810 | 95.65 | 92 | 92 | 100 |
| 78816 | MON810 | 98.92 | 93 | 92 | 101 |
| 119658 | MON810 | 96.81 | 94 | 92 | 102 |
| 128024 | MON810 | 100.00 | 92 | 92 | 100 |
| 132414 | MON810 | 96.74 | 92 | 92 | 100 |
| 158734* | MON810 | 100.00 | 92 | 92 | 100 |
| 158734* | MON810 | 100.00 | 92 | 92 | 100 |
*CCS read 158734 showed two perfect alignments with the database sequence.