| Literature DB >> 31666537 |
Frédéric Debode1, Julie Hulin2, Benoît Charloteaux3, Wouter Coppieters3, Marc Hanikenne4, Latifa Karim3, Gilbert Berben2.
Abstract
Next generation sequencing (NGS) is a promising tool for analysing the quality and safety of food and feed products. The detection and identification of genetically modified organisms (GMOs) is complex, as the diversity of transgenic events and types of structural elements introduced in plants continue to increase. In this paper, we show how a strategy that combines enrichment technologies with NGS can be used to detect a large panel of structural elements and partially or completely reconstruct the new sequence inserted into the plant genome in a single analysis, even at low GMO percentages. The strategy of enriching sequences of interest makes the approach applicable even to mixed products, which was not possible before due to insufficient coverage of the different genomes present. This approach is also the first step towards a more complete characterisation of agrifood products in a single analysis.Entities:
Mesh:
Year: 2019 PMID: 31666537 PMCID: PMC6821802 DOI: 10.1038/s41598-019-51668-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
List of the structural elements used for the enrichment step.
| Type of structural element | Name | Size (bp) | Sequence source |
|---|---|---|---|
| Promoters | p35S | 867 | NCBI KX880509 |
| pFMV | 981 | NCBI X06166 | |
| pUbi | 2018 | NCBI S94464 | |
| pNOS | 398 | Patent WO2006074956 | |
| pmas | 660 | NCBI DQ225747 | |
| Ps7s7 | 1046 | NCBI AY181091 | |
| pRice actin1 | 660 | NCBI S44221 | |
| pRice actin2 | 259 | NCBI EU161577 | |
| pSSuAra | 1727 | NCBI CP002684 | |
| pTA29 | 627 | NCBI X52283 | |
| pMTL | 2556 | NCBI S57628 | |
| Terminators | tOCS | 823 | NCBI LT727071 |
| tE9 | 648 | Patent W02007027777 | |
| tNOS | 300 | NCBI AB809952 | |
| tg7 | 203 | Patent WO2006074956 | |
| tpinII | 318 | NCBI KP784700 | |
| t35S | 211 | NCBI GU734649 | |
| Genes | gus | 576 | NCBI CP029981 |
| gox | 1296 | Patent US5463175 | |
| cryIAb 1 | 1854 | Patent US20030226171 | |
| cryIAb 2 | 3844 | NCBI AY326434 | |
| cryIAb 3 | 1947 | Patent US5625136 | |
| cry1B | 1950 | NCBI KC414884 | |
| cry1Aa | 1848 | NCBI GU583855 | |
| cry1Ab/c | 1923 | NCBI GU583854 | |
| cry1Ac | 1923 | NCBI KF630361 | |
| cry1A105 1 | 3537 | Patent WO200702777 | |
| cry1A105 2 | 3433 | NCBI DI362404 | |
| cry2Ab2 | 1900 | NCBI DI362404 | |
| cry3A055 | 1797 | Patent EP2289311 | |
| cry34Ab1 | 424 | Patent W02006039376 | |
| cry35Ab1 | 981 | Patent W02006039376 | |
| EPSPS 1 | 1415 | NCBI AB209952 | |
| EPSPS 2 | 1367 | Patent WO2004074492 | |
| 2mEPSPS | 1338 | Patent WO2011063411 | |
| bar | 835 | NCBI X05822 | |
| pat | 569 | NCBI GQ497217 | |
| Prsv-cp 1 | 1601 | NCBI F5490192 | |
| Prsv-cp 2 | 1070 | NCBI GZ450610 | |
| Miscellaneous | hsp70 | 804 | NCBI AY326434 |
| Total length of the database used for enrichment | 52534 | ||
When several variants of the same gene were present, they were identified with a different numeric index (e.g., EPSPS1, EPSPS2).
Figure 1Workflow of the enrichment technology prior to sequencing.
Figure 2Bioinformatic workflow developed for detecting and identifying GMOs. The bioinformatic packages used are indicated in grey.
Origin of the samples used for analysis by NGS.
| Material used | Reference | Provider |
|---|---|---|
| Rapeseed GT73 (100% GM) | AOCS 0304-B | AOCS |
| Rapeseed MS8 (100% GM) | AOCS 0306-F2 | AOCS |
| Maize 59122 (10% GM) | ERM-BF424d | IRMM |
| Cotton 281 × 3006 (10% GM) | ERM-BF422d | IRMM |
| Maize MON89034 (100% GM) | AOCS 0906-E | AOCS |
| Soybean A2704-12 (100% GM) | AOCS 0707-B4 | AOCS |
| Rice LL62 (100% GM) | AOCS 0306-I4 | AOCS |
| Soybean GTS 40-3-2 (0.1% GM) | ERM-BF410bk | IRMM |
| Soybean GTS 40-3-2 (1% GM) | ERM-BF410dk | IRMM |
| Soybean GTS 40-3-2 (10% GM) | ERM-BF410gk | IRMM |
| Maize (0% GM) | Commercial organic maize | (Ekibio, Peaugres, France) |
| Soybean (0% GM) | Commercial organic soybean | (Ekibio, Peaugres, France) |
Figure 3Detection and characterisation of GMOs by NGS. The structures of the inserts of seven GMOs are presented. The 281 × 3006 cotton has two GM inserts. The mixed sample contains 50% A2704 soybean and 50% LL62 rice. The structural elements in grey were present in the database used for enrichment and were detected by NGS. The reads associated with these structural elements were used to create contigs. Only larger contigs covering several structural elements are shown here. Larger structural elements not covered by the capture probes created gaps, making it impossible to reconstruct the entire sequence of the transgenic cassette. Junction regions covering the plant and transgenic insert were also obtained.
Figure 4Sequence of GTS 40-3-2 soybean and alignments of the contigs obtained in this research. The structural elements in grey shown in the database were used for enrichment and were detected by NGS. (A) Expected sequences of the GTS-40-3-2 soybean, as announced by Monsanto and as described by Windels et al.[28]. Additional sequence corresponds to a duplication of part of the EPSPS gene and an unknown rearranged sequence. (B) Positions of the contigs created for the samples containing GTS-40-3-2 soybean at 10%, 1% and 0.1%.
Reads obtained in soybean flour containing 0.1%, 1% and 10% GTS 40-3-2 soybean (results obtained with reads of 75 bp; the number of reads was not normalised to the length of the structural element considered here).
| Reads | Soybean | Soybean | Soybean | Soybean |
|---|---|---|---|---|
| Number of reads | 2103154 | 2098020 | 2226922 | 1961322 |
| Number of reads aligned | 211 | 1423 | 11023 | 74739 |
| % of reads aligned | 0.01 | 0.07 | 0.49 | 3.81 |
| Reads aligned with p35S | 0 | 128 | 1275 | 10504 |
| Reads aligned with tNOS | 4 | 142 | 1438 | 8743 |
| Reads aligned with EPSPS | 28 | 824 | 8107 | 55316 |
The values presented in the table are the results of a single analysis.