| Literature DB >> 26908260 |
Satish K Guttikonda1, Pradeep Marri1, Jafar Mammadov1, Liang Ye1, Khaing Soe1, Kimberly Richey1, James Cruse1, Meibao Zhuang1, Zhifang Gao1, Clive Evans1, Steve Rounsley1, Siva P Kumpatla1.
Abstract
Demand for the commercial use of genetically modified (GM) crops has been increasing in light of the projected growth of world population to nine billion by 2050. A prerequisite of paramount importance for regulatory submissions is the rigorous safety assessment of GM crops. One of the components of safety assessment is molecular characterization at DNA level which helps to determine the copy number, integrity and stability of a transgene; characterize the integration site within a host genome; and confirm the absence of vector DNA. Historically, molecular characterization has been carried out using Southern blot analysis coupled with Sanger sequencing. While this is a robust approach to characterize the transgenic crops, it is both time- and resource-consuming. The emergence of next-generation sequencing (NGS) technologies has provided highly sensitive and cost- and labor-effective alternative for molecular characterization compared to traditional Southern blot analysis. Herein, we have demonstrated the successful application of both whole genome sequencing and target capture sequencing approaches for the characterization of single and stacked transgenic events and compared the results and inferences with traditional method with respect to key criteria required for regulatory submissions.Entities:
Mesh:
Year: 2016 PMID: 26908260 PMCID: PMC4764375 DOI: 10.1371/journal.pone.0149515
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Molecular characterization of transgenic events by traditional and advanced approaches.
Insertion location, integration site in soybean genome and 5’ and 3’ Flanking Junction reads by whole genome sequencing of TE1 and TE2 single and TE1 x TE2 breeding stack events.
| Trait | Breeding Generation and Sample Number | Integration site in soybean genome | Total Reads | Reads mapped to genome (Coverage) | Reads mapped to Transgene (Coverage) | Number of Junction Pair End (PE) pairs | 5’ Junction Reads coverage | 3’ Junction Reads coverage |
|---|---|---|---|---|---|---|---|---|
| TE 1 | F2-1 | Gm06:18,552,249..18,556,606 | 98,725,411 | 93,762,230 (10X) | 863 (9X) | 72 | 6 | 5 |
| TE 1 | T3-3 | Gm06:18,552,249..18,556,606 | 119,386,991 | 110,949,911 (11X) | 938 (9X) | 72 | 8 | 6 |
| TE 2 | F2-10 | Gm02:10,027,285..10,027,342 | 108,314,430 | 102,156,087 (10X) | 517 (4X) | 29 | 1 | 2 |
| TE 2 | T3-1 | Gm02:10,027,285..10,027,342 | 95,660,998 | 91,367,370 (9X) | 952 (8X) | 47 | 4 | 5 |
| TE 1 in TE 1 x TE2 stack | Single generation | Gm06:18,552,249..18,556,606 | 146,746,063 | 140,544,330 (14X) | 1,128 (11X) | 38 | 8 | 3 |
| TE 2 in TE 1 x TE 2 stack | Single generation | Gm02:10,027,285..10,027,342 | 146,746,063 | 140,544,330 (14X) | 1,338 (11X) | 13 | 6 | 5 |
* The coverage of the genome was calculated by the formula [(number of reads)*length of a read]/genome size] with an average read length and the size of the soybean genome being equal to 100 bp and 975,000,000 bp, respectively.
**The coverage of the T-DNA was calculated by the formula [(number of reads)*length of a read]/length of T-DNA] with the insert size of ~10,000 bp in Trait 1 and ~12,500 bp in Trait 2.
***The coverage of the T-DNA in TE2 in F2 generation is half of that in T3 generation as the sample (F2-10) representing the former is in hemizygous state.
**** The coverage within the junctions was represented by number of reads spanning those regions.
Insertion location, integration site in soybean genome and 5’ and 3’ Flanking Junction reads by Target genome sequencing of TE1 and TE2 single and TE1 x TE2 breeding stack events.
| Trait | Breeding Generation and Sample Number | Integration site in soybean genome | Reads mapped to transgene (Avg. coverage) | Number of Junction Pair End (PE) pairs | 5’ Junction Reads coverage | 3’ Junction Reads coverage |
|---|---|---|---|---|---|---|
| TE 1 | F2-1 | Gm06:18,552,249..18,556,606 | 279797 (6053x) | 50970 | 4905 | 7013 |
| TE 1 | T3-3 | Gm06:18,552,249..18,556,606 | 267750 (5792x) | 49607 | 4692 | 6919 |
| TE 2 | F2-10 | Gm02:10,027,285..10,027,342 | 161770 (2821x) | 17722 | 1847 | 4029 |
| TE 2 | T3-1 | Gm02:10,027,285..10,027,342 | 265192 (4625x) | 28051 | 3001 | 6006 |
| TE 1 in TE 1 x TE2 stack | Single generation | Gm06:18,552,249..18,556,606 | 362253 (7837x) | 62534 | 9206 | 10587 |
| TE 2 in TE 1 x TE 2 stack | Single generation | Gm02:10,027,285..10,027,342 | 455842 (7950x) | 25019 | 6987 | 18792 |
Fig 2Molecular characterization of transgenic soybean Transgenic Event 1 (TE1) and Transgenic Event 2 (TE2) using whole genome sequencing.
Genomic DNA of TE1 and TE2 was randomly sheared and sequenced using Illumina's HiSeq2000 instrument. The genome coverage was ~ 10X, i.e. 10 copies of soybean haploid genome. Short HiSeq2000 reads (A) spanning entire T-DNA within TE1 and TE2 (B) were mapped back to transformation plasmid that contained intended T-DNA (C1) and backbone (C2). Uninterrupted blue bars aligned to the intended T-DNA (C1) of the transformation plasmid confirms the integrity of T-DNA within events. No blue bars over plasmid backbone (C2) confirms the absence of those sequences within the genome of TE1 and TE2 events. Twenty chromosomes (Gm1-20) of soybean reference genome (Williams 82 version X) are represented in circular fashion (D). Reads spanning junction regions were mapped back to soybean reference genome, which showed single insertion site on chromosome 6 (E) in TE1 and chromosome 2 (E) in TE2.
Fig 3Molecular characterization of soybean breeding stack Transgenic Event 1 x Transgenic Event 2 (TE1 x TE2) using whole genome sequencing.
Genomic DNA of TE1 x TE2 was randomly sheared and sequenced using Illumina's HiSeq2000 instrument. The genome coverage was ~ 14X, i.e. 14 copies of soybean haploid genome. Short HiSeq2000 reads (A) spanning entire T-DNA within TE1 and TE2 (B) were mapped back to transformation plasmid that contained intended T-DNA (C1) and backbone (C2). Uninterrupted blue bars aligned to the intended T-DNA (C1) of the transformation plasmid confirms the integrity of T-DNA within TE1 and TE2. No blue bars over plasmid backbone (C2) confirms the absence of those sequences within the genome of TE1 and TE2. Twenty chromosomes (Gm1-20) of soybean reference genome (Williams 82 version X) are represented in circular fashion (D). Reads spanning junction regions were mapped back to soybean reference genome, which showed single insertion site on chromosome 6 (E) in TE1 and on chromosome 2 (E) in TE2. T-DNA insert in both TE1 and TE2 share the same fragment at the 3'border region (F).
Molecular characterization of soybean single events, TE 1 and TE2, and their breeding stack TE1× TE2 using whole genome sequencing sequencing approaches.
Each element of the T-DNA is represented by the “X” amount of coverage depth. Both T-DNA inserts within TE1 and TE2 events share several identical elements, such as T1_E5 = T2_E1 (promoters), T1_E7 = T2_E3 = T2_E6 (terminators), T1_E8 = T2_E4 = T2_E7 (promoters), T1_E9 = T2_E8 (gene of interest, GOI), and T1_E10 = T2_E9 (terminators).
| TE1 | TE1 in TE1 × TE2 | ||
| Generations | F2_1 | T3_3 | Single generation |
| Elements(Donor Organism) | Coverage (nX) | ||
| T1_E1-expression enhancer | 8 | 8 | 14 |
| T1_E2 –terminator( | 8 | 9 | 12 |
| T1_E3 –GOI( | 6 | 7 | 5 |
| T1_E4 –promoter( | 8 | 8 | 15 |
| T1_E5 –promoter( | 8 | 8 | 15 |
| T1_E6 –GOI( | 8 | 7 | 6 |
| T1_E7 –terminator( | 9 | 7 | 7 |
| T1_E8 –promoter( | 7 | 9 | 8 |
| T1_E9 –GOI( | 7 | 10 | 7 |
| T1_E10 –terminator( | 8 | 12 | 9 |
| TE2 | TE2 in TE1 × TE2 | ||
| Generations | F2_10 | T3-1 | Single Generation |
| Elements(Donor Organism) | Coverage (nX) | ||
| T2_E1 –promoter( | 4 | 8 | 14 |
| T2_E2 –GOI( | 4 | 6 | 6 |
| T2_E3 –terminator( | 4 | 8 | 20 |
| T2_E4 –promoter( | 5 | 8 | 21 |
| T2_E5 –GOI( | 3 | 7 | 9 |
| T2_E6 –terminator( | 4 | 9 | 9 |
| T2_E7 –promoter( | 3 | 10 | 8 |
| T2_E8 –GOI( | 2 | 6 | 8 |
| T2_E9 –terminator( | 3 | 5 | 11 |
Molecular characterization of soybean single events, TE 1 and TE2, and their breeding stack TE1× TE2 using target capture sequencing approaches.
Each element of the T-DNA is represented by the “X” amount of coverage depth. Both T-DNA inserts within TE1 and TE2 events share several identical elements, such as T1_E5 = T2_E1 (promoters), T1_E7 = T2_E3 = T2_E6 (terminators), T1_E8 = T2_E4 = T2_E7 (promoters), T1_E9 = T2_E8 (gene of interest, GOI), and T1_E10 = T2_E9 (terminators).
| TE1 | TE1 in TE1 × TE2 | ||
| Generations | F2_1 | T3_3 | Single generation |
| Elements | Coverage (nX) | ||
| T1_E1-expression enhancer | 14874 | 15877 | 14871 |
| T1_E2 –terminator | 7433 | 7244 | 6711 |
| T1_E3 –GOI | 189 | 131 | 78 |
| T1_E4 –promoter | 2126 | 1743 | 1653 |
| T1_E5 –promoter | 2392 | 1883 | 1774 |
| T1_E6 –GOI | 5171 | 4476 | 4242 |
| T1_E7 –terminator | 12914 | 11770 | 11373 |
| T1_E8 –promoter | 15291 | 14219 | 13808 |
| T1_E9 –GOI | 12804 | 11951 | 11788 |
| T1_E10—terminator | 11033 | 9007 | 8800 |
| TE2 | TE2 in TE1 × TE2 | ||
| Generations | F2_10 | T3-1 | Single Generation |
| Elements | Coverage (nX) | ||
| T2_E1 –promoter | 1881 | 3049 | 4869 |
| T2_E2 –GOI | 2409 | 3738 | 5401 |
| T2_E3 –terminator | 5294 | 9086 | 13035 |
| T2_E4 –promoter | 5907 | 10313 | 14303 |
| T2_E5 –GOI | 1426 | 2358 | 3653 |
| T2_E6 –terminator | 5182 | 8776 | 12930 |
| T2_E7 –promoter | 6799 | 11757 | 17953 |
| T2_E8 –GOI | 5905 | 9692 | 16089 |
| T2_E9—terminator | 3984 | 6342 | 13483 |
Fig 4Sensitivity of paired end chemistry of NGS in detecting small DNA aberrations (insertion, deletion, and inversion) within T-DNA.
Comparison of concepts between Southern blot analysis, Target capture sequencing and Whole Genome Sequencing.
| Step | Southern Blot analysis | Target Capture Sequencing | Whole Genome Sequencing |
|---|---|---|---|
| 1 | ~2500 bp overlapping probes are designed to cover entire length of T-DNA. | ~120 bp capture probes (bates) are designed to cover entire T-DNA | No analogy |
| 2 | Genomic DNA of an event is digested by restriction enzymes | Genomic DNA of an event is randomly sheared and library prepared | Genomic DNA of an event is randomly sheared and library prepared |
| 3 | Genomic DNA transfer and immobilization on the matrix | No analogy | No analogy |
| 4 | Hybridization of a labeled probe with genomic DNA immobilized on the matrix | Hybridization of biotinylated probes with sheared genomic DNA. Add magnetic streptavidin beads to bind to biotinylated probe | No analogy |
| 6 | Wash away unhybridized probes and non-specific hybridizations | Capture target with magnet and wash away off-target sequences | No analogy |
| 7 | DNA band detection and visualization | Sequence captured DNA fragments | Sequencing of entire genome |
| 8 | Outcome: a band | Outcome: DNA sequence | Outcome: DNA sequence |