| Literature DB >> 25378689 |
Masaki Endo1, Masahiko Kumagai2, Ritsuko Motoyama1, Harumi Sasaki-Yamagata1, Satomi Mori-Hosokawa1, Masao Hamada1, Hiroyuki Kanamori1, Yoshiaki Nagamura1, Yuichi Katayose1, Takeshi Itoh1, Seiichi Toki3.
Abstract
Gene targeting (GT) is a technique used to modify endogenous genes in target genomes precisely via homologous recombination (HR). Although GT plants are produced using genetic transformation techniques, if the difference between the endogenous and the modified gene is limited to point mutations, GT crops can be considered equivalent to non-genetically modified mutant crops generated by conventional mutagenesis techniques. However, it is difficult to guarantee the non-incorporation of DNA fragments from Agrobacterium in GT plants created by Agrobacterium-mediated GT despite screening with conventional Southern blot and/or PCR techniques. Here, we report a comprehensive analysis of herbicide-tolerant rice plants generated by inducing point mutations in the rice ALS gene via Agrobacterium-mediated GT. We performed genome comparative genomic hybridization (CGH) array analysis and whole-genome sequencing to evaluate the molecular composition of GT rice plants. Thus far, no integration of Agrobacterium-derived DNA fragments has been detected in GT rice plants. However, >1,000 single nucleotide polymorphisms (SNPs) and insertion/deletion (InDels) were found in GT plants. Among these mutations, 20-100 variants might have some effect on expression levels and/or protein function. Information about additive mutations should be useful in clearing out unwanted mutations by backcrossing.Entities:
Keywords: Acetolactate synthase; Gene targeting; Herbicide tolerance; Rice
Mesh:
Substances:
Year: 2014 PMID: 25378689 PMCID: PMC4301741 DOI: 10.1093/pcp/pcu153
Source DB: PubMed Journal: Plant Cell Physiol ISSN: 0032-0781 Impact factor: 4.927
Fig. 1Strategy for T-DNA-mediated GT of the rice ALS locus and analysis of homologous recombination events in T0 plants. Schematic representation of GT events. The white boxes represent the coding region of the ALS gene. The thick black lines represent flanking rice genomic DNA. Left (LB) and right (RB) border sequences are represented by gray boxes. A sequence of 165 bp encoding 55 amino acids including the chloroplast-targeting signal is deleted in the T-DNA region of the GT vector, rendering the ALS gene non-functional. The two mutations (W548L and S627I) that confer BS tolerance exist on the GT vector. The W548L and S627I mutations create novel MfeI restriction sites (CAATTG). The positions of primers (F1 and R1) used for PCR, and the expected size of PCR-amplified fragments and their MfeI endonuclease digestion products are shown. E; EcoRI site. (B) PCR-MfeI CAPS analysis of BSR-9-9-8 and BSR-59-8-5. PCR was performed using primers F1 and R1. The 2,287 bp PCR product resolves to three fragments (1,751, 299 and 237 bp) by MfeI digestion when W548L and S627I mutations are induced in the endogenous ALS locus because W548L and S627I mutations create novel MfeI sites. (C) PCR-MfeI CAPS analysis of BSR-12-1 to BSR-12-8. *Non-digested PCR product of BSR-12-1; other lanes, MfeI-digested PCR products.
Fig. 2Detection of vector backbone sequence in siblings of BSR-12. (A) Southern blot analysis of T2 progeny of BSR-12 named BSR-12-1 to BSR-12-8. *GT vector was added to the WT. The position of the probe is shown in Fig. 1A. (B) PCR analysis using primers located near the LB and RB (Fig. 1A). Amplification of PCR products in BSR-12-1, 2 and 4–8 means that not only the T-DNA region but also the vector backbone sequence is integrated in these plants. The positions of the primers are shown in Fig. 1A.
Fig. 3Confirmation of vector backbone insertion and Tos17 transposon copy number. (A) Southern blot analysis of ApaI-digested genomic DNA of BSR-9-9-8, BSR-59-8-5, BSR-12-1 and BSR-12-2 using a probe located on the vector backbone (Fig. 1A). (B) Southern blot analysis of BSR-9-9-8, BSR-59-8-5, BSR-12-1 and BSR-12-2 using a probe located on the Tos17 transposon.
Fig. 4Array CGH ratio plots for the forward strand of binary vector pPZP2028. The log2 ratios of Cy5/Cy3 using a moving average of 20 adjacent probes (1.2 kb) were plotted against the cumulative kilobase pairs position. Locations of pBR322 ori, Spectinomycin resistance gene, LB and RB sequence existing on pPZP2028 are shown below the graph.
Number of SNPs and InDels
| Sample (generation) | Depth:quality | Homo SNPs | Hetero SNPs | Homo InDels | Hetero InDels | Total | T0 | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Raw | Corrected | Raw | Corrected | Raw | Corrected | Raw | Corrected | Raw | Corrected | SNPs | InDels | Total | Average | ||
| BSR-12-1 (T2) | DP2Q10 | 59 | 84 | 894 | 1,277 | 63 | 90 | 320 | 457 | 1,336 | 1,887 | 225 | 240 | 465 | 501 |
| BSR-12-2 (T2) | DP2Q10 | 88 | 126 | 575 | 821 | 53 | 76 | 169 | 241 | 885 | 1250 | 335 | 202 | 537 | 501 |
| BSR-9-9-8 (T3) | DP2Q10 | 100 | 143 | 487 | 696 | 43 | 61 | 196 | 280 | 826 | 1167 | 327 | 140 | 467 | |
| BSR-59-8-5 (T3) | DP2Q10 | 117 | 167 | 352 | 503 | 29 | 41 | 132 | 189 | 630 | 890 | 382 | 95 | 477 | |
| BSR-12-1 (T2) | DP4Q20 | 34 | 74 | 156 | 339 | 17 | 37 | 81 | 176 | 288 | 626 | 197 | 99 | 296 | 307 |
| BSR-12-2 (T2) | DP4Q20 | 37 | 80 | 152 | 330 | 18 | 39 | 47 | 102 | 254 | 552 | 214 | 104 | 319 | 307 |
| BSR-9-9-8 (T3) | DP4Q20 | 45 | 98 | 121 | 263 | 15 | 33 | 64 | 139 | 245 | 533 | 224 | 75 | 298 | |
| BSR-59-8-5 (T3) | DP4Q20 | 49 | 107 | 90 | 196 | 13 | 28 | 31 | 67 | 183 | 398 | 243 | 65 | 308 | |
Number of sample-specific variants detected in the region called by all four samples.
Validated depth and quality. DP2Q10: reads filled by depth ≥2, quality ≥10 in SAMtools. DP4Q20: reads filled by depth ≥4, quality ≥20 in SAMtools
Number of variants, corrected by coverage of the genome (Supplementary Table S4).
Expected number of SNPs and InDels in T0 calli calculated from the corrected number of homo SNPs, homo InDels and generation.
Pattern of homozygous SNPs
| SNP pattern | BSR-12-1 (T2) | BSR-12-2 (T2) | BSR-9-9-8 (T3) | BSR-59-8-5 (T3) | ||||
|---|---|---|---|---|---|---|---|---|
| No. | % | No. | % | No. | % | No. | % | |
| Homozygous transition | ||||||||
| AT→GC | 6 | 10.2 | 10 | 11.4 | 22 | 22 | 18 | 15.4 |
| GC→AT | 24 | 40.7 | 36 | 40.9 | 37 | 37 | 37 | 31.6 |
| Homozygous transversion | ||||||||
| AT→CG | 2 | 3.4 | 8 | 9.1 | 4 | 4 | 5 | 4.3 |
| AT→TA | 9 | 15.3 | 12 | 13.6 | 11 | 11 | 10 | 8.5 |
| GC→CG | 6 | 10.2 | 8 | 9.1 | 8 | 8 | 7 | 6.0 |
| GC→TA | 12 | 20.3 | 14 | 15.9 | 18 | 18 | 40 | 34.2 |
| Total | 59 | 88 | 100 | 117 | ||||
| Ti | 30 | 50.8 | 46 | 52.3 | 59 | 59 | 55 | 47.0 |
| Tv | 29 | 49.2 | 42 | 47.7 | 41 | 41 | 62 | 53.0 |
| Ti/Tv | 1.03 | 1.1 | 1.44 | 0.89 | ||||
SNPs called by two or more reads with quality score 10 at SAMtools (DP2Q10).
Annotation of variants by snpEff
| Effect of SNPs and InDel | BSR-12-1 (T2) | BSR-12-2 (T2) | BSR-9-9-8 (T3) | BSR-59-8-5 (T3) | ||||
|---|---|---|---|---|---|---|---|---|
| Count | Percentage | Count | Percentage | Count | Percentage | Count | Percentage | |
| FRAME_SHIFT | 0.21% | 0.11% | 0% | 0% | ||||
| NON_SYNONYMOUS_CODING | 1.31% | 0.90% | 0.84% | 1.47% | ||||
| CODON_CHANGE_PLUS_CODON_DELETION | 0% | 0.05% | 0% | 0% | ||||
| START_GAINED | 0.11% | 0.16% | 0.12% | 0.07% | ||||
| START_LOST | 0% | 0% | 0.12% | 0% | ||||
| STOP_GAINED | 0.43% | 0.05% | 0% | 0.07% | ||||
| SPLICE_SITE_ACCEPTOR | 0.04% | 0.05% | 0.06% | 0% | ||||
| SPLICE_SITE_DONOR | 0.04% | 0% | 0% | 0% | ||||
| SYNONYMOUS_CODING | 13 | 0.46% | 7 | 0.37% | 5 | 0.30% | 3 | 0.22% |
| EXON | 11 | 0.39% | 2 | 0.11% | 5 | 0.30% | 2 | 0.15% |
| INTRON | 166 | 5.89% | 136 | 7.20% | 99 | 5.95% | 76 | 5.57% |
| UTR_3_PRIME | 40 | 1.42% | 32 | 1.70% | 22 | 1.32% | 14 | 1.03% |
| UTR_5_PRIME | 21 | 0.75% | 24 | 1.27% | 11 | 0.66% | 20 | 1.47% |
| UPSTREAM | 722 | 25.60% | 477 | 25.27% | 395 | 23.72% | 365 | 26.76% |
| DOWNSTREAM | 690 | 24.47% | 474 | 25.11% | 414 | 24.87% | 341 | 25% |
| INTERGENIC | 1,097 | 38.90% | 711 | 37.66% | 695 | 41.74% | 521 | 38.20% |
| Total | 2820 | 1888 | 1665 | 1364 | ||||
Variants called by two or more reads with quality score 10 at SAMtools (DP2Q10) were annotated.
Mutations predicted to affect the quantity and/or quality of proteins are shown in bold.