| Literature DB >> 34092948 |
Agnieszka Skarzyńska1, Magdalena Pawełkowicz1, Wojciech Pląder1.
Abstract
The development of new plant varieties by genetic modification aims at improving their features or introducing new qualities. However, concerns about the unintended effects of transgenes and negative environmental impact of genetically modified plants are an obstacle for the use of these plants in crops. To analyze the impact of transgenesis on plant genomes, we analyze three cucumber transgenic lines with an introduced thaumatin II gene. After genomes sequencing, we analyzed the transgene insertion site and performed variant prediction. As a result, we obtained similar number of variants for all analyzed lines (average of 4307 polymorphisms), with high abundance in one region of chromosome 4. According to SnpEff analysis, the presence of genomic variants generally does not influence the genome functionality, as less than 2% of polymorphisms have high impact. Moreover, analysis indicates that these changes were more likely induced by in vitro culture than by the transgenesis itself. The insertion site analysis shows that the region of transgene integration could cause changes in gene expression, by gene disruption or loss of promoter region continuity. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s12298-021-00990-8.Entities:
Keywords: Comparative genomics; Cucumber; SNP; Small nucleotide polymorphism; Thaumatin; Transgenesis
Year: 2021 PMID: 34092948 PMCID: PMC8139995 DOI: 10.1007/s12298-021-00990-8
Source DB: PubMed Journal: Physiol Mol Biol Plants ISSN: 0974-0430
Information about thaumatin II transgene insertion site in the cucumber genome
| Line | Junction site | Deletion in the insertion site | Transgene insertion | Integration site |
|---|---|---|---|---|
| 212 | Chr 6; ctg1402 | 1304 nt | T-DNA region; 1 copy | Intergenic region |
| 224 | Chr 2; ctg1522 | 361 nt | T-DNA region; 1 copy | Promoter of |
| 225 | Chr 2; ctg2178 | 95 nt | T-DNA—vector backbone—T-DNA; 2 copies | Intergenic region |
Fig. 1a Scheme of the T-DNA used for transformation. b The exact sites in specific contigs, and the way of integration of the inserted T-DNA. The deletion within the insertion site is marked with a white bar; dotted lines mark the junction sites indicated by chimeric read pairs mapped to the reference genome on one side and to the T-DNA sequence on the other side
Number of SNVs predicted by Freebayes and DeepVariant, and after cross-checking. Variant rate is the number of variants per 1b of genomic sequence
| Freebayes | DeepVariant | Common | Variation rate | |
|---|---|---|---|---|
| 212 | 4652 | 80,246 | 4610 | 1.35 × 10−5 |
| 224 | 3975 | 79,076 | 3929 | 1.15 × 10−5 |
| 225 | 4426 | 81,108 | 4383 | 1.28 × 10−5 |
| Mean | 4351 | 80,143 | 4307 | 1.26 × 10−5 |
Fig. 2a Number of particular types of predicted variants in cucumber thaumatin lines. b Venn diagram of common polymorphisms in transgenic lines
Fig. 3a Distribution of predicted variants on the cucumber chromosomes. Histograms show the number of polymorphisms within regions of 10 000 nt. The region on chromosome 4 with the highest density of polymorphisms is marked with a box. The transgene insertion sites are marked with asterisks. b Distribution of predicted polymorphism within ctg1556, which possesses the highest number of variants. Positions of variants are marked with dots, and yellow bars indicate genes (protein coding, lincRNA, tRNA, rRNA, snRNA, and miRNA) on plus and minus strands
Types of HI variants according to SnpEff analysis
| 212 | 224 | 225 | ||||
|---|---|---|---|---|---|---|
| frameshift_variant | 108 | 95.58% | 92 | 94.85% | 87 | 96.67% |
| start_lost | 2 | 1.77% | 2 | 2.06% | 1 | 1.11% |
| stop_gained | 1 | 0.88% | 1 | 1.03% | 1 | 1.11% |
| stop_lost | 2 | 1.77% | 2 | 2.06% | 1 | 1.11% |
Fig. 4Functional KOG classification of genes localized on ctg1556