| Literature DB >> 22709793 |
Joung Sug Kim1, Jiye Kim1, Tae-Ho Lee2, Kyong Mi Jun2, Tea Hoon Kim2, Yul-Ho Kim3, Hyang-Mi Park3, Jong-Seong Jeon4, Gynheung An4, Ung-Han Yoon5, Baek Hie Nahm1,2, Yeon-Ki Kim2.
Abstract
BACKGROUND: Information about a transgene locus is one of the major concerns in transgenic research because expression of the transgene or a gene interrupted by the integration event could be affected. Thus, the flanking sequences obtained from transgenic plants need to be analyzed in terms of genomic context, such as genic and intergenic regions. This process may consist of several steps: 1) elimination of a vector sequence from the flanking sequence, 2) finding the locations in the target genome, and 3) statistics of the integration sites. These steps could be automated for flanking sequences from several dozens of transgenic plants generated in an ordinary targeted gene expression strategy. It would be indispensable in a genome-wide mutagenesis screen using T-DNA or transposons because these projects often generate several thousands of transgenic lines and just as many loci of the transgene among the transgenic plants.Entities:
Year: 2012 PMID: 22709793 PMCID: PMC3439307 DOI: 10.1186/1746-4811-8-19
Source DB: PubMed Journal: Plant Methods ISSN: 1746-4811 Impact factor: 4.993
Figure 1Procedure in FSTVAL. FSTVAL involves two consecutive steps: 1.) validate FSTs and 2.) find the best mapping position to the genome. Users can input FSTs in FASTA, FASTQ, or PHD format. First, the regions of the border of T-DNA insertion, the adaptor for PCR amplification, and the binary vector are masked from FSTs, and the FSTs are validated. Next, FSTs are matched to the genome and categorized. The highest scoring region is selected as an FST integration site.
Summary characteristics of T-DNA flanking sequences
| Acceptable sequence (A) a | 706 | 63.4% | 3,921 | 97.3% |
| Not acceptable sequence (NA) b | 121 | 10.9% | 89 | 2.2% |
| vector c | 257 | 23.1% | 13 | 0.3% |
| Low quality sequence (Low) | 30 | 2.7% | 7 | 0.2% |
| Total | 1,117 | 100.0% | 4,030 | 100.0% |
a FSTs larger than 30 bp after removal of the RB and adaptor sequences.
b FSTs shorter than 30 bp after removal of the RB and adaptor sequences.
c Binary vector or T-DNA tandem repeats.
Frequencies of T-DNA insertions in the genic and intergenic regions compared with Tos17
| | |||||
|---|---|---|---|---|---|
| Genic | | 273 | 41.8% | 2,350 | 60.3% |
| | Exon | 53 | 8.1% | 1,029 | 26.4% |
| | intron | 56 | 8.9% | 714 | 18.4% |
| | 5'Upstream(−1,000 bp) | 138 | 21.1% | 429 | 11.0% |
| | 3'Downstream(+300 bp) | 26 | 4.0% | 178 | 4.5% |
| Intergenic | | 368 | 56.4% | 1,424 | 36.6% |
| Repeat (>1) | | 12 | 1.8% | 120 | 3.1% |
| Total | 653 | 100.0% | 3,895 | 100.0% | |
Distribution of T-DNA insertions over the 12 rice chromosomes compared with Tos17 insertions
| 1 | 43.6 | 92 | 14% | 0.5 | 485 | 13% | 11.1 |
| 2 | 35.9 | 78 | 12% | 0.5 | 486 | 12% | 13.0 |
| 3 | 36.3 | 85 | 13% | 0.4 | 455 | 12% | 12.5 |
| 4 | 35.2 | 64 | 10% | 0.6 | 355 | 9% | 10.1 |
| 5 | 29.9 | 27 | 4% | 1.1 | 331 | 9% | 11.1 |
| 6 | 31.2 | 40 | 6% | 0.8 | 309 | 8% | 9.9 |
| 7 | 29.7 | 49 | 8% | 0.6 | 278 | 7% | 9.4 |
| 8 | 28.3 | 51 | 8% | 0.6 | 283 | 7% | 10.0 |
| 9 | 23 | 42 | 7% | 0.5 | 205 | 5% | 8.9 |
| 10 | 22.9 | 39 | 6% | 0.6 | 157 | 4% | 6.9 |
| 11 | 28.5 | 43 | 7% | 0.7 | 244 | 6% | 8.6 |
| 12 | 27.5 | 31 | 5% | 0.9 | 204 | 5% | 7.4 |
| Total | 372.1 | 641 | 100% | 0.6 | 3774 | 100% | 10.1 |
a T-DNA inserts/total inserts.
b Number of T-DNA inserts/size of chromosome.
Figure 2Distribution maps of the T-DNA (A) and Tos17 (B) FSTs in the rice genome. Among 1,114 and 4,030 sequences from T-DNA and Tos17 insertion batches, 653 and 3,895 sequences passed through the filtration and chromosome mapping steps, respectively.
Figure 3Frequency graphs of Tos17 insertion on rice chromosomes. The 3,895 sequences from Tos17 insertion batches are presented as frequencies along the chromosomes.
Figure 4Identification of Tos17- insertion mutants. The genomic structures of insertion alleles were determined by FSTVAL analysis in which boxes, bold lines, asterisks, and triangles indicate exons, intron, start codons (ATG), and Tos17, respectively. Scale bar represents DNA length for each gene. The arrow and arrowhead indicate gene specific primer pairs from genomic DNA and 3Tos primers at 239 bp downstream from the 3’ end of Tos17, respectively. Genomic DNA was isolated from the leaf of Tos17 insertion lines for PCR analysis. The PCR product size from wild type was approximately 1 kb, which is 240 bp smaller than that from Tos17 insertion lines. W/W wild type, T/W heterozygous for Tos17 insertion line, T/T homozygous for Tos17 insertion line.