| Literature DB >> 32410353 |
Raja Sekhar Nandety1, Juan C Serrani-Yarce1, Upinder S Gill1, Sunhee Oh1, Hee-Kyung Lee1, Xinji Zhang1, Xinbin Dai1, Wenchao Zhang1, Nick Krom1, Jiangqi Wen1, Patrick X Zhao1, Kirankumar S Mysore1.
Abstract
Brachypodium distachyon is an annual C3 grass used as a monocot model system in functional genomics research. Insertional mutagenesis is a powerful tool for both forward and reverse genetics studies. In this study, we explored the possibility of using the tobacco retrotransposon Tnt1 to create a transposon-based insertion mutant population in B. distachyon. We developed transgenic B. distachyon plants expressing Tnt1 (R0) and in the subsequent regenerants (R1) we observed that Tnt1 actively transposed during somatic embryogenesis, generating an average of 6.37 insertions per line in a population of 19 independent R1 regenerant plants analyzed. In seed-derived progeny of R1 plants, Tnt1 segregated in a Mendelian ratio of 3:1 and no new Tnt1 transposition was observed. A total of 126 flanking sequence tags (FSTs) were recovered from the analyzed R0 and R1 lines. Analysis of the FSTs showed a uniform pattern of insertion in all the chromosomes (1-5) without any preference for a particular chromosome region. Considering the average length of a gene transcript to be 3.37 kb, we estimated that 29 613 lines are required to achieve a 90% possibility of tagging a given gene in the B. distachyon genome using the Tnt1-based mutagenesis approach. Our results show the possibility of using Tnt1 to achieve near-saturation mutagenesis in B. distachyon, which will aid in functional genomics studies of other C3 grasses.Entities:
Keywords: zzm321990Brachypodiumzzm321990; zzm321990Tnt1zzm321990; flanking sequence tag; insertional mutagenesis; retrotransposons; sequence capture; technical advance
Mesh:
Substances:
Year: 2020 PMID: 32410353 PMCID: PMC7496502 DOI: 10.1111/tpj.14813
Source DB: PubMed Journal: Plant J ISSN: 0960-7412 Impact factor: 6.417
Tnt1 insertion numbers in Brachypodium distachyon using different methods
genic association: Tnt1 insertions that are associated with B. distachyon genes.; nr, non‐redundant.
1The insertions identified from TAIL‐PCR and sequence capture are marked in green‐colored brackets; insertions common to TAIL‐PCR and WGS are marked in blue‐colored brackets and those in common between sequence capture and WGS are marked in red‐colored brackets.
Figure 1Nucleobase composition at the site of insertion of Tnt1.
The plot displays 100 base pairs flanking both sides of the Tnt1 insertion, labeled as [−100, (0), +100]. Bases are shown in colors: A + T in red and G + C in blue. A strong pattern of a low GC percentage along the insertion site is observed. One hundred base pairs on either side of the 5 bp target site duplication (TSD) and within the TSD itself showed G + C preference 100 bp immediately downstream of the TSD. The lowest of these was at positions 20–40, where the preference for G + C (30%) was greater by 10% than the expected average in the genome. The whole‐genome GC% is represented as a thin line.
Distribution of Tnt1 insertions in Brachypodium distachyon
| Line no. | Exon (CDS) | 5′‐UTR | 3′‐UTR | Intron | Genic | Intergenic |
|---|---|---|---|---|---|---|
| MP15 | 2 | 2 | 2 | |||
| MP2 | 1 | |||||
| nc15‐001 | 7 | 1 | 2 | 5 | 12 | 22 |
| nc15‐002 | 2 | |||||
| nc15‐003 | 4 | 4 | 4 | |||
| nc15‐004 | 3 | 1 | 1 | 3 | 7 | |
| nc15‐006 | 1 | 2 | 3 | 3 | ||
| nc15‐007 | 1 | 4 | 4 | 4 | ||
| nc15‐008 | 1 | 4 | 4 | 2 | ||
| nc15‐009 | 1 | 1 | 1 | |||
| nc15‐010 | 1 | 1 | 1 | 1 | ||
| nc15‐011 | 1 | 1 | ||||
| nc15‐013 | 1 | 2 | 2 | 6 | ||
| nc15‐014 | 1 | 1 | 1 | |||
| nc15‐015 | 1 | |||||
| nc2‐001 | 1 | 2 | 3 | 1 | ||
| nc2‐002 | 2 | |||||
| nc2‐003 | 1 | 1 | ||||
| nc2‐004 | 2 | 1 | 4 | 6 | 9 | |
| nc2‐005 | 2 | 1 | 1 | 3 | 4 | |
| nc2‐006 | 2 | |||||
| Total | 20 | 3 | 5 | 34 | 51 | 75 |
CDS, coding sequence; UTR, untranslated region; genic region, exon + 5′‐UTR + 3′‐UTR + intron.
Figure 2Circular genomic visualization of Tnt1 insertions in Brachypodium distachyon genome.
The figure was generated using the Circos package in the R statistical platform. The first band of the circle (outer circle) represents chromosomes (Chr 1–5 and 0). Centromeric positions are marked on each of the chromosomes with a red letter ‘c’. The second band is plotted with GC% of the chromosomes in 500 kb window bins. The third band is plotted with flanking sequence tags (FSTs; total length of the FST at the site of insertion) plotted on to B. distachyon genome. The fourth, innermost, band represents the gene lengths of B. distachyon (total length of the gene at that particular location, thus showing a genic location).
Figure 3Genome coverage estimate for Tnt1 inserts.
The estimation assumes that 96.71% of the Brachypodium distachyon genome is sequenced. The probability of insertion into a transcription region within the sequenced gene space is 42.15%. The probability of tagging any given gene with Tnt1 is a function of gene size and is derived from the equation (where p is the probability of finding one Tnt1 insert within a given gene, x is the average length of a gene transcript, y is the total length of transcribed genes, R tr is the probability of insertion into a transcription region within the sequenced gene space R gs and N is the total number of Tnt1 inserts required). The most likely estimate, which assumes 3.373 kb as the average gene length, is shown in red. The blue points on the red curve show the corresponding numbers of Tnt1 inserts required to achieve 50%, 65%, 75%, 85%, 90%, 95% or 99% genome coverage, respectively.
Estimation of insertion lines required to mutate the Brachypodium distachyon genome
| Percentage chance of mutating a particular gene (%) |
| Insertion lines required ( |
|---|---|---|
| 50 | 56 771 | 8915 |
| 65 | 85 984 | 13 502 |
| 75 | 113 543 | 17 829 |
| 85 | 155 381 | 24 399 |
| 90 | 188 590 | 29 613 |
| 95 | 245 362 | 38 528 |
| 99 | 377 181 | 59 227 |
Given an average gene length of 3.37 kb, the percentage chance of mutating a particular gene would require (N) Tnt1 inserts with a 99.39% chance of insertion into a sequenced gene space and a 42.15% chance of insertion into the gene transcript region. The number of insertion lines (IL) required is based on an insertion rate of 6.37 insertions per line (IL = N/6.37).