| Literature DB >> 29164313 |
Danielle Keidar1, Chen Doron1, Khalil Kashkush2.
Abstract
KEY MESSAGE: Here, we show that Au SINE elements have strong associations with protein-coding genes in wheat. Most importantly Au SINE insertion within introns causes allelic variation and might induce intron retention. The impact of transposable elements (TEs) on genome structure and function is intensively studied in eukaryotes, especially in plants where TEs can reach up to 90% of the genome in some cases, such as in wheat. Here, we have performed a genome-wide in-silico analysis using the updated publicly available genome draft of bread wheat (T. aestivum), in addition to the updated genome drafts of the diploid donor species, T. urartu and Ae. tauschii, to retrieve and analyze a non-LTR retrotransposon family, termed Au SINE, which was found to be widespread in plant species. Then, we have performed site-specific PCR and realtime RT-PCR analyses to assess the possible impact of Au SINE on gene structure and function. To this end, we retrieved 133, 180 and 1886 intact Au SINE insertions from T. urartu, Ae. tauschii and T. aestivum genome drafts, respectively. The 1886 Au SINE insertions were distributed in the seven homoeologous chromosomes of T. aestivum, while ~ 67% of the insertions were associated with genes. Detailed analysis of 40 genes harboring Au SINE revealed allelic variation of those genes in the Triticum-Aegilops genus. In addition, expression analysis revealed that both regular transcripts and alternative Au SINE-containing transcripts were simultaneously amplified in the same tissue, indicating retention of Au SINE-containing introns. Analysis of the wheat transcriptome revealed that hundreds of protein-coding genes harbor Au SINE in at least one of their mature splice variants. Au SINE might play a prominent role in speciation by creating transcriptome variation.Entities:
Keywords: Exonization; Genome evolution; SINE; Transposable elements; Wheat
Mesh:
Substances:
Year: 2017 PMID: 29164313 PMCID: PMC5787218 DOI: 10.1007/s00299-017-2213-1
Source DB: PubMed Journal: Plant Cell Rep ISSN: 0721-7714 Impact factor: 4.570
Fig. 1Copy number and distribution of Au SINE in T. aestivum (genome composition AABBDD) genome. Each chromosome (1–7) is defined by its genome composition (AA, BB and DD subgenomes). A total of 1886 Au SINE insertions were retrieved from the T. aestivum genome draft, while 753, 875, and 221 insertions were retrieved from A, B and D subgenomes, respectively. Note that 37 insertions were not mapped in the seven homeologous chromosomes and they are indicated as “unknown”
Fig. 2Site-specific PCR analysis using primers from Au SINE-flanking sequences. In each panel, the upper arrow represents a “full site” and the lower arrow represents an “empty site”. “M” represents the size marker in all the gels, “NC” represents for negative control, ddH20 was used as template in PCR reactions. The PCR analysis was performed in accessions of: BB1 = Ae. searsii, BB2 = Ae. speltoides, AA = T. urartu, DD = Ae. tauschii, AABB1 = T. durum, AABB2 = T. dicoccoides, AABBDD = T. aestivum. Note that for all polymorphic Au SINE insertions the difference between the “full site” fragment and the “empty site” is ~ 181 bp, the size of Au SINE. Numbers above each lane represent the genomic replicates. a Au SINE insertion in Putative Serine/threonine-protein kinase (case 8 in Table 1). The “full site” is 399 bp and the “empty site” is 218 bp. The insertion is unique to T. aestivum (amplified in two accessions 1 and 3). Note that the rest of the upper bands are non-specific PCR products as seen by sequence validation. b Au SINE insertion in an Predicted protein (case 22 in Table 1). The “full site” is 395 bp and the “empty site” is 214 bp. The insertion was seen in accessions of T. durum, T. dicoccoides and T. aestivum. c Au SINE insertion in Putative ATP-dependent RNA helicase DHX36 (cases 13 in Table 1). The “full site” is 387 bp and the “empty site” is 208 bp. The insertion was seen in T. durum, T. dicoccoides and T. aestivum. d Au SINE insertion in Inositol hexakisphosphate and diphosphoinositol-pentakisphosphate kinase (cases 27 in Table 1). The “full site” is 377 bp and the “empty site” is 196 bp. The insertion was seen in Ae. tauschii and T. aestivum. e Au SINE insertion in Chloride channel protein CLC-c (cases 25 in Table 1). The “full site” is 323 bp, and the “empty site” is140bp. The insertion was seen in Ae. tauschii and T. aestivum. f Au SINE insertion in an Predicted protein (case 31 in Table 1). The “full site” is 274 bp and the “empty site” is 97 bp. The insertion was seen in T. urartu, T. dicoccoides and T. aestivum. g Au SINE insertion in SIN3 transcription regulator family member B (cases 40 in Table 1). The “full site” is 342 bp and the “empty site” is 165. The insertion was seen in all tested species except for Ae. searsii accessions. h Au SINE insertion in Calcineurin-like metallo-phosphoesterase (cases 36 in Table 1). The “full site” is 352 bp and the “empty site” is 180 bp. The insertion was seen in Ae. speltoides, T. durum, T. dicoccoides and T. aestivum
Site-specific PCR analysis of Au SINE insertional polymorphism within genes in Triticum and Aegilops species
| No. | Gene accession numbera | Gene productb | Primer R sequencec | Primer L sequencec | Location within gened | Existence of | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| AA | BB* | BB# | DD | AABB | AABBDD | ||||||
| 1 | TRIAE_CS42_1DL_TGACv1_061151_AA0187150 | DnaJ homolog subfamily C member 13 | GAATTTAGTTTGCGGTTCCAAG | TGTGGTACATATCCATGCGTTT | Intron | √ | √ | √ | √ | √ | √ |
| 2 | TRIAE_CS42_2BL_TGACv1_129350_AA0379730 | CLPTM1-like membrane protein cnrB | ACGAAGAACTAAAAGCCGTGAA | CTGCTATACCATGCGATCGTT | Intron | √ | √ | √ | √ | √ | √ |
| 3 | TRIAE_CS42_7BL_TGACv1_577050_AA1863990 | Predicted protein | CGTATTCAAAGATGTTCCACGA | GGCTGGGTTAAGATTGTTTTTG | Exon | √ | √ | √ | √ | √ | √ |
| 4 | TRIAE_CS42_7BS_TGACv1_591977_AA1927210 | Predicted protein | ATGAAGACAACAAGTGCCACAC | GAATAAACATGCCATTCTGCAA | 100 bp downstream | √ | √ | √ | √ | √ | √ |
| 5 | TRIAE_CS42_4DS_TGACv1_363130_AA1183390 | Rht-D1b | TTGCATCAACTCACCATGAAAT | ATGTGTGAACCGACAACTGAAG | Intron | √ | √ | √ | √ | √ | √ |
| 6 | TRIAE_CS42_4DL_TGACv1_342699_AA1119920 | Superoxide dismutase [Cu–Zn] 3 | GCGACACACCAAAAATTTCATA | GCACCACCTGCTGATACACTTA | Intron | √ | √ | √ | √ | √ | √ |
| 7 | TRIAE_CS42_U_TGACv1_640941_AA2079970 | Disease resistance protein RPP13-like | TTACTGGGACCTTCCACACC | GCCATCCATTTCCATTTCAG | Exon | × | × | × | × | × | √ |
| 8 | TRIAE_CS42_1BS_TGACv1_050314_AA0170550 | Putative Serine/threonine-protein kinase CBK1 | ACATGGATGAGCAGGACTAGGT | TCAGAGGGGTCAGGAATAGAAA | Intron | × | × | × | × | × | √ |
| 9 | TRIAE_CS42_7BL_TGACv1_577920_AA1886220 | Zinc transporter ten-like | CCCAAAGATCGCTAGATCAGA | TGTTCAAACACGGGGATGTA | Intron | × | × | × | × | √ | √ |
| 10 | TRIAE_CS42_1BL_TGACv1_032021_AA0124300 | Putative E3 ubiquitin-protein ligase HERC1 | TCCTTCTCAGGGCGTAGAAA | TTGGTTTACATTCACAGGATCAA | Intron | × | × | × | × | √ | √ |
| 11 | TRIAE_CS42_1BS_TGACv1_049809_AA0161910 | Predicted protein | TGTCTGGTGCTTGTGAAGAAAC | ATCGAATCACATCCCTTTCAGT | Intron | × | × | × | × | √ | √ |
| 12 | TRIAE_CS42_6BL_TGACv1_499645_AA1588080 | Rho guanine nucleotide exchange factor 8 | TCCTTTCCTACCCACAGATCAT | CGCAGACTGATTCCCTGTCTAT | Intron | × | × | × | × | √ | √ |
| 13 | TRIAE_CS42_5AL_TGACv1_377324_AA1245930 | Putative ATP-dependent RNA helicase DHX36 | AGCATTGGGGAGTTTCTATCAG | TAAGAGCCCAACAAATGTCAAA | Intron | × | × | × | × | √ | √ |
| 14 | TRIAE_CS42_5BL_TGACv1_406235_AA1342580 | Predicted protein | TGAGTGGCAAAACTCTCAGATG | GCCTACATCGACCAAATTCTTC | Intron | × | × | × | × | √ | √ |
| 15 | TRIAE_CS42_2AL_TGACv1_093126_AA0272720 | Cell division protein FtsZ homolog 1, chloroplastic | AGTGCCTGACGTGGTAAGAAAT | GAATTTCTGTTTGCAGTGCTTG | Intron | × | × | × | × | √ | √ |
| 16 | TRIAE_CS42_2BL_TGACv1_130367_AA0409600 | Exocyst complex component 2 | GTGAGAACTGAGCATGAACTGG | ATCCATTAGGCCTTGGGTAACT | Intron | × | × | × | × | √ | √ |
| 17 | TRIAE_CS42_3B_TGACv1_220590_AA0709880 | Josephin family protein | CAGCTGTACACTTCAAACCAATG | TATGATTTGATCCGAAATGCAA | Intron | × | × | × | × | √ | √ |
| 18 | TRIAE_CS42_1BS_TGACv1_049553_AA0156720 | Predicted protein | GCTATCGCCTGGTTATGAGTTC | AAGAGGATCATTTGCTTTTCCA | Exon/Intron | × | × | × | × | √ | √ |
| 19 | TRIAE_CS42_2BS_TGACv1_146572_AA0468420 | 2-dehydro-3-deoxyphosphooctonate aldolase | GCAGACATTTTTGCTCAACCTT | CATGATGATTCCCTTGATGTTG | Intron | × | × | × | × | √ | √ |
| 20 | TRIAE_CS42_2AL_TGACv1_096183_AA0317680 | ATP-dependent RNA helicase SUPV3L1, mitochondrial | ATCTACGCCTTATTTGCTCTGG | GGTAAAGTGTGCCTTTTTGAGG | Intron | × | × | × | × | √ | √ |
| 21 | TRIAE_CS42_2BL_TGACv1_129880_AA0398630 | Oligoribonuclease | TGCTAGTGGACTCAACCAAATC | TGGCCTCAGAGCCTAGTAACA | Intron | × | × | × | × | √ | √ |
| 22 | TRIAE_CS42_1AL_TGACv1_000103_AA0003430 | Predicted protein | CAGACCACAATGGGTATGGTTA | CCATAGAACTCCATCAACATCG | intron | × | × | × | × | √ | √ |
| 23 | TRIAE_CS42_5BL_TGACv1_405351_AA1325650 | MADS-box transcription factor 14 | CTAGGTCCATCTGGTCCCTAAAG | CTTTGACTACCCCACATTAGCTG | Intron | × | × | × | × | √ | √ |
| 24 | TRIAE_CS42_5BL_TGACv1_404700_AA1308770 | E3 ubiquitin-protein ligase AIP2 | AGATTTGTGATAGCAGCACCAG | TTGTGAGTTACCTTGAGCCTAGC | Intron | × | × | × | × | √ | NA |
| 25 | TRIAE_CS42_2DL_TGACv1_158055_AA0507480 | Chloride channel protein CLC-c | CAGCTGTGCTGATTTGCCTA | GAAAGGATGATGCAGAGTTTCA | Intron | × | × | × | √ | × | √ |
| 26 | TRIAE_CS42_2DS_TGACv1_177403_AA0575920 | 5′–3′ exoribonuclease 3 | CATGGCATAACATCGAACAAAA | AGCCGCCTTTTTGAATTTTACT | Intron | × | × | × | √ | × | √ |
| 27 | TRIAE_CS42_4DS_TGACv1_361106_AA1161070 | Inositol hexakisphosphate and diphosphoinositol-pentakisphosphate kinase 1 | TCTCCCTCACAGATCACTTCAA | TTAGCCACAAGTTTGGATAGGC | Intron | × | × | × | √ | × | √ |
| 28 | TRIAE_CS42_4DL_TGACv1_343519_AA1135850 | Conserved oligomeric Golgi complex subunit 6-like | GTGCCAAGTGAATGAAGTCAAG | CTGACACCATGGTACCCTAACA | Intron | × | × | × | √ | × | √ |
| 29 | TRIAE_CS42_6DL_TGACv1_526989_AA1696800 | Serine/threonine-protein kinase | TAGTCTCTAATTTGCGGGTCCA | CACGCATGTCACCAAACATTA | Intron | × | × | × | √ | × | √ |
| 30 | TRIAE_CS42_4AL_TGACv1_290382_AA0984800 | ERBB-3 BINDING PROTEIN 1 | TACCGTCAGCAGAACACCAC | TGAGATGTGTGAGAAGGGTGA | Intron | √ | × | × | × | √ | √ |
| 31 | TRIAE_CS42_5AL_TGACv1_378959_AA1255190 | Leucine zipper protein | AGATTGCTGGAAAATAAGGACAA | TTTTGGATCGTGCCTAGGAG | Intron | √ | × | × | × | √ | √ |
| 32 | TRIAE_CS42_2AL_TGACv1_093836_AA0287830 | Putative pectinesterase/ pectinesterase inhibitor 51 | TCTCCCTTTGTCACTTTTGCTT | GAAGCATCTTTCTGCCATCTTT | Intron | √ | × | × | × | √ | √ |
| 33 | TRIAE_CS42_3AS_TGACv1_211370_AA0689310 | C3H2C3 RING-finger protein | CCTCAGGAGGTGAATTGCTC | TTAACCCGCCTCACTTTGTC | Exon/Intron | √ | × | × | × | √ | NA |
| 34 | TRIAE_CS42_7AL_TGACv1_558277_AA1791890 | ELAV-like protein 1 | ATAGCCAGTGGTAGGCCACA | ACACTGACCGGATTTGAACC | Intron | × | √ | √ | × | √ | √ |
| 35 | TRIAE_CS42_2BS_TGACv1_145935_AA0449980 | Mediator of RNA polymerase II transcription subunit 14-like | CCCAAAAGATGAAATAGCAACC | GAAGAGGAAGGGCCAGTATGTT | Intron | × | × | √ | × | √ | √ |
| 36 | TRIAE_CS42_5BL_TGACv1_407697_AA1358910 | Calcineurin-like metallo-phosphoesterase superfamily protein | GTACATCAGTAGCGCAATGGAA | GAACTCCCAAGAAGAGGACCTT | intron | × | × | √ | × | √ | √ |
| 37 | TRIAE_CS42_4AL_TGACv1_288293_AA0943850 | Rho GTPase-activating protein 7 | CCCTCAAATGCAAAGCGTAT | CTCCTCATGCTACGACGACA | Intron | √ | √ | × | √ | √ | √ |
| 38 | TRIAE_CS42_7DS_TGACv1_624740_AA2063030 | Kanadaptin | CATGTGTGCTTCCAAGATCG | ACCATGACTGGAATCGAAGG | Intron | √ | × | √ | √ | √ | √ |
| 39 | TRIAE_CS42_4DL_TGACv1_342534_AA1116030 | Serine-aspartate repeat-containing protein I-like | TGGAACCTGTCGGCTCTATTAT | AAGAATGATGGTCATGGATGTG | Intron | × | √ | √ | √ | √ | √ |
| 40 | TRIAE_CS42_1BS_TGACv1_049347_AA0149810 | SIN3 transcription regulator family member B | ACACGGCATATGGGTAATTGA | ATTCTGGCCACGTGATCTCTAT | Intron | √ | × | √ | √ | √ | √ |
aGene accessions from EnsemblPlants (http://plants.ensembl.org/Triticum_aestivum)
bBased on gene annotation from EnsemblPlants with e value < 10−10.
cPrimers were designed from Au SINE-flanking sequences.
“NA (Not available)” represents monomorphic cases in which the original genome that displays Au SINE insertion in the gene cannot be determined
dLocation of Au SINE within or adjacent the gene: exon, intron, exon/intron junction or downstream of the gene
eAA = T. urartu, BB* = Ae. searsii, BB# = Ae. speltoides, DD = Ae. tauschii, AABB = T. turgidum, AABBDD = T. aestivum, “√"-full site in at least one accession of the same species; “×" empty site in all accessions of the same species, NA data not available
Fig. 3Relative expression levels of genes harboring Au SINE in two accessions of T. aestivum, as seen by realtime RT-PCR analysis. In each accession, the “regular transcript” compared to the Au SINE-containing transcript were analyzed. On top of each panel, a scheme of the analyzed gene, while the numbered boxes the exons and the black boxes note the Au SINE insertion. White arrows note the primers used to amplify the “regular transcript”, while the black arrows note the primers used to amplify the Au SINE-containing transcript. Expression levels (Y axis) were measured relative to ACTIN, and the exact relative expression (fold) is indicated by on top of each bar. Standard error on top of each bar was measured using three biological replicates. All the analyzed genes display the same trend of significantly higher expression levels of the regular transcript compared to the Au SINE-containing transcript, in the two tested T. aestivum accessions. The 4 analyzed genes: a DnaJ homolog (case 1 in Table 1), b Calcineurin-like metallo-phosphoesterase (case 36 in Table 1), c Putative Serine/threonine-protein kinase (case 8 in Table 1), and d Superoxide dismutase (case 6 in Table 1)
Fig. 4Distribution of 84 Au SINE-containing genes in the seven homoeologous chromosomes of T. aestivum. Each chromosome (1–7) is defined by its genome composition (AA, BB and DD subgenomes). A total of 28, 47 and 2 Au SINE-containing genes were retrieved from AA, BB and DD subgenomes, respectively. Note that 7 Au SINE-containing genes were not mapped in the seven homoeologous chromosomes and they are indicated as “unknown”
In-silico analysis of Au SINE-containing transcripts
| Gene (EnsemblPlants)a | Gene productb | Locationc | Number of splice variantsd | Number of splice variants containing Au SINEe |
|---|---|---|---|---|
| TRIAE_CS42_1AL_TGACv1_002495_AA0042400 | Putative methyltransferase PMT7 | 1A | 5 | 3 |
| TRIAE_CS42_1AS_TGACv1_020149_AA0075200 | Protein STIP1-like protein / Ankyrin | 1A | 3 | 3 |
| TRIAE_CS42_1BL_TGACv1_030243_AA0083440 | 3-deoxy-manno-octulosonate cytidylyltransferase, mitochondrial | 1B | 3 | 2 |
| TRIAE_CS42_1BS_TGACv1_051223_AA0178630 | Methyl-CpG-binding domain protein 4 | 1B | 9 | 3 |
| TRIAE_CS42_2AL_TGACv1_094480_AA0298520 | Heterogeneous nuclear ribonucleoprotein Q | 2A | 6 | 1 |
| TRIAE_CS42_2AL_TGACv1_095668_AA0313540 | Cysteine proteinases superfamily protein | 2A | 3 | 1 |
| TRIAE_CS42_2BL_TGACv1_130920_AA0419930 | Serine/threonine-protein kinase | 2B | 3 | 1 |
| TRIAE_CS42_2BL_TGACv1_131783_AA0432150 | AMP-activated protein kinase, gamma regulatory subunit | 2B | 2 | 1 |
| TRIAE_CS42_2BL_TGACv1_131823_AA0432620 | SNARE associated Golgi protein family | 2B | 6 | 3 |
| TRIAE_CS42_2BS_TGACv1_148673_AA0494490 | Predicted membrane protein | 2B | 2 | 1 |
| TRIAE_CS42_2DS_TGACv1_177434_AA0577040 | Predicted protein | 2D | 3 | 1 |
| TRIAE_CS42_3AL_TGACv1_193715_AA0618280 | 4-coumarate–CoA ligase-like 9 | 3A | 8 | 1 |
| TRIAE_CS42_3AL_TGACv1_194196_AA0628350 | Predicted protein | 3A | 2 | 2 |
| TRIAE_CS42_3AS_TGACv1_211514_AA0690950 | Retinol dehydrogenase 14 | 3A | 3 | 1 |
| TRIAE_CS42_3B_TGACv1_222217_AA0760010 | Potassium transporter 5 | 3B | 2 | 2 |
| TRIAE_CS42_3B_TGACv1_224095_AA0792250 | Vacuolar-processing enzyme | 3B | 6 | 3 |
| TRIAE_CS42_3B_TGACv1_227320_AA0822830 | Zinc finger CCCH domain-containing protein 12 | 3B | 4 | 1 |
| TRIAE_CS42_4AL_TGACv1_288915_AA0961300 | Protein kinase superfamily protein | 4A | 3 | 2 |
| TRIAE_CS42_4AL_TGACv1_291111_AA0992310 | Noncoding RNA | 4A | 2 | 1 |
| TRIAE_CS42_4AS_TGACv1_306183_AA1003850 | Periplasmic serine endoprotease DegP-like | 4A | 2 | 2 |
| TRIAE_CS42_4BL_TGACv1_321946_AA1067110 | Predicted protein | 4B | 3 | 2 |
| TRIAE_CS42_4BS_TGACv1_328309_AA1086060 | Protein CDC73 homolog | 4B | 3 | 1 |
| TRIAE_CS42_4BS_TGACv1_328640_AA1091470 | Predicted protein | 4B | 4 | 2 |
| TRIAE_CS42_4DL_TGACv1_343838_AA1140270 | FAR-RED IMPAIRED RESPONSE 1-like | 4D | 2 | 2 |
| TRIAE_CS42_5AL_TGACv1_374408_AA1199410 | DExH-box ATP-dependent RNA helicase DExH16, mitochondrial | 5A | 3 | 1 |
| TRIAE_CS42_5AL_TGACv1_374413_AA1199550 | Disease resistance RPP8-like protein 3 | 5A | 2 | 2 |
| TRIAE_CS42_5AL_TGACv1_375575_AA1223920 | Non-coding RNA | 5A | 2 | 1 |
| TRIAE_CS42_5BL_TGACv1_404363_AA1296950 | Predicted protein | 5B | 3 | 2 |
| TRIAE_CS42_5BL_TGACv1_406039_AA1339580 | Carbamoyl-phosphate synthase small chain, chloroplastic | 5B | 4 | 2 |
| TRIAE_CS42_5BL_TGACv1_407028_AA1352680 | FBD-associated F-box protein | 5B | 2 | 2 |
| TRIAE_CS42_5BL_TGACv1_407299_AA1355630 | Signal recognition particle-related/SRP-related | 5B | 3 | 1 |
| TRIAE_CS42_5BL_TGACv1_408403_AA1363260 | Predicted protein | 5B | 9 | 1 |
| TRIAE_CS42_5BS_TGACv1_424513_AA1390380 | Putative WRKY transcription factor 3 | 5B | 2 | 1 |
| TRIAE_CS42_6AL_TGACv1_472758_AA1525700 | U-box domain-containing protein 11 | 6A | 3 | 1 |
| TRIAE_CS42_6AS_TGACv1_485705_AA1550580 | Predicted protein | 6A | 3 | 3 |
| TRIAE_CS42_6BL_TGACv1_499355_AA1579140 | Lysyl-tRNA synthetase | 6B | 4 | 2 |
| TRIAE_CS42_6BL_TGACv1_501000_AA1612460 | F-box/FBD/LRR-repeat protein | 6B | 4 | 1 |
| TRIAE_CS42_6BS_TGACv1_514524_AA1660940 | Predicted protein | 6B | 6 | 1 |
| TRIAE_CS42_6BS_TGACv1_514925_AA1665920 | Transcription termination factor MTERF8, chloroplastic-like | 6B | 2 | 2 |
| TRIAE_CS42_7AL_TGACv1_557374_AA1780510 | Polyadenylate-binding protein RBP45-like | 7A | 4 | 3 |
| TRIAE_CS42_7AS_TGACv1_569582_AA1819670 | Predicted protein | 7A | 3 | 1 |
| TRIAE_CS42_7BL_TGACv1_577086_AA1865600 | ELAV-like protein 1 | 7B | 2 | 1 |
| TRIAE_CS42_7BL_TGACv1_577812_AA1883950 | Hydroxyproline O-galactosyltransferase HPGT1 | 7B | 2 | 1 |
| TRIAE_CS42_7BS_TGACv1_591848_AA1923570 | Transcription initiation factor TFIID subunit 10 | 7B | 3 | 1 |
| TRIAE_CS42_7BS_TGACv1_593481_AA1951940 | Putative clathrin assembly protein | 7B | 3 | 1 |
| TRIAE_CS42_U_TGACv1_640735_AA2071780 | Putative rust resistance kinase Lr10 | NA | 2 | 2 |
| TRIAE_CS42_U_TGACv1_640941_AA2079970 | disease resistance protein RPP13-like | NA | 2 | 2 |
| TRIAE_CS42_U_TGACv1_641735_AA2102860 | Methionine S-methyltransferase | NA | 2 | 2 |
| TRIAE_CS42_U_TGACv1_641821_AA2105030 | Putative Exocyst complex component 7 | NA | 2 | 1 |
| TRIAE_CS42_U_TGACv1_643249_AA2129660 | Mitochondrial inner membrane translocase complex, subunit Tim44-related protein | NA | 2 | 1 |
aGene accessions from EnsemblPlants (plants.ensembl.org/Triticum_aestivum)
bBased on gene annotation from EnsemblPlants with e value < 10−10.
cChromosome location of the gene in T. aestivum genome. NA = not available
dThe total number of splice variants for each gene detected from RNA-seq databases of T. aestivum
eThe number of Au SINE-containing transcripts out of the total number of splice variants detected for each gene
Fig. 5Splice variants (transcripts) of four Au SINE-harboring genes (a–d). The name of the gene and EnsemblPlants accessions number are indicated on top. Gray boxes represent exons and lines represent introns. White boxes represent CDS (coding sequences) regions. Note that the mature transcripts consist of exons only; thus, we kept here the intron regions to indicate the exact location of Au SINE (black boxes) in the mature transcript. The predicted protein for each splice variant is indicated on right. a Transcripts 2, 4 and 5 contain Au SINE. b Transcript 2 contains Au SINE. c Transcript 2 contains Au SINE. d Transcript 3 contains Au SINE