| Literature DB >> 22905278 |
M E Manetti, M Rossi, G M Q Cruz, N L Saccaro, M Nakabashi, V Altebarmakian, M Rodier-Goud, D Domingues, A D'Hont, M A Van Sluys.
Abstract
Mutator-like transposase is the most represented transposon transcript in the sugarcane transcriptome. Phylogenetic reconstructions derived from sequenced transcripts provided evidence that at least four distinct classes exist (I-IV) and that diversification among these classes occurred early in Angiosperms, prior to the divergence of Monocots/Eudicots. The four previously described classes served as probes to select and further sequence six BAC clones from a genomic library of cultivar R570. A total of 579,352 sugarcane base pairs were produced from these "Mutator system" BAC containing regions for further characterization. The analyzed genomic regions confirmed that the predicted structure and organization of the Mutator system in sugarcane is composed of two true transposon lineages, each containing a specific terminal inverted repeat and two transposase lineages considered to be domesticated. Each Mutator transposase class displayed a particular molecular structure supporting lineage specific evolution. MUSTANG, previously described domesticated genes, are located in syntenic regions across Sacharineae and, as expected for a host functional gene, posses the same gene structure as in other Poaceae. Two sequenced BACs correspond to hom(eo)logous locus with specific retrotransposon insertions that discriminate sugarcane haplotypes. The comparative studies presented, add information to the Mutator systems previously identified in the maize and rice genomes by describing lineage specific molecular structure and genomic distribution pattern in the sugarcane genome. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s12042-012-9104-y) contains supplementary material, which is available to authorized users.Entities:
Year: 2012 PMID: 22905278 PMCID: PMC3418495 DOI: 10.1007/s12042-012-9104-y
Source DB: PubMed Journal: Trop Plant Biol ISSN: 1935-9756 Impact factor: 1.512
R570 BAC gene index
| Sugarcane BAC | Genes1 | FS or STOP2 | Orthologous loci | Transcript5 | |
|---|---|---|---|---|---|
| Sorghum3 | Rice4 | ||||
| 095F04 (130,606 bp) R570 haplotype A | Putative DEAH (Asp-Glu-Ala-His) box polypeptide 35 (DEAD-like helicases superfamily) (rnah) | No | Sb03g002030 | LOC_Os01g11370 | BQ536744 (stem), CA217625 (seedling), CA217706 (seedling). |
| Calcium-binding protein (cbp) | No | Sb03g002020 | LOC_Os01g11414 | CA300700 (etiolated leaf), TA32438_4547 (leaf). | |
| Class III MUSTANG (SC-MUGA.2) | No | Sb03g002010 | LOC_Os01g41210 | CA246160 (inflorescence), DQ115076 (callus). | |
| 148J07 (116,781 bp) R570 haplotype B | Putative DEAH (Asp-Glu-Ala-His) box polypeptide 35 (DEAD-like helicases superfamily) (rnah) | No | Sb03g002030 | LOC_Os01g11370 | BQ536744 (stem), CA217625 (seedling), CA217706 (seedling). |
| Calcium-binding protein (cbp) | No | Sb03g002020 | LOC_Os01g11414 | CA300700 (etiolated leaf), TA32438_4547 (leaf). | |
| Class III MUSTANG (SC-MUGA.1) | No | Sb03g002010 | LOC_Os01g41210 | CA246160 (inflorescence), DQ115076 (callus). | |
| 115J16 (132,900 bp) | Protein brittle-1 chloroplast precursor ( | Yes | Sb07g027010 | LOC_Os08g40850 | CA074816 (apical meristem), CA202529 (inflorescence). |
| exocyst complex subunit family protein (exo) | Yes | Sb07g027000 | LOC_Os08g40840 | No transcript. | |
| Pumilio domain-containing protein (ppd1) | No | Sb07g026995 | LOC_Os08g40830 | CA099599 (calli), TA38834_4547 (apical meristem, inflorescence, root tips, callus and leaf roll). | |
| Class I Mutator-like transposase (SC-MuI.1) | No | Absent | Absent | DQ115055 (inflorescence and rachis). | |
| 086H20 (fragment: 143,827 bp) | 4-hydroxy-3-methylbut-2-enyl diphosphate reductase (hme) | No | Sb01g009140 | LOC_Os03g52170 | TA33519_4547 (inflorescence, root apex, stalk bark). |
| Regulatory protein ( | Yes | Sb01g009150 | LOC_Os03g52160 | TA37774_4547 (inflorescence, seedling, leaf roll, stalk bark), TA49600_4547 (root tips, seedling). | |
| Putative leishmanolysin-like protein ( | No | Sb01g009170 | LOC_Os03g52150 | CA177995 (first apical stalk), TA33915_4547 (shoot-root transition zone, inflorescence and rachis), TA33917_4547 (inflorescence and rachis). | |
| Expressed protein ( | Yes | Sb01g009180 | LOC_Os03g52130 | Sorghum: CD428305; Rice: AK070777. | |
| Class I Mutator-like transposase (SC-MuI.2) | No | Absent | Absent | DQ115055 (inflorescence and rachis). | |
| 249C12 (fragment: 75,471 bp) | Expressed protein ( | No | Sb10g024710 | LOC_Os06g42660 | Rice: AK107791 (inflorescence) |
| Class IV MUSTANG (SC-MUGB.1) | No | Sb10g024700 | LOC_Os06g42640 | TA40072_4547 (root tips, leaf roll, root apex). | |
| Protein B3 DNA binding domain containing protein (b3) | Yes | Unknown* | LOC_Os06g42630 | No transcript. | |
| Hypothetical protein (hp) | No | Sb10g024690 | LOC_Os06g42620 | No transcript. | |
| 007O13 (fragment: 14,093 bp) | Class II Mutator-like transposase (SC-MuII.1) | No | Absent | Absent | TA46073_4547 (apical meristem). |
*b3 was identified in sorghum even when is not annotated at Phytozome (http://www.phytozome.net/sorghum)
1: the gene name abbreviation is between parenthesis
2: presence of frame shifts or stop codons compared with orthologous protein sequence of sorghum
3: sorghum locus identification according to Phytozome (http://www.phytozome.net/sorghum)
4: rice locus identification according to TIGR (http://www.tigr.org/tdb/e2k1/osa1)
5: identified transcript in Plant Transcript Assemblies fhttp://plantta.tigr.orgD. the tissue is indicated in parenthesis. All transcripts belong to sugarcane except when indicated
6: a fragment of the BAC clone was sequenced and annotated
Fig. 1Genomic structure of the four Mutator-like classes found in sugarcane sequenced BACs. Schematic representation of the genomic features of Class I (ScMuI) and Class II (ScMuII) transposon units, and Class III (MUGA) and Class IV (MUGB) MUSTANG genes isolated from sugarcane R570 BAC clones. Light grey boxes represent exons drawn to scale. Dark grey boxes correspond to portions with similarity to a Mutator transposase protein domain. Black boxes represent terminal inverted repeats (TIRs) bordering transposon units. TSD; are the target site duplication sequence identified at the immediate border of the transposon adjacent to the TIR. (plus sign/minus sign) denotes presence or absence of the featured region of the transposon in the sequenced BAC. Triangles represent LTR retrotransposons inserted inside sequenced elements
Fig. 2Distribution of Mutator class II on chromosomes. In situ hybridization of the Class II Mutator-like transposase clone TE109, detected by FITC fluorescence (green) to metaphase chromosomes of sugarcane ROC10 cultivar (a and b) and a hybrid Saccharum x Miscanthus (c and d). Chromosomes are counterstained with DAPI (blue signal, b and d). Genomic DNA from Miscanthus was labeled with biotin and detected by Texas Red fluorescence (d)
Gene features predicted
| Gene1 | Number of exons2 | Total length of exons (bp) | Number of introns2 | Total length of introns (bp) | Total length of protein (aa) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| sorghum | rice |
|
| sorghum | rice |
|
| sorghum | rice | |||
|
| 23 | 2,103 | 1,7264 | 2,103 | 2,103 | 22 | 12,366 | 11,032 | 10,037 | 9,561 | 700 | 5744 | 700 | 700 |
|
| 7 | 1,722 | 1,692 | 1,725 | 1,755 | 6 | 1,890 | 2,085 | 1,927 | 2,912 | 573 | 563 | 574 | 584 |
|
| 1 | 1,785 | 1,785 | 1,785 | 1,830 | 1 | 729 | 1,756 | 1,058 | – | 594 | 594 | 594 | 555 |
|
| 5 | 1,234 | 1,149 | 1,158 | 4 | 8,477 | 3,288 | 2,776 | 410 | 382 | 385 | |||
|
| 2 | 2,065 | 2,082 | 2,064 | 1 | 118 | 105 | 531 | 687 | 693 | 687 | |||
|
| 9 | 2,295 | 2,292 | 5421 | 8 | 6,154 | 2,778 | 7,654 | 764 | 763 | 1,806 | |||
|
| 10 | 1,404 | 1,395 | 1,380 | 9 | 2,061 | 2,354 | 1,372 | 467 | 464 | 459 | |||
|
| 3 | 1,048 | 1,047 | 1,056 | 2 | 205 | 251 | 437 | 348 | 348 | 351 | |||
|
| 16 | 2,502 | 2,502 | 2,538 | 15 | 5,225 | 11,413 | 6,229 | 833 | 833 | 845 | |||
|
| 3 | 362 | 366 | 390 | 2 | 959 | 731 | 954 | 119 | 121 | 129 | |||
|
| 1 | 324 | 363 | 387 | – | – | – | – | 107 | 120 | 128 | |||
|
| 3 | 2,163 | 2,163 | 2,169 | 2 | 1,220 | 73 | 70 | 720 | 720 | 722 | |||
|
| 7 | 1,187 | 1,187 | 1,182 | 6 | 2,194 | 2,194 | 2,058 | 394 | 394 | 393 | |||
|
| 3 | 204 | 189 | 408 | 2 | 444 | 67 | 62 | 135 | |||||
1: Gene name abbreviation from Table 1
2: Number of exons and introns from the species with the most number
3: Sc. sugarcane
4: Truncated BAC end
5: haplotypes variant
Fig. 3Gene order is conserved at Class III MUSTANG locus in Poaceae. Schematic representation of orthologous region from Brachypodium, rice, sorghum, maize, sugarcane haplotype a and sugarcane haplotype b. Exons are indicated as colored boxes. Connectors mark orthologous genes. Grey boxes are inserted retrotransposons with their corresponding LTRs shown as black boxes. Gene names are indicated on the sugarcane haplotype a