A number of abundant mobile genetic elements called retrotransposons reverse transcribe RNA to generate DNA for insertion into eukaryotic genomes. Non-long-terminal repeat (non-LTR) retrotransposons represent a major class of retrotransposons, and transposons that move by target-primed reverse transcription lack LTRs characteristic of retroviruses and retroviral-like transposons. Yeast model systems in Candida albicans and Saccharomyces cerevisiae have been developed for the study of non-LTR retrotransposons. Non-LTR retrotransposons are divided into LINEs (long interspersed nuclear elements), SINEs (short interspersed nuclear elements), and SVA (SINE, VNTR, and Alu). LINE-1 elements have been described in fungi, and several families called Zorro elements have been detected from C. albicans. They are all members of L1 clades. Through a mechanism named target-primed reverse transcription (TPRT), LINEs translocate the new copy into the target site to initiate DNA synthesis primed by the 3' OH of the broken strand. In this article, we describe some advances in the research on structural features and origin of non-LTR retrotransposons in C. albicans, and discuss mechanisms underlying their reverse transcription and integration of the donor copy into the target site.
A number of abundant mobile genetic elements called retrotransposons reverse transcribe RNA to generate DNA for insertion into eukaryotic genomes. Non-long-terminal repeat (non-LTR) retrotransposons represent a major class of retrotransposons, and transposons that move by target-primed reverse transcription lack LTRs characteristic of retroviruses and retroviral-like transposons. Yeast model systems in Candida albicans and Saccharomyces cerevisiae have been developed for the study of non-LTR retrotransposons. Non-LTR retrotransposons are divided into LINEs (long interspersed nuclear elements), SINEs (short interspersed nuclear elements), and SVA (SINE, VNTR, and Alu). LINE-1 elements have been described in fungi, and several families called Zorro elements have been detected from C. albicans. They are all members of L1 clades. Through a mechanism named target-primed reverse transcription (TPRT), LINEs translocate the new copy into the target site to initiate DNA synthesis primed by the 3' OH of the broken strand. In this article, we describe some advances in the research on structural features and origin of non-LTR retrotransposons in C. albicans, and discuss mechanisms underlying their reverse transcription and integration of the donor copy into the target site.
Candida albicans is a major human fungal pathogen. With the spread of AIDS and the increased use of invasive surgical techniques, C. albicansinfections have become more of a problem in recent years.C. albicans is an asexual eukaryote. However, Saccharomyces cerevisiae can also reproduce sexually. Several laboratories have devoted considerable efforts over recent years toward understanding the genomic organization of C. albicans and how it varies among strains. Several results to date include the construction of a SfiI restriction map of the complete genome and a detailed physical map of chromosome 7.C. albicans is an important model system for studying pathogenic fungi and interactions between these species and their hosts. Several researchers, reported the existence of a large number of families of retrotransposons in C. albicans. Retrotransposons should be are transcribed into mRNA molecules and then be reverse transcribed into double stranded cDNA by their own reverse transcriptase before the potential mobility of retrotransposons can be approximately predicted by the presence of their mRNA transcript. Retrotransposons are a significant component of many eukaryote genomes; for instance, L1 retrotransposon comprises 15% of the human genome, and is known to cause mutations and promote genomic alterations. It is widespread in multicellular eukaryotes, and has an important effect on the structure of eukaryotic genomic and genetic evolution. Two types of transposons have been classified: transposons that encode a transposase required for transposition, and retroposons that use a retrotranscriptase encoded in their genome for retrotransposition. Transposons are found in a large variety of eukaryotes, and retrotransposons are part of different subfamilies of transposons. It is remarkable that retrotransposons are highly related to animal retroviruses with respect to gene organization and expression strategies.,Retrotransposons are divided into two major categories. First, long-terminal-repeat (LTR) retrotransposons have structures and mechanisms similar to those of vertebrate retroviruses. The integrated forms of LTR retrotransposons are flanked by LTR at the end of both sides. Second, non-long-terminal repeat (non-LTR) retrotransposons that move by target-primed reverse transcription (TPRT), which emerged from the biochemical work of Luan and Eickbush using the R2Bm model of Bombyx mori lacking the LTR retrotransposons characteristic of retroviruses and retroviral-like transposons. Non-LTR retrotransposons are divided into LINEs (long interspersed nuclear elements), SINEs (short interspersed nuclear elements), and SVA (SINE, VNTR, and Alu). Non-LTR retrotransposons also contain a reverse transcriptase domain. Unlike LTR retrotransposons, they have no LTR retrotransposons, either direct or indirect. This review summarizes the past and recent advances in the study of non-retrotransposon elements in C. albicans. Further delineation and comparison of non-LTR retrotransposons in C. albicans may provide interesting insights into more general aspects of the genome structure, function, and mechanism, though the integrated structure and mechanism remain unclear.
LINEs Elements Found in C. albicans
As we described above, LINEs (long interspersed nuclear elements) are one of the three classes of non-LTR retrotransposons that influence the evolution of eukaryote genomes. Complete mechanistic details of how LINEs duplicate and retrotranspose are unclear; however, a mechanism of the reverse transcription, termed target-primed reverse transcription (TPRT), has been reported. In history, these elements which are called LINEs in generally today have been referred to by a variety of names, including poly A retrotransposons, nonviral retroposons, or simple retroposons. The first indication is that these elements were catalyzed by the retrotransposition machinery of LTR retrotransposons or retroviruses. The rapid accumulation of more sequences eventually leads to the recovery of elements from different animals and plants with ORFs that encode intact RT domains. Phylogenetic comparison of these RT sequences with that of all other RT sequences revealed that they represented a distinct class of retrotransposons., It soon became known that RT domains of several elements could encode authentic RT DNA polymerase activity.- These elements are called LINEs retrotransposons today. The structure of LINEs is shown in Figure 1A. LINEs are 4–6 kbp in length and bounded by an untranslated region (UTR) at both ends of the element. LINEs are characterized by 3′ poly-A tails or 3′ tandem repeats as other non-retrotransposons and transcribed from a promoter within the first few nucleotides of the element. Active LINEs frequently result in 5′ truncated LINE copies. Most LINE elements are inactivated because of inefficiency of reverse transcription that is error-prone, so that ORFs encoding the transposition machinery are likely to be disabled by mutations, and not highly processive, so that 5′ truncation of the elements often occurs during transposition. A typical LINEs family consists of two open reading frames, ORF1 and ORF2 (Fig. 1A). ORF1 encodes a RNA-binding protein that associates with the LINEs transposition intermediate and nucleic acid chaperone activity,- both of which are important for LINEs activity., ORF2 encodes endonuclease, reverse transcriptase activity, zinc finger domain, and RNase H domains in some cases. Genomic LINEs, like human L1, are typically flanked by target site duplications (TSDs) as LTR-retrotransposons. ORF1 and ORF2 proteins assemble with LINEs RNA into a ribonucleoprotein (RNP) complex, which is presumably transported into the nucleus.,
Figure 1. Structure of non-LTR retrotransposons. (A) Structure of LINEs: LINEs family consists of two open reading frames, ORF1 and ORF2. ORF1 encodes a RNA-binding protein that associates with the LINE transposition intermediate. ORF2 encodes endonuclease (endo), reverse transcriptase (RT), zinc finger domain (zf), and RNase H domains in some cases (not shown). Arrows are TSDs. A represents poly-A tail. (B) Structure of Zorro3 in C. albicans: ORF1 contains two zinc knuckle (zk) motifs called type I ORF1, while human L1s contains a type II ORF1. Zorro3 has no TSDs, with poly-A tract flanking both ends. (C) Structure of Zorro1 in C. albicans. The end of 5′UTR cannot be identified. Unlike another non-LTR retrotransposons, neither a poly-A tract nor a 3′ tandem repeat is apparent at the 3′ end of Zorro1.
Figure 1. Structure of non-LTR retrotransposons. (A) Structure of LINEs: LINEs family consists of two open reading frames, ORF1 and ORF2. ORF1 encodes a RNA-binding protein that associates with the LINE transposition intermediate. ORF2 encodes endonuclease (endo), reverse transcriptase (RT), zinc finger domain (zf), and RNase H domains in some cases (not shown). Arrows are TSDs. A represents poly-A tail. (B) Structure of Zorro3 in C. albicans: ORF1 contains two zinc knuckle (zk) motifs called type I ORF1, while human L1s contains a type II ORF1. Zorro3 has no TSDs, with poly-A tract flanking both ends. (C) Structure of Zorro1 in C. albicans. The end of 5′UTR cannot be identified. Unlike another non-LTR retrotransposons, neither a poly-A tract nor a 3′ tandem repeat is apparent at the 3′ end of Zorro1.Multiple retrotransposons, consisting of non-LTR retrotransposons and LTR-retrotransposons, are flanked by 4–5 bp short direct repeats representing TSDs at 5′ and 3′ ends. For instance, 36% of the total S. cerevisiaeTy 1–4 elements were flanked by TSDs, and it is reported that Tca elements are also typically flanked by TSDs. Analyzing the sequences of all the perfect TSDs of Tca elements in C. albicans (Fig. 2A) and Ty elements in S. cerevisiae (Fig. 2B) to derive a 4–5 bp TSDs target site sequence, a strong bias for A and T: in the internal position 2 (72%), position 3 (76%), position 4 (78%), is shown in Figure 2B. In Figure 2A, a bias for A and G is found in position 1 (92%), a bias for T and C is shown in position 4 (71%). Recombination or mutation may result in the exchange of target site sequences between the elements.
Figure 2. TSDs target site sequence flanking by retrotransposons. The direction of TSDs is 5′ to 3′. (A) Four base pairs TSDs target site sequence analyzed by Tca families in Candida albicans. Sample capacity, n = 24. (B) Five base pairs TSDs target site sequence analyzed by Ty families in Saccharomyces cerevisiae. Sample capacity, n = 118.
Figure 2. TSDs target site sequence flanking by retrotransposons. The direction of TSDs is 5′ to 3′. (A) Four base pairs TSDs target site sequence analyzed by Tca families in Candida albicans. Sample capacity, n = 24. (B) Five base pairs TSDs target site sequence analyzed by Ty families in Saccharomyces cerevisiae. Sample capacity, n = 118.Many non-retrotransposons have been found in vertebrates, insects, and fungi. Human L1 element has affected both the size and complexity of the human genome, and varietal plant non-LTR retrotransposons have been reported, for instance, Cin4 in maize and BLIN (6.3 kbp in length) from barley. So far, phylogenetic analysis of non-LTR retrotransposons based on the reverse transcriptase domains has allowed for distinguishing 21 clades.- Three clades (Tad, L1, and CRE) of non-LTR retrotransposons are known in fungi. L1 clade elements were described from the genomes of C. albicans, a basidiomycete Microbotryum violaceum, and a glomeromycete Gigaspora. Unfortunately, S. cerevisiae appears to lack non-LTR retrotransposons.The existence of non-LTR retrotransposons in C. albicans has been reported., Subsequently, Goodwin et al. used a series of TBLASTN (protein query vs. nucleotide database) and BLASTN search to screen non-LTR retrotransposons in assembling 5 of the Stanford C. albicans sequence database, and identified only a small number of sequences corresponding to non-LTR retrotransposons. Only three of them appear to be full-length or nearly full-length: Zorro1, Zorro2, and Zorro3 with 25–40% amino acid identity. Zorro elements are widespread in C. albicans giving low copy numbers (data not shown by the original authors).The structures of the Zorro elements are shown in Figure 1B and C. The structure of Zorro2 is similar to that of Zorro1, except that the ORFs have suffered several nonsense frameshift mutations and highly conserved residues can be identified. The intact Zorro1 element (Fig. 1C) contains two ORFs, like many non-LTR retrotransposons. ORF1 containing two zinc-finger motifs potentially considered as putative nucleic acid-binding domains. ORF2 encodes a potential endonuclease (EN), a reverse transcriptase (RT), and a C-terminal. Upstream of ORF1 is a 5′ untranslated region (5′UTR), and the end of 5′UTR cannot be identified. Comprising with 5′UTR, downstream of ORF2 is a 3′ untranslated region (3′UTR). The end of this 3′UTR can be tentatively identified; however, neither a poly-A tract nor a 3′ tandem repeat is apparent at the 3′ end of Zorro1. The Zorro3 element is a structurally intact element., It contains ORF1 and ORF2, the first of which encodes two zinc-finger motifs (considered as putative nucleic acid-binding domains). ORF2 of Zorro3 encodes an endonuclease (EN), a reverse transcriptase (RT), and a C-terminal. Zorro3 is bounded by 5′UTR at upstream of ORF1 and 3′UTR at downstream of ORF2. The end of 5′UTR of Zorro3 is characterized by a series of A residues, and the end of 3′UTR can be identified as a short poly-A tract (itself bordered by poly-A). Interestingly, ORF2 of Zorro3 is separated from a feature-like stop codon that contains four in-phase stop codons. But it was reported that the gag and pol ORFs were separated by a UGA stop codon (gag-UGA-pol junction) in the C. albicans retrotransposon Tca2. Forbes and Gibson et al. demonstrated that the LTR promoter directed Tca2 pol protein expression and suggested that there was a non-canonical mechanism underlying gag UGA bypass in Tca2. Unfortunately, whether or not Zorro3’s ORF2 directly translates stop codon bypass remains unclear.L1-like non-LTR retrotransposons were described for all eukaryotic groups: Protista, Plantae, Fungi, and Metazoa.,,, The neighbor-joining (NJ) phylogenetic tree based on reverse transcriptase of non-LTR retrotransposons reveals the position of the Zorro elements in L1 non-LTR retrotransposons. Figure 3 shows that the phylogenetic tree in distinct families is inside L1 clade based on RT domain. Subsequently, three Zorro elements emerge as a monophyletic group shown in Figure 4. These assignments are well supported by bootstrap re-sampling. It is remarkable that the three families of Zorro elements have been evolving independently for a very long time, and that they are probably extremely ancient components of the Candida genome.
Figure 3. The neighbor-joining (NJ) phylogenetic tree based on RT amino acid sequences of L1 elements from fungi. The percentage of bootstrap support for major branches is indicated. The clade and families are shown on the right. The distance is the categories distance of the PROTDIST program of PHYLIP.
Figure 4. The neighbor-joining (NJ) phylogenetic tree based on RT sequences of Zorro elements from C. albicans. The percentage of bootstrap support for major branches is indicated. The clade and families are shown on the right. The distance is the categories distance of the PROTDIST program of PHYLIP.
Figure 3. The neighbor-joining (NJ) phylogenetic tree based on RT amino acid sequences of L1 elements from fungi. The percentage of bootstrap support for major branches is indicated. The clade and families are shown on the right. The distance is the categories distance of the PROTDIST program of PHYLIP.Figure 4. The neighbor-joining (NJ) phylogenetic tree based on RT sequences of Zorro elements from C. albicans. The percentage of bootstrap support for major branches is indicated. The clade and families are shown on the right. The distance is the categories distance of the PROTDIST program of PHYLIP.As we described below, Zorro elements in C. albicans are intact elements consisting of two ORFs, and ORF2 encodes an endonuclease (EN), a reverse transcriptase (RT), and a C-terminal. An UTR is bounded at both ends of Zorro elements in C. albicans. However, comparing with another L1 non-LTR retrotransposons, for instance, human L1, which is a classical structure, these are series of differences between Zorro elements and another L1 non-LTR retrotransposons (Fig. 1). Unlike human L1 elemnets, neither a poly-A tract nor a 3′ tandem repeat is apparent at the 3′ end of this copy of Zorro1 and Zorro2. However, Zorro3 has a short poly-A tract at the end of 3′UTR. Another distinguishing feature between Zorro3 and human L1 elements is that Zorro elements contain two zinc-finger motifs in ORF1 instead of the conserved mammalian C-terminal domain. ORF1 contains two zinc knuckle motifs called type I ORF1, while human L1s contains a type II ORF1. Another distinguishing feature is a 19-bp poly-A tract in the inter ORF region.
Translocation of LINEs Using Target-Primed Reverse Transcription
The process of how LINE elements retrotranspose is called target-primed reverse transcription (TPRT), which is a mode of duplication and transposition of non-LTR retrotransposons by spreading through reverse transcription of retrotransposon RNA primed by DNA at the target site. By extension, it is likely that this mechanism applies to numerous LINEs found in diverse lineages, like human L1. At first, a RNA binding protein with endonuclease (EN) activity encoded by ORF1, a multifunctional protein with reverse transcriptase (RT) activity encoded by ORF2, and the L1 RNA transcribed from its internal RNA polymerase II promoter located within the 5′UTR, to compose a compound called L1 ribonucleoprotein particle (RNP)., RNP enters the nucleus and nicks a chromosomal target site for integration. The sequence of events in translocation is shown in Figure 5. Recombination begins with nicking of the target DNA by the element-encoded EN that preferentially cleaves A/T rich sequences, with nicking occurring mainly at the TpA and flanking phosphodiesters. The target DNA 3′ OH exposed by endonuclease cleavage then acts as a primer for the synthesis of a new line DNA strand by reverse transcriptase using the line mRNA as a template. Thus a new line DNA strand is produced at the insertion site. And then, the nuclease makes a break in the opposite strand of chromosomal DNA a few nucleotides from the first. Template RNA is removed by RNase H allowing the new 3′ OH to prime synthesis of the second DNA strand and host repair enzymes to complete integration. Finally, a second DNA strand is synthesized, and the target DNA at each end is filled in to generate the TSDs., In addition, TPRT is mediated by the activities of both EN and RT domains; however, whether EN and RT are competitive inhibitors or non-competitive inhibitors remains unclear., In RNP, the poly G RNA could inhibit L1 EN activity. By DNA binding or action of L1 ORF1,, the poly G RNA may be removed from the L1 EN domain, and L1 EN activates to nick the chromosome at the target site. The nicked DNA moves to the RT active site and the newly generated 3′ OH primes reverse transcription and double-strand breaks (DSBs) generated in trans.
Figure 5. The mechanism of target-primed reverse transcription (TPRT).Transposition begins with the transcription of the LINE element (red) into RNA (blue) which encodes an RNA binding protein and a multifunctional protein with endonuclease and reverse transcriptase activity. These proteins (not shown) associate with the LINE RNA, and the endonuclease nicks the DNA at the target site, which contains a poly T tract, which base-pairs with the poly A sequence in the LINE RNA. The LINE RNA is then copied by the reverse transcriptase into a DNA copy (green), which is covalently attached to the target DNA. A second DNA strand is then synthesized on the template of the DNA copy, and the target DNA at each end is filled in to generate the TSDs that flank these elements.
Figure 5. The mechanism of target-primed reverse transcription (TPRT).Transposition begins with the transcription of the LINE element (red) into RNA (blue) which encodes an RNA binding protein and a multifunctional protein with endonuclease and reverse transcriptase activity. These proteins (not shown) associate with the LINE RNA, and the endonuclease nicks the DNA at the target site, which contains a poly T tract, which base-pairs with the poly A sequence in the LINE RNA. The LINE RNA is then copied by the reverse transcriptase into a DNA copy (green), which is covalently attached to the target DNA. A second DNA strand is then synthesized on the template of the DNA copy, and the target DNA at each end is filled in to generate the TSDs that flank these elements.Goodwin et al. developed a yeast model system using the Zorro3 element from C. albicans for the study of non-LTR retrotransposons. This system called retrotransposition assay for Zorro3 is outlined in Figure 6, in which the ORF of the C. albicans URA3 gene and its promoter sequence, with the ORF disrupted by an antisense intron inserted into 3′UTR of Zorro3 element, as the indicator gene. When Zorro3 is transcribing to give a full-length mRNA, and then the intron would be removed by splicing. Thus, retrotransposition events can be detected by the appearance of URA3+ colonies on the appropriate selective media. After retrotransposition assay, 30 independent transposed copies were amplified to reveal not only the 3′ and 5′ ends but their 3′ and 5′ flanking sequences of retrotransposed Zorro3 elements. Several findings from these sequences indicate that the target site of Zorro3 elements which is inserted very close to coding regions specifically integrated at poly-A sequences, and there seemed to be a bias toward promoter regions. In addition, Goodwin et al. suggested that the transposable events in Zorro3 of C. albicans are similar to TPRT in mammalian cells.
Figure 6. An assay for Zorro3 retrotransposition. The cloned Zorro3 element has a retrotransposition indicator gene (URA3 promoter, and URA3 ORF, disrupted by an antisense intron) inserted into its 3′ UTR. Reverse transcription and integration of the spliced RNA results in a functional and stably integrated URA3 gene and confers a URA3+ phenotype on the host cell.
Figure 6. An assay for Zorro3 retrotransposition. The cloned Zorro3 element has a retrotransposition indicator gene (URA3 promoter, and URA3 ORF, disrupted by an antisense intron) inserted into its 3′ UTR. Reverse transcription and integration of the spliced RNA results in a functional and stably integrated URA3 gene and confers a URA3+ phenotype on the host cell.As we described above, non-LTR retrotransposons have never been found in S. cerevisiae that has no endogenous L1 homologs or remnants. However, Poulter et al. established a model system of S. cerevisiae called retrotransposition assay for scZorro3 (Zorro3 in S. cerevisiae named ScZorro3), which has a similar process of retrotransposition assay for Zorro3 in C. albicans except using mHIS3AI as an indicator gene to confirm Zorro3 retrotransposition, and found that S. cerevisiae unexpectedly retained the basal host machinery required for L1 retrotransposition. Through this model system called scZorro3 that recapitulates the non-LTR retrotransposition process in S. cerevisiae, they found several differences between Zorro3 of C. albicans retrotransposition and scZorro3 of S. cerevisiae retrotransposition. For instance, the reverse transcription complex searches for sequences with homology to the minus strand to enable the template to jump during minus strand synthesis. In Zorro3 of C. albicans retrotransposition, this search is largely restricted to regions around the target site. In scZorro3 of S. cerevisiae retrotransposition, this search space is relaxed, and template jumping occurs in other RNAs/DNAs at a higher frequency. In addition, scZorro3 can generate a circular and episomal retrotransposition products in S. cerevisiae. Previously, circular products derived from retroviruses or LTR-retrotransposons were observed.- Han and Shao suggested that these products are likely to be formed via a variation of TPRT. For simplicity, bottom chromosome strand nick at first, and then LINE mRNA annealing, minus strand synthesis finally. Subsequently, the top strand chromosomal nick and the template jump to the top strand and then re-cleave the top and bottom strands to release the retrotransposition intermediate. These episomal products may represent an unexpected source for de novo retrotransposition. Yeast model systems of C. albicans and S. cerevisiae have been principally described, which have been developed for studying Zorro family elements and TPRT emerged by biochemical experiments with human L1 and Zorro3 retrotransposon. However, complete mechanistic details of how Zorro families of LINE elements retrotranspose remain unclear.TPRT is a process spreading through reverse transcription of retrotransposon RNA primed by DNA, effectively welding the new copy into the target site as it is made. It is a complicate process that LTR retrotransposons can move from place to place in a genome by reverse transcription of an RNA transposition mediated in cells (in this study, we do not describe in details). Distinguishing features of TPRT (as compared with the process that LTR retrotransposons transpose) are the RNP, consisting of L1 RNA, proteins encoded by ORF1 and ORF2, enters the nucleus and nicks a chromosomal target site as the first step; however, no compound similar to RNP have been found in the process that LTR retrotransposons transpose. The target DNA 3′ OH acts as a primer for the synthesis of a new line DNA strand in TPRT, whereas a tRNA base-paired to a sequence near 5′ end of the genomic RNA, as a primer to anneals to binding site on retroviral RNA for the synthesis of minus strand DNA; retroviral RNA ends in direct repeats (R), and results that a linear double-stranded DNA with an LTR at each end.
Non-LTR Retrotransposons Play an Important Role in Evolutionary Dynamics of C. albicans
The evolutionary history of a particular or related species, the population structure, ecological aspects, and the mating mode could affect the diversity of non-LTR retrotransposons and copy numbers., For instance, L1 elements play an important role in the evolution of the structure and activity of the remainder of the genome by providing dispersed sites of sequence similarity at which recombination can occur, by inserting into genes altering their structure and/or regulation, and by carrying flanking sequences with them during transposition (L1-mediated sequence transduction). In addition, there are other processes that could affect the copy number and diversity of non-LTR retrotransposons in fungi: stochastic loss of non-LTR retrotransposons, burst of retrotransposition, the limitation of copy number increase by natural selection which removes deleterious insertions, horizontal transfer, passive and active inactivation of repetitive sequences, and self-regulation of transposition.,, Low copy numbers of non-LTR retrotransposons could cause a loss of retrotransposons-like elements as a result of genetic drift, especially when the population is small and non-LTR retrotransposons degenerate copies. It is reported that the presence of retrotransposons and their large copy numbers can cause mutations and genomic rearrangements. These discoveries indicate that non-LTR retrotransposons and the transposition play an important role in evolutionary dynamics of C. albicans.The inactivation of repeated sequences is a very important factor, which leads to the shifts in diversity and copy number of non-LTR retrotransposons. For instance, non-LTR retrotransoposons represented only by degenerate copies in Drosophila could lose these elements as a result of genetic drift, especially if the population is small. In bacteria, Tn retrotransposons are likely to be principal players in the formation of tetracycline resistance by spreading drug resistance gene during genetic transfer. In addition, the relationship between resistance and virulence with reverse transposition of retrotransposons is rarely reported, but in our original research, the transposition of Zorro2 and Zorro3 in strains that are resistant to miconazole and the strains show low virulence in a systemic murinecandidiasis model, have been observed (unpublished).
SINEs and SVA Elements are Rarely Reported in C. albicans
We have summarized several past and recent advances in the study of LINEs including Zorro families in C. albicans. Unfortunately, little has been known about the distribution and properties of SINES and SVA elements in C. albicans as compared with LINEs elements. However, much has been disclosed about the biology and function of SINEs and SVA elements since these elements were discovered.SINEs are genomic sequences derived from tRNA genes or 7SL RNA, and they spread non-autonomously in the genome by TPRT mediated by LINE-encoded recombination proteins. The first described SINEs were mouseB1 and B2, and human Alu. Today these elements are also found existing in other organisms, including fungi, insects, birds, and plants. SINEs are similar to LINEs in that both move via TPRT. SINE elements are much shorter (100–300 bp) than LINEs. A typical SINE consists of three parts: 5′ ends of all SINEs families originating from one of the three types of short pol III transcripts: tRNAs, 5S rRNA, or 7SL RNA. The 3′ ends consist of poly A tails flanked by TSDs. The internal domain of the SINEs family is usually unique and has no coding capacity. To date, 4 such domains have been described: CORE domain in vertebrates, V-domain in fish, Deu-domain in deuterostomes, and Ceph-domain in cephalopods.SVA elements for another group of non-autonomous retroelements in humans and non-human primate, and are present at a relatively low copy number of a few thousand per genome. The SVA elements were originally named SINE-R. It is named “SVA” after its main components (SINE, VNTR, and Alu) by Shen et al., who identified the SINE-R element together with a stretch of sequence that shares sequence similarity with Alu sequences. The 3′ ends of full-length SVA have the human endogenous retrovirus HERV-K, including the LTR and a 3′ poly A tails, and TSDs flanking both ends of SVA elements. A (CCCTCT)n hexamer simple repeat region that is located at the 5′ end. The internal domain is composed of an Alu-like sequence, a VNTR (variable number of tandem repeats) region, and a SINE region (SINE-R) about 490 bp. It is proposed that SVA elements are non-autonomous retrotransposons that are mobilized by L1 encoded proteins in trans2.