Literature DB >> 25101670

The structure and retrotransposition mechanism of LTR-retrotransposons in the asexual yeast Candida albicans.

Lulu Zhang1, Lan Yan1, Jingchen Jiang2, Yan Wang1, Yuanying Jiang1, Tianhua Yan2, Yongbing Cao1.   

Abstract

Retrotransposons constitute a major part of the genome in a number of eukaryotes. Long-terminal repeat (LTR) retrotransposons are one type of the retrotransposons. Candida albicans have 34 distinct LTR-retrotransposon families. They respectively belong to the Ty1/copia and Ty3/gypsy groups which have been extensively studied in the model yeast Saccharomyces cerevisiae. LTR-retrotransposons carry two LTRs flanking a long internal protein-coding domain, open reading frames. LTR-retrotransposons use RNA as intermediate to synthesize double-stranded DNA copies. In this article, we describe the structure feature, retrotransposition mechanism and the influence on organism diversity of LTR retrotransposons in C. albicans. We also discuss the relationship between pathogenicity and LTR retrotransposons in C. albicans.

Entities:  

Keywords:  Candida albicans; LTR-retrotransposons; Tca; retrotransposition mechanism; structure feature

Mesh:

Substances:

Year:  2014        PMID: 25101670      PMCID: PMC4139406          DOI: 10.4161/viru.32180

Source DB:  PubMed          Journal:  Virulence        ISSN: 2150-5594            Impact factor:   5.882


Introduction

As an opportunistic fungal pathogen of humans, Candida albicans usually cause fungal infections such as thrush, vaginitis, and life-threatening bloodborne candidiasis. Of the 16 Mb haploid genome of C. albicans, 14.9 Mb has been assembled by Stanford University (Stanford DNA Sequencing and Technology Center), which facilitates understanding of the genetic structure, pathogenicity, adaptability, and evolution in C. albicans. Retrotransposons are mobile genetic elements capable of independent transposition through RNA intermediates. Retrotransposons are widespread transposable elements in eukaryotes, and constitute a major part of eukaryotic genomes in many cases.- For instance, retrotransposons constitute 42% of human genomes and 75% of maize genomes. Two primary subclasses of retrotransposons are present in eukaryotic cells: long-terminal repeat (LTR) retrotransposons and non LTR retrotransposons (LINEs, long interspersed nuclear elements, and SINEs, short interspersed nuclear elements). LTR retrotransposons in the yeast Saccharomyces cerevisiae have been extensively studied. According to the sequence similarity of reverse transcriptases (RTs), and the subunits of pol genes, there are two different types of LTR retrotransposons. One is the Ty1/copia type, where the subunits are protease (PR), integrase (IN), reverse transcriptase (RT), and RNase H (RH) in order; the other is the Ty3/gypsy type, where the subunits are PR, RT, RH, and IN (Fig. 1). In S. cerevisiae, six classes of retrotransposons, Ty1–5 and Ty3p, belong to 2 different types, respectively: Ty1/copia components (Ty1, Ty2, Ty4, and Ty5), and Ty3/gypsy components (Ty3 and Ty3p). In these retrotransposons families, Ty1, Ty2, Ty3, and Ty4 have transposition activity in S. cerevisiae as known. This review summarizes the structure, mechanism, and influence on organism diversity of LTR-retrotransposons found in C. albicans.

Figure 1. The genomic organization of different types of LTR-retrotransposon families present in C. albicans. RT, reverse transcriptase DNA polymerase domain; RH, reverse transcriptase RNase H domain; PR, proteinase; IN, integrase.

Figure 1. The genomic organization of different types of LTR-retrotransposon families present in C. albicans. RT, reverse transcriptase DNA polymerase domain; RH, reverse transcriptase RNase H domain; PR, proteinase; IN, integrase.

LTR-Retrotransposon Families in C. albicans

Compared with the 6 families present in S. cerevisiae, 34 distinct LTR-retrotransposon families pertaining to the Ty1/copia and Ty3/gypsy groups have been found in C. albicans. They are named by the Greek alphabet letters (α, β,… iota.), archaic Greek letters (sampi, san, etc.), and phonetically similar names of New Zealand birds (moa, tara, weka, etc.). According to the internal regions, the first 16 LTRs have been defined as Tca1–Tca16 (Table 1).
Table 1.

Properties of C. albicans retrotransposon LTR families

LTRLength (bp)TSD (bp)Associated internal regionCopy numberaAccession numberGag/pol ORFsb
alpha3885Tca110 (5–10)M94628No
beta3955Tca810 (6–8)Y08494Partical
gamma2805Tca29 (5–10)Y08494Intact
kappa2805Tca620 (10–15)AF069450Frag.s
zeta5085Tca719 (10–15)AF074943Partial
san3815Tca45 (1–4)AF074943Intact
omega6855Tca53 (0–5)AF093417Intact
nu2774Tca311AF119344Partial
psi4705Tca930AF119344No
chi1925Tca1011AF118059No
eta4705Tca1113AF118059N.D.
whio3485Tca128AF180289N.D.
moa5075Tca138AF180291No
lambda5125Tca144AF180284N.D.
kahu5315Tca1517AF192278Partial
huia1275Tca166AF180285Frag.s
omicron2684-9--
rho2754-13--
pi2804/5-17--
Iota2514-13--
sampia3245-9--
theta3665-7--
upsilon2645-6--
koppa2085-10--
epsilon4805-9--
phi1945-14--
episemon5185-4--
mu7805-4--
xi3875-5--
weka1655/7-9--
tui1995-19--
titi3365-3--
tara2855-9--
toroa2825-11--

a Numbers in parentheses indicate the copy number in a variety of strains as estimated by Southern blotting. bIntact, long ORFs present containing all of the motifs characteristic of full-length, functional retrotransposons; Partial, long ORFs are present but no full-length ones discovered yet; Frag.s, no long ORFs, but short regions of homology to other retroelement ORFs are present; No, no long ORFs present nor any homology to other retroelements in databases.

a Numbers in parentheses indicate the copy number in a variety of strains as estimated by Southern blotting. bIntact, long ORFs present containing all of the motifs characteristic of full-length, functional retrotransposons; Partial, long ORFs are present but no full-length ones discovered yet; Frag.s, no long ORFs, but short regions of homology to other retroelement ORFs are present; No, no long ORFs present nor any homology to other retroelements in databases. By comparing the LTR retrotransposons families in C. albicans and S. cerevisiae, we can find a great difference. The different number of families illustrates that the two species experience disparity in retrotransposons element evolution. LTR retrotransposons contain two long-terminal repeats (LTRs), as shown in Figure 1, at their ends, typically 250–600 bp in length, flanking a 5–7 kb long internal protein-coding domain. Between the two LTRs are two open reading frames (ORFs): gag and pol. The gag ORF encodes the structural proteins that make up a virus like particle (VLP). The pol ORF encodes the enzymes required for reverse transcription and integration. The enzymes are PR, IN, RT, and RH. The LTRs of retrotransposons have the transcription termination signals and promoter, and are divided into three functional areas: U3, R, and U5. U3 contains an enhancer and promoter region at the 3′ end of transcript; R contains both the start and termination sites for transcription; U5 only exists in the transcript of 5′ terminal. The process of transcription is from the left LTR U3/R boundary to the right LTR R/U5 boundary to produce a RNA molecule with both ends of R region (Fig. 2).

Figure 2. LTRs can be subdivided into three regions: U3, R, and U5. U3 contains the enhancer and promoter sequences that drive viral transcription. R domain encodes the 5′ capping sequences (5′ cap) and the polyA (pA) signal.

Figure 2. LTRs can be subdivided into three regions: U3, R, and U5. U3 contains the enhancer and promoter sequences that drive viral transcription. R domain encodes the 5′ capping sequences (5′ cap) and the polyA (pA) signal. Thirty-four LTR retrotransposon families in C. albicans share the following characteristics: (1) the length of the retrotransposon LTRs ranges from 127 to 780 bp with a mean length of 359 bp; (2) most elements have the terminal dinucleotides 5′-TG…CA-3′ and these dinucleotides tend to form part of larger terminal inverted repeats; (3) the total cope number of LTRs is 355; (4) both sides of more than half of the full-length LTR are short direct repeats representing target-site duplications (TSDs), most of which are 5 bp in length showing a tendency toward a purine in the first position and a pyrimidine in the last (data not shown); and (5) these retrotransposon families in C. albicans generally vary widely in coding capacity as Ty families in S. cerevisiae and have a low copy number. Most families have the similar copy number to Ty5 (7 copies) LTR in S. cerevisiae, but considerably lower than that of Ty1/Ty2 (251 copies), Ty3 (41 copies), and Ty4 (32 copies). Most LTR families in C. albicans only contain solo LTRs or LTR remnants in which the LTR-retrotransposon sequence has been retained and the intervening sequence has been lost. So far, just 3 intact retrotransposons have been founded. They are (1) Tca2, an active retrotransposon element which has a stop codon between two ORFs, (2) Tca5, a Ty5 like retrotransposon, and (3) Tca4, which belongs to Ty1/copia group and resembles to Tca2. Several families of highly degenerate elements appear to be still capable of transposition, presumably via transactivation. Full-length retrotransposons are usually lost when two LTRs recombine with each other. This results in isolated solo LTRs at the original sites. A survey shows that 85% of Ty insertions in S. cerevisiae are solo LTRs or LTR fragments. In C. albicans, for retrotransposon element, β, there are some 395 bp solo LTRs. Both full-length retrotransposons and solo LTRs have short (4 or 5 bp) direct repeats on either end. Most LTR retrotransposons generate 5 bp short direct repeats representing TSDs, as the Ty elements in S. cerevisiae. For instance, 36% of the total S. cerevisiae Ty1–4 elements are flanked by TSDs. Except for Ty3/gypsy elements (Ty3) that are flanked by 4 bp TSDs, Ty5 elements have no TSDs. The condition of Ty5 suggests frequent recombination between these elements at the telomeres. Recombination or mutation may have resulted in exchange of target site sequences between elements. A similar preference is apparent for the elements in C. albicans: 14 full-length LTRs are flanked by 4 bp direct repeats, like element pi, weka has a confirmed 7 bp TSD. With analysis of the 5 bp TSDs in C. albicans and S. cerevisiae, base bias can be found: purine highly is enriched in the first site (55%); and pyrimidine is enriched in the last site (53%) and there is a strong bias for A and T: in the internal position 2 (72%), position 3 (76%), and position 4 (78%) (data not shown). LTR retrotransposons have a potential tRNA primer-binding site. At the downstream from the left of 5′ LTR is a short polypurine sequence (8–49 nt), termed as the primer-binding sites (PBSs) (Table 1), which is a 10–20 nucleotide sequence that can base-pair with cytoplasmic tRNA molecule partly. Fourteen different families of LTRs in C. albicans have connection with PBSs. All PBSs share a common feature, the LTR terminus starting from 5′-TGG-3′ is base-pair with CCA sequence at the 3′ end of all tRNAs. Goodwin et al. had found the 3′ end of tRNA could be used as a primer to determine whether two LTRs were two independent insertions or a single retrotransposon. Furthermore, if the two LTRs belong to a family, they repeat in a direct orientation on a contig. According to the above mentioned common features, 14 families of LTRs in C. albicans have been found sharing the relative PBSs. There are 7 different classes of these PBSs (Fig. 3): some have an extensive region which is homologous to an internal region of tRNAArg(UCU) (e.g., gamma), some have a short region which is homologous to the same tRNA (e.g., whio), some are homologous to a tRNAiMet internal region (e.g., zeta), and some are homologous to the 3′ ends of various tRNAs such as, Ala, Gln, Ile, and iMet tRNAs. In some instances, the PBS is found to be located downstream of the LTR just 2–10 bp. A short gap between the LTR and the PBS is a common feature of gypsy-class retrotransposons.

Figure 3. The diversity of PBSs in C. albicans retrotransposons. The names of the retrotransposons and the associated LTRs are shown on the left. LTR sequences are underlined. The GenBank accession numbers of the tRNAs are as follows: tRNAArg, AF041470; tRNAiMet, AF069449; tRNAIle, Y08492; tRNAGln, AF180282; and tRNAAla, Y08493.

Figure 3. The diversity of PBSs in C. albicans retrotransposons. The names of the retrotransposons and the associated LTRs are shown on the left. LTR sequences are underlined. The GenBank accession numbers of the tRNAs are as follows: tRNAArg, AF041470; tRNAiMet, AF069449; tRNAIle, Y08492; tRNAGln, AF180282; and tRNAAla, Y08493.

Mechanisms of Retrotransposition

A general phenomenon that resides in a number of eukaryotes is that adverse living conditions can activate retrotransposons. They can move from place to place in a genome by reverse transcription of a RNA transposition intermediate to enable the organism to adapt to the environment. It has been demonstrated that the retrotransposons generate new copies by reverse transcription of their RNA transcripts. Similar to retroviruses in function, the LTRs of retrotransposons play a border role in complex reverse transcription procedure. The two DNA strands are synthesized from opposite directions: tRNA binds to the site near the 3′ end of left LTR when synthesizing the first strand, while the polypurine track (PPT), a short purine-rich sequence immediately upstream of the right LTR, as a primer binds to the upstream of the right LTR when synthesizing the second strand. Retrotransposons must synthesize mRNA at first, which can be translated into proteins related to replication, also acts as a replication template. At each end of the template, mRNA has a short repeat sequence (R, green) and U region (Fig. 4). Primer tRNA can base-pair with PBS sequence. When tRNA anneals to PBS, minus strand DNA starts synthesizing under the catalysis of reverse transcriptase. DNA synthesized can reach the U5 region (red) at the 5′ end of the template and continue as the last few bases. The newly synthesized DNA can base-pair with the repeat sequence at the 3′ end of the genomic RNA. Then, new strand DNA base-pair with 3′ end of the RNA template starting with U3 sequence, and continues until R and U5 at the 5′ end of RNA template. After that the RNase H degrades the RNA template leaving fragments at the poly-purine tract to prime second strand DNA synthesis. Using the new strand DNA as a template reverse transcriptase synthesizes another strand DNA until U5 and R. After that, the DNA fragment transfers to another end of new template. Synthesis then proceeds in both directions to give double-stranded DNA and LTR, making up of U5, R and U3, at each end. Integrase inserts this into chromosomal DNA, and transcription initiating in one LTR and terminating in the other generates genomic RNA with terminal repeats.

Figure 4. Mechanisms of retrotranspositions. The RNA transposition intermediate (brown) of an LTR retrotransposon provide plus strand RNA. The RNA has a tRNA (blue) base-paired sequence, the PBS, near its 5′ end (1). Primer tRNA anneals to binding site on RNA (2). The first sequences to be copied are the unique sequence at the 5′ end of the RNA (U5, red) and a short repeat sequence (R, green) present at both ends of the RNA. In this step, single-stranded DNA R region pairs with 3′ terminus in the first jump (3). Reverse transcriptase starts synthesis minus strand DNA (4), starting with U3 (purple) adjacent to R, and continuing until U5 and R are copied a second time. tRNA primer is removed. The RNA template is degraded by RNase H leaving a fragment at the poly-purine tract to prime second strand DNA synthesis (5). In the second jump, reverse transcriptase transfers to the other end of minus strand (7). Synthesis then proceeds (8). Integrase inserts this into chromosomal DNA, and transcription initiating in one LTR and terminating in the other generates genomic RNA with terminal repeats (9).

Figure 4. Mechanisms of retrotranspositions. The RNA transposition intermediate (brown) of an LTR retrotransposon provide plus strand RNA. The RNA has a tRNA (blue) base-paired sequence, the PBS, near its 5′ end (1). Primer tRNA anneals to binding site on RNA (2). The first sequences to be copied are the unique sequence at the 5′ end of the RNA (U5, red) and a short repeat sequence (R, green) present at both ends of the RNA. In this step, single-stranded DNA R region pairs with 3′ terminus in the first jump (3). Reverse transcriptase starts synthesis minus strand DNA (4), starting with U3 (purple) adjacent to R, and continuing until U5 and R are copied a second time. tRNA primer is removed. The RNA template is degraded by RNase H leaving a fragment at the poly-purine tract to prime second strand DNA synthesis (5). In the second jump, reverse transcriptase transfers to the other end of minus strand (7). Synthesis then proceeds (8). Integrase inserts this into chromosomal DNA, and transcription initiating in one LTR and terminating in the other generates genomic RNA with terminal repeats (9).

Specific Description

Of the 34 distinct LTR-retrotransposon families in C. albicans, 16 families still retain some internal sequences which have very different coding capacity (Tca1–Tca16). Tca2, Tca4, and Tca5 are intact and possess all the characteristic features of functional retrotransposons. Tca3 and Tca8 have no full-length elements, but still retain long and continuous ORFs like other retroelements; Tca6 still contains ORF fragments in the internal region; Tca9 and Tca13 have 5 bp direct repeats at each end, intact PBSs and PPTs and identical LTRs; Tca10 is a compound element, whose LTR shares 99.5% identity and is flanked by a 5 bp direct repeat. Some extensive ORFs, the PBS and PPT regions make up the ~2 kb long internal region. The other 18 LTR families do not have the sequences similar to retrotransposons internal regions. These retrotransposon families may remain solo LTRs and LTR fragments, resulting in some of these LTRs escaping from detection. Many researchers have studied a few LTR retrotransposons in C. albicans. Now we summarize the characteristics of Tca1, Tca2, Tca3, Tca4, and Tca5.

Tca1, an Inactive Element without ORF

Tca1, as the first C. albicans retroelement, is of 5614 bp and has 388 bp LTRs. No significant ORF sequence presents in the internal region, suggesting that Tca1 is a degenerate and inactive element. It has been demonstrated that Tca1 has two loci in the strain SC5314. One is referred to as Tca1–1, and the other is designated as Tca1–2. The Tca1 elements of the two loci have more than 99% similarity in sequence and almost the same structure.Tca1–1 was isolated from a lambda phage genomic library clone called CJY-3. This clone contains 3.25 kb and 4.8 kb long fragments (Fig. 5). Tca1–1 lacks any retrotransposon-like ORF. This differentiates Tca1–1 from Tca1–2. Tca1–2 was found in an approximately 6 kb clone, called CJY-4. It is highly similar to the insert of CJY-3, besides minor restriction site polymorphisms (Fig. 6).

Figure 5. Southern blot analysis of DNA from lambda clones CJY-3 and CJY-4. Genomic DNA from strain SC5314 or purified DNA from lambda clones CJY-3 and CJY-4 was digested with EcoRI and hybridized with the α-element probe.

Figure 6. Restriction site maps of the genomic inserts from lambda clones CJY-3 and CJY-4 containing Tca1–1 and Tca1–2, respectively. The locations of elements (LTRs) are indicated by the boxed regions. Also indicated are the plus- and minus-strand primer binding sites (1PBS and 2PBS), the 5 bp direct repeats flanking the elements, the EcoRI fragments associated with each locus, and the 1.8 kb HindIII-EcoRI and 2.2kb EcoRV fragments that are used together as a hybridization probe of the internal region. The direction of transcription of Tca1 is from left to right.

Figure 5. Southern blot analysis of DNA from lambda clones CJY-3 and CJY-4. Genomic DNA from strain SC5314 or purified DNA from lambda clones CJY-3 and CJY-4 was digested with EcoRI and hybridized with the α-element probe. Figure 6. Restriction site maps of the genomic inserts from lambda clones CJY-3 and CJY-4 containing Tca1–1 and Tca1–2, respectively. The locations of elements (LTRs) are indicated by the boxed regions. Also indicated are the plus- and minus-strand primer binding sites (1PBS and 2PBS), the 5 bp direct repeats flanking the elements, the EcoRI fragments associated with each locus, and the 1.8 kb HindIII-EcoRI and 2.2kb EcoRV fragments that are used together as a hybridization probe of the internal region. The direction of transcription of Tca1 is from left to right. Nucleotide sequence analysis suggests that the insert of CJY-4 is flanked by identical direct repeat sequences, which was 99% identical to the LTRs of Tca1–1. Each LTR of Tca1–2 is flanked by 6 bp inverted repeat sequence (TGTTCG) like LTRs of other retrotransposons. The sequence in the front of 5′ LTR and behind of 3′ LTR of Tca1–2 is ATTGC. This suggests that the sequence is a copy of integration target site. The sequence is different from the one of Tca1–1, TTGGT. Sequence analysis of entire Tca1–2 has not found any extended ORFs or potential splice sites. This suggests that Tca1–2 has been degenerated highly. Compared with about 1400 bp of insertion region of Tca1–1, Tca1–2 is just differing in less than 1% (data not shown). Previous work indicated that the transcription of Tca1–1 is affected by growth temperature. Northern blot analysis suggested that the transcription of Tca1 is also influenced by temperature in strains lacking Tca1–1 or Tca1–2. Thus, Tca1 elements are temperature-regulated retrotransopsons. The expression level of Tca1 at 25 °C is higher than that at 37 °C. It has been reported that the virulence determinants of a lot of pathogenic bacteria (Shigella app, Yersinia app, Vibrio cholerae, Bordetella pertussis, and Staphylococcus aureus) were regulated by temperature. Antley et al. found C. albicans cells grown at 25 °C are more virulent than those grown at 37 °C. As Ty elements can regulate adjacent gene transcription the Tca1 may control adjacent genes temperature-dependently. The expression of Tca1 could be similar to the invasion gene in Yersinia pseudotuberculosis. Southern blot analysis suggests that deletion of Tca1 elements does not result in obvious growth defects and the recombination between LTRs occurs readily. Furthermore, silencing a single element would lead to complete loss at that locus, suggesting that the Tca1 elements at both loci are hemizygous. It is known that elements may be abundant in natural populations if they provide some advantage to the microorganism. However, there is no clear reason to maintain these degenerate elements. Neither copy of Tca1 choose silent chromatin as preferential integration region as the Ty5 retrotransposon in S. cerevisiae.- Although nearly 40% of the strains lack Tca1 elements, solo LTRs in the genome suggest that they have existed in these strains for a time, and moreover, it is not essential to maintain Tca1 in natural populations.

Tca2 is an Abundant, Extrachromosomal DNA Molecule with Intact ORF

Tca2 is widespread in C. albicans and was first identified in the strain hOG1042, known as pCal. The pCal was a distinct band when the uncut C. albicans DNA was examined on an agarose gel, so it is an extrachromosomal retrotransposon.Tca2 belongs to Ty1/copia-type retrotransposon, and was originally identified as a linear double-stranded DNA molecule. It is the most abundant DNA copy (about 50 copies per cell).Tca2 is 6426 bp long and has 280 bp LTRs flanked by a six-nucleotide imperfect inverted repeat, TGTTGG…CCATCA. The internal domain usually has two long gag and pol-like ORFs. ORF1 and ORF2 are separated by a stop-codon (UGA). The structure feature is similar to mammalian retroviruses but is unique in LTR retrotransposons. However, the pol gene can translate into protein. Forbes and coworkers demonstrated that the LTR promoter is heat-shock induced. Like pol proteins, the expression of Tca2 pol protein is regulated by LTR promoter. There is a novel stop codon bypass mechanism to made gag-pol RNA translate pol proteins. The mechanism is read-through suppression of the UAG codon. In Tca2, the sequences between the gag and pol ORFs are an 8 bp purine-rich sequence, AAAACAGG, and a potential pseudoknot. They are tightly followed the UGA stop codon, which is essential to suppression. Tca2 has retrotranspositional activity, and is strongly dependent on the growth temperature. There is substantially more extrachromosomal DNA at 37 °C than that at 27 °C. In addition, northern blot analysis on the transcriptional activity of Tca2 showed that the most prominent band is full-length transcripts. Interestingly, the number of full-length transcripts at 37 °C is more than that at 27 °C, suggesting that Tca2 has transposition activity and is in favor of higher temperature (37 °C). There is also a correlation between the level of Tca2 RNA and extrachromosomal DNA, suggesting that the synthesis of extrachromosomal DNA needs Tca2 RNA as intermediate. The level of Tca2 RNA is relative to the situation of Tca2 elements. For instance, the levels of Tca2 RNA are different at various sites in genome. In addition, the levels of RNA are disparate in different strains. The level of transcription is probably affected by strains, sequences and genomic context. Thus, we speculate that there may be close relationship between the Tca2 expression and virulence in the process of C. albicans evolution.

Tca3 has a Partial Coding Region

Tca3 is a Ty3/gypsy-like retrotransposon widely existing in C. albicans. It has 277bp LTRs and contains a little part of a coding region. There are two major data sets of genome sequence have been determined for C. albicans strain SC5314. The first is Assembly 6 and another is Assembly 19. Assembly 6 is a high coverage sequence (~10-fold) in the genome. The database has two composites called as Tca3Δ, sharing ~99% identity with each other. Both of them are ~4.55 kb, having an internal region flanked by LTRs (Fig. 7A). The left LTR of Tca3Δ6–2328 has the same sequence as the right LTR of Tca3Δ6–1874, higher than its right LTR (99.4%); the right LTR of Tca3Δ6–2328 has 99.7% similarity to the left LTR of Tca3Δ6–1874 than its left LTR (99.0%). Figure 7A also shows that in the internal regions of both Tca3Δ elements there are extensive ORFs. The internal region of Tca3Δ6–2328 contains one ORF almost whole length of itself, and in Tca3Δ6–1874 a frameshift ORF locate in the internal region. These ORFs contain the RT, RH, and IN domains like Ty3/gypsy retrotransposons in sequence.

Figure 7. Structures of Tca3 elements. (A) The two Tca3 elements in Assembly 6 of the Stanford C. albicans sequencing project database. (B) A full-length Tca3 element from C. albicans strain ATCC10261. In all panels shaded boxes represent the ORFs of the elements. The locations of the conserved domains are indicated. Offset boxes and vertical lines within the boxes represent frameshifts and premature stop codons, respectively. The LTRs are represented by the boxed triangles. The common scale is shown at the bottom.

Figure 7. Structures of Tca3 elements. (A) The two Tca3 elements in Assembly 6 of the Stanford C. albicans sequencing project database. (B) A full-length Tca3 element from C. albicans strain ATCC10261. In all panels shaded boxes represent the ORFs of the elements. The locations of the conserved domains are indicated. Offset boxes and vertical lines within the boxes represent frameshifts and premature stop codons, respectively. The LTRs are represented by the boxed triangles. The common scale is shown at the bottom. Assembly 19 represents an initial draft of the diploid genome, sharing a similar Tca3 complement to Assembly 6. However, it has some information which Assembly 6 does not contain, such as a composite on contig 19–10140. Assembly 19 contains identical LTRs flanking a target site duplication instead of the internal region. Data suggest that the sequence of internal region is substituted by a run of Ns. Highly similarity to other Tca3 elements may lead the internal sequence co-assemble with them mistakenly. Tca3 is roughly the same as other Ty3/gypsy elements in structure except for some special features. For example, elements of Tca3 seem to have a distinct mechanism for DNA synthesis which is involve in minus-strand DNA. Second, elements have a distant relationship with other Ty3/gypsy-like elements. Third, the ORFs of Tca3Δ elements have no obvious pol domain and shorter RT domain as typical Ty3/gypsy elements. Finally, losing the pol domain and large part of gag domain is another special feature. A copy of Tca3 obtained from ATCC10261 that retains the gag/pol region is referred to as Tca3F-10261 (Fig. 7B). It is found that Tca3Δ is more common than Tca3F. Tca3F is 6134 bp long. The internal region contains two long ORFs flanking by identical 313 bp LTRs. Tca3F-10261 has 99% identity with Assembly 6 Tca3Δ elements, except for the ~1.6 kb indel covering the gag/pol region, a 42 bp indel upwards to the large indel, and some variation in the 5′ untranslated regions. Goodwin et al. have found 5 of 6 strains emerged Tca3 bands, suggesting that Tca3 is quite common in C. albicans. In the five examined strains, only 1 to 3 bands were produced, which indicates that the elements are present in low copy number (data not shown). This might be the chance events in evolutionary process leading to excessive Tca3 elements produced. It may also be the result of selection, or may have connection with asexual reproduction in C. albicans., However, it is difficult to evaluate the relationship between different forms of Tca3 and host’s reproductive mode, on the base of indefinite reproduction in C. albicans. Sequence comparison and Southern analysis have shown that Tca3 is much conserved in sequence and structure, either in loci or in strains, suggesting that the original deletion event has occurred not long ago.

Tca4, an Active Element, Contains Intact ORF

Tca4 is a Ty1/copia element closely related to Tca2, which is flanked by 381 bp LTRs and contains an intact gag/pol ORFs. Tca4 is simple in sequence, suggesting that it has recently emerged. Like Tca2, Tca4 is a retrotranspositionally active element that strongly depends on the growth temperature. High temperature can induce Tca4 transcription.

Tca5, an Intact LTR Retrotransposon

Tca5 is of ~5.6 kb and 4218 bp of internal sequence flanked by identical 685 bp LTRs. Only one single long ORF lies in the internal region. It is larger than most other yeast retrotransposon LTRs. Phylogenetic analysis and sequence comparison imply that Tca5 has close connection with Ty5 element in S. cerevisiae. The S. cerevisiae Ty LTRs range in length from 251 bp for Ty5 to 371bp for Ty4., The terminal inverted repeat sequences are the most conserved in Ty1-like elements, containing at least 5 nt: 5′-TGTTG…CAACA-3′. Therefore, the Tca5 retrotransposon begins with TG and ends with CA, as most retrotransposons do. Sequencing of the PCR product revealed the left LTR, which was identical to the right LTR, and these LTR are designated as omega. Close behind the downstream of left LTR is PBS which is complementary to methionine tRNAiMet15 of initiator. Upstream of right LTR is a tract enriched in purines (ATGGGGAAG) and a similar sequence (ATGGGGAGG) at position 3132 bp. The internal region contains a single and long uninterrupted ORF with a relatively low copy number, and this ORF extends from within the left LTR (382 bp) to within 60 bp of the right LTR, with no inframe stop codon or frame shift, predicting a protein with motifs characteristic of gag, protease, integrase, reverse transcriptase, and RNase H proteins of other retrotransposons. However, it is unclear where the ORF might initiate translation in vivo. There are several possible sites, all of which are ATG in sequence. The first one within the ORF (382 bp from the 5′ end of the left LTR) is the unusually deep inset within the LTR. The second is also within the LTR three codons beyond the first. The third one within the ORF is unusually far beyond the end of the LTR, the three ATG are in-phase. Besides, there are several out-of-phase ATG triplets 5′ to this candidate. Southern analysis (data not shown by original authors) demonstrated that Tca5 is a low copy number retrotransposon, with very few solo LTRs. It is possible that these elements contribute to the heterogeneity of the Candida genome, either by recombination between the LTRs of one element or by chromosomal rearrangements between two elements. Tca5 may still be active which is supported by the detection of full-length transcript in northern blot. In C. albicans, Tca5 is the only element showing significant similarity to Ty5. It is a suitable candidate for comparisons because it has all the expected structures of an active element and is probably intact. Comparative genomic analysis between Tca5 and Ty5 elements will be helpful in understanding them.

LTR-Retrotransposons Improve Genetic Diversity

C. albicans is a diploid organism. Clinical isolates of C. albicans show variform phenotypic characteristics associated with infection and virulence.- These complex phenotypes suggest genetic diversity. However, C. albicans is an asexual fungus; it must have other mechanisms to generate various genomes. Retrotransposons have contributed to the evolution of genomes and genes in ways that go well beyond simply increasing genome size. It also affects genomes on a small scale, such as mobilization and integration at a locus directly, or recombination with heterotopic element indirectly. These changes contain gene inactivation, variance in transcription ability of gene, gene deletions and inversions. It has been well documented that retrotransposons would mobilize in response to stress.- Servant et al. have shown that the relationship between transposable activity of Ty1 LTR retrotransposons and genome expression in S. cerevisiae. It seems to have a connection between stress reaction and the diversity of retrotransposons in C. albicans. Many organisms have the ability to recruit LTR-retrotransposons as alternative exons or promoters to drive genome evolution.- The LTRs of Ty elements have the ability to modify genome transcription and transposable elements can influence the expression of neighboring host genes., In gram-positive bacteria, circularized form of Tn retrotransposons can promote organisms resistance to antimicrobial. There is an article about the relationship between virulence and reverse transposition of LTR retrotransposons in C. albicans. Take the Tca1 for example, the expression level of Tca1 is related to temperature, suggesting that Tca1 may place adjacent to the temperature-dependent genes. Since Tca1 exhibits higher expression at 25 °C than at 37 °C, Tca1 could upregulate the expression of genes required for the establishment of infection. Furthermore, in our original research, we found the transposition phenomenon of LTR-retrotransposon in strains which are resistant to miconazole (unpublished observation). It is suggested that there is a relationship between virulence and LTR-retrotransposons.
  64 in total

1.  Evolutionary analyses of the human genome.

Authors:  W H Li; Z Gu; H Wang; A Nekrutenko
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

2.  Molecular markers reveal that population structure of the human pathogen Candida albicans exhibits both clonality and recombination.

Authors:  Y Gräser; M Volovsek; J Arrington; G Schönian; W Presber; T G Mitchell; R Vilgalys
Journal:  Proc Natl Acad Sci U S A       Date:  1996-10-29       Impact factor: 11.205

Review 3.  Retroviral insertional mutagenesis: past, present and future.

Authors:  A G Uren; J Kool; A Berns; M van Lohuizen
Journal:  Oncogene       Date:  2005-11-21       Impact factor: 9.867

Review 4.  The evolution of Ty1-copia group retrotransposons in eukaryote genomes.

Authors:  A J Flavell; S R Pearce; P Heslop-Harrison; A Kumar
Journal:  Genetica       Date:  1997       Impact factor: 1.082

Review 5.  Transcription and reverse transcription of retrotransposons.

Authors:  J D Boeke; V G Corces
Journal:  Annu Rev Microbiol       Date:  1989       Impact factor: 15.500

6.  Relationship between cell surface composition, adherence, and virulence of Candida albicans.

Authors:  J McCourtie; L J Douglas
Journal:  Infect Immun       Date:  1984-07       Impact factor: 3.441

7.  Ty4, a new retrotransposon from Saccharomyces cerevisiae, flanked by tau-elements.

Authors:  B Janetzky; L Lehle
Journal:  J Biol Chem       Date:  1992-10-05       Impact factor: 5.157

8.  Murine leukemia virus protease is encoded by the gag-pol gene and is synthesized through suppression of an amber termination codon.

Authors:  Y Yoshinaka; I Katoh; T D Copeland; S Oroszlan
Journal:  Proc Natl Acad Sci U S A       Date:  1985-03       Impact factor: 11.205

Review 9.  Pathogenesis of Candida infections.

Authors:  F C Odds
Journal:  J Am Acad Dermatol       Date:  1994-09       Impact factor: 11.527

Review 10.  Tissue infection and site-specific gene expression in Candida albicans.

Authors:  Chantal Fradin; Bernhard Hube
Journal:  Adv Appl Microbiol       Date:  2003       Impact factor: 5.086

View more
  9 in total

1.  The landscape and structural diversity of LTR retrotransposons in Musa genome.

Authors:  Faisal Nouroz; Shumaila Noreen; Habib Ahmad; J S Pat Heslop-Harrison
Journal:  Mol Genet Genomics       Date:  2017-06-10       Impact factor: 3.291

2.  A D-genome-originated Ty1/Copia-type retrotransposon family expanded significantly in tetraploid cottons.

Authors:  Qian Li; Yue Zhang; Zhengsheng Zhang; Xianbi Li; Dan Yao; Yi Wang; Xufen Ouyang; Yaohua Li; Wu Song; Yuehua Xiao
Journal:  Mol Genet Genomics       Date:  2017-08-28       Impact factor: 3.291

3.  A first insight into the genome of Prototheca wickerhamii, a major causative agent of human protothecosis.

Authors:  Zofia Bakuła; Paweł Siedlecki; Robert Gromadka; Jan Gawor; Agnieszka Gromadka; Jan J Pomorski; Hanna Panagiotopoulou; Tomasz Jagielski
Journal:  BMC Genomics       Date:  2021-03-09       Impact factor: 3.969

4.  Insights from the genome of Ophiocordyceps polyrhachis-furcata to pathogenicity and host specificity in insect fungi.

Authors:  Duangdao Wichadakul; Noppol Kobmoo; Supawadee Ingsriswang; Sithichoke Tangphatsornruang; Duriya Chantasingh; Janet Jennifer Luangsa-ard; Lily Eurwilaichitr
Journal:  BMC Genomics       Date:  2015-10-28       Impact factor: 3.969

5.  Chromatin Profiling of the Repetitive and Nonrepetitive Genomes of the Human Fungal Pathogen Candida albicans.

Authors:  Robert Jordan Price; Esther Weindling; Judith Berman; Alessia Buscaino
Journal:  mBio       Date:  2019-07-23       Impact factor: 7.867

6.  Annotation of Siberian Larch (Larix sibirica Ledeb.) Nuclear Genome-One of the Most Cold-Resistant Tree Species in the Only Deciduous GENUS in Pinaceae.

Authors:  Eugenia I Bondar; Sergey I Feranchuk; Ksenia A Miroshnikova; Vadim V Sharov; Dmitry A Kuzmin; Natalya V Oreshkova; Konstantin V Krutovsky
Journal:  Plants (Basel)       Date:  2022-08-06

7.  Genome-wide characterization of LTR retrotransposons in the non-model deep-sea annelid Lamellibrachia luymesi.

Authors:  Oluchi Aroh; Kenneth M Halanych
Journal:  BMC Genomics       Date:  2021-06-23       Impact factor: 3.969

Review 8.  Retrotransposon targeting to RNA polymerase III-transcribed genes.

Authors:  Stephanie Cheung; Savrina Manhas; Vivien Measday
Journal:  Mob DNA       Date:  2018-04-23

9.  The Genome of the Human Pathogen Candida albicans Is Shaped by Mutation and Cryptic Sexual Recombination.

Authors:  Joshua M Wang; Richard J Bennett; Matthew Z Anderson
Journal:  MBio       Date:  2018-09-18       Impact factor: 7.867

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.