| Literature DB >> 20587589 |
Matej Vesteg1, Rostislav Vacula, Jürgen M Steiner, Bianka Mateásiková, Wolfgang Löffelhardt, Brona Brejová, Juraj Krajcovic.
Abstract
The chloroplasts of Euglena gracilis bounded by three membranes arose via secondary endosymbiosis of a green alga in a heterotrophic euglenozoan host. Many genes were transferred from symbiont to the host nucleus. A subset of Euglena nuclear genes of predominately symbiont, but also host, or other origin have obtained complex presequences required for chloroplast targeting. This study has revealed the presence of short introns (41-93 bp) either in the second half of presequence-encoding regions or shortly downstream of them in nine nucleus-encoded E. gracilis genes for chloroplast proteins (Eno29, GapA, PetA, PetF, PetJ, PsaF, PsbM, PsbO, and PsbW). In addition, the E. gracilis Pbgd gene contains two introns in the second half of presequence-encoding region and one at the border of presequence-mature peptide-encoding region. Ten of 12 introns present within presequence-encoding regions or shortly downstream of them identified in this study have typical eukaryotic GT/AG borders, are T-rich, 45-50 bp long, and pairwise sequence identities range from 27 to 61%. Thus single recombination events might have been mediated via these cis-spliced introns. A double crossing over between these cis-spliced introns and trans-spliced introns present in 5'-UTRs of Euglena nuclear genes is also likely to have occurred. Thus introns and exon-shuffling could have had an important role in the acquisition of chloroplast targeting signals in E. gracilis. The results are consistent with a late origin of photosynthetic euglenids.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20587589 PMCID: PMC2920757 DOI: 10.1093/dnares/dsq015
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
List of primers derived from E. gracilis mRNA sequences
| mRNAa | Referenceb | Accession numberc | Forward primersd | Reverse primerse |
|---|---|---|---|---|
| AJ272112 | 65–84 | 298–279 | ||
| L21904 | 23–42 | 468–449 | ||
| X15743 | 183–202 | 511–492 | ||
| AF443625 | 49–68 | 422–403 | ||
| AJ130725 | 89–108 | 386–367 | ||
| D14702 | 40–59 | 674–655 |
aPrimers were derived from mRNA sequences of nucleus-encoded genes (Eno29, GapA, Pbgd, PetA, PetJ and PsbO) for chloroplast proteins (enolase, glyceraldehyde-3-phosphate dehydrogenase, porphobilinogen deaminase, cytochrome f, cytochrome c6, and 30 kDa protein of the oxygen-evolving complex, respectively).
bNumber of reference in the reference list in which the corresponding mRNA was characterized.
cAccesion numbers of mRNAs.
d,eThe numbers of primers correspond to the positions in mRNA sequences that can be found under the accession numbers (accession number) listed in the third column. For example, forward primer 65–84 (first row, fourth column) is identical to positions 65–84 of Eno29 mRNA sequence, which can be found under accession number AJ272112 and reverse primers 298–279 (first row, fifth column) is complementary to the sequence 279–298 of Eno29 mRNA.
List of primers derived from E. gracilis EST sequences
| EST producta | Accession numberb | Organism with the best BLASTX hit | Forward primersc | Reverse primersd | |
|---|---|---|---|---|---|
| PetF | EG565162 | 100E−40 | 57–76 | 627–608 | |
| PsaF | EG565174 | 900E−36 | 30–49 | 499–480 | |
| PsbM | EG565161 | 700E−11 | 85–104 | 575–556 | |
| PsbW | EG565140 | 300E−14 | 68–86 | 560–543 |
aThe name of protein product of ESTs from which primers were derived. PetF is plastid-targeted ferredoxin, PsaF is subunit F of photosystem I, and PsbM and PsbW are subunits M and W of photosystem II.
bAccession numbers of ESTs.
c,dThe numbers of primers correspond to the positions in corresponding ESTs. E.g. forward primer 57–76 (first row, fifth column) is identical to positions 57–76 of EST with accession number EG56162, and reverse primer 627–608 (first row, sixth column) is complementary to nucleotides 627–608 of this EST.
Introns in E. gracilis nucleus-encoded genes for chloroplast proteins identified in this study
| Intron | Accession number | Length | Borders | Percent AT | Percent T | Nucleotide position | Phase |
|---|---|---|---|---|---|---|---|
| eno29-i1 | GQ925702 | 48 | GT/AG | 62.50 | 37.50 | 166 | 1 |
| gapA-i1 | GQ925704 | 50 | GT/AG | 54.00 | 44.00 | 243 | 0 |
| pbgd-i1 | GQ925705 | 48 | GT/AG | 60.42 | 33.33 | 276 | 1 |
| pbgd-i2 | GQ925705 | 46 | GT/AG | 67.39 | 45.65 | 377 | 0 |
| pbgd-i3 | GQ925705 | 50 | GT/AG | 68.00 | 46.00 | 462 | 1 |
| petA-i1 | GQ925706 | 45 | GT/AG | 64.44 | 37.78 | 261 | 0 |
| petF-i1 | GQ925703 | 46 | GT/AG | 63.04 | 47.83 | 423 | 1 |
| petJ-i1 | GQ925707 | 41 | GT/TC or TT/CG | 63.41 | 34.15 | 304 or 305 | 1 or 2 |
| psaF-i1 | GQ925708 | 47 | GT/AG | 72.34 | 44.68 | 307 | 0 |
| psbM-i1 | GQ925709 | 47 | GT/AG | 55.32 | 38.30 | 411 | 1 |
| psbO-i1 | GQ925710 | 93 | ? | 68.82 | 44.09 | 517, 518 or 519 | ? |
| psbW-i1 | GQ925711 | 48 | GT/AG | 64.58 | 39.58 | 198 | 0 |
| psbW-i2 | GQ925711 | 195 | GA/GT | 54.36 | 31.79 | 505 | 1 |
All introns, except for psbW-i2, are present either in the presequence-encoding regions or shortly upstream of them. The table includes accession numbers of partial gene sequences containing corresponding introns, intron length (in nucleotides), intron borders, AT- and T-content of introns, and intron phase. Nt position is the position downstream of which the intron is inserted into the corresponding mRNA or EST sequence (for the accession numbers of mRNAs and ESTs see Tables 1 and 2).
Primary sequence identity (top-right half) and alignment P-value (bottom-left half) of the selected introns from E. gracilis (44–50 bp long, with consensus GT/AG borders)
| eno29-i1 | gapA-i1 | pbgd-i1 | pbgd-i2 | pbgd-i3 | petA-i1 | petF-i1 | psaF-i1 | psbM-i1 | psbW-i1 | nop1p-i1 | nop1p-i3 (%) | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| eno29-i1 | 45% | 27% | 42% | 36% | 56% | 42% | ||||||
| gapA-i1 | 0.07 | 46% | 37% | 39% | 54% | 55% | 45% | 37% | ||||
| pbgd-i1 | 0.39 | 55% | 40% | 43% | 51% | 49% | 42% | 46% | 32% | |||
| pbgd-i2 | 0.32 | 0.35 | 0.20 | 55% | 33% | 46% | 29% | 47% | 41% | 41% | ||
| pbgd-i3 | 0.22 | 0.93 | 0.23 | 42% | 47% | 50% | 37% | 37% | 47% | |||
| petA-i1 | 0.20 | 0.25 | 0.27 | 0.25 | 38% | 34% | 46% | 49% | 38% | |||
| petF-i1 | 0.21 | 0.15 | 0.90 | 0.63 | 55% | 46% | 48% | 47% | ||||
| psaF-i1 | 0.33 | 0.15 | 0.74 | 0.13 | 38% | 40% | 39% | |||||
| psbM-i1 | 0.09 | 0.16 | 0.83 | 0.37 | 0.28 | 0.11 | 0.91 | 47% | 39% | |||
| psbW-i1 | 0.06 | 0.20 | 0.34 | 0.24 | 0.26 | 0.07 | 0.27 | 50% | 46% | |||
| nop1p-i1 | 0.26 | 0.44 | 0.83 | 0.28 | 0.36 | 0.35 | ||||||
| nop1p-i3 | 0.38 | 0.31 | 0.34 | 0.95 | 0.58 | 0.65 | 0.23 | 0.62 | 0.25 | 0.93 |
Except for nop1p-i1 and nop1p-i3 (introns present in the gene-encoding nucleolar protein fibrillarin), all these introns are present either in the presequence-encoding regions or shortly upstream of them. The primary sequence identity was calculated as the number of identical nucleotide oppositions of two introns in a pairwise alignment divided by the length of the alignment. Statistically significant alignments with P-value ≤0.05 are shown in bold (see section Methods).
Figure 1Possible mechanisms for the acquisition of presequences in the ancestor of phototrophic euglenids. A single recombination event mediated via cis-spliced introns (Cis i1 and Cis i2) can result in the addition of presequence (or its part) from donor gene to acceptor gene (A). The donor gene would also acquire trans-spliced intron (Trans i) necessary for the addition of capped SL-leader ensuring translation of acceptor gene mRNA. Note that intron Cis i1 is shown to be present exactly at the presequence-mature peptide border encoding region for illustration in (A), but it can be present also within second half of presequence-encoding region or shortly downstream of it. Intron Cis i2 is also shown to be present at 5′-end of the acceptor gene for illustration in (A), but it can be also present within 5′-end of the acceptor gene. Another possible mechanism for the acquisition of chloroplast-targeting signals (B) may involve double crossing over, i.e. the first recombination event occurring between trans-spliced introns Trans i1 and Trans i2, present at the 5′-ends of donor and acceptor gene, respectively, and the second recombination event occurring between cis-spliced intron (Cis i1) present at the presequence-mature peptide border-encoding region of the donor gene and cis-spliced intron (Cis i2) present somewhere within the 5′-end of the protein-coding region of the acceptor gene.