| Literature DB >> 17083741 |
Pierre M Durand1, Andries J Oelofse, Theresa L Coetzer.
Abstract
BACKGROUND: The completed genome sequences of the malaria parasites P. falciparum, P. y. yoelii and P. vivax have revealed some unusual features. P. falciparum contains the most AT rich genome sequenced so far--over 90% in some regions. In comparison, P. y. yoelii is approximately 77% and P. vivax is approximately 55% AT rich. The evolutionary reasons for these findings are unknown. Mobile genetic elements have a considerable impact on genome evolution but a thorough investigation of these elements in Plasmodium has not been undertaken. We therefore performed a comprehensive genome analysis of these elements and their derivatives in the three Plasmodium species.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17083741 PMCID: PMC1636048 DOI: 10.1186/1471-2164-7-282
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Schematic representation of MGEs. ORFs are represented by coloured blocks. Non-autonomous retrotransposons and mobile introns are not depicted here. Arrows represent flanking repeats.
InterPro and Pfam mobile genetic element signature domains identified in P. falciparum and P. y. yoelii.
| IPR004244 (PF02994) | Transposase_22/L1 element | PY07579 (PORF1) |
| IPR002104 (PF00589) | Phage integrase catalytic domain | MAL13P1.42 (hypothetical) PY04273 (hypothetical) |
| IPR010998 | Lambda phage integrase-like domain | PY05943 (yir3 protein, putative), MAL13P1.42 (hypothetical), PY04273 (hypothetical) |
| IPR001207 (PF00872) | Transposase, mutator type | PY07767 (transposase, mutator type) |
| IPR003611 (PF07460) | Intron-encoded nuclease 2 | PFC0925w (hypothetical), PY00434 (hypothetical) |
| IPR011940 | Meiotic recombinase | PY05593 (DNA repair protein rhp51), MAL8P1.76 (meiotic recombination protein dmc1-like) |
| IPR011941 | DNA repair protein rad51 | PY03786 (rad51 homologue), PF11_0085 (rad51 homologue) |
Seven P. y. yoelii and four P. falciparum ORFs were identified. The ORFs MAL13P1.42 and PY04273 were duplicated by the two InterPro domains IPR002104 and IPR010998.
Figure 2Multiple sequence alignment of conserved RT domains and two potential RT-encoding genes in . #RT_domains is the RT consensus sequence developed in this study. It includes the fingers (1 and 2) and palm (A, B', C and D) domains. #PF13_0080 is the putative PfTERT identified by Figueiredo et al, 2005 [12]. #PFE1555c is a hypothetical protein with similarity to the RT consensus sequence developed in this study. The alignment was performed using ClustalW with default parameters. Residues shaded in yellow have been demonstrated to be in contact with nucleic acid [29]; residues shaded in red are critical for RT activity [30]. * residues common to all threesequences. : conserved residues. . semi-conserved residues
Figure 3Multiple sequence alignment of PyTERT, PfTERT and PvTERT. Evidence for the putative P. y. yoelii (PY00479) and P. falciparum (PF13_0080) TERTs has been published previously [10-12]. The candidate P. vivax TERT (Pv122530) was identified in this study. Shaded blocks represent RT conserved domains. * residues common toall three sequences. : conserved residues. . semi-conserved residues
P. vivax ORFs containing MGE domains.
| PF03184 Endonuclease CENP-B protein | 2 | E values -9 and -10 |
| PF00692 DeoxyUTP pyrophosphatase | 1 | E value -8 |
| PF00078 Reverse transcriptase | 23 | E values -8 to -27 |
| PF00589 phage integrase catalytic domain | 1 | E value -13 |
| PF07460 Intron-encoded nuclease 2 | 1 | E value -3 |
| Determined by BLASTP search | 2 | E values -2 and -3 |
| Determined by BLASTP search | 10 | E values -4 to -168 |
ORFs containing MGE domains were identified in the genome in three ways: i) a HMMER search using Pfam_fs HMMs ii) a BLASTP search using the RT consensus sequence developed in this study, and iii) a BLASTP search using the P. y. yoelii and P. falciparum MGE domain sequences identified from InterPro but not located in Pfam. Only the Pfam entries with significant hits are listed.
The six conserved domains used in this study to represent the RT sequence of RT-encoding MGEs.
| Fingers | 1 | II1XXL1[KG]S[GY]P1E1N[FP]K1XG1[RP][IV]XXI2,3[PL]K2,3[PK]D1[GK][GS] |
| Fingers | 2 | XR2,3P2[LI]2,3[TS]XXS3XXXK2,3IXQ1XV[IL]X[IK]XL2XXXXEXK |
| Palm | A | K1XXX[WS]X[TV]XL1D2,4I1K2K1,3CF2,3D2S2I2P1H2D1KL2I1XXL2XXY |
| Palm | B' | R1XXXXXXG2XP1Q2,3G2,3SX3XS2P2,3XL1F[NS]IYL1XXL2XE1XLXNXX |
| Palm | C | [YL]XR1Y2,3A1,3D2,4D2,4I[IL]V1X[VS]XXXKN |
| Palm | D | KXXLDXXTXFCXXIG2L1T1VN1XXK2T1K[IT] |
Alternative amino acid residues are in brackets. 'X' is any residue. 1 – residue present in three or more of the six aligned sequences; 2 – residue present in four or more of the six aligned sequences; 3 – residue in contact with nucleic acid template, dNTP or Mg2+ ; 4 – point mutations of this residue abolish catalytic activity.