| Literature DB >> 19068121 |
Andrew P Jackson1, Michael A Quail, Matthew Berriman.
Abstract
BACKGROUND: Bodo saltans is a free-living kinetoplastid and among the closest relatives of the trypanosomatid parasites, which cause such human diseases as African sleeping sickness, leishmaniasis and Chagas disease. A B. saltans genome sequence will provide a free-living comparison with parasitic genomes necessary for comparative analyses of existing and future trypanosomatid genomic resources. Various coding regions were sequenced to provide a preliminary insight into the bodonid genome sequence, relative to trypanosomatid sequences.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19068121 PMCID: PMC2621209 DOI: 10.1186/1471-2164-9-594
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Description of genomic fragments sequenced in this study.
| Clone | Probe† | Length (bp) | Genes by taxonomic affinity* | GenBank accession number | ||||
| K | E | P | U | TOTAL | ||||
| 5 e 15 | Rab1 | 35803 | 4 | 2 | 0 | 7 | 13 | |
| 14 l 17 | α-tubulin | 38184 | 8 | 1 | 0 | 5 | 14 | |
| 16 k 02 | HSP70 | 32908 | 8 | 2 | 0 | 2 | 12 | |
| 23 g 24 | GPDH | 37591 | 11 | 1 | 0 | 7 | 19 | |
| 45 a 09 | 38203 | 7 | 1 | 1 | 5 | 14 | ||
| 45 a 12 | 41499 | 16 | 3 | 0 | 5 | 24 | ||
| 46 a 11 | 30067 | 8 | 3 | 1 | 3 | 15 | ||
| 5 m 18 | β-tubulin | 28046 | 9 | 1 | 0 | 3 | 13 | |
| 93 d 02 | 18199 | 6 | 2 | 0 | 2 | 10 | ||
| 93 e 01 | 40818 | 12 | 2 | 0 | 3 | 17 | ||
| 96 f 03 | 38993 | 12 | 0 | 0 | 2 | 14 | ||
| 96 g 09 | 31028 | 5 | 2 | 0 | 6 | 13 | ||
Note: † Clones for which no probe is stated were selected after end-sequencing.* Genes were classified according to their affinity: kinetoplastid (K), other eukaryotic (E), prokaryotic (P) or unknown (U).
Figure 1Schematic representation of three regions of the . Six reading frames are shown as parallel grey bars; scale in base-pairs. Base composition is plotted above. Putative coding sequences are shown as coloured boxes: red (homolog of trypanosomatid gene with known function), orange (homolog of hypothetical trypanosomatid gene), green (hypothetical gene with no trypanosomatid homolog but a positive functional match to a sequence database), blue (hypothetical gene with no matches to sequence databases). Labels attending these coding sequences contain the GeneDB identification numbers of homologous trypanosomatid genes where possible, or the description of homologous genes detected by BLAST comparisons (with % identity). Predicted transmembrane helices (blue) and signal peptides (purple) are shown on the DNA strands below the coding sequence. a. Clone '16k02' containing a tandem gene array of heat-shock protein 70. b. Clone '14l17' containing a tandem gene array of α- and β-tubulin. An asterisk * denotes a β-tubulin gene disrupted by a single base deletion at position 589. c. Clone '5m18' containing a second tandem gene array of α- and β-tubulin.
Figure 2Screenshot from the Artemis Comparison Tool (ACT), showing a 41.5 Kb fragment of . Key to B. saltans coding sequence annotation: 1. RNA-binding protein (homolog of Tb927.7.5380); 2. Hypothetical, no matches; 3. Serine-threonine protein kinase (Metazoa 46%, Plantae 43%); 4. Hypothetical, no matches; 5. Homolog of Tb10.61.3155; 6. Hypothetical lipase; 7. Serine-threonine protein kinase (Tb10.61.3140); 8. Homolog of Tb10.61.3130; 9. Possible ornithine decarboxylase (Bacteria 27%); 10. Dephospho-CoA kinase (Tc00.1047053511277.500); 11. Homolog of Tb10.61.3120; 12. Homolog of Tb10.61.3115; 13. Homolog of Tb10.61.3110; 14. DNAJ chaperone (Tb10.61.3100); 15. Homolog of Tb10.61.3080; 16. Homolog of Tb10.61.3070; 17. GPI-anchor transamidase (Tb10.61.3060); 18. Tubulin tyrosine ligase (Tb10.61.3050); 19. Ubiquitin-conjugating enzyme (Tb927.5.1000); 20. Homolog of Tb10.61.3040.
Bodo-specific hypothetical genes, with evidence of protein domains, transmembrane domains (TMH) signal peptides (SP) and affinities to sequence databases where available.
| Clone | Position | Size (bp) | Interpro match | TMH | SP | Best BLASTp match | Best taxonomic match (% amino acid identity) |
| 14 l 17 | 3085 | 645 | Oxygenase (IPR005123) | Prolyl hydroxylase | Metazoa (53%) | ||
| 16869 | 1143 | Zn-binding protein (IPR001841) | 1 | ||||
| 21613 | 7356 | Leucine-rich repeat (IPR001611) | 2 | ||||
| 16 k 02 | 14573 | 1125 | Yes | ||||
| 16273 | 1596 | Yes | GPI-anchored protein | Plantae/Fungi (64%) | |||
| 18275 | 1212 | Hypothetical protein | Metazoa (50%) | ||||
| 27014 | 1050 | 7 | Yes | ||||
| 23 g 24 | 391 | 1626 | Pleckstrin-like (IPR001849) | ||||
| 7655 | 2781 | Zinc metallopeptidase (IPR006025) | |||||
| 21361 | 23428 | EF-hand protein | |||||
| 45 a 09 | 4161 | 535 | Methyltransferase (IPR013216) | ||||
| 30026 | 1134 | Phosphoribulokinase (IPR006083) | Bacteria (50%) | ||||
| 45 a 12 | 12191 | 1695 | Lipase (IPR008265) | 1 | |||
| 31394 | 455 | Serine-Threonine protein kinase (IPR017442) | Protein kinase | Metazoa (46%) | |||
| 41227 | 264 | Nucleotide-sugar transporter (IPR007271) | 3 | Yes | |||
| 46 a 11 | 211 | 753 | Endonuclease (IPR001604) | 1 | Yes | Endonuclease | Bacteria (53%) |
| 1166 | 1416 | Thioredoxin (IPR000866) | Nucleoredoxin | Plantae (53%) | |||
| 12286 | 1092 | Fucosyltransferase | Plantae (36%) | ||||
| 14365 | 765 | Hypothetical protein | Metazoa (55%) | ||||
| 5 e 15 | 477 | 10617 | Phosphatidylinositol-4-phosphate kinase (IPR003409) | ||||
| 14913 | 1296 | Lyzosomal lipase (IPR006693) | 1 | Lipase | Metazoa (59%) | ||
| 16510 | 2199 | Serine endopeptidase (IPR001254) | Serine protease | Plantae (46%) | |||
| 21065 | 546 | ABC transporter (IPR013525) | 4 | Yes | |||
| 32153 | 2613 | 1 | |||||
| 35286 | 546 | Zinc-finger protein (IPR001841) | |||||
| 5 m 18 | 5577 | 1857 | Sodium/hydrogen exchanger (IPR006153) | 14 | Yes | Sodium/hydrogen exchanger | Metazoa (50%) |
| 17069 | 321 | Ankyrin repeat protein (IPR002110) | |||||
| 93 d 02 | 5528 | 1758 | Yes | ||||
| 7529 | 2589 | Chloride channel (IPR0014743) | 9 | Chloride channel | Metazoa (56%) | ||
| 14924 | 1788 | WD40 repeat (IPR001680) | Hypothetical protein | Plantae (56%) | |||
| 93 e 01 | 56 | 5505 | 14 | ||||
| 16439 | 1995 | Forkhead associated protein (IPR000253) | |||||
| 18802 | 4260 | 8 | Yes | ||||
| 24535 | 855 | Protein tyrosine phosphatase (IPR000340) | Protein phosphatase | Plantae (56%) | |||
| 96 g 09 | 10627 | 2559 | PDZ protein binding motif (IPR001478) | Yes | |||
| 16737 | 1242 | 7 | Yes | ||||
| 26182 | 948 | Leucine-rich repeat kinase (IPR001611) | Receptor-type protein kinase | Plantae (54%) | |||
| 27597 | 483 | Leucine-rich repeat kinase (IPR001611) | Receptor-type protein kinase | Plantae (34%) |
Figure 3a. Schematic representation of a 32.9 Kb fragment of . The asterisk * denotes a physical sequence gap of uncertain length. b. Protein sequence alignment showing the repetitive region of 11 'surface antigen' proteins from L. major and their putative homolog from B. saltans, shown in panel a. (13440–16665 bp).