Literature DB >> 19068121

Insights into the genome sequence of a free-living Kinetoplastid: Bodo saltans (Kinetoplastida: Euglenozoa).

Andrew P Jackson1, Michael A Quail, Matthew Berriman.   

Abstract

BACKGROUND: Bodo saltans is a free-living kinetoplastid and among the closest relatives of the trypanosomatid parasites, which cause such human diseases as African sleeping sickness, leishmaniasis and Chagas disease. A B. saltans genome sequence will provide a free-living comparison with parasitic genomes necessary for comparative analyses of existing and future trypanosomatid genomic resources. Various coding regions were sequenced to provide a preliminary insight into the bodonid genome sequence, relative to trypanosomatid sequences.
RESULTS: 0.4 Mbp of B. saltans genome was sequenced from 12 distinct regions and contained 178 coding sequences. As in trypanosomatids, introns were absent and %GC was elevated in coding regions, greatly assisting in gene finding. In the regions studied, roughly 60% of all genes had homologs in trypanosomatids, while 28% were Bodo-specific. Intergenic sequences were typically short, resulting in higher gene density than in trypanosomatids. Although synteny was typically conserved for those genes with trypanosomatid homologs, strict colinearity was rarely observed because gene order was regularly disrupted by Bodo-specific genes.
CONCLUSION: The B. saltans genome contains both sequences homologous to trypanosomatids and sequences never seen before. Structural similarities suggest that its assembly should be solvable, and, although de novo assembly will be necessary, existing trypanosomatid projects will provide some guide to annotation. A complete genome sequence will provide an effective ancestral model for understanding the shared and derived features of known trypanosomatid genomes, but it will also identify those kinetoplastid genome features lost during the evolution of parasitism.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 19068121      PMCID: PMC2621209          DOI: 10.1186/1471-2164-9-594

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

The Kinetoplastida (Euglenozoa) are unicellular flagellates that include the trypanosomatid parasites, most notably Trypanosoma brucei, T. cruzi and Leishmania spp. These organisms cause substantial mortality and morbidity in humans and their livestock worldwide as the causative agents of African sleeping sickness, Chagas disease and leishmaniasis respectively. Bodo saltans is a free-living heterotroph found worldwide in freshwater and marine habitats. It possesses the diagnostic kinetoplastid features, such as flagella sited within a specialised flagellar pocket, glycolytic processes confined to a dedicated organelle (the 'glycosome'), and the characteristic concentration of mitochondrial DNA at the base of the flagellum (the 'kinetoplast') [1,2]. When comparing trypanosomatid parasites with each other, or collectively with other eukaryotes, the value of B. saltans is as a non-parasitic near relative, (i.e., an 'outgroup'), that can illuminate their key evolutionary transitions. Five draft genome sequences exist for Trypanosoma spp. and four for Leishmania spp. [3-7]; these will be augmented with further strains and other non-human parasites in the coming years [8]. With such excellent comparative resources in place or in development, there is a critical need for a non-trypanosomatid outgroup. In effect, it will provide a model of the ancestral trypanosomatid to distinguish those derived parts of the parasite genomes (i.e., unique trypanosomatid adaptations) from those which are a legacy of the free-living ancestor. For instance, such a model will help to resolve whether trypanosomatids previously possessed an algal plastid from which 'plant-like' genes in trypanosomatid genomes are derived [9-11]. As a prelude to a complete B. saltans genome sequencing effort, this study sought to establish an initial understanding of the bodonid genome, its structure and content relative to the trypanosomatids. The most recent kinetoplastid phylogeny has shown that trypanosomatid parasites are just one of many independent acquisitions of parasitism, indeed, a relatively minor component of total diversity [12-15]. Nonetheless, they are, naturally, the most important aspect of kinetoplastid diversity. Many features of their completed genome sequences emphasised the common ancestry of T. brucei, T. cruzi and Leishmania spp., especially with respect to gene repertoire and order [16], but their critical pathological differences were also evident at the genomic level. The three human parasites cause distinct diseases; their genomes contain enigmatic adaptations related to pathogenesis and immune evasion, for instance the bloodstream expression site in T. brucei from which its variant surface glycoproteins (VSG) are expressed [17,18], and surface antigen families in general [16]. Without an historical dimension, these features cannot be compared, nor understood in an evolutionary context. As it is among the closest bodonid relatives of the trypanosomatids [19], Bodo saltans is a suitable outgroup to address three principal comparative issues: i) understanding how human trypanosomatid parasites acquired their distinct pathological strategies; ii) understanding how the ancestral trypanosomatid became parasitic in terms of derived innovations (e.g., cell surfaces) and loss of genomic repertoire; iii) understanding how typical kinetoplastid features (e.g., glycosomes) evolved and how these might have been modified for parasitism. Quite what to expect from a bodonid genome sequence is an open question. Beyond the basic kinetoplastid features named above, the biological differences between bodonids and trypanosomatids are striking. While B. saltans is a bacteriovore, especially prevalent in polluted waters or other environments with high bacterial densities [1], trypanosomatids are obligate parasites inhabiting a nutrient-rich, but ultimately hostile, host environment, and adept at exploiting their eutrophic environment to maximise proliferation and transmission. By contrast, B. saltans preys on bacterial cells [1,2] and is probably adapted for resource acquisition within its relatively oligotrophic environment. Although bodonids and trypanosomatids are all flagellates, trypanosomatids attach their single flagellum to the cell surface to generate motile force, whereas the anterior flagellum in B. saltans is modified with hair-like mastigonemes, which may assist prey location during feeding [2,20-22]. There are wider cytoskeletal differences also; the subpellicular microtubular cortex is instrumental in maintaining the numerous cell forms adopted by trypanosomatids [23], but is reduced in bodonids, (which lack complex developmental stages), to the region around the cytostome [2,24]. Perhaps most importantly for understanding the evolution of parasitism, we can expect substantial differences between trypanosomatid cell surfaces that function primarily to manipulate and frustrate the host immune response and bodonid membranes that are perhaps largely concerned with cellular homeostasis. Rather than providing definitive answers to these questions, the preliminary sequence data presented here provides an initial insight into a few comprehensively resolved locations in the B. saltans genome, indicating what to expect from gene content and arrangement, and testing the feasibility of a complete sequence project. The sequence contigs were compared with corresponding regions in trypanosomatids (based on conserved gene order, where this existed), to examine gene content and the conservation of gene order (i.e., colinearity) and, therefore, the potential for using trypanosomatid genome sequences as scaffolds to assist assembly and annotation of the B. saltans sequence.

Results

Gene structure

Clones were selected from the B. saltans fosmid library according to random end-sequences and positive results for specific PCR probes. Inserts from 12 fosmid clones were shotgun sequenced, comprising 0.403 Mbp in total and an average size of 33.6 Kbp. Table 1 describes the composition of the 12 contigs in terms of the affinity shown by each putative coding sequence to sequence databases. 178 putative coding sequences are specified; genes could be predicted by eye because of a definite elevation in GC content in coding regions. Subsequent matches to sequence databases showed these features to be correct. The boundaries between coding and flanking regions are marked by a transition from GC-rich to AT-rich signatures; the sequences shown in Figure 1 clearly demonstrate the GC troughs that appear between coding sequences. This pattern is repeated in other contigs, as shown in subsequent figures. Gene density is high relative to corresponding regions in the L. major and T. brucei genome sequences, reflected by the consistently short intercoding sequences across all contigs (average = 377.2 bp). Figure 2 compares the gene order of one region (average interceding sequence length = 439.7 bp) with positionally orthologous regions in L. major (average = 1480.6 bp) and T. brucei (average = 1129.4 bp); this, like most fosmid inserts, contains more genes in Bodo than in trypanosomatids.
Table 1

Description of genomic fragments sequenced in this study.

CloneProbeLength (bp)Genes by taxonomic affinity*GenBank accession number
KEPUTOTAL
5 e 15Rab135803420713FJ168547
14 l 17α-tubulin38184810514FJ168548
16 k 02HSP7032908820212FJ168549
23 g 24GPDH375911110719FJ168550
45 a 0938203711514FJ168551
45 a 12414991630524FJ168552
46 a 1130067831315FJ168553
5 m 18β-tubulin28046910313FJ168554
93 d 0218199620210FJ168555
93 e 01408181220317FJ168556
96 f 03389931200214FJ168557
96 g 0931028520613FJ168558

TOTAL41133910620250178

Note: † Clones for which no probe is stated were selected after end-sequencing.* Genes were classified according to their affinity: kinetoplastid (K), other eukaryotic (E), prokaryotic (P) or unknown (U).

Figure 1

Schematic representation of three regions of the . Six reading frames are shown as parallel grey bars; scale in base-pairs. Base composition is plotted above. Putative coding sequences are shown as coloured boxes: red (homolog of trypanosomatid gene with known function), orange (homolog of hypothetical trypanosomatid gene), green (hypothetical gene with no trypanosomatid homolog but a positive functional match to a sequence database), blue (hypothetical gene with no matches to sequence databases). Labels attending these coding sequences contain the GeneDB identification numbers of homologous trypanosomatid genes where possible, or the description of homologous genes detected by BLAST comparisons (with % identity). Predicted transmembrane helices (blue) and signal peptides (purple) are shown on the DNA strands below the coding sequence. a. Clone '16k02' containing a tandem gene array of heat-shock protein 70. b. Clone '14l17' containing a tandem gene array of α- and β-tubulin. An asterisk * denotes a β-tubulin gene disrupted by a single base deletion at position 589. c. Clone '5m18' containing a second tandem gene array of α- and β-tubulin.

Figure 2

Screenshot from the Artemis Comparison Tool (ACT), showing a 41.5 Kb fragment of . Key to B. saltans coding sequence annotation: 1. RNA-binding protein (homolog of Tb927.7.5380); 2. Hypothetical, no matches; 3. Serine-threonine protein kinase (Metazoa 46%, Plantae 43%); 4. Hypothetical, no matches; 5. Homolog of Tb10.61.3155; 6. Hypothetical lipase; 7. Serine-threonine protein kinase (Tb10.61.3140); 8. Homolog of Tb10.61.3130; 9. Possible ornithine decarboxylase (Bacteria 27%); 10. Dephospho-CoA kinase (Tc00.1047053511277.500); 11. Homolog of Tb10.61.3120; 12. Homolog of Tb10.61.3115; 13. Homolog of Tb10.61.3110; 14. DNAJ chaperone (Tb10.61.3100); 15. Homolog of Tb10.61.3080; 16. Homolog of Tb10.61.3070; 17. GPI-anchor transamidase (Tb10.61.3060); 18. Tubulin tyrosine ligase (Tb10.61.3050); 19. Ubiquitin-conjugating enzyme (Tb927.5.1000); 20. Homolog of Tb10.61.3040.

Schematic representation of three regions of the . Six reading frames are shown as parallel grey bars; scale in base-pairs. Base composition is plotted above. Putative coding sequences are shown as coloured boxes: red (homolog of trypanosomatid gene with known function), orange (homolog of hypothetical trypanosomatid gene), green (hypothetical gene with no trypanosomatid homolog but a positive functional match to a sequence database), blue (hypothetical gene with no matches to sequence databases). Labels attending these coding sequences contain the GeneDB identification numbers of homologous trypanosomatid genes where possible, or the description of homologous genes detected by BLAST comparisons (with % identity). Predicted transmembrane helices (blue) and signal peptides (purple) are shown on the DNA strands below the coding sequence. a. Clone '16k02' containing a tandem gene array of heat-shock protein 70. b. Clone '14l17' containing a tandem gene array of α- and β-tubulin. An asterisk * denotes a β-tubulin gene disrupted by a single base deletion at position 589. c. Clone '5m18' containing a second tandem gene array of α- and β-tubulin. Screenshot from the Artemis Comparison Tool (ACT), showing a 41.5 Kb fragment of . Key to B. saltans coding sequence annotation: 1. RNA-binding protein (homolog of Tb927.7.5380); 2. Hypothetical, no matches; 3. Serine-threonine protein kinase (Metazoa 46%, Plantae 43%); 4. Hypothetical, no matches; 5. Homolog of Tb10.61.3155; 6. Hypothetical lipase; 7. Serine-threonine protein kinase (Tb10.61.3140); 8. Homolog of Tb10.61.3130; 9. Possible ornithine decarboxylase (Bacteria 27%); 10. Dephospho-CoA kinase (Tc00.1047053511277.500); 11. Homolog of Tb10.61.3120; 12. Homolog of Tb10.61.3115; 13. Homolog of Tb10.61.3110; 14. DNAJ chaperone (Tb10.61.3100); 15. Homolog of Tb10.61.3080; 16. Homolog of Tb10.61.3070; 17. GPI-anchor transamidase (Tb10.61.3060); 18. Tubulin tyrosine ligase (Tb10.61.3050); 19. Ubiquitin-conjugating enzyme (Tb927.5.1000); 20. Homolog of Tb10.61.3040. Description of genomic fragments sequenced in this study. Note: † Clones for which no probe is stated were selected after end-sequencing.* Genes were classified according to their affinity: kinetoplastid (K), other eukaryotic (E), prokaryotic (P) or unknown (U).

Gene content

Table 1 shows that 106/178 coding sequences (59.6%) are homologs of known trypanosomatid genes. The percentage nucleotide identity between bodonid and trypanosomatid proteins varies greatly; genes of known conservatism display high identity (α-tubulin, 98%; β-tubulin, 99%; HSP70, 95%; GAPDH, 81%), but on average coding sequences are 44.38% identical and the most abundant identity class is 30–39%. Hence, most orthologs in these two classes have diverged by two-thirds or more. Of those coding sequences without trypanosomatid homologs, 20 show homology with other eukaryotes, 2 are of bacterial affinity, and the remainder (28.1%) are without matches to any database, i.e., Bodo-specific. Despite the bacterial contamination inevitable in DNA preparations (see methods), we can be certain that these bacterial-type coding sequences are not artefacts because they are present in fosmid inserts otherwise composed of eukaryotic sequences, and individual sequence clones span both the bacterial-type gene and surrounding eukaryotic-type sequence. Although present in B. saltans, some of the familiar genes intensively studied in trypanosomatids are found in novel contexts. Figure 1 describes tandem gene arrays of HSP70 and tubulin, which are found in locations unlike those in trypanosomatids. An alternating tandem array containing α and β-tubulin is found in two distinct inserts (Figure 1b and 1c); α and β-tubulin isoforms contain no amino acid differences but had dissimilar (unalignable) 3' untranscribed regions. Coding sequences without trypanosomatid homologs were compared to sequence and structural databases (see Table 2). Many of the gene products are homologous to proteins beyond the Kinetoplastida, suggesting that they are core eukaryotic proteins subsequently lost from trypanosomatids; for example, contiguous genes in Figure 1a homologous to a GPI-anchored protein in plants and fungi (16273–17686 bp) and a hypothetical gene in Metazoa (18275–19486 bp). Gene products in other regions (not shown) contain protein domains known elsewhere, for example an ABC transporter protein (clone '5 e 15', 21065 bp) and a nucleotide-sugar transporter protein (clone '45 a 12', 41227 bp), strongly indicating that these are Bodo-specific members of ubiquitous gene families. Some otherwise uncharacterised hypothetical proteins are predicted to expressed on the cell surface. The region shown in Figure 3a is notable not only for the base composition of coding regions and admixture of trypanosomatid and Bodo-specific genes mentioned above, but also for a hypothetical protein (16737–17978 bp) with 7 predicted transmembrane helices and a signal peptide. Table 2 contains other examples of Bodo-specific hypothetical genes predicted to be surface expressed, including those shown in Figure 1a (27014–28063 bp) and b (16869–18011 bp).
Table 2

Bodo-specific hypothetical genes, with evidence of protein domains, transmembrane domains (TMH) signal peptides (SP) and affinities to sequence databases where available.

ClonePositionSize (bp)Interpro matchTMHSPBest BLASTp matchBest taxonomic match (% amino acid identity)
14 l 173085645Oxygenase (IPR005123)Prolyl hydroxylaseMetazoa (53%)
168691143Zn-binding protein (IPR001841)1
216137356Leucine-rich repeat (IPR001611)2
16 k 02145731125Yes
162731596YesGPI-anchored proteinPlantae/Fungi (64%)
182751212Hypothetical proteinMetazoa (50%)
2701410507Yes
23 g 243911626Pleckstrin-like (IPR001849)
76552781Zinc metallopeptidase (IPR006025)
2136123428EF-hand proteinTetrahymena (47%)
45 a 094161535Methyltransferase (IPR013216)
300261134Phosphoribulokinase (IPR006083)Bacteria (50%)
45 a 12121911695Lipase (IPR008265)1
31394455Serine-Threonine protein kinase (IPR017442)Protein kinaseMetazoa (46%)
41227264Nucleotide-sugar transporter (IPR007271)3Yes
46 a 11211753Endonuclease (IPR001604)1YesEndonucleaseBacteria (53%)
11661416Thioredoxin (IPR000866)NucleoredoxinPlantae (53%)
122861092FucosyltransferasePlantae (36%)
14365765Hypothetical proteinMetazoa (55%)
5 e 1547710617Phosphatidylinositol-4-phosphate kinase (IPR003409)
149131296Lyzosomal lipase (IPR006693)1LipaseMetazoa (59%)
165102199Serine endopeptidase (IPR001254)Serine proteasePlantae (46%)
21065546ABC transporter (IPR013525)4Yes
3215326131
35286546Zinc-finger protein (IPR001841)
5 m 1855771857Sodium/hydrogen exchanger (IPR006153)14YesSodium/hydrogen exchangerMetazoa (50%)
17069321Ankyrin repeat protein (IPR002110)
93 d 0255281758Yes
75292589Chloride channel (IPR0014743)9Chloride channelMetazoa (56%)
149241788WD40 repeat (IPR001680)Hypothetical proteinPlantae (56%)
93 e 0156550514
164391995Forkhead associated protein (IPR000253)
1880242608Yes
24535855Protein tyrosine phosphatase (IPR000340)Protein phosphatasePlantae (56%)
96 g 09106272559PDZ protein binding motif (IPR001478)Yes
1673712427Yes
26182948Leucine-rich repeat kinase (IPR001611)Receptor-type protein kinasePlantae (54%)
27597483Leucine-rich repeat kinase (IPR001611)Receptor-type protein kinasePlantae (34%)
Figure 3

a. Schematic representation of a 32.9 Kb fragment of . The asterisk * denotes a physical sequence gap of uncertain length. b. Protein sequence alignment showing the repetitive region of 11 'surface antigen' proteins from L. major and their putative homolog from B. saltans, shown in panel a. (13440–16665 bp).

a. Schematic representation of a 32.9 Kb fragment of . The asterisk * denotes a physical sequence gap of uncertain length. b. Protein sequence alignment showing the repetitive region of 11 'surface antigen' proteins from L. major and their putative homolog from B. saltans, shown in panel a. (13440–16665 bp). Bodo-specific hypothetical genes, with evidence of protein domains, transmembrane domains (TMH) signal peptides (SP) and affinities to sequence databases where available.

Colinearity

The extent of conserved gene order, or colinearity, between bodonid and trypanosomatid genome sequences was assessed using the Artemis Comparison Tool (ACT, see methods). One region of excellent colinearity is shown in Figure 2 and, despite disruption by some eukaryotic genes not seen in trypanosomatids, this contig corresponds unmistakably with chromosome 18 in L. major and chromosome 10 in T. brucei. Conversely, the presence of so many non-trypanosomatid genes meant that colinearity disappears entirely in some locations, as shown in Figure 1. Across the 12 genomic regions however, both patterns were atypical; most regions shows brief patches of colinearity, perhaps 2 or 3 genes with conserved synteny, set among larger regions of Bodo-specific genes or homologs to trypanosomatid genes from elsewhere in the genome. In this sense, the sequence presented in Figure 3 is representative because several coding sequences are homologs of trypanosomatid genes on chromosomes 13 (L. major) and 11 (T. brucei); these are roughly colinear but the order is disrupted by genes present on other chromosomes or by Bodo-specific genes.

Discussion

In this study, various locations in B. saltans genome, amounting to ~0.4 Mbp, were sequenced. Assuming that the bodonid genome is approximately the same size as a trypanosomatid haploid genome, i.e., 35–55 Mbp [16], these sequences comprise ~1% of the complete genome sequence, which will therefore contain roughly 14,000 genes. The success and utility of a B. saltans genome project will depend on its relationship with existing trypanosomatid genome sequences. This study shows that coding regions of the B. saltans genome share several structural features with trypanosomatids, indicating that the project is both feasible and likely to provide a useful comparative resource. Putative B. saltans genes lack introns, as in most trypanosomatid genes [3-5]. They display a conspicuous elevation in GC content, which will greatly assist gene finding. No evidence of strand-switching was observed in B. saltans, corroborating the view that it operates polycistronic transcription [25], i.e., transcription of many contiguous loci within a single nascent transcript [26-28], which is subsequently trans-spliced and polyadenylated to produce mature mRNA, as in trypanosomatids [29-33]. Although the arrangement of coding regions along the bodonid chromosome may be conserved with trypanosomatids, it is clear that gene order was not. The extent of conserved synteny, or rather colinear gene order, between bodonid and trypanosomatid genomes is of particular importance to the assembly of any B. saltans genome sequence. The coding regions presented here indicate that trypanosomatid genome sequences will be of limited value in the global assembly of a B. saltans genome sequence. Strict colinearity was not normally observed, if only because of the large number of Bodo-specific genes interposed between trypanosomatid homologs. Colinearity tended to persist over a distance of 3–5 genes, although some regions displayed conspicuous conservation (e.g., Figure 2), while others showed none at all (e.g., Figure 1). Therefore, this initial exploration of the B. saltans genome demonstrates that it should be possible to resolve a complete genome sequence, but, while the existing trypanosomatid resources will provide some useful guides for annotation, they could not be used as scaffolds for assembly, which should proceed de novo. The purpose of a completed B. saltans genome sequence would be for understanding the evolution of trypanosomatid genome sequences. The mixture of familiar and novel features in the regions sequenced here indicates the value of a bodonid genome sequence in distinguishing trypanosomatid characters inherited from free-living ancestors (and still shared with them) from characters evolved since the origin of trypanosomatids. Hence, the first application would be in determining which parts of the trypanosomatid genome reflect the genomic legacy inherited from free-living ancestors, and show how they have been co-opted and modified for parasitism. Bodonid and trypanosomatid cells share various structural features, principally those that characterise kinetoplastid cells. Bodonids arrange their mitochondrial DNA in kinetoplasts, although their position within the cell differs from trypanosomatids [1], and conduct their glycolytic pathways within a dedicated organelle (the glycosome) [2]. Bodonids construct their flagella in a similar manner to trypanosomatids, but deploy them very differently [1]. While B. saltans uses one flagellum for movement and another for feeding, trypanosomatids flagella perform their motility function within the context of their sophisticated cell forms. One might expect these structural similarities to be reflected at the genomic level. α- and β-tubulin, the proteins that facilitate the development of flagella in trypanosomatids, are known to be arranged in tandem gene arrays, with an alternating, heterotypic α-β array in Trypanosoma spp. and distinct, monotypic α and β arrays in Leishmania spp. [34-37]. Bodonids were shown to share the alternating conformation, suggesting that Leishmania spp. and their relatives had abolished the ancestral locus and evolved novel genomic repertoires [38]. However, two B. saltans regions containing tubulin in this study show that modification of tubulin repertoire has also occurred in Trypanosoma, since neither of the α-β arrays in B. saltans was found at the genomic position occupied in trypanosomes. This demonstrates the utility of the B. saltans genome in resolving the evolutionary causes of structural or compositional differences between trypanosomatid genomes. The second application of a B. saltans genome sequence would be to identify which components of the free-living legacy have been lost from trypanosomatids, and therefore, how reductive genome evolution has contributed to the parasite genomes. Table 2 describes many predicted proteins identified in B. saltans that have no trypanosomatid homologs. Among these, mostly Bodo-specific, genes are membrane transporters, various protein kinases, and other proteins containing domains commonly associated with cell surfaces. These and other Bodo-specific proteins must include those metabolism pathways, intracellular transport, cellular signalling and subcellular structures that exist in free-living kinetoplastids, but which have been deleted during the evolution of parasitism. Many of these proteins will be widespread among eukaryotic lineages, as is evident in Table 2; yet we should also expect to encounter a considerable genetic repertoire unique to the Kinetoplastida and so entirely new. Having identified those features of trypanosomatid genomes that reflect their free-living ancestry, a B. saltans genome sequence would also reveal the additions to each parasite genome; structures derived from existing genes and co-opted for novel uses, and genuinely novel genes involved in parasite-specific adaptations. These enigmatic genes include the numerous and diverse families of surface glycoprotein that form the protective coats around trypanosomatid parasites. T. brucei, T. cruzi and L. major each display highly derived and complex surface coats to frustrate host immunity, yet they differ in structure and substance and it is not known how each acquired its distinct solution to their common problem. Understanding the origins of these surface architectures will only be achieved with an historical perspective; one principal objective of a B. saltans genome project would be to identify the precursors of proteins such as VSG in T. brucei, mucins and trans-sialidase in T. cruzi, and proteophosphoglycans in Leishmania spp. (amongst others). A glimpse of this potential is seen in Figure 3, which includes a predicted protein with a complex 24 amino acid repeat (13440–16655 bp). The protein had a high affinity (42% amino acids identical) with a gene family on chromosome 12 in Leishmania spp., (currently annotated as 'surface antigens'), and a more distant affinity with proteophosphoglycans. Figure 3b shows a sequence alignment of the repeat domain from the B. saltans protein and its leishmanial homologs, where the level of amino acid identity rises to 50%.

Conclusion

Thorough sequencing of a few locations in the B. saltans genome has revealed clear similarities with trypanosomatids, but has also shown that trypanosomatid genome sequences will not be effective guides for any complete bodonid project, due to significant differences in content and gene order. This mixture of familiar and novel features suggests that B. saltans will indeed provide an effective outgroup for comparisons of trypanosomatid parasites, and, as with the evolution of tubulin repertoire, the historical perspective to understand which aspects of trypanosomatid biology have been retained from their common ancestry, which have been lost, and what has been uniquely derived since.

Methods

Fosmid library preparation

A freshwater strain of Bodo saltans ('Lake Konstanz'; courtesy of Dr Julius Lukes, University of South Bohemia, Czech Republic), was cultured in tap water in the presence of environmental bacteria. Bodonid cells were concentrated through a gentle centrifugation step (3,000 g for 2 minutes). Genomic DNA was prepared after resuspension of the pellet using phenol-chloroform extraction. This preparation contained a residuum of bacterial DNA. Genomic DNA was sheared and blunt-end repaired before being electrophoresed on a CHEF gel, from which the 25–40 kb band region was excised. The DNA was electroluted from the gel slice and ligated into a pCC1 fosmid vector (CopyControl Fosmid Kit; Epicentre Biotechnologies). Fosmid ligations were packaged into lambda bacteriophage (Gigapack XL2 Packaging Extract; Stratagene) and used to transform XL2-Blue MRF ultracompetent cells. Positive transformants were picked from chloramphenicol plates and cultured under drug selection. The B. saltans genome library contained 9600 individual clones (approximately 300 Mb).

Clone selection and sequencing

As B. saltans cannot be grown axenically, 96 fosmid inserts were end-sequenced to examine the relative contributions of bodonid and bacterial DNA to the library. 16% of clones had end-sequences with affinity for eukaryotic coding sequences when compared to databases. Another 19% of clones had matches to bacterial sequences. Hence, although a larger proportion of end-sequences may have been genuine bodonid non-coding sequences (without representation in sequence databases), the library included a considerable, perhaps equal, component of bacterial DNA. Seven clones with positive end-sequence matches to trypanosomatid genes were sequenced in full (see below). Filters were prepared for the library by spotting bacterial culture on to a charged nylon membrane (Nytran Supercharge membrane: Schleicher and Schuell Bioscience) and lysing the cells; denaturation and fixation of the fosmid DNA produced a filter representing 8,832 genome fragments of 25–40 Kb. The filter was probed for five known B. saltans genes using radiolabelled PCR products, generated with the following primers: α-tubulin (F: AACGCSTGCTGGGAGYTGT; R: GTTGATRCCGCACTTGAAGCC; 1 kb), β-tubulin (F: AACCAGATCGGCTCNAAGTT; R: GATGTTGTTSGGGATCCACTC; 1 kb), Glyceraldehyde-6-phosphate dehydrogenase (F: CGGTCAAGGTAGGCATCAAC; R: TTGGGAAGGTTGTTCTGGAG; 800 bp), Heat shock protein 70 (F: TTCAAGAACGACCAGGTTGA; R: ACCAAGTCCGGCAACAATAG; 1.4 kb) and Rab1 (F: TTTGACAACCGBTACAAGGC; R: CCTTTGCGGACGTCTCAAAGTA; 500 bp). Clones corresponding to positive spots on the filter were cultured, and purified fosmid DNA was fully sequenced using a shotgun sequencing method to approximately 8× coverage.

Assembly and analysis

Fosmid inserts were assembled using Phrap [39] and arranged within Gap4 [40]. PCR products were generated to close residual gaps between finished contigs. Finished sequence was annotated within Artemis [41,42] and coding sequences were initially defined by eye. Whole sequences were compared to EMBL sequence databases using both BLASTn and BLASTp algorithms. Coding sequences were scrutinised for possible transmembrane helices and signal peptides using TMHMM [43] and SignalP [44] respectively. Each coding sequence was checked for known protein domains using all options within the Interproscan suite [45]. Conserved synteny was assessed by aligning B. saltans contigs with T. brucei and L. major chromosomal regions using ACT [46] and existing trypanosomatid sequences downloaded from the GeneDB website [7].

Abbreviations

ACT: Artemis comparison tool; PCR: Polymerase chain reaction; BLAST: Basic local alignment search tool; CDS: Coding sequence; IGS: Intergenic sequence; CHEF: Clamped homogeneous electric field.

Authors' contributions

APJ prepared and probed the genomic library, assembled and annotated sequence contigs, analysed the data and prepared the manuscript. MAQ oversaw DNA preparation and sequencing. MB provided funds for sequencing and editorial guidance in producing the manuscript.
  35 in total

1.  Comparative genomics of trypanosomatid parasitic protozoa.

Authors:  Najib M El-Sayed; Peter J Myler; Gaëlle Blandin; Matthew Berriman; Jonathan Crabtree; Gautam Aggarwal; Elisabet Caler; Hubert Renauld; Elizabeth A Worthey; Christiane Hertz-Fowler; Elodie Ghedin; Christopher Peacock; Daniella C Bartholomeu; Brian J Haas; Anh-Nhi Tran; Jennifer R Wortman; U Cecilia M Alsmark; Samuel Angiuoli; Atashi Anupama; Jonathan Badger; Frederic Bringaud; Eithon Cadag; Jane M Carlton; Gustavo C Cerqueira; Todd Creasy; Arthur L Delcher; Appolinaire Djikeng; T Martin Embley; Christopher Hauser; Alasdair C Ivens; Sarah K Kummerfeld; Jose B Pereira-Leal; Daniel Nilsson; Jeremy Peterson; Steven L Salzberg; Joshua Shallom; Joana C Silva; Jaideep Sundaram; Scott Westenberger; Owen White; Sara E Melville; John E Donelson; Björn Andersson; Kenneth D Stuart; Neil Hall
Journal:  Science       Date:  2005-07-15       Impact factor: 47.728

2.  ACT: the Artemis Comparison Tool.

Authors:  Tim J Carver; Kim M Rutherford; Matthew Berriman; Marie-Adele Rajandream; Barclay G Barrell; Julian Parkhill
Journal:  Bioinformatics       Date:  2005-06-23       Impact factor: 6.937

3.  Were class C iron-containing superoxide dismutases of trypanosomatid parasites initially imported into a complex plastid? A hypothesis based on analyses of their N-terminal targeting signals.

Authors:  A Bodył; P Mackiewicz
Journal:  Parasitology       Date:  2008-07-14       Impact factor: 3.234

4.  Genomic and transcriptional linkage of the genes for calmodulin, EF-hand 5 protein, and ubiquitin extension protein 52 in Trypanosoma brucei.

Authors:  S Wong; T H Morales; J E Neigel; D A Campbell
Journal:  Mol Cell Biol       Date:  1993-01       Impact factor: 4.272

5.  The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas disease.

Authors:  Najib M El-Sayed; Peter J Myler; Daniella C Bartholomeu; Daniel Nilsson; Gautam Aggarwal; Anh-Nhi Tran; Elodie Ghedin; Elizabeth A Worthey; Arthur L Delcher; Gaëlle Blandin; Scott J Westenberger; Elisabet Caler; Gustavo C Cerqueira; Carole Branche; Brian Haas; Atashi Anupama; Erik Arner; Lena Aslund; Philip Attipoe; Esteban Bontempi; Frédéric Bringaud; Peter Burton; Eithon Cadag; David A Campbell; Mark Carrington; Jonathan Crabtree; Hamid Darban; Jose Franco da Silveira; Pieter de Jong; Kimberly Edwards; Paul T Englund; Gholam Fazelina; Tamara Feldblyum; Marcela Ferella; Alberto Carlos Frasch; Keith Gull; David Horn; Lihua Hou; Yiting Huang; Ellen Kindlund; Michele Klingbeil; Sindy Kluge; Hean Koo; Daniela Lacerda; Mariano J Levin; Hernan Lorenzi; Tin Louie; Carlos Renato Machado; Richard McCulloch; Alan McKenna; Yumi Mizuno; Jeremy C Mottram; Siri Nelson; Stephen Ochaya; Kazutoyo Osoegawa; Grace Pai; Marilyn Parsons; Martin Pentony; Ulf Pettersson; Mihai Pop; Jose Luis Ramirez; Joel Rinta; Laura Robertson; Steven L Salzberg; Daniel O Sanchez; Amber Seyler; Reuben Sharma; Jyoti Shetty; Anjana J Simpson; Ellen Sisk; Martti T Tammi; Rick Tarleton; Santuza Teixeira; Susan Van Aken; Christy Vogt; Pauline N Ward; Bill Wickstead; Jennifer Wortman; Owen White; Claire M Fraser; Kenneth D Stuart; Björn Andersson
Journal:  Science       Date:  2005-07-15       Impact factor: 47.728

6.  The genome of the kinetoplastid parasite, Leishmania major.

Authors:  Alasdair C Ivens; Christopher S Peacock; Elizabeth A Worthey; Lee Murphy; Gautam Aggarwal; Matthew Berriman; Ellen Sisk; Marie-Adele Rajandream; Ellen Adlem; Rita Aert; Atashi Anupama; Zina Apostolou; Philip Attipoe; Nathalie Bason; Christopher Bauser; Alfred Beck; Stephen M Beverley; Gabriella Bianchettin; Katja Borzym; Gordana Bothe; Carlo V Bruschi; Matt Collins; Eithon Cadag; Laura Ciarloni; Christine Clayton; Richard M R Coulson; Ann Cronin; Angela K Cruz; Robert M Davies; Javier De Gaudenzi; Deborah E Dobson; Andreas Duesterhoeft; Gholam Fazelina; Nigel Fosker; Alberto Carlos Frasch; Audrey Fraser; Monika Fuchs; Claudia Gabel; Arlette Goble; André Goffeau; David Harris; Christiane Hertz-Fowler; Helmut Hilbert; David Horn; Yiting Huang; Sven Klages; Andrew Knights; Michael Kube; Natasha Larke; Lyudmila Litvin; Angela Lord; Tin Louie; Marco Marra; David Masuy; Keith Matthews; Shulamit Michaeli; Jeremy C Mottram; Silke Müller-Auer; Heather Munden; Siri Nelson; Halina Norbertczak; Karen Oliver; Susan O'neil; Martin Pentony; Thomas M Pohl; Claire Price; Bénédicte Purnelle; Michael A Quail; Ester Rabbinowitsch; Richard Reinhardt; Michael Rieger; Joel Rinta; Johan Robben; Laura Robertson; Jeronimo C Ruiz; Simon Rutter; David Saunders; Melanie Schäfer; Jacquie Schein; David C Schwartz; Kathy Seeger; Amber Seyler; Sarah Sharp; Heesun Shin; Dhileep Sivam; Rob Squares; Steve Squares; Valentina Tosato; Christy Vogt; Guido Volckaert; Rolf Wambutt; Tim Warren; Holger Wedler; John Woodward; Shiguo Zhou; Wolfgang Zimmermann; Deborah F Smith; Jenefer M Blackwell; Kenneth D Stuart; Bart Barrell; Peter J Myler
Journal:  Science       Date:  2005-07-15       Impact factor: 47.728

7.  The genome of the African trypanosome Trypanosoma brucei.

Authors:  Matthew Berriman; Elodie Ghedin; Christiane Hertz-Fowler; Gaëlle Blandin; Hubert Renauld; Daniella C Bartholomeu; Nicola J Lennard; Elisabet Caler; Nancy E Hamlin; Brian Haas; Ulrike Böhme; Linda Hannick; Martin A Aslett; Joshua Shallom; Lucio Marcello; Lihua Hou; Bill Wickstead; U Cecilia M Alsmark; Claire Arrowsmith; Rebecca J Atkin; Andrew J Barron; Frederic Bringaud; Karen Brooks; Mark Carrington; Inna Cherevach; Tracey-Jane Chillingworth; Carol Churcher; Louise N Clark; Craig H Corton; Ann Cronin; Rob M Davies; Jonathon Doggett; Appolinaire Djikeng; Tamara Feldblyum; Mark C Field; Audrey Fraser; Ian Goodhead; Zahra Hance; David Harper; Barbara R Harris; Heidi Hauser; Jessica Hostetler; Al Ivens; Kay Jagels; David Johnson; Justin Johnson; Kristine Jones; Arnaud X Kerhornou; Hean Koo; Natasha Larke; Scott Landfear; Christopher Larkin; Vanessa Leech; Alexandra Line; Angela Lord; Annette Macleod; Paul J Mooney; Sharon Moule; David M A Martin; Gareth W Morgan; Karen Mungall; Halina Norbertczak; Doug Ormond; Grace Pai; Chris S Peacock; Jeremy Peterson; Michael A Quail; Ester Rabbinowitsch; Marie-Adele Rajandream; Chris Reitter; Steven L Salzberg; Mandy Sanders; Seth Schobel; Sarah Sharp; Mark Simmonds; Anjana J Simpson; Luke Tallon; C Michael R Turner; Andrew Tait; Adrian R Tivey; Susan Van Aken; Danielle Walker; David Wanless; Shiliang Wang; Brian White; Owen White; Sally Whitehead; John Woodward; Jennifer Wortman; Mark D Adams; T Martin Embley; Keith Gull; Elisabetta Ullu; J David Barry; Alan H Fairlamb; Fred Opperdoes; Barclay G Barrell; John E Donelson; Neil Hall; Claire M Fraser; Sara E Melville; Najib M El-Sayed
Journal:  Science       Date:  2005-07-15       Impact factor: 47.728

8.  Early evolution within kinetoplastids (euglenozoa), and the late emergence of trypanosomatids.

Authors:  Alastair G B Simpson; Erin E Gill; Heather A Callahan; R Wayne Litaker; Andrew J Roger
Journal:  Protist       Date:  2004-12

9.  The tubulin genes of Trypanosoma cruzi.

Authors:  R Maingon; R Gerke; M Rodriguez; J Urbina; J Hoenicka; S Negri; T Aguirre; J Nehlin; T Knapp; J Crampton
Journal:  Eur J Biochem       Date:  1988-01-15

Review 10.  Control of gene expression in trypanosomes.

Authors:  L Vanhamme; E Pays
Journal:  Microbiol Rev       Date:  1995-06
View more
  18 in total

1.  Genomic characterization of Neoparamoeba pemaquidensis (Amoebozoa) and its kinetoplastid endosymbiont.

Authors:  Goro Tanifuji; Eunsoo Kim; Naoko T Onodera; Rebecca Gibeault; Marlena Dlutek; Richard J Cawthorn; Ivan Fiala; Julius Lukes; Spencer J Greenwood; John M Archibald
Journal:  Eukaryot Cell       Date:  2011-06-10

2.  Trypanosoma brucei translation initiation factor homolog EIF4E6 forms a tripartite cytosolic complex with EIF4G5 and a capping enzyme homolog.

Authors:  Eden R Freire; Amaranta M Malvezzi; Ajay A Vashisht; Joanna Zuberek; Edwin A Saada; Gerasimos Langousis; Janaína D F Nascimento; Danielle Moura; Edward Darzynkiewicz; Kent Hill; Osvaldo P de Melo Neto; James A Wohlschlegel; Nancy R Sturm; David A Campbell
Journal:  Eukaryot Cell       Date:  2014-05-16

3.  Identification of two voltage-dependent anion channel-like protein sequences conserved in Kinetoplastida.

Authors:  Nadine Flinner; Enrico Schleiff; Oliver Mirus
Journal:  Biol Lett       Date:  2012-01-04       Impact factor: 3.703

4.  The intraflagellar transport dynein complex of trypanosomes is made of a heterodimer of dynein heavy chains and of light and intermediate chains of distinct functions.

Authors:  Thierry Blisnick; Johanna Buisson; Sabrina Absalon; Alexandra Marie; Nadège Cayet; Philippe Bastin
Journal:  Mol Biol Cell       Date:  2014-07-02       Impact factor: 4.138

5.  Myosin repertoire expansion coincides with eukaryotic diversification in the Mesoproterozoic era.

Authors:  Martin Kollmar; Stefanie Mühlhausen
Journal:  BMC Evol Biol       Date:  2017-09-04       Impact factor: 3.260

6.  De Novo Transcriptome Meta-Assembly of the Mixotrophic Freshwater Microalga Euglena gracilis.

Authors:  Javier Cordoba; Emilie Perez; Mick Van Vlierberghe; Amandine R Bertrand; Valérian Lupo; Pierre Cardol; Denis Baurain
Journal:  Genes (Basel)       Date:  2021-05-29       Impact factor: 4.096

7.  RNA-seq-based metatranscriptomic and microscopic investigation reveals novel metalloproteases of Neobodo sp. as potential virulence factors for soft tunic syndrome in Halocynthia roretzi.

Authors:  Ho Bin Jang; Young Kyu Kim; Carmelo S Del Castillo; Seong Won Nho; In Seok Cha; Seong Bin Park; Mi Ae Ha; Jun-Ichi Hikima; Sung Jong Hong; Takashi Aoki; Tae Sung Jung
Journal:  PLoS One       Date:  2012-12-27       Impact factor: 3.240

Review 8.  Evolutionary cell biology of chromosome segregation: insights from trypanosomes.

Authors:  Bungo Akiyoshi; Keith Gull
Journal:  Open Biol       Date:  2013-05-01       Impact factor: 6.411

9.  Heme pathway evolution in kinetoplastid protists.

Authors:  Ugo Cenci; Daniel Moog; Bruce A Curtis; Goro Tanifuji; Laura Eme; Julius Lukeš; John M Archibald
Journal:  BMC Evol Biol       Date:  2016-05-18       Impact factor: 3.260

Review 10.  Lateral gene transfers and the origins of the eukaryote proteome: a view from microbial parasites.

Authors:  Robert P Hirt; Cecilia Alsmark; T Martin Embley
Journal:  Curr Opin Microbiol       Date:  2014-12-05       Impact factor: 7.934

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.