| Literature DB >> 18700033 |
Rosanna Young1, Jesse E Taylor, Ayako Kurioka, Marion Becker, Edward J Louis, Gloria Rudenko.
Abstract
BACKGROUND: African trypanosomes (including Trypanosoma brucei) are unicellular parasites which multiply in the mammalian bloodstream. T. brucei has about twenty telomeric bloodstream form Variant Surface Glycoprotein (VSG) expression sites (BESs), of which one is expressed at a time in a mutually exclusive fashion. BESs are polycistronic transcription units, containing a variety of families of expression site associated genes (ESAGs) in addition to the telomeric VSG. These polymorphic ESAG families are thought to play a role in parasite-host adaptation, and it has been proposed that ESAG diversity might be related to host range. Analysis of the genetic diversity of these telomeric gene families has been confounded by the underrepresentation of telomeric sequences in standard libraries. We have previously developed a method to selectively isolate sets of trypanosome BES containing telomeres using Transformation associated recombination (TAR) cloning in yeast.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18700033 PMCID: PMC2533676 DOI: 10.1186/1471-2164-9-385
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Isolation of trypanosome BESs in yeast using Transformation Associated Recombination (TAR) cloning as described in[]. A schematic of a typical BES is shown above with the promoter indicated with a flag, and different expression site associated genes (ESAGs) indicated with coloured boxes. Characteristic repeat arrays either upstream of the BES promoter (50 bp repeats) or adjacent to the telomeric VSG gene (70 bp repeats) are indicated with vertically hatched boxes. Trypanosome telomere repeats are indicated with white arrows. The linearised pEB4 TAR vector is shown below with yeast telomere repeats (black triangles) as well as a positive (+) and a negative (-) selectable marker flanking a BES promoter containing fragment. Recombination between the vector and genomic DNA on the BES promoter fragment results in a yeast artificial chromosome (YAC) which can be stably maintained.
T. brucei gambiense DAL 972 BES TAR clone library.
| 1 | 1 | 1 | 18 | A41 | 1 | 1 | 1 | 1 | |
| 2 | 2a | - | 1 | 2.2–37 | 2 | 2 | -b | - | |
| 2 | 3 | 1 | 27 | A120 | 2 | 2 | -b | - | |
| 3 | 4 | 1 | 15 | D18 | 3 | 3 | 2c | 1 | |
| 4 | 5 | 1 | 17 | B49 | 4 | 3 | 3 | 2 | |
| 5 | 5 | 1 | 35 | D30 | 4 | 3 | 4 | 3 | |
| 6 | 6 | 2 | 22 | C10 | 4 | 3 | 5 | 4 | |
| 7 | 7 | 3 | 8 | D102 | 5 | 4 | 6 | 5 | |
| 8 | 8 | 4 | 3 | 2.1–9 | 4 | 3 | 7 | 6 | |
| 8 | 9 | 5 | 24 | B46 | 6 | 5 | 8 | 6 | |
| 9 | 10 | 6 | 16 | C49 | 4 | 3 | 9 | 7 | |
| 10 | 11 | 7 | 17 | 98 | 7 | 6 | 10 | 8 | |
| 11 | 11 | 7 | 1 | A121 | 7 | 6 | 10 | 8 | |
Overview of T. b. gambiense DAL 972 TAR clone library. A total of 204 T. b. gambiense TAR clones were isolated in yeast and typed into 13 different BES sets after sequencing over the promoter and ESAG6 regions. The different BES promoter and ESAG6 sequence types (either at the DNA or protein level) are indicated with numbers. Two TAR clones were chosen from each BES type (the name of one representative clone is indicated in the chart), and the full length ESAG6, ESAG5 and ESAG2 ORFs were isolated and sequenced from both of these two clones. The sequence types of the ESAG5 and ESAG2 are indicated.
aPartial sequence.
bPresence not detected by PCR. See Materials and Methods for diagnostic primers that were used for each ESAG.
cApproximately 300 bp is single stranded sequence.
T. brucei brucei EATRO 2340 BES TAR clone library.
| 1 | 1 | 1 | 11 | F44 | 1 | 1 | 1 | 1 | |
| 2 | 2 | 2 | 9 | E42 | 2 | 2 | 2 | 2 | |
| 3 | 3 | 3 | 9 | G37 | 3 | 3 | 3 | 3 | |
| 4 | 4a | 4a | 3 | E16 | 4 | 4 | -b | - | |
| 5 | 5 | 5 | 19 | G39 | 5c | - | -b | - | |
| 5 | 6 | 6 | 5 | K13 | -d | - | -b | - | |
| 6 | 7 | 7 | 1 | V6 | 6 | 5 | 4 | 4 | |
| 7 | 8 | 8 | 8 | E51 | 7 | 6 | 5 | 5 | |
| 8 | 9 | 9 | 7 | S32 | 8 | 7 | 6e | 6e | |
| 9 | 10 | 10 | 4 | U19 | 9 | 8 | 7 | 7 | |
| 9 | 11 and 12f | 11 and 12f | 1 | S43 | 10 | 9 | 8 | 8 | |
| 10 | 13 | 13 | 16 | E23 | 11 | 10 | 9 | 9 | |
| 11 | 14 | 14 | 8 | J12 | 12 | 11 | 10 | 10 | |
| 11 | 15 | 15 | 9 | F36 | 13 | 12 | 11 | 11 | |
| 12 | 16 | 16 | 17 | E12 | 14 | 13 | 12 | 12 | |
| 13 | 17 | 17 | 2 | R23 | 10 | 9 | 8 | 8 | |
| 13 | 11 | 11 | 2 | E40 | 10 | 9 | 8 | 8 | |
| 13 | 18 | 18 | 4 | J4 | 10 | 9 | 13e | 13e | |
| 14 | 19 | 19 | 13 | E19 | 15 | 14 | 14 | 14 | |
| 15 | 20 | 20 | 12 | F35 | 16 | 15 | 15 | 15 | |
| 16 | 21 | 21 | 16 | E48 | 17 | 16 | 16 | 16 | |
| 17 | 22 | 22 | 31 | F27 | 18 | 17 | 17 | 17 | |
| 18 | 11 | 11 | 1 | E44 | -b | - | -b | - | |
Overview of T. b. brucei EATRO 2340 TAR clone library. A total of 208 TAR clones were isolated and typed into 23 different BES sets. The sequence typing proceeded further as described for Table 1. As before, two TAR clones were chosen for each BES type (the name of one clone is indicated in the chart), and the full length ESAG6, ESAG5 and ESAG2 ORFs were isolated and sequenced from both of these two clones.
aFirst 1000 bp only.
bPresence not detected by PCR. See Materials and Methods for diagnostic primers that were used for each ESAG.
cPseudogene.
dPresent but not sequenced.
eOne clone.
fTwo ESAG6 in one BES, both assembled.
T. equiperdum STIB 818 BES TAR clone library.
| 1 | 1a | - | 9 | 10 | 1b | - | -c | - | |
| 2 | 2 | 1 | 1 | 3 | -c | - | -c | - | |
| 3 | 3d | - | 11 | D27 | 2 | 1 | 1 | 1 | |
| 4 | 4d | - | 2 | 25 | 3 | 2 | -c | - | |
| 5 | 5b | - | 2 | 17 | -c | - | 2 | 2 | |
| 6 | 6 | 2 | 8 | 14 | 4 | 3 | 3 | 3 | |
| 6 | 7e | - | 7 | D44 | 5 | 4 | 4 | 4 | |
| 6 | 8b | - | 3 | 16 | 6 | 5 | 2 | 2 | |
| 7 | 9 | 3 | 7 | 7 | 7 | 6 | 1 | 1 | |
| 8 | 9 | 3 | 9 | 1 | 8 | 7 | 1 | 1 | |
| 9 | 10 | 4 | 10 | 53 | 9b | - | -c | - | |
| 10 | 11 | 5 | 8 | C23 | 9b | - | 2 | 2 | |
| 10 | 12 | 6 | 1 | K13 | 9b | - | 2 | 2 | |
| 11 | 13 | 7 | 4 | G47 | 6 | 5 | -c | - | |
| 11 | -f | 8 | E6 | 10 | 8 | 2 | 2 | ||
| 12g | -f | - | 1 | J46 | 11 | 9 | -c | - | |
Overview of T. equiperdum STIB 818 TAR clone library. A total of 91 TAR clones were isolated and typed into 16 different BES sets. The sequence typing was performed as described for Table 1. As done previously, two TAR clones were chosen for each BES type (the name of one clone is indicated in the chart), and the full length ESAG6, ESAG5 and ESAG2 ORFs were isolated and sequenced from both of these two clones.
aDouble sequence in the centre indicating two sequence types.
bPseudogene.
cPresence not detected by PCR. See Materials and Methods for diagnostic primers that were used for each ESAG.
dPartial sequence.
eDouble sequence indicating two sequence types within the same BES.
f Sequence failed.
gAnti-sense sequence strand only.
hProbably an underestimate.
BES nucleotide diversity.
| T. b. g. | 13 | 621 | 5.6 | 0.015 | 1 | 311 | |
| T. b. b. | 23 | 24.3 | 0.058 | 2 | 207 | ||
| T. eq. | 15 | 13.8 | 0.046 | 2 | 207 | ||
| T. b. g. | 12 | 1197 | 6.0 | 0.012 | 1 | 599 | |
| T. b. b. | 23 | 17.3 | 0.051 | 5 | 200 | ||
| T. eq. | 10 | 8.4 | 0.030 | 3 | 299 | ||
| T. b. g. | 13 | 1429 | 1.5 | 0.002 | 0 | 1429 | |
| T. b. b. | 21 | 20.0 | 0.055 | 7 | 179 | ||
| T. eq. | 14 | 14.6 | 0.061 | 7 | 179 | ||
| T. b. g. | 11 | 1291 | 15.3 | 0.047 | 7 | 161 | |
| T. b. b. | 19 | 28.7 | 0.101 | 9 | 129 | ||
| T. eq. | 10 | 16.9 | 0.044 | 9 | 129 |
Nucleotide diversity and recombination in BES sequences from three trypanosomatid protozoa. The BES loci studied are indicated on the left. The strains are Trypanosoma brucei gambiense DAL 972 (T.b.g.), T. b. brucei EATRO 2340 (T.b.b.) and T. equiperdum STIB 818 (T. eq.). N° seqs. = number of nucleotide sequences in alignment; L = number of nucleotides in gap-stripped alignment; %S = percentage of polymorphic sites; π = average pairwise nucleotide diversity; N° rec. break points and tract length = number of recombination breakpoints and mean tract length inferred by GARD analysis.
BES amino acid diversity.
| T. b. g. | 12 | 401 | 9.2 | 0.018 | 15 | 3.40 | |
| T. b. b. | 23 | 24.7 | 0.079 | 24 | 3.20 | ||
| T. eq. | 8 | 9.2 | 0.040 | 38 | 2.15 | ||
| T. b. g. | 13 | 469 | 2.8 | 0.005 | 0 | n.a. | |
| T. b. b. | 19 | 27.5 | 0.087 | 48 | 3.75 | ||
| T. eq. | 10 | 19.4 | 0.087 | 49 | 2.29 | ||
| T. b. g. | 11 | 424 | 19.1 | 0.059 | 15 | 3.38 | |
| T. b. b. | 19 | 37.0 | 0.130 | 11 | 3.98 | ||
| T. eq. | 10 | 19.6 | 0.053 | 11 | 4.39 |
Amino acid diversity and selection in ESAG sequences from three trypanosomatid protozoa. The BES loci studied are indicated on the left. The strains are Trypanosoma brucei gambiense DAL 972 (T.b.g.), T. b. brucei EATRO 2340 (T.b.r.) and T. equiperdum STIB 818 (T. eq.). N° seqs. = number of protein sequences in alignment; codons = number of codons in gap-stripped alignment; % S = percentage of polymorphic sites; π = average pairwise amino acid diversity; dN/dS ratio ω > 2 = number of amino acid residues with dN/dS ratio (ω) estimated to be greater than 2 indicating evidence for positive selection; Average dN/dS ω (avg.) = average ω estimate over residues with ω > 2.
Figure 2Sequence alignment over a particularly polymorphic region of ESAG6 shows that The ESAG6 protein sequence types as listed in Tables 1-3 are indicated on the left with amino acid residue position above. The ESAG6 hypervariable region (HV) as described in [38,40] as well as boxes I and II of the ESAG6 transferrin binding site (Tf binding) as described in [39] are indicated with red boxes. Residues that are the most dissimilar to the consensus within a given trypanosome subspecies are highlighted in white. Residues that are completely conserved within a given subspecies are indicated in dark blue, while residues with intermediate degrees of sequence conservation are highlighted with intermediate shades of blue.
Figure 3Maximum likelihood phylogenetic trees show that BES promoter and The DNA sequence types are as listed in Tables 1-3. Sequence accession numbers are listed in the Materials and Methods. Alignments of the BES promoter sequences used are shown in the supplementary material (Additional file 2). Bootstrap support values are shown for key nodes only.
Figure 4Maximum likelihood phylogenetic trees of . Most of the T. b. brucei ESAG2 sequences also appear to cluster separately. The ESAG DNA sequence types are as listed in Tables 1-3. Sequence accession numbers are listed in the Materials and Methods. Bootstrap support values are shown for key nodes only. Type A and Type B sequences referred to in the text are indicated with brackets.