| Literature DB >> 20144183 |
Christophe J Noël1, Nicia Diaz, Thomas Sicheritz-Ponten, Lucie Safarikova, Jan Tachezy, Petrus Tang, Pier-Luigi Fiori, Robert P Hirt.
Abstract
BACKGROUND: Trichomonas vaginalis is the most common non-viral human sexually transmitted pathogen and importantly, contributes to facilitating the spread of HIV. Yet very little is known about its surface and secreted proteins mediating interactions with, and permitting the invasion and colonisation of, the host mucosa. Initial annotations of T. vaginalis genome identified a plethora of candidate extracellular proteins.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20144183 PMCID: PMC2843621 DOI: 10.1186/1471-2164-11-99
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Taxonomic distribution of genomes encoding BspA-like proteins
| Taxaa | Best Bit score | No. hitb | Higher taxon | Habitatc |
|---|---|---|---|---|
| 774 | 595 (908) | Parabasala | Urogenital (human) | |
| 205 | 34 (298) | Amoebozoa | Intestinal (human) | |
| 143 | 27 (124) | Amoebozoa | Intestinal (human) | |
| 251 | 2 (2) | Euryarchaeotes | Aquatic & rumen | |
| 244 | 3 (3) | Euryarchaeotes | Aquatic & rumen | |
| 126 | 1 (2) | Euryarchaeotes | Aquatic | |
| 100 | 2 (2) | Euryarchaeotes | Aquatic | |
| 249 | 7 (7) | Firmicutes | Intestine (human, HMP) | |
| 236 | 15 (15) | CFB groupd | Fish pathogen | |
| 229 | 1 (1) | Firmicutes | Intestine (human, HMP) | |
| 199 | 2 (2) | Verrucomicrobia | Soil | |
| 193 | 1 (1) | Firmicutes | Aquatic & rumen | |
| 178 | 1 (1) | Firmicutes | Intestine (human, HMP) | |
| 174 | 5 (5) | Firmicutes | Intestine, aquatic | |
| 174 | 19 (19) | Firmicutes | Intestine (fish) | |
| 167 | 3 (3) | Verrucomicrobia | Intestine (human) | |
| 155 | 3 (3) | γ-proteobacteria | Nidamental glands (squid) | |
| 153 | 12 (15) | Firmicutes | Intestine (human, HMP) | |
| 148 | 2 (4) | CFB group | Intestine (human) | |
| 146 | 5 (11) | Spirochetes | Oral cavity | |
| 146 | 6 (7) | CFB group | Intestine (human, HMP) | |
| 141 | 2 (3) | Cyanobacteria | Marine & host associated | |
| 138 | 4 (6) | Firmicutes | Intestine (human, HMP) | |
| 134 | 4 (5) | CFB group | Intestine (human, HMP) | |
| 117 | 4 (10) | Firmicutes | Intestine (human, HMP) | |
| 113 | 4 (6) | Firmicutes | Intestine (human, HMP) | |
| 112 | 2 (2) | Firmicutes | Intestine (human) | |
| 107 | 3 (6) | Firmicutes | Intestine (human, HMP) | |
| 104 | 1 (1) | γ-proteobacteria | Aquatic, host associated | |
| 103 | 1 (1) | CFB group | Aquatic, algae pathogen | |
| 99 | 4 (6) | CFB group | Intestine (human, HMP) | |
| 99 | 4 (6) | Firmicutes | Soil | |
| 2 (3) | Firmicutes | Intestine (human, HMP) | ||
| 91 | 1 (1) | Firmicutes | Soil | |
aThe order of appearance of the three Domains of life is defined by the taxon with the highest PHI-Blast bit score. The taxa of a given Domain are then listed by decreasing bit score. Strains/isolates are indicated.
bTaxa with bit score ≥ 90 in the initial PHI-Blast (see Methods section) are listed here. additional file 2, Table S2 has the complete taxa and sequences accessions list. The listed eukaryotes and Archaea were the only taxa recovered in the search. The numbers in brackets are the corresponding numbers of hits from a PSI-Blast search - two iterations with e-score ≤ 1e-5 following the aforementioned PHI-Blast search. The complete list of taxa and sequences accessions for the PSI-Blast search are in additional file 3, Tables S3 (Trichomonas and Entamoeba) and additional file 9, Table S6 (Bacteria and Archaea).
cTaxa isolated from human tissues might be present in other mammalian species as well. HMP: indicates species being sequenced in the context of the Human Microbiome Project [101].
dCFB, Cytophagales/Green sulfur bacteria group syn. Bacteroidetes/Chlorobi group.
eComplete denominations: Syntrophomonas wolfei subsp wolfei str Goettingen
Features of the 18 TvBspA proteins encoded by scaffold DS113361
| Locus taga | Alignb | Protein | LRRc | LRRc | Pa/Prc | E-valued | TMDe | Notable |
|---|---|---|---|---|---|---|---|---|
| TVAG_133050 | 110 | 837 | 15 | 796 | 1 | 2.0E-69 | Conserved | |
| TVAG_133260 | 649 | 285 | 56 | 273 | 0 | 3.0E-21 | ||
| TVAG_133330 | 650 | 666 | 23 | 656 | 0 | 7.0E-43 | ||
| TVAG_133300 | 651 | 566 | 65 | 517 | 1 | 4.0E-46 | 468-490, | |
| TVAG_133380 | 652 | 174 | 38 | 173 | 0 | 9.0E-16 | ||
| TVAG_133420 | 653 | 392 | 66 | 390 | 0 | 3.0E-29 | SP | |
| TVAG_133230 | 678 | 205 | 3 | 201 | 1 | 9.0E-26 | ||
| TVAG_133270 | 679 | 205 | 3 | 201 | 1 | 9.0E-26 | ||
| TVAG_133390 | 680 | 638 | 3 | 555 | 2 | 4.0E-43 | ||
| TVAG_133240 | 681 | 601 | 5 | 358 | 0 | 3.0E-26 | 513-535 | Conserved CT |
| TVAG_133280 | 682 | 583 | 5 | 352 | 0 | 7.0E-26 | 495-517 | Conserved CT |
| TVAG_133310 | 683 | 595 | 4 | 382 | 1 | 4.0E-30 | 505-527 | Conserved CT |
| TVAG_133340 | 684 | 448 | 3 | 210 | 1 | 3.0E-17 | 358-380 | Conserved CT |
| TVAG_133430 | 685 | 734 | 3 | 474 | 1 | 2.0E-35 | 744-666 | Conserved CT |
| TVAG_133290 | 689 | 318 | 3 | 227 | 2 | 8.0E-34 | ||
| TVAG_133410 | 690 | 562 | 1 | 426 | 1 | 8.0E-40 | ||
| TVAG_133360 | 691 | 333 | 2 | 263 | 1 | 5.0E-26 | ||
| TVAG_133220 | 692 | 363 | 51 | 362 | 0 | 2.0E-31 | ||
aThe locus tag numbers give an indication of the proximity of the different TvBspA encoding genes on the scaffold, with an incremental of 10 for each neighbouring genes (see additional file 8, Figure S3). The numbers of entries for each of the four proteins subfamilies identified by CLUSS2 (additional file 1, Table S1) are indicated.
bAlignment position in the output file generated by ClustalW. The position of the 18 TvBspA proteins in the global TvBspA alignment (additional file 1, Table S1) is indicated.
cThe position of TpLRR inferred by BlastP (see Methods section). The numbers indicate the start and end of the TpLRR. The entries positive for either the TpLRR profile or pattern (Pa/Pr), or both, are indicated by a 1 and 2 respectively.
dE-values for the PSI-Blast search (2nd iteration) that followed a PHI-Blast (see Methods section).
eEntries with inferred transmembrane domains (TMD) are indicated with the shown start and end position of the TMD. All such entries had the N-termini with the TpLRR domain inferred to be "external" by two or more TMD predictors (see Methods section).
fTVAG_133420 has an inferred signal peptide (SP). The five entries from subfamily #228 with a TMD all share a similar cytoplasmic tail - see Figure 3.
Figure 1Structural diversity of the TvBspA proteins. (A) Proteins with an inferred TMD (black boxes) are contrasted with those without them. In both case a fraction of entries are inferred to posses a SP (red boxes, brackets illustrate variability). A few TvBspA have more then one TMD but these are not depicted. In addition to the TpLRR (green boxes) some proteins also possessed other type of repeats (R) or low complexity segments (LC) of various lengths (pink box) typically located between the TpLRR and the TMD (additional file 10, Table S7). Notable structural variants are illustrated in panels B and C and variation in the length of the TpLRR in panels D and E. (B) The 12 TvBspA entries forming a subfamily that share a glycine-rich domain (blue box, GRD). Their TpLRR lengths range from 16 to 20 repeats. Four proteins have an inferred SP and one has a TMD. (C) A TvBspA entry with InterProScan hits indicating a partial peptidase S8/S53-like domain (purple box - Pfam domain PF00082) and a galactose-binding domain (red box - SUPERFAMILY domain SSF49785), both of which are typically found in extracellular proteins. One TvBspA proteins was also identified to possess ankirin-like repeats at its C-terminus end (data not shown). (D) Graph illustrating the relationship between the TvBspA TpLRR length and corresponding total length for proteins without TMD. The linear regression is shown (black line) and is contrasted with the y = x line (red dashed) and y = 1/2 x line (black dashed). (E) Corresponding graph (as shown in D) for the 193 TvBspA with TMD-CCT.
Figure 2Comparison of two TvBspA paralogues in isolate G3 and TvBspA orthologue between isolate G3 and SS-22. The two proteins TvBspA625 (TVAG_073760) and TvBspA805 (TVAG_154640) have the highest level of identity, 72%, between members of subfamily #13 (see text). (A) The TvBspA805 protein sequence with the TpLRR (green) and P/NRR (violet) aligned to compare individual repeats and their corresponding features in TvBspA625. The star indicates the first TpLRR that is part of the BlastP TvBspA625/TvBspA805 alignment (underlined segment) and that extends up to the C-termini. The inferred TMD is highlighted in yellow. (B) The corresponding TpLRR and P/NRR alignments for TvBspA625. The number of TpLRR is greater in TvBspA805, whereas the P/NRR is more extensive in TvBspA625. (C) Alignment comparing the C-terminal end of the TvBspA805 (G3 isolate: 805_G3) and TvBspA625 (G3 isolate: 625_G3, identical to sequence of isolate C-1:NIH, GenBank accession: AAM51159) proteins with the corresponding partial sequence of TvBsp625 derived from the clinical isolate SS-22 (625_SS-22, GenBank accession FJ881695). In TvBspA805 every second P/NRR (aligned in A) are underlined whereas for the TvBspA625 proteins every second P/NRR (aligned in B) are boxed. TvBspA625_SS-22 (7 P/NRR) has five fewer P/NRR compared to TvBspA625_G3 (12 P/NRR). The alignments of repeats were manually adjusted to maximise identity levels between repeats. Missing sequence data for the sequence from isolate SS-22 are indicated by question marks. The aligned sequences correspond to the segments between [----] in panels A and B. The sequences of the four peptides synthesized to generate mouse antisera and used to test antibody response in T. vaginalis infected patients are shaded in grey with differences highlighted in white (see text).
Figure 3Alignment of TvBspA with TMD and similar cytoplasmic tails with potential signal for endocytosis. (A) The 21 identified TvBspA with similar CT (sequence and length) all ending with the pattern [DE]FG (green). Most sequences possess a motif related to [DE]XXXL [LI] (blue), a dileucine-based type endocystosis and lysososmal-like sorting signal (see text). Residues replacing [DE] in known functional endocytosis signal in other species are also shown in blue (see text). (B) The 15 identified TvBspA sequences with a similar TMD (yellow) and CT. Most entries share the FDNPX[YF] (blue) an NPXY-type endocytosis signal (see text). The five entries that are part of subfamily #228 (bottom five sequences, see text and Table 2) also share a more similar TMD compared to the other 10 entries. Conserved residues in the TMD differentiating the two sets of sequences are highlighted in dark blue.
Taxonomic distribution of proteins sharing a glycine-rich domain found in 12 TvBspA.
| Taxa | Bit Scorea | E-valuea | No. of | Top hit annotation |
|---|---|---|---|---|
| 335 | 2.0E-90 | 12 (51) | Surface antigen BspA-like | |
| 84 | 8.0E-15 | 1 | Hypothetical protein | |
| 71 | 6.0E-11 | 1 | Hypothetical protein | |
| 71 | 7.0E-11 | 1 | Hypothetical protein | |
| 70 | 7.0E-10 | 2 | Hypothetical protein | |
| 70 | 2.0E-10 | 3 | Hypothetical protein | |
| 68 | 6.0E-10 | 1 | Hypothetical protein | |
| 65 | 6.0E-09 | 2 | Hypothetical protein | |
| 58 | 4.0E-07 | 1 | FG-GAP repeat protein | |
aThe values for the top BlastP hits (query: TVAG_174900 GRD, residues 1-267, E-value <1.0E-4) only are shown. The complete taxonomic report listing all hits is given in additional file 13, Table S8. The taxa are listed by decreasing bit scores. The shown values for T. vaginalis correspond to the non-self top hit, entry TVAG_191460.
bThe values give the number of hits per genome. The 12 TvBspA possessing the GRD were hit and include the query sequence. There is an additional 51 T. vaginalis proteins that possess a related GRD that do not possess TpLRR.
Figure 4Evidence for differential TvBspA gene expression from EST. (A) EST derived from different growth conditions were scanned for entries corresponding to TvBspA genes. The total numbers of distinct TvBspA EST (blue bar) or different unique TvBspA genes (orange bar) for a given condition are indicated. For each condition ~10,000 EST were compared (numbers in brackets). (B) The different libraries are characterised by little overlap in terms of EST encoding TvBspA with only two TvBspA genes shared between all five compared conditions and most found in only one library.
Microarray data for 13 TvBspA with significant modulation in their mRNA concentration upon exposure to different iron concentration
| Locus tag | Annotationa | Subfamily membershipa, b | ESTa | p-value | qRT-PCRd | |
|---|---|---|---|---|---|---|
| High iron culture condition | ||||||
| TVAG_397210 | BspA-like | #43, 10 memberse | 1 | 2.2 | 1.5 × 10-4 | 2.16 ± 0.002 |
| TVAG_441420 | BspA-like, SP, TMD | #374, singleton, divergent TpLRR | 1 | 1.55 | 8.4 × 10-3 | 2.02 ± 0.04 |
| TVAG_191490 | BspA-like, SP, PG, GRD | #168, 12 members | 1 | 1.29 | 5.5 × 10-3 | --- |
| TVAG_396970 | BspA-like | #44, 15 members* | 2 | 1.23 | 4.8 × 10-3 | --- |
| TVAG_080240 | BspA-like | #44, 15 members* | 6 | 1.18 | 8.5 × 10-3 | --- |
| Low iron culture condition | ||||||
| TVAG_093850 | BspA-like | #184, 3 members | 2 | -1.27 | 2.9 × 10-3 | --- |
| TVAG_299910 | BspA-like | #69, 3 members * | 16 | -1.25 | 6.8 × 10-3 | --- |
| TVAG_493590 | BspA-like | #384, 6 members | 4 | -1.24 | 3.7 × 10-3 | --- |
| TVAG_129450 | BspA-like, PG | #30, 15 members | 1 | -1.18 | 2.4 × 10-5 | --- |
| TVAG_530030 | BspA-like | #99, 2 members | 7 | -1.15 | 2.5 × 10-3 | -2.06 ± 0.02 |
| TVAG_341990 | BspA-like | #215, 3 members | 1 | -1.14 | 3.7 × 10-3 | --- |
| TVAG_176920 | BspA-like, TMD | #341, singleton, divergent TpLRR | 1 | -1.13 | 7.8 × 10-3 | --- |
| TVAG_235070 | BspA-like | #73, Singleton | 1 | -1.12 | 3.9 × 10-4 | --- |
| Control genes | ||||||
| TVAG_238830 | Hydrogenosomal malic enzyme subunit B | --- | --- | 1.67 | 1.5 × 10-4 | 2.51 ± 0.01 |
| TVAG_165030 | Cytosolic | --- | --- | -1.33 | 2.8 × 10-4 | -1.62 ± 0.03 |
aSee additional file 1, Table S1 for full details. SP, signal peptide; TMD, transmembrane domain; PG, pseudogene; GRD, glycine-rich domain.
bSF, subfamilies identified with CLUSS2. *Related sequences identified by visual inspection of the alignment.
cPositive values indicate up-regulated genes and negative values down-regulated genes. All values are derived from quadruplets with indicated mean value (Log scale) and significance (p-values) of modulation.
dqRT-PCR was performed in quadruplets with indicated mean and standard deviations.
Figure 5Semi-quantitative RT-PCR assessment of TvBspA expression in . Upon contact to the ECM proteins T. vaginalis cells strongly bind to the substrate and the majority of cells cannot be washed off after 60 min of incubation. RT-PCR for nine selected TvBspA-like genes and control actin (no major change) and alpha-actinin (up-regulated) genes was performed on T. vaginalis ECM proteins bound cells after 60 min of incubation (BC) and control free swiming trophozoites incubated in parallel (T). The shown results are representative of 6 to 15 independent PCR and at least five independent binding experiments/RNA extractions. TvBspA gene-specific primers were designed to produce amplicons of different sizes and avoid cross-amplification between TvBspA genes, in particular between the closely related TvBspA625 (TVAG_073760) and TvBspA805 (TVAG_154640) indicated by a star. A control PCR was also performed on genomic DNA (gDNA) confirming that the designed primers are efficient in generating the specific amplicons (size are indicated). Actin amplicons are equivalent for cDNA preparations from both trophozoite and ECM bound cells indicating similar total cDNA load whereas the cDNA specific for alpha-actinin increases upon T. vaginalis binding to substrate as described [97]. Five TvBspA genes show clear increase in the amount of amplicons for the ECM bound cells suggesting transcription up-regulation or higher stability of their mRNA in this condition. For TvBspA605 no amplicon could be detected suggesting that it is not transcribed in either tested conditions. For TvBspA805 the amount of cDNA was doubled to allow the detection of the shown signal indicating that the mRNA encoding this protein is not as abundant as for the paralogue TvBspA625.
Figure 6Cellular localization of TvBspA625 by indirect immunofluorescence analysis. T. vaginalis cells (G3 isolate) grown in vitro were processed for IFA (ethanol fixation) with the four mouse antisera raised against the four peptides designed from TvBspA625 sequence (see Figure 2) and imaging performed with a confocal microscope. Three antisera (EXT-1, CT-1 and CT-2) gave consistent signals on the cell surface of the parasites whereas EXT-2 often gave strong labelling over most of the cells structures (additional file 18, Figure S7). (A) The mouse antisera raised against a peptide derived from the TvBspA625 cytoplasmic tail (CT-2, green) gave a clear surface labelling. Co-labelling of the hydrogenosomal malic enzyme (red) with a specific rabbit antiserum led to the labelling of intracellular structures as expected [49]. Controls consisted of specific mouse pre-sera or secondary anti-mouse antisera alone (no signal in both cases, data no shown) and a mouse monoclonal antibody raised against tubulin decorated with same secondary anti-mouse antibody (panel C), clearly demonstrated the specificity of the signal attributed to the anti CT-2 peptide antisera. (B) The corresponding DIC image of the panel A. DAPI was added to the mounting media to label the nuclei (blue). The scale bar is shown. (C) Corresponding signal identified for the antisera raised against the TvBspA625 extracellular peptide (EXT-1, green) and processed as in A. (D) The corresponding DIC image of the panel B. (E) Corresponding signal identified with the mouse monoclonal VG2 raised against a T. vaginalis tubulin protein and processed with the same secondary anti-mouse antisera (green) and anti-ME rabbit antisera (red). (F) The corresponding DIC image of the panel E.
Patients antisera response to peptides derived from TvBspA625.
| 89 | 59 | 148* (88%) | 3 | 7 | 10*(16%) | |
| 15 | 6 | 21* (12%) | 33 | 18 | 51* (84%) | |
| Total | 104 | 65 | 36 | 25 | ||
| 71 | 56 | 127** (75%) | 15 | 18 | 33** (54%) | |
| HIV positive & positive ≥ 1 peptide | 62 | 53 | 115 | 0 | 4 | 4 |
| HIV positive & negative all peptides | 9 | 3 | 12 | 15 | 14 | 29 |
| 33 | 9 | 42** (25%) | 21 | 7 | 28** (46%) | |
| HIV negative & positive ≥ 1 peptide | 27 | 6 | 33 | 3 | 3 | 6 |
| HIV negative & negative all peptides | 6 | 3 | 9 | 18 | 4 | 22 |
aA total of 169 patients with sera positive for T. vaginalis total protein extracts were contrasted to 61 patients with sera negative for T. vaginalis total protein extracts. Fractions of respective samples are indicated (in %). The proportion of HIV positive sera from the same set of patients was also compared. Sets of values used for statistical test and calculation of respective % are indicated by * or **.