Literature DB >> 30211125

Comparative Evolution of Sand Fly Salivary Protein Families and Implications for Biomarkers of Vector Exposure and Salivary Vaccine Candidates.

Iliano V Coutinho-Abreu1, Jesus G Valenzuela1.   

Abstract

Sand fly salivary proteins that produce a specific antibody response in humans and animal reservoirs have been shown to be promising biomarkers of sand fly exposure. Furthermore, immunity to sand fly salivary proteins were shown to protect rodents and non-human primates against Leishmania infection. We are missing critical information regarding the divergence amongst sand fly salivary proteins from different sand fly vectors, a knowledge that will support the search of broad or specific salivary biomarkers of vector exposure and those for vaccines components against leishmaniasis. Here, we compare the molecular evolution of the salivary protein families in New World and Old World sand flies from 14 different sand fly vectors. We found that the protein families unique to OW sand flies are more conserved than those unique to NW sand flies regarding both sequence polymorphisms and copy number variation. In addition, the protein families unique to OW sand flies do not display as many conserved cysteine residues as the one unique to the NW group (28.5% in OW vs. 62.5% in NW). Moreover, the expression of specific protein families is restricted to the salivary glands of unique sand fly taxon. For instance, the ParSP15 family is unique to the Larroussius subgenus whereas phospholipase A2 is only expressed in member of Larroussius and Adlerius subgenera. The SP2.5-like family is only expressed in members of the Phlebotomus and Paraphlebotomus subgenera. The sequences shared between OW and NW sand flies have diverged at similar rates (38.7 and 45.3% amino acid divergence, respectively), yet differences in gene copy number were evident across protein families and sand fly species. Overall, this comparative analysis sheds light on the different modes of sand fly salivary protein family divergence. Also, it informs which protein families are unique and conserved within taxon for the choice of taxon-specific biomarkers of vector exposure, as well as those families more conserved across taxa to be used as pan-specific vaccines for leishmaniasis.

Entities:  

Keywords:  Leishmania; Leishmaniasis; markers of exposure; molecular evolution; salivary proteins; sand flies; vaccines

Mesh:

Substances:

Year:  2018        PMID: 30211125      PMCID: PMC6123390          DOI: 10.3389/fcimb.2018.00290

Source DB:  PubMed          Journal:  Front Cell Infect Microbiol        ISSN: 2235-2988            Impact factor:   5.293


Introduction

Vector borne diseases represent almost half of the neglected tropical infectious diseases. When attempting to get a blood meal, most vectors of disease deliver the pathogen in the host skin and together with the pathogen these arthropods deliver saliva (Coutinho-Abreu et al., 2015; de Castro et al., 2017). Blood sucking arthropods secrete a plethora of bioactive compounds in their saliva to counteract the mammalian host hemostatic system in order to get a successful blood meal. Salivary anti-hemostatic components such as anticoagulants (Chagas et al., 2014), vasodilators (Ribeiro et al., 1989; Lerner et al., 1991; Champagne and Ribeiro, 1994), and inhibitors of platelet aggregation (Calvo et al., 2007, 2011; Assumpcao et al., 2013) have been described and some of these proteins characterized at the molecular level (Coutinho-Abreu et al., 2015). In addition, the biological activity of some arthropod salivary proteins was shown to promote the establishment of pathogens in the mammalian host (Coutinho-Abreu et al., 2015; de Castro et al., 2017). Furthermore, immunity to specific sand fly salivary proteins was shown to protect rodents and non-human primates against leishmaniasis (Kamhawi et al., 2000; Gomes et al., 2008; Collin et al., 2009; Oliveira et al., 2015). It is well established that humans and animal reservoirs make antibodies to proteins in the saliva of insects including those present in sand flies. These findings have prompted research groups to explore the use of sand fly salivary proteins as markers of sand fly exposure for humans and animal reservoirs (Teixeira et al., 2010; Drahota et al., 2014; Marzouki et al., 2015; Sima et al., 2016; Kostalova et al., 2017). A recombinant sand fly salivary protein of 43 kDa (rSP03B), from the sand fly P. pernicious, belonging to the yellow family of proteins was shown to be a marker of sand fly exposure in dogs living in Southern and Central Italy and in Portugal (Drahota et al., 2014; Kostalova et al., 2017). Interestingly, the yellow related protein from P. orientalis (rPorSP24) was demonstrated to be a good marker of sand fly exposure in domestic animals including dogs from a L. donovani foci in Ethiopia (Sima et al., 2016). For humans living in visceral leishmaniasis disease endemic areas in Brazil, the combination of salivary proteins LJM11 and LJM17 (yellow related proteins) was demonstrated to be the best biomarker of Lutzomyia longipalpis exposure in humans (Teixeira et al., 2010), while Linb13, a protein of 30 kDa belonging to the antigen-5 family of proteins, was shown to be the best biomarker of Lutzomyia intermedia exposure in humans living in a cutaneous leishmaniasis endemic area (Carvalho et al., 2017). The salivary protein PpSP32 from P. papatasi was demonstrated to be the marker of P. papatasi exposure for humans living in cutaneous leishmaniasis endemic areas in Tunisia (Marzouki et al., 2015) and in Saudi Arabia (Mondragon-Shem et al., 2015). Based on their continental separation, sand flies are classified as belonging to New World (NW) and Old World (OW) groups. Thus far, over a dozen sand fly salivary gland transcriptomes have been obtained. From the OW group, 10 species have had their salivary transcriptomes decoded. These species belong to five subgenera of the genus Phlebotomus, including the subgenera Phlebotomus, Paraphlebotomus, Larroussius, Adlerius, and Euphlebotomus (Valenzuela et al., 2001; Anderson et al., 2006; Kato et al., 2006; Oliveira et al., 2006; Hostomská et al., 2009; Abdeladhim et al., 2012; Rohousova et al., 2012; Martín-Martín et al., 2013; Vlkova et al., 2014). For NW sand flies, salivary gland transcriptomes were sequenced from four species. Within the genus Lutzomyia, the salivary transcriptomes of Lu. longipalpis (subgenus Lutzomyia) and Lutzomyia ayacuchensis (subgenus Helcocyrtomyia), as well as two other transcriptomes from sand flies in the genera Nyssomyia (Nyssomyia intermedia) and Bichromomyia (Bichromomyia olmeca) have been obtained (Valenzuela et al., 2004; de Moura et al., 2013; Kato et al., 2013; Abdeladhim et al., 2016). The use of sand fly salivary proteins as markers of vector exposure represents therefore a practical application that can be implemented in epidemiological studies as well as for vector control programs. Therefore, having a well-defined catalog of sandfly salivary proteins as well as a better understanding of the evolutionary relationship of salivary proteins from different sand fly vectors will allow us to make more precise selection for these appealing biomarkers. In the current study, we focused on the evolutionary analysis of protein families unique to Old World (OW) sand flies comparing that with the protein families shared between OW and New World (NW) sand flies as well as those unique to NW sand flies. This comparative analysis also unveiled that the some salivary protein families unique to OW sand flies emerged and diversified in different manners than their counterparts unique to NW sand flies. These findings can inform which proteins are the best candidates to be used as a pan-specific or species-specific biomarkers of sand fly exposure or as well as a pan-specific vaccine against leishmaniasis.

Materials and methods

Sequences

Nucleotide and amino acid sequences were retrieved from the NCBI databases from sand fly salivary gland transcriptomes (Valenzuela et al., 2001, 2004; Anderson et al., 2006; Oliveira et al., 2006; Hostomská et al., 2009; Abdeladhim et al., 2012, 2016; Rohousova et al., 2012; de Moura et al., 2013; Kato et al., 2013; Martín-Martín et al., 2013; Vlkova et al., 2014). Signal peptides were removed from the protein sequences whereas sequences encoding signal peptides and stop codons were removed from the nucleotide sequences for further analyses. Only sequences displaying more than 5% divergence at the amino acid level were assumed to be encoded by true paralog genes and included in the analyses, rather than being alleles of the same gene and otherwise discarded. Sand fly groups, species, and sequence accession numbers are provided in Supplementary Table 1.

Sequence alignment

Multiple sequence alignments of putative peptides were carried out using Clustal Omega built in the MacVector software 15.8 (Olson, 1994) and in MEGA7 (Kumar et al., 2016). For the construction of phylogenetic trees, the gap penalties were not taken into account in the multiple sequence alignments.

DNA polymorphism, protein divergence, and phylogenetic analysis

The evolutionary analyses were performed in the DnaSP 5.10 software (Librado and Rozas, 2009). The parameter ω refers to the rate of non-synonymous nucleotide polymorphisms (Ka) over the synonymous rate of nucleotide polymorphisms (Ks) (Nei, 1987). Slide window analyses of ω along the nucleotide sequences encoding such proteins were also obtained. The diversity of the protein family sequences refers to the p-distance [proportion (p) of amino acid sites at which the two sequences to be compared are different (Nei, 2000)] obtained in the MEGA7 software (Kumar et al., 2016). The evolutionary histories of salivary protein families were inferred by using the Maximum Likelihood method and conducted in MEGA7 (Kumar et al., 2016). The amino acid substitution model was selected based on the best fit provided by the Model Selection tool built in the MEGA 7 software. The bootstrap consensus trees inferred from 1,000 replicates (Felsenstein, 1985) were taken to represent the evolutionary history of the taxa analyzed (Felsenstein, 1985). Branches corresponding to partitions reproduced in <50% bootstrap replicates are collapsed. Initial tree(s) for the heuristic search were obtained by applying the Neighbor-Joining method to a matrix of pairwise distances estimated using a JTT model (Felsenstein, 1985).

Statistical analysis

D'Agostino and Pearson normality test was performed to assess whether or not data follow a normal distribution, and the Mann–Whitney test was carried out in order to test the significance of the differences in protein divergence and omega (ω) values. Both tests were performed using the Prism 7 software (GraphPad).

Results

Updated sand fly salivary protein catalog

From 14 sand fly salivary gland transcriptomes we compiled the sand fly salivary proteins unique to either NW or OW sand flies as well as protein families common to all sand fly species and developed an updated catalog for all the analyzed sand fly salivary proteins (Table 1). With this extensive catalog of sand fly salivary proteins we identified 12 protein families shared between OW and NW sand flies, lufaxin, apyrase, yellow-related protein, silk-related protein or PpSP32, endonuclease, hyaluronidase, adenosine deaminase, small odorant binding-like proteins, D7 family of proteins, antigen-5 related protein, ParSP17 (39 kDa protein of unkown function), ParSP80 (16 kDa protein of unkown function) (Figure 1 and Supplementary Figures 4–15), seven protein families unique to OW sand flies, pyrophosphatase, phospholipase A2, ParSp15 (5 kDa protein of unknown function), ParSP25 (32 kDa protein of unknown function), SP16 (14.3 kDa protein of unknown function), ParSP23 (2 kDa protein of unknown function), and SP2.5 (2.5 kDa protein of unknown function, Figures 7–10 and Supplementary Figures 1–3). Two protein families unique to OW sand flies that are only shared by two sand fly species (ParSP23-like and ParSP2.5-like; Supplementary Figures 2,3). We also identified 12 protein families unique to NW sand flies: toxin, RGD, c-type lectin, 14 kDa protein family, ml domain, 5′ nucleotidase, 9 kDa protein family, salo, 11.5 kDa protein family, maxadilan, 71 kDa protein family, and 5 kDa protein family.
Table 1

Sand fly salivary protein families across multiple sand fly species.

Family of proteinsB. olmecaL. longipalpisLu. ayacuyensisN. intermediaP. papatasiP. duboscqiP. sergentiP. ariasiP. orientalisP. tobbiP. perniciosusP. kandelakkiP. arabicusP. argentipes
PROTEINS UNIQUE TO NEW WORLD SAND FLIES
ToxinXX
RGD-containingXXXX
C-type lectinXXXX
MaxadilanXX
14kDa SPXXX
ML domainXX
5′ NucleotidaseX
9 kDa SPXX
SALOXXX
11.5 kDa SPXXX
71 kDa proteinsXXX
5 kDa proteinXX
PROTEINS UNIQUE TO OLD WORLD SAND FLIES
PyrophosphataseXXXXXX
Phospholipase A2XXXXX
SP16XXXXX
ParSP23-XX
SP2.5XX
ParSP25XXXXXXX
ParSP15XXXXX
PROTEINS SHARED BETWEEN NEW WORLD AND OLD WORLD SAND FLIES
OBPsXXXXXXXXXXXXXX
Yellow-relatedXXXXXXXXXXXXXX
Antigen 5XXXXXXXXXXXXXX
LufaxinXXXXXXXXXXXXXX
D7-relatedXXXXXXXXXXXXXX
ApyraseXXXXXXXXXXXXXX
Silk relatedXXXXXXXXXXXXXX
HyaluronidaseXXXXXXXX
EndonucleaseXXXXXXXXXX
Adenosine deaminaseXXXX
ParSP17XXXXXXXXX
ParSP80XXXX

X, protein expressed in the saliva.

–, not expressed or not detected.

Figure 1

Phylogenetic analysis of ParSP17 protein family. The evolutionary history was inferred by using the Maximum Likelihood method based on the Le Gascuel matrix-based model (Gomes et al., 2008). Sand fly species are indicated by the different symbols in the legend on the right. Tree branches were color-coded so as to represent specific taxon: Green color represents the Larroussius and Adlerius subgenera; Blue color points to proteins of the Phlebotomus and Paraphlebotomus subgenera; and Black color indicates the proteins belonging to New World sand flies. Although the PduK84 sequence is illustrated here, it was not included in further analyses because it is truncated.

Sand fly salivary protein families across multiple sand fly species. X, protein expressed in the saliva. –, not expressed or not detected. Phylogenetic analysis of ParSP17 protein family. The evolutionary history was inferred by using the Maximum Likelihood method based on the Le Gascuel matrix-based model (Gomes et al., 2008). Sand fly species are indicated by the different symbols in the legend on the right. Tree branches were color-coded so as to represent specific taxon: Green color represents the Larroussius and Adlerius subgenera; Blue color points to proteins of the Phlebotomus and Paraphlebotomus subgenera; and Black color indicates the proteins belonging to New World sand flies. Although the PduK84 sequence is illustrated here, it was not included in further analyses because it is truncated.

Salivary proteins shared between old world and new world sand flies

The phylogenetic trees of 9 out of 12 salivary protein-encoding gene families shared between OW and NW sand flies (Supplementary Figures 4–15) correlated well with the sand fly species phylogeny, constructed based on ITS-2 sequences (Aransay et al., 2000). The resulting phylogenetic tree indicates that these shared gene families of sand fly salivary proteins have been evolving under purifying (negative) selection or are selective neutral (ω < 1; Barton and Etheridge, 2004). Such salivary protein families encompassed lufaxin, apyrase, yellow-related protein, antigen-5, endonuclease, hyaluronidase, ParSP80, odorant binding proteins (OBPs), and adenosine deaminase. On the other hand, Silk-related protein or PpSP32, the D7 family of proteins (Supplementary Figures 6,12; Abdeladhim et al., 2016) and the ParSP17-like family of proteins (Figure 1) displayed phylogenies diverging from the sand fly species phylogeny, pointing that strong selective forces related to the function and/or immunogenicity of such proteins have driven their evolution since the split of NW and OW sand flies that likely took place about 100MYs with the continental separation of the American continent from the European and African ones.

Molecular evolution between OW and NW sand fly salivary proteins

The overall rates of molecular evolution of the salivary protein families unique to OW sand flies were similar to the families shared with NW sand flies (Figures 2A,B). The median protein divergences (% divergence) for the families unique to OW sand flies and shared with the NW ones were 41.4 and 35.5%, respectively (Figure 2A). Likewise, the median ratios of non-synonymous over synonymous replacements (ω) for the families unique to OW sand flies and shared with the NW ones were 0.34 and 0.23, respectively (Figure 2B). This is in sharp contrast to the pattern of divergence observed in the protein families unique to NW sand flies when compared with their counterparts shared with Old World sand flies (Figures 2C,D), as observed elsewhere (Abdeladhim et al., 2016). In the protein families unique to NW sand flies, the median protein divergence was 54.7% whereas 39.3% divergence was noticed for the protein families shared with OW sand flies (Figure 2C, p < 0.0008). By the same token, the median ω value was significantly greater in the protein families unique to NW sand flies (median ω = 0.49) as compared to those shared with OW sand flies (median ω = 0.2, p < 0.0008; Figure 2D).
Figure 2

Average molecular divergence of unique and shared Old World (A) and New World (C) salivary proteins (top), and rate of non-synonymous over synonymous replacements (omega – ω) of unique and shared Old World (B) and New World (D) salivary proteins. *p < 0.05.

Average molecular divergence of unique and shared Old World (A) and New World (C) salivary proteins (top), and rate of non-synonymous over synonymous replacements (omega – ω) of unique and shared Old World (B) and New World (D) salivary proteins. *p < 0.05.

Molecular evolution within OW and NW sand flies

The molecular evolution of the protein families was also taken into account individually, in regard to the analyses of % divergence and the ratio of non-synonymous over synonymous replacements (ω; Figure 3). Among the protein families unique to OW sand flies, phospholipase A2, pyrophosphatase, and ParSP25-like displayed the lowest rates of diversification (divergence < 30%; ω < 0.35; Figures 3A,B) whereas ParSP2.5, ParSP23, and SP16 were among the most divergent families (divergence > 50%; ω > 0.44; Figures 3A,B). Within the OW sand fly protein families shared with NW sand flies, ParSP80, and hyaluronidase families showed consistently the lowest rates of sequence divergence and non-synonymous to synonymous codon replacement (divergence < 12%; ω < 0.12; Figures 3A,B). In contrast, the OBPs, silk-related, D7, and ParSP17 families exhibited the greatest rates of diversification (divergence > 39%; ω > 0.26; Figures 3A,B). Regarding the protein families unique to NW sand flies, the SALO, maxadilan, and ML domain families presented the highest rates of molecular evolution (divergence > 60%; ω > 0.62; Figures 3C,D; Abdeladhim et al., 2016) whereas the 5 kDa protein family presented itself as the least divergent protein family (divergence = 36%; ω = 0.26; Figures 3C,D; not previously assessed). Amongst the protein families of NW sand flies shared with OW species, the D7 family presented the highest levels of diversification (divergence = 43%; Figure 3C) whereas the antigen-5 protein family exhibited the lowest divergence at the amino acid level (divergence = 19%; Figure 3C). Regarding the ω values, the silk family displayed the highest ratio of non-synonymous over synonymous replacements (ω = 0.30; Figure 3D) whereas the lowest ratio was exhibited by the apyrase family. Regarding the OBP family, it is actually a superfamily, encompassing at least six protein families (Supplementary Figure 15). Hence, the high rates of divergence for the OBPs should be view with caution, as noted below.
Figure 3

Molecular divergence across salivary protein in Old World (A) and New World (C) sand flies. Rate of non-synonymous over synonymous replacements (omega – ω) across salivary proteins in Old World (B) and New World (D) sand flies.

Molecular divergence across salivary protein in Old World (A) and New World (C) sand flies. Rate of non-synonymous over synonymous replacements (omega – ω) across salivary proteins in Old World (B) and New World (D) sand flies.

Molecular diversification along protein lengths

As specific regions of a protein can be subjected to different selective constraints and in turn evolve at different rates, slide-window analyses of non-synonymous over synonymous replacements (ω) for the genes encoding the salivary protein unique to OW sand flies and shared with NW sand flies were carried out (Figures 4, 5). Contrasting to the genes encoding the protein families unique to NW sand flies, which display multiple codons under positive (diversifying) or relaxed purifying selection (ω ≥ 1; Abdeladhim et al., 2016), the gene families unique to Old World sand flies exhibited for the most part codons under purifying (negative) selection (ω < 1; Figure 4). In the latter group, only ParSP15 and phospholipase A2 bore a few codons under positive selection (ω ≥ 1; Figure 4), mostly in the 5′ portion of the genes. The gene families shared with NW sand flies also displayed low levels of sequence divergence (Figure 3), with exception of the ParSP17 family which bear multiple codons under positive selection (Figure 5). In such (ParSP17; Figure 5A), a greater number of codons under positive selection were noticed in the gene sequences for the sand flies belonging to the related Phlebotomus and Paraphlebotomus subgenera (P/P; Figure 5B) than for the sand flies belonging to related Larrossious, Adhelerius, and Euphlebotomus subgenera (L/A/E; Figure 5C).
Figure 4

Slide window analyses of omega (ω) for the salivary protein families unique to Old World sand flies. Dotted lines point to the threshold for positive selection (ω ≥ 1). (A) ParSP15. (B) Phospholipase A2. (C) SP16. (D) ParSP23. (E) ParSP25. (F) ParSP2.5. (G) Pyrophosphatase.

Figure 5

Slide window analyses of omega (ω) for the salivary protein ParSP17 in all the sand fly species (A), Old World sand flies (B), Phlebotomus and Paraphlebotomus sub-genera (C), and Larrossious, Adlerius, and Euphlebotomus sub-genera. Dotted lines point to the threshold for positive selection (ω ≥ 1).

Slide window analyses of omega (ω) for the salivary protein families unique to Old World sand flies. Dotted lines point to the threshold for positive selection (ω ≥ 1). (A) ParSP15. (B) Phospholipase A2. (C) SP16. (D) ParSP23. (E) ParSP25. (F) ParSP2.5. (G) Pyrophosphatase. Slide window analyses of omega (ω) for the salivary protein ParSP17 in all the sand fly species (A), Old World sand flies (B), Phlebotomus and Paraphlebotomus sub-genera (C), and Larrossious, Adlerius, and Euphlebotomus sub-genera. Dotted lines point to the threshold for positive selection (ω ≥ 1).

Copy number variation of salivary protein-encoding genes across species

As important as sequence polymorphism (SNPs and INDELs) for the evolution of gene families and emergence of new molecular functions are gene duplication events (Innan and Kondrashov, 2010). Although less than two gene duplication events per species were noticed for two NW sand fly gene families (yellow-related and the small OBPs) shared with OW sand flies (Figure 6; yellow-related, Supplementary Figure 13; OBPs, Supplementary Figure 15; Abdeladhim et al., 2016), up to five gene duplication events were accounted for in six OW sand fly gene families (antigen-5, apyrase, D7, Silk, yellow-related, OBPs) shared with NW sand flies (Figure 6; antigen-5, Supplementary Figure 4; apyrase, Supplementary Figure 5; D7, Supplementary Figure 6; Silk, Supplementary Figure 12; yellow-related, Supplementary Figure 13; and OBPs, Supplementary Figure 15). In the gene families unique to NW sand flies (SALO, RGD, mannose-binding lectin, c-type lectin, and spider toxin-like) up to eight gene duplication events were detected in six gene families (Figure 6; SALO, RGD, mannose-binding lectin, c-type lectin, and spider toxin-like; Abdeladhim et al., 2016). In sharp contrast to the high rate of copy number variation in gene families unique to NW sand flies, no gene duplication event was noticed for the sand fly salivary protein encoding gene families unique to OW sand flies (Figures 6–10 and Supplementary Figures 1–3).
Figure 6

Copy number variation across sand fly salivary protein families. The number of gene duplication events were enumerated from the sequences within New World (NW—black boxes) as well as Old World (OW) sand flies (L/A/E—Larrossious, Adlerius, and Euphlebotomus subgenera—green and red boxes; P/P—Phlebotomus and Paraphlebotomus subgenera—blue boxes). Gene duplication events were normalized by the number of species represented in each group.

Figure 10

Multiple sequence alignment and molecular phylogenetic analysis of the sand fly Pyrophosphatase salivary protein family. (A) Multiple sequence alignment of Pyrophosphatase A2 proteins. PduM54 (P. duboscqi), PpePyrophosphatase (P. perniciosus), PorASP262 (P. orientalis), PkanSP24 (P. kandelakki), PabSP288 (P. arabicus), and PagSP132 (P. argentipes). Black background shading represents identical amino acids. Gray background shading represents similar amino acids. (B) The evolutionary history of Pyrophosphatase salivary protein family was inferred by using the Maximum Likelihood method based on the Whelan And Goldman model (Whelan and Goldman, 2001). Sand fly species are indicated by the different symbols in the legend on the right. Tree branches were color-coded so as to represent specific taxon: Green color represents the Larroussius and Adlerius subgenera; Red color indicates the Euphlebotomus subgenus; Blue color points to proteins of the Phlebotomus and Paraphlebotomus subgenera.

Copy number variation across sand fly salivary protein families. The number of gene duplication events were enumerated from the sequences within New World (NW—black boxes) as well as Old World (OW) sand flies (L/A/E—Larrossious, Adlerius, and Euphlebotomus subgenera—green and red boxes; P/P—Phlebotomus and Paraphlebotomus subgenera—blue boxes). Gene duplication events were normalized by the number of species represented in each group. Multiple sequence alignment and molecular phylogenetic analysis of the sand fly ParSP15 salivary protein family. (A) Multiple sequence alignment of ParSP15 proteins. ParSP15 (P. ariasi), PorMSP104 (P. orientalis), PpeSP12 (P. perniciosus), PtSP71 (P. tobbi), and PkanSP19 (P. kandelakii). Black background shading represents identical amino acids. Gray background shading represents similar amino acids. (B) The evolutionary history of ParSP15 salivary protein family was inferred by using the Maximum Likelihood method based on the JTT matrix-based model (Jones et al., 1992). Sand fly species are indicated by the different symbols in the legend on the right. Multiple sequence alignment and molecular phylogenetic analysis of the sand fly ParSP25 salivary protein family. (A) Multiple sequence alignment of ParSP25 proteins. ParSP25 (P. ariasi), PorASP106 (P. orientalis), PpeSP08 (P. perniciosus), PtSP73 (P. tobbi), PkanSP20 (P. kandelakki), and PabSP12 (P. arabicus). Black background shading represents identical amino acids. Gray background shading represents similar amino acids. (B) The evolutionary history of ParSP25 salivary protein family was inferred by using the Maximum Likelihood method based on the JTT matrix-based model (Jones et al., 1992). Sand fly species are indicated by the different symbols in the legend on the right. Multiple sequence alignment and molecular phylogenetic analysis of the sand fly Phospholipase A2 salivary protein family. (A) Multiple sequence alignment of Phospholipase A2 proteins. ParSP11 (P. ariasi), PorMSP129 (P. orientalis), PpeSP18 (P. perniciosus), PkanSP25 (P. kandelakki), and PabSP52 (P. arabicus). Black background shading represents identical amino acids. Gray background shading represents similar amino acids. Asterisks indicate the conserved cysteine residues. (B) The evolutionary history of Phospholipase A2 salivary protein family was inferred by using the Maximum Likelihood method based on the JTT matrix-based model (Jones et al., 1992). Sand fly species are indicated by the different symbols in the legend on the right. Multiple sequence alignment and molecular phylogenetic analysis of the sand fly Pyrophosphatase salivary protein family. (A) Multiple sequence alignment of Pyrophosphatase A2 proteins. PduM54 (P. duboscqi), PpePyrophosphatase (P. perniciosus), PorASP262 (P. orientalis), PkanSP24 (P. kandelakki), PabSP288 (P. arabicus), and PagSP132 (P. argentipes). Black background shading represents identical amino acids. Gray background shading represents similar amino acids. (B) The evolutionary history of Pyrophosphatase salivary protein family was inferred by using the Maximum Likelihood method based on the Whelan And Goldman model (Whelan and Goldman, 2001). Sand fly species are indicated by the different symbols in the legend on the right. Tree branches were color-coded so as to represent specific taxon: Green color represents the Larroussius and Adlerius subgenera; Red color indicates the Euphlebotomus subgenus; Blue color points to proteins of the Phlebotomus and Paraphlebotomus subgenera.

Emergence of specific salivary protein encoding genes

Besides the differences in the rates of gene diversification and gene duplication events among protein families, the molecular evolution of salivary protein encoding gene families in sand flies has also displayed different modes of gene emergence as well as specific signatures of protein structure and divergence. A few salivary gland gene families appear to have arisen upon duplication from a gene expressed in another tissue that subsequently acquired a specific promoter driving expression into the salivary gland, a mechanism called sub-functionalization (neofunctionalization) (Hahn, 2009; Innan and Kondrashov, 2010). Amongst the salivary gene families unique to NW sand flies, the c-type lectin, the mannose-binding lectin, and the spider toxin-like gene families share paralogs expressed in other tissues of sand flies and other unrelated arthropods (Abdeladhim et al., 2016). Similar phenomena were noticed for salivary protein families unique to OW sand flies, such as phospholipase A2 (Tunaz et al., 2003) and pyrophosphatase (Silva et al., 2015), as well as in protein families shared with NW sand flies, like hyaluronidase (Allalouf et al., 1975), endonuclease (Broderick et al., 2014), adenosine deaminase (Dolezelova et al., 2005), and OBPs (Benoit et al., 2017). Hence, such gene families seem to have emerged by sub-functionalization. The presence of an OBP-like domain in the D7 protein C-termini (Hekmat-Scafe et al., 2000) points to the emergence of D7 genes by gene fusion (Kaessmann et al., 2009) between an ancient OBP and an unknown gene. In fact, conserved cysteine signatures (Table 2), as bore by OBPs, can unveil the mechanism of gene birth even among fast evolving genes. Multiple sand fly salivary protein families bear cysteine signatures (≥4 cysteines; Table 2). In the salivary protein families unique to NW sand flies, conserved cysteine residue signatures (62.5%, five out of eight protein families; Table 2) are present at a much higher frequency than in the protein families shared between OW and NW sand flies (50%, 6 out of 12; Table 2). Amongst the protein families unique to the OW sand flies, such proportion is reduced to only 28.5% of the protein families (two out of seven families; Table 2).
Table 2

Cysteine residue signatures of the sand fly salivary protein families.

Protein familyNumber of cysteine residuesNew world sand fliesOld world sand fliesCysteine residues signature
SALO6YNCX14CX26−41CX8−11CX8CX6C
C-Type lectin5YNCX70−91CX8−14CX4CX7C
14.2 kDa4YNCX11−13CX18−21CX12−14C
ML domain6YNCX13CX5−10CX42−47CX5−11CX39−43C
Spider toxin6YNCX6CX4−9CCX4CX5C
RGD0YN
5 kDa0YN
71 kDa0YN
OBPs5YYCX10CX3CX46CX15CX8
D7 protein10YYCX25−27CX3CX44−46CX49−50CX6−12CX3CX13−16CX9CX8C
Antigen-514YYCX4CX9−13CX9−10CX59CX6CX5CX71CX18CX2CX15CX2CX4CX7C
Yellow4YYCX69−72CX132−136CX81−91C
Endonuclease10YYCX39−47CX9−11CX10−13CX14CX24CX192−198CX3CX25CC
Lufaxin5YYCX21−29CX9CX22−35CX118−125C
Silk0YY
ParSP800YY
ParSP170YY
Hyaluronidase0YY
Aden. deaminase0YY
Apyrase0YY
Phospholipase A210NYCX38CX49−51CX20CCX5CX23CX1CX6CX23C
SP166NYCX20−31CX8CX18CCX13C
ParSP150NY
ParSP230NY
ParSP250NY
Pyrophosphatase0NY
SP2.50NY

Y, protein expressed in the salivary glands.

N, lack of expression in the salivary glands.

Cysteine residue signatures of the sand fly salivary protein families. Y, protein expressed in the salivary glands. N, lack of expression in the salivary glands.

Taxon-specific expression of salivary protein encoding genes

In order to adapt to the new ecological niches, sand flies have faced the hemostasis components and immune systems of the mammal and bird species indigenous to their habitat. In order to face such selective pressures, the establishment of new protein variants may have been required for the survival of the sand flies. It is important to mention, nonetheless, that even within OW sand flies, differences in the levels of polymorphisms were noticed. The rates of codons under positive selection varied within subgroups: for instance, higher rates were noticed within the P/P subgenera in ParSP17 (Figure 5). In addition, most of the gene duplication events have taken place in P. duboscqi for the P/P subgenera as well as in P. orientalis and P. perniciosus for the L/A/E subgenera (Figure 6). It is noteworthy that members of the P/P subgenera only express up to three of the salivary proteins unique to OW sand flies (SP2.5, SP16, and/or pyrophosphatase) whereas members of the L/A/E subgenera for the most part secrete salivary proteins of four to five different families (ParSP15, ParSP25, ParSP23, phospholipase A2, SP16, and/or pyrophosphatase; Table 1). Among the later, ParSP15 family is unique to the Larroussius subgenus whereas phospholipase A2 is only expressed in member of Larroussius and Adlerius subgenera. Along the same lines, transcripts for hyaluronidase and endonuclease were missing in the P/P sialotranscriptomes whereas such protein are expressed in NW and L/A/E salivary glands (Table 1).

Discussion

The comparative analysis of salivary protein gene families between OW and NW sand flies reveals that differences in the mode of evolutionary diversification amongst gene families that could have potential implications for the application of such proteins in the selection of markers of vector exposure and vaccines as: (1) At the sequence level, protein families unique to OW sand flies have evolved at a similar pace than those shared between OW and NW species, yet at a slower pace than the families unique to NW sand flies; (2) The evolutionary rates of both the protein/gene families unique to or shared between sand fly groups range from more divergent to more conserved families; (3) Gene families unique to NW sand flies have been diverging for the most part due to positive selection whereas the families shared between NW and OW (except ParSP17) as well as those unique to OW sand flies are less divergent due to purifying selection; (4) Events of gene duplication are more often observed in the gene families unique to NW sand fly species, such as SALO, RGD, mannose-binding lectin, c-type lectin, and spider toxin-like, whereas the emergence of new gene copies in OW sand flies is more evident in the gene families shared with NW sand flies, such as antigen-5, apyrase, D7, Silk, yellow-related, and OBPs; (5) Different from NW sand flies (Abdeladhim et al., 2016), the emergence of salivary gland gene families in OW sand flies relies less often on ancient genes bearing sequences encoding conserved cysteine signatures; and (6) a few salivary protein gene families is only expressed in sand flies belonging to specific sub-genera within OW sand flies; for instance, ParSP15 is unique to the Larrossious sub-genus, and SP2.5 is only expressed in members of the Phlebotomus sub-genus. Overall, these findings highlight the striking differences in the rates of molecular evolution of salivary protein encoding genes in NW and OW sand flies, which are underscored not only by sequence polymorphisms but also by copy number variation. Also, sand fly salivary proteins are recognized by the humoral immune system of humans and animal reservoirs previously bitten by sand flies; therefore, some sand fly salivary proteins have been shown to work as biomarkers of vector exposure. For such proteins to be used as biomarker of vector exposure for the identification of taxon-specific sand fly bites, the candidates are supposed to be not only immunogenic and conserved but also expressed only within sand flies belonging to a specific taxon. For species-specific biomarkers the candidates should be divergent or with some level of homology that allows the selection of specific peptides. For specific salivary proteins to be used as vaccine components across species, we predict that the candidates need not only to be immunogenic but also conserved proteins to some extent. Biomarkers of vector exposure have been identified among the salivary proteins of Lu. longipalpis (Teixeira et al., 2010), Lu. intermedia (Carvalho et al., 2017), P. papatasi (Marzouki et al., 2015), P. perniciosus (Drahota et al., 2014; Kostalova et al., 2017), and P. orientalis (Sima et al., 2016). For the most part, the best salivary protein candidates belong to conserved protein families. Although the less conserved Silk protein (Figure 3) was found to be the best biomarker of P. papatasi exposure (Marzouki et al., 2015), members of the apyrase, ParSP25, and antigen-5 conserved families have been shown as good markers of exposure in P. perniciosus (Drahota et al., 2014; Kostalova et al., 2017), P. orientalis (Sima et al., 2016), and Lu. intermedia (Carvalho et al., 2017), respectively. It is worth noting that proteins of the conserved yellow family (Figure 3) have been found to be the best biomarkers of vector exposure in dogs and humans amongst distantly related sand fly species, such as Lu. longipalpis, P. orientalis, and P. perniciosus (Teixeira et al., 2010). Regarding sand fly salivary protein vaccines, SALO (Gomes et al., 2008) and lufaxin (Collin et al., 2009) from Lu. longipalpis protects hamsters and dogs, respectively, against Leishmania infantum infection. On the other hand, PdSP15 (PduM02 herein; OBP family) from P. duboscqi (and its P. papatasi ortholog) protects mice (Valenzuela et al., 2001) and non-human primates (Oliveira et al., 2015) against Leishmania major infection. As SALO is the most divergent among the unique salivary protein families in NW sand flies (Figure 3), it is more likely to work as a specific vaccine for L. longipalpis transmitted leishmaniasis. Regarding PdSP15 (PduM02), it belongs to the diverse OBP (super-) family, yet it only shares orthologs with P. papatasi and P. sergenti (Supplementary Figure 15). In fact, PdSP15 orthologs are relatively conserved (% divergence = 36.3%; ω = 0.54). Hence, a PdSP15-based vaccine is restricted to protect leishmania transmission from sand flies of the P/P subgenera. On the other hand, lufaxin is likely the best candidate for a pan-specific vaccine, as it is conserved and shared between OW and NW sand flies (Figure 3 and Supplementary Figure 9). The data pointing out yellow proteins as the best biomarkers of vector exposure across sand fly species underscores the idea that conserved salivary proteins from the saliva of different sand fly species are recognized, processed, and presented in similar ways by the host immune systems. Thereby, it gives supports to the possibility that conserved yellow epitopes can be used as a pan-biomarker for sand fly exposure. For the same reasons, it is possible that conserved lufaxin epitopes can be used as pan-specific vaccines against leishmaniasis transmitted by different sand fly species. ParSP25 proteins were shown to work as biomarkers of vector exposure for P. perniciosus and P. orientalis (Drahota et al., 2014; Sima et al., 2016; Kostalova et al., 2017). Such a protein family was among the least divergent families unique to OW sand flies (Figures 2A,B) and was only expressed in member of the L/A/E subgenera (Table 1). Thereby, it is likely that ParSP25 epitopes can be used as a taxon-specific biomarker of vector exposure to recognize bites of sand flies from L/A/E subgenera and distinguish them from bites of sand flies belonging to P/P ones that display overlapping habitats. The evolution of salivary gland gene families unique to NW and to OW sand flies has faced completely different selective pressures, leading to a much faster pace of sequence and copy number variation in NW sand flies and a more constrained divergence in OW species. These evolutionary differences are further highlighted by the fact that the emergence of new gene families unique to NW sand flies relied upon sequences encoding specific cysteine codons, which were less evident in the genes encoding the protein families unique to OW sand flies. Regarding the protein families shared between OW and NW sand flies, the rates of sequence divergence were similar between OW and NW sand flies, yet more gene duplication events were noticed in OW species. Evolutionary differences are noticed even within OW sand flies, as more gene duplication are observed in P/P than in L/A/E, and the expression of some gene families was restricted to either P/P or L/A/E sand fly taxon. As the salivary protein families shared between OW and NW sand flies are inter-specifically conserved, such proteins are suitable for the development of pan-specific biomarker of vector exposure and pan-specific vaccines. Along the same lines, the more conserved protein families unique to OW sand flies are ideal candidates for the development of taxon-specific biomarkers of vector exposure, as even the expression of such proteins can be restricted to a specific taxon. On the other hand, the development of species-specific biomarkers of vector exposure would require the identification of species-specific epitopes amongst the most divergent proteins families unique to either NW or OW sand flies.

Author contributions

IC-A performed the evolutionary analysis and wrote the manuscript. JV wrote and edited the manuscript.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  51 in total

1.  A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach.

Authors:  S Whelan; N Goldman
Journal:  Mol Biol Evol       Date:  2001-05       Impact factor: 16.240

2.  Identification of the most abundant secreted proteins from the salivary glands of the sand fly Lutzomyia longipalpis, vector of Leishmania chagasi.

Authors:  Jesus G Valenzuela; Mark Garfield; Edgar D Rowton; Van M Pham
Journal:  J Exp Biol       Date:  2004-10       Impact factor: 3.312

3.  From transcriptome to immunome: identification of DTH inducing proteins from a Phlebotomus ariasi salivary gland cDNA library.

Authors:  Fabiano Oliveira; Shaden Kamhawi; Amy E Seitz; Van My Pham; Pierre M Guigal; Laurent Fischer; Jerrold Ward; Jesus G Valenzuela
Journal:  Vaccine       Date:  2005-08-15       Impact factor: 3.641

4.  A sand fly salivary protein vaccine shows efficacy against vector-transmitted cutaneous leishmaniasis in nonhuman primates.

Authors:  Fabiano Oliveira; Edgar Rowton; Hamide Aslan; Regis Gomes; Philip A Castrovinci; Patricia H Alvarenga; Maha Abdeladhim; Clarissa Teixeira; Claudio Meneses; Lindsey T Kleeman; Anderson B Guimarães-Costa; Tobin E Rowland; Dana Gilmore; Seydou Doumbia; Steven G Reed; Phillip G Lawyer; John F Andersen; Shaden Kamhawi; Jesus G Valenzuela
Journal:  Sci Transl Med       Date:  2015-06-03       Impact factor: 17.956

Review 5.  Distinguishing among evolutionary models for the maintenance of gene duplicates.

Authors:  Matthew W Hahn
Journal:  J Hered       Date:  2009-07-13       Impact factor: 2.645

Review 6.  The emerging role of adenosine deaminases in insects.

Authors:  Eva Dolezelova; Michal Zurovec; Tomas Dolezal; Petr Simek; Peter J Bryant
Journal:  Insect Biochem Mol Biol       Date:  2005-05       Impact factor: 4.714

7.  The recombinant protein rSP03B is a valid antigen for screening dog exposure to Phlebotomus perniciosus across foci of canine leishmaniasis.

Authors:  T Kostalova; T Lestinova; C Maia; P Sumova; M Vlkova; L Willen; N Polanska; E Fiorentino; A Scalone; G Oliva; F Veronesi; J M Cristóvão; O Courtenay; L Campino; L Gradoni; M Gramiccia; P Volf
Journal:  Med Vet Entomol       Date:  2016-10-08       Impact factor: 2.739

8.  Recombinant antigens from Phlebotomus perniciosus saliva as markers of canine exposure to visceral leishmaniases vector.

Authors:  Jan Drahota; Ines Martin-Martin; Petra Sumova; Iva Rohousova; Maribel Jimenez; Ricardo Molina; Petr Volf
Journal:  PLoS Negl Trop Dis       Date:  2014-01-02

9.  Recombinant Salivary Proteins of Phlebotomus orientalis are Suitable Antigens to Measure Exposure of Domestic Animals to Sand Fly Bites.

Authors:  Michal Sima; Blanka Ferencova; Alon Warburg; Iva Rohousova; Petr Volf
Journal:  PLoS Negl Trop Dis       Date:  2016-03-17

10.  Lundep, a sand fly salivary endonuclease increases Leishmania parasite survival in neutrophils and inhibits XIIa contact activation in human plasma.

Authors:  Andrezza C Chagas; Fabiano Oliveira; Alain Debrabant; Jesus G Valenzuela; José M C Ribeiro; Eric Calvo
Journal:  PLoS Pathog       Date:  2014-02-06       Impact factor: 6.823

View more
  6 in total

1.  Immunity to vector saliva is compromised by short sand fly seasons in endemic regions with temperate climates.

Authors:  Fabiano Oliveira; Ekaterina Giorgobiani; Anderson B Guimarães-Costa; Maha Abdeladhim; James Oristian; Lamzira Tskhvaradze; Nikoloz Tsertsvadze; Mariam Zakalashvili; Jesus G Valenzuela; Shaden Kamhawi
Journal:  Sci Rep       Date:  2020-05-14       Impact factor: 4.379

2.  Conserved and distinct morphological aspects of the salivary glands of sand fly vectors of leishmaniasis: an anatomical and ultrastructural study.

Authors:  Rafael Nacif-Pimenta; Luciana C Pinto; Vera Volfova; Petr Volf; Paulo F P Pimenta; Nagila F C Secundino
Journal:  Parasit Vectors       Date:  2020-09-03       Impact factor: 3.876

3.  Sergentomyia schwetzi: Salivary gland transcriptome, proteome and enzymatic activities in two lineages adapted to different blood sources.

Authors:  Nikola Polanska; Aygul Ishemgulova; Vera Volfova; Pavel Flegontov; Jan Votypka; Vyacheslav Yurchenko; Petr Volf
Journal:  PLoS One       Date:  2020-03-24       Impact factor: 3.240

4.  RNA-sequencing of the Nyssomyia neivai sialome: a sand fly-vector from a Brazilian endemic area for tegumentary leishmaniasis and pemphigus foliaceus.

Authors:  Sebastian Vernal; Fabiano Oliveira; Wanderson H C Oliveira; Thais M Goulart; James Oristian; Eric Calvo; Mara C Pinto; Ana Maria Roselino; José M C Ribeiro
Journal:  Sci Rep       Date:  2020-10-19       Impact factor: 4.996

5.  Phlebotomus perniciosus Recombinant Salivary Proteins Polarize Murine Macrophages Toward the Anti-Inflammatory Phenotype.

Authors:  Petra Sumova; Nikola Polanska; Tereza Lestinova; Tatiana Spitzova; Barbora Kalouskova; Ondrej Vanek; Petr Volf; Iva Rohousova
Journal:  Front Cell Infect Microbiol       Date:  2020-08-24       Impact factor: 6.073

6.  PpSP32-like protein as a marker of human exposure to Phlebotomus argentipes in Leishmania donovani foci in Bangladesh.

Authors:  Petra Sumova; Chizu Sanjoba; Laura Willen; Nikola Polanska; Yoshitsugu Matsumoto; Eisei Noiri; Shyamal Kumar Paul; Yusuf Ozbel; Petr Volf
Journal:  Int J Parasitol       Date:  2021-07-15       Impact factor: 3.981

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.