| Literature DB >> 35047420 |
Yanna Reis Praça1,2, Paula Beatriz Santiago1, Sébastien Charneau3, Samuel Coelho Mandacaru3, Izabela Marques Dourado Bastos1, Kaio Luís da Silva Bentes1,2, Sofia Marcelino Martins Silva1, Waldeyr Mendes Cordeiro da Silva4, Ionizete Garcia da Silva5, Marcelo Valle de Sousa3, Célia Maria de Almeida Soares6, José Marcos Chaves Ribeiro7, Jaime Martins Santana1,2, Carla Nunes de Araújo1,2,8.
Abstract
Triatomines have evolved salivary glands that produce versatile molecules with various biological functions, including those leading their interactions with vertebrate hosts' hemostatic and immunological systems. Here, using high-throughput transcriptomics and proteomics, we report the first sialome study on the synanthropic triatomine Triatoma sordida. As a result, 57,645,372 reads were assembled into 26,670 coding sequences (CDS). From these, a total of 16,683 were successfully annotated. The sialotranscriptomic profile shows Lipocalin as the most abundant protein family within putative secreted transcripts. Trialysins and Kazal-type protease inhibitors have high transcript levels followed by ubiquitous protein families and enzyme classes. Interestingly, abundant trialysin and Kazal-type members are highlighted in this triatomine sialotranscriptome. Furthermore, we identified 132 proteins in T. sordida salivary gland soluble extract through LC-MS/MS spectrometry. Lipocalins, Hemiptera specific families, CRISP/Antigen-5 and Kazal-type protein inhibitors proteins were identified. Our study provides a comprehensive description of the transcript and protein compositions of the salivary glands of T. sordida. It significantly enhances the information in the Triatominae sialome databanks reported so far, improving the understanding of the vector's biology, the hematophagous behaviour, and the Triatominae subfamily's evolution.Entities:
Keywords: blood-feeding; salivary glands; salivary molecules; sialome; sialoproteome; sialotranscriptome; triatomine
Mesh:
Year: 2022 PMID: 35047420 PMCID: PMC8762107 DOI: 10.3389/fcimb.2021.798924
Source DB: PubMed Journal: Front Cell Infect Microbiol ISSN: 2235-2988 Impact factor: 5.293
Functional classification and expression levels of the coding sequences (CDS) extracted from Triatoma sordida sialotranscriptome.
| Category | Number of CDS | Number of reads | %reads | Average number of reads/CDS | FPKM | %FPKM |
|---|---|---|---|---|---|---|
|
| 735 | 9,340,897 | 77.5 | 12,709 | 2,714,435 | 79.5 |
|
| 6,325 | 2,200,836 | 18.3 | 348 | 414,372 | 12.1 |
|
| 8,092 | 365,839 | 3.0 | 45 | 260,915 | 7.6 |
|
| 501 | 102,188 | 0.9 | 204 | 18,925 | 0.6 |
|
| 126 | 24,319 | 0.2 | 193 | 4,857 | 0.1 |
|
| 894 | 13,458 | 0.1 | 15 | 2,551 | <0.1 |
|
| 10 | 2,474 | <0.1 | 247 | 45 | <0.1 |
|
|
|
|
|
|
|
The values in bold correspond to the total for each class, group or subgroup.
Functional classification and expression levels of the CDS attributed to the Housekeeping category extracted from Triatoma sordida sialotranscriptome.
| Housekeeping Category | Number of CDS | Number of reads | %reads | Average number of reads/CDS | FPKM | %FPKM |
|---|---|---|---|---|---|---|
|
| 462 | 533,374 | 24.2 | 1,154 | 153,782 | 37.1 |
|
| 359 | 365,123 | 16.6 | 1,017 | 38,371 | 9.3 |
|
| 386 | 158,957 | 7.2 | 412 | 32,260 | 7.8 |
|
| 101 | 72,003 | 3.3 | 713 | 27,895 | 6.7 |
|
| 1,562 | 179,296 | 8.2 | 115 | 24.835 | 6.0 |
|
| 171 | 43,294 | 2.0 | 253 | 23,761 | 5.7 |
|
| 415 | 159,587 | 7.3 | 385 | 23,567 | 5.7 |
|
| 432 | 101,734 | 4.6 | 235 | 18,456 | 4.5 |
|
| 773 | 152,434 | 6.9 | 197 | 17,459 | 4.2 |
|
| 268 | 89,681 | 4.1 | 335 | 11,836 | 2.9 |
|
| 30 | 22,510 | 1.0 | 750 | 8,704 | 2.1 |
|
| 304 | 56,943 | 2.6 | 187 | 7,414 | 1.8 |
|
| 299 | 28,771 | 1.3 | 96 | 4,904 | 1.2 |
|
| 166 | 48,878 | 2.2 | 294 | 4,459 | 1.0 |
|
| 140 | 45,558 | 2.1 | 325 | 4,375 | 1.1 |
|
| 26 | 60,704 | 2.8 | 2,334 | 3,078 | <0.1 |
|
| 144 | 17,692 | 0.8 | 123 | 2,820 | <0.1 |
|
| 166 | 20,953 | 1.0 | 126 | 2,530 | <0.1 |
|
| 43 | 18,567 | 0.8 | 431 | 1,842 | <0.1 |
|
| 49 | 21,082 | 1.0 | 430 | 1,750 | <0.1 |
|
| 29 | 3,697 | 0.2 | 127 | 275 | <0.1 |
|
|
|
|
|
|
|
The values in bold correspond to the total for each class, group or subgroup.
Functional classification and expression levels of the CDS attributed to the putative Secreted category extracted from Triatoma sordida sialotranscriptome.
| Putative Secreted Category | Number of CDS | Number of reads | %reads | Average number of reads/CDS | FPKM | %FPKM |
|---|---|---|---|---|---|---|
|
| ||||||
| Triabin | 89 | 3,180,268 | 34.1 | 35,733 | 936,123 | 34.5 |
| Pallidipin | 21 | 1,281,977 | 13.7 | 61,047 | 255,970 | 9.4 |
| Other lipocalin | 18 | 61,350 | 0.7 | 3,408 | 19,885 | 0.7 |
| Triatin | 5 | 32,225 | 0.3 | 6,445 | 11,353 | 0.4 |
| Procalin | 3 | 1,184 | <0.1 | 395 | 1,352 | 0.1 |
| Total | 136 | 4,557,004 | 48.8 | 1,224,683 | 45.1 | |
|
| ||||||
| Short trialysin/trialysin | 10 | 2,763,272 | 29.6 | 276,327 | 821,268 | 30.3 |
| Salivary protein MYS precursor/Hemolysin like | 17 | 217,213 | 2.3 | 12,777 | 45,652 | 1.7 |
| Triatoma and Panstrongylus specific salivary proteins 16kDa | 3 | 1,123 | <0.1 | 374 | 290 | <0.1 |
| Tsor 8kDa basic salivary peptide | 6 | 157 | <0.1 | 26 | 150 | <0.1 |
| Cisteine rich secreted protein | 4 | 16 | <0.1 | 4 | 19 | <0.1 |
| Total | 40 | 2,981,781 | 32.0 | 867,380 | 32.0 | |
|
| ||||||
| Kazal-type | 27 | 229,814 | 2.5 | 8,512 | 347,414 | 12.8 |
| Cystatin | 3 | 1,357 | <0.1 | 452 | 844 | <0.1 |
| Serpin | 6 | 7,385 | <0.1 | 1,231 | 618 | <0.1 |
| Pacifastin | 6 | 1,435. | <0.1 | 239 | 250 | <0.1 |
| Mucins | 20 | 279 | <0.1 | 14 | 383 | <0.1 |
| Kunitz-type | 7 | 208 | <0.1 | 30 | 13 | <0.1 |
| Total | 69 | 240,478 | 2.6 | 349,522 | 12.9 | |
|
| ||||||
| Inositol phosphate phosphatase | 14 | 390,510 | 4.2 | 27,894 | 54,472 | 2.0 |
| Salivary Trypsin | 8 | 232,304 | 2.5 | 29,038 | 31,074 | 1.1 |
| Metalloproteases | 5 | 78,930 | 0.8 | 15,786 | 10,240 | 0.4 |
| 79kDa salivary apyrase | 5 | 30,899 | 0.3 | 6,180 | 7,242 | 0.3 |
| Nucleases | 4 | 2,237 | <0.1 | 559 | 200 | <0.1 |
| Secreted caboxylesterase | 8 | 1,841 | <0.1 | 230 | 106 | <0.1 |
| Lysosomal aspartic protease | 2 | 888 | <0.1 | 444 | 75 | <0.1 |
| Lipase | 5 | 420 | <0.1 | 84 | 58 | <0.1 |
| Total | 51 | 738,028 | 7.9 | 103,467 | 3.8 | |
|
| ||||||
| Antigen 5 | 13 | 438,036 | 4.7 | 33,695 | 77,434 | 2.9 |
| Conserved insect secreted protein | 13 | 15,438 | 0.1 | 1,188 | 6,212 | 0.2 |
| Low density lipoprotein receptor | 5 | 16 | <0.1 | 3 | 18 | <0.1 |
| Total | 31 | 453,490 | 4.6 | 83,664 | 3.1 | |
|
| 362 | 361,649 | 3.9 | 999 | 82,797 | 3.1 |
|
| 9 | 4,918 | <0.1 | 546 | 2,145 | <0.1 |
|
| 25 | 1,608 | <0.1 | 64 | 470 | <0.1 |
|
| 12 | 1,940 | <0.1 | 162 | 307 | <0.1 |
|
|
|
|
|
|
|
The values in bold correspond to the total for each class, group or subgroup.
Classification and abundance of proteins identified in the salivary glands of Triatoma sordida following LC-MS/MS analysis.
| Protein identification | Number of proteins | Spectrum count | % Spectrum count |
|---|---|---|---|
|
| |||
|
| |||
| Triabin | 31 | 532 | 33.9 |
| Pallidipin | 5 | 113 | 7.2 |
| Other lipocalins | 1 | 10 | 0.6 |
| Triatin | 1 | 5 | 0.3 |
| Total | 38 | 660 | 42.0 |
|
| |||
| Inositol phosphaye phosphatse | 8 | 252 | 16.0 |
| Salivary trypsin | 4 | 86 | 5.5 |
| 79kDa salivary apyrase | 2 | 16 | 1.0 |
| Total | 14 | 354 | 22.5 |
|
| |||
| Short trialysin/Trialysin | 2 | 133 | 8.5 |
| Salivary protein MYS precursor/Hemolysin like | 2 | 19 | 1.2 |
| Triatoma and Panstrongylus specific salivary protein of 16kDa | 1 | 7 | 0.5 |
| Total | 5 | 159 | 10.1 |
|
| |||
| Antigen 5 | 4 | 92 | 5.9 |
|
| 7 | 77 | 4.9 |
|
| |||
| Kazal type | 4 | 22 | 1.4 |
|
| 1 | 1 | <0.1 |
|
|
|
|
|
|
|
|
|
|
|
| |||
|
| 7 | 44 | 2.8 |
|
| 7 | 25 | 1.6 |
|
| 5 | 12 | 0.8 |
|
| 4 | 11 | 0.7 |
|
| 5 | 11 | 0.7 |
|
| 3 | 10 | 0.6 |
|
| 3 | 9 | 0.6 |
|
| 1 | 8 | 0.5 |
|
| 3 | 6 | 0.4 |
|
| 1 | 6 | 0.4 |
|
| 1 | 3 | 0.2 |
|
| 1 | 2 | 0.1 |
|
| 1 | 2 | 0.1 |
|
| 1 | 2 | 0.1 |
|
| 1 | 1 | <0.1 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
The values in bold correspond to the total for each class, group or subgroup.
Figure 1Triatoma sordida salivary transcriptome/proteome correlation. The scatterplot illustrates the Pearson’s correlation (r.) between CDS (FPKM) and proteins (spectrum count) from T. sordida salivary gland transcriptome and salivary gland extract proteome. It shows a statistically significant P value (p<0.0001) in the three groups tested: All identified proteins (blue), Secreted identified proteins (pink), and Lipocalin identified proteins (black).
Figure 2Maximum likelihood tree with Triatoma sordida salivary lipocalins. The distance tree among lipocalin sequences is derived from the alignment of T. sordida CDS extracted from the sialotranscriptome analysis and other lipocalin sequences from Triatominae. The amino acid sequences are identified as described in the Methods section. The coloured circles identify the species whose sequences were used: blue, T. sordida; magenta, Triatoma matogrossensis; purple, Triatoma brasiliensis; green, Triatoma infestans; yellow, Triatoma dimidiata; grey, Panstrongylus chinai. The numbers at the nodes indicate the bootstrap values. The scale on the bottom measure evolutionary distance in substitutions per amino acid.
Figure 3The trialysins from Triatoma sordida sialotranscriptome. (A) Multiple sequence alignment of short-trialysin members extracted from T. sordida sialotranscriptome (TRISOR_c10816_g1_i1.p1; TRISOR_c15059_g1_i2.p2; TRISOR_c15059_g1_i3.p1; TRISOR_c14763_g9_i1.p2) and other homologous short-trialysin sequences from Triatoma infestans (TRIINF_ABR27833.1; TRIINF_ABR27839.1; TRIINF_ABR27838.1; TRIINF_ABR27837.1) and Triatoma matogrossensis (TRIMAT_tr/E2J776) (B) Multiple sequence alignment of a r-trialysin member extracted from T. sordida sialotranscriptome (TRISOR-c15366_g14_i1.p1) and two homologous members, a r-trialyisn from Triatoma infestans (TRIINF_ABR27943.1) and a redulysin from P. rhadamanthus (PLARHA_QHB21548.1). The sequences are identified as described in Methods section. The alignments indicate conserved residues in a blue scale background. The grey bar marks the consensus motif of trialysins, red rectangles mark Lys residues. Consensus logo sequence shows the frequency of amino acids in the multiple sequence alignment. The height of symbols within the stack indicates the relative frequency of each amino acid at that position. Secondary structure predictions were inferred using JNet Secondary Structure Prediction. The sequence for which the prediction was made is the first one in the alignment.
Figure 4The phylogenetic tree of trialysins from Triatoma sordida sialotranscriptome. The distance tree among trialysin sequences is derived from the alignment of T. sordida CDS extracted from the sialotranscriptome analysis and other trialysin sequences from Hemiptera (extracted from the non-redundant protein database of the NCBI applying PSI-BLAST algorithm) identified as described in Methods section. The colored circles identify the species whose sequences were used: blue, T. sordida; magenta, Triatoma matogrossensis; green, Triatoma infestans; orange, P. rhadamanthus. Red asterisks indicate gene duplication events. The number at the nodes shows the bootstrap values. The scale on the bottom measure evolutionary distance in substitutions per amino acid.
Figure 5The Kazal-type members from Triatoma sordida. Multiple sequence alignment of Kazal-type members extracted from T. sordida sialotranscriptome (TRISOR_ c15210_g10_i1.p1; TRISOR_ c15210_g10_i2.p1; TRISOR_ c15210_g10_i3.p1; TRISOR_ c15210_g10_i4.p1; TRISOR_ c15210_g10_i6.p1; TRISOR_ c15210_g10_i5.p1; TRISOR_ c11034_g1_i1.p1; TRISOR_ c15210_g10_i7.p1; TRISOR_ c11377_g1_i2.p1; TRISOR_ c10221_g1_i1.p1; TRISOR_ c13467_g1_i1.p1) and other homologous Kazal-type sequences from: Panstrongylus chinai (PANCHI_BBA30649.1); Triatoma infestans (TRIINF_ABR27937.1); Triatoma brasiliensis (TRIBRA_ABJ96351.1); Hybomitra bimaculata (HYBBIM_P84843.1); and Tabanus yao (TABYAO_ABX80080.1). The alignments indicate conserved residues in a blue scale background. The sequences are identified as described in the Methods section. The grey bar above indicates the Kazal domain region, with coloured asterisks showing the arrangement of the six conserved cysteines (C1:C5, C2:C4, C3:C6). Consensus sequence shows the frequency of amino acids in the multiple sequence alignment. The height of aminoacid symbols within the stack indicates the relative frequency of each amino acid at that position.
Figure 6The phylogenetic tree of Kazal-type members from Triatoma sordida sialotranscriptome. The distance tree among trialysin sequences is derived from the alignment of T. sordida CDS extracted from the sialotranscriptome analysis and other Kazal-type sequences (extracted from the non-redundant protein database of the NCBI applying PSI-BLAST algorithm) identified as described in Methods section. The blue circle identifies sequences from T. sordida. The number at the nodes indicates the bootstrap values. The scale on bottom measure evolutionary distance in substitutions per amino acid.
Figure 7Heat map representing the expression profile of Triatominae salivary CDS. The heatmap shows the normalized CDS count of the main secreted families from Triatominae sialotranscriptomes. The differences among the CDS counts were statistically significant when compared by a Chi-square test (P < 0.001) (see ). Rows represent salivary families and columns represent the Triatominae species. The number of CDS frequencies were calculated for each species followed by log10 transformation. Blue-yellow scale: blue represents a high CDS count and yellow low represents low CDS count. The heatmap was created using GraphPad Prism software.