| Literature DB >> 19178717 |
Eric Calvo1, Van M Pham, Osvaldo Marinotti, John F Andersen, José M C Ribeiro.
Abstract
BACKGROUND: Mosquito saliva, consisting of a mixture of dozens of proteins affecting vertebrate hemostasis and having sugar digestive and antimicrobial properties, helps both blood and sugar meal feeding. Culicine and anopheline mosquitoes diverged ~150 MYA, and within the anophelines, the New World species diverged from those of the Old World ~95 MYA. While the sialotranscriptome (from the Greek sialo, saliva) of several species of the Cellia subgenus of Anopheles has been described thoroughly, no detailed analysis of any New World anopheline has been done to date. Here we present and analyze data from a comprehensive salivary gland (SG) transcriptome of the neotropical malaria vector Anopheles darlingi (subgenus Nyssorhynchus).Entities:
Mesh:
Substances:
Year: 2009 PMID: 19178717 PMCID: PMC2644710 DOI: 10.1186/1471-2164-10-57
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Distribution of the transcripts from the salivary gland cDNA library of .
Classification of transcripts associated with housekeeping function
| Protein synthesis machinery | 429 | 53.8 |
| Unknown conserved | 114 | 14.3 |
| Energy metabolism | 79 | 9.9 |
| Conserved secreted proteins | 37 | 4.6 |
| Protein modification machinery | 23 | 2.9 |
| Signal transduction | 22 | 2.8 |
| Proteasome machinery | 20 | 2.5 |
| Protein export machinery | 15 | 1.9 |
| Transcription machinery | 11 | 1.4 |
| Transporter/storage | 10 | 1.3 |
| Carbohydrate metabolism | 8 | 1.0 |
| Cytoskeletal | 8 | 1.0 |
| Nuclear regulation | 6 | 0.8 |
| Secondary products metabolism | 6 | 0.8 |
| Amino acid metabolism | 4 | 0.5 |
| Lipid metabolism | 2 | 0.3 |
| Nucleotide metabolism | 1 | 0.1 |
| Intermediary metabolism | 1 | 0.1 |
| Extracellular matrix and adhesion | 1 | 0.1 |
| 797 |
Classification of transcripts associated with secreted products
| Subclass | ||
| D7/OBP | 269 | 22.6 |
| Aegyptin/30-kDa antigen | 98 | 8.2 |
| Anophelin | 20 | 1.7 |
| gSG8/Kazal | 14 | 1.2 |
| Pattern recognition | 5 | 0.4 |
| Antimicrobials | 92 | 7.7 |
| Glycosidases | 41 | 3.5 |
| Serine proteases | 5 | 0.4 |
| Apyrase/5' Nucleotidase | 22 | 1.9 |
| Peroxidase | 4 | 0.3 |
| gSG3 family | 91 | 7.7 |
| gSG10 | 12 | 1.0 |
| 13.5-kDa family | 40 | 3.4 |
| Other mucins | 39 | 3.3 |
| SG1 family | 63 | 5.3 |
| SG2 family | 74 | 6.2 |
| SG7 family | 16 | 1.3 |
| SG5 family | 9 | 0.8 |
| Antigen-5 family | 42 | 3.5 |
| 56-kDa family | 5 | 0.4 |
| Acidic protein family | 32 | 2.7 |
| Anopheline 6.3-kDa family | 11 | 0.9 |
| Anopheline 8.2-kDa family | 58 | 4.9 |
| Anopheline hyp 15/17 family | 44 | 3.7 |
| Basic tail mosquito family | 32 | 2.7 |
| | 2 | 0.2 |
| Culicidae 23.4-kDa family | 1 | 0.1 |
| Culicine 41.9-kDa family | 17 | 1.4 |
| Other 11 families | 30 | 2.5 |
| 1188 |
Putative secreted proteins deducted from the salivary transcriptome analysis
| | ||
| AD-98 | 208657501 | short form D7 salivary protein |
| AD-82 | 208657481 | SHORT FORM D7 SALIVARY PROTEIN |
| AD-81 | 208657479 | SHORT AD Clade D7 SALIVARY PROTEIN |
| AD-32 | 208657493 | D7 short |
| AD-31 | 208657495 | SHORT AD Clade D7 SALIVARY PROTEIN |
| AD-97 | 208657497 | SHORT AD Clade D7 SALIVARY PROTEIN |
| AD-395 | 208657489 | Short D7 protein |
| AD-394 | 208657487 | Short D7 protein |
| AD-1 | 16798386 | AF427696_1 D7-RELATED 3.2 PROTEIN |
| AD-3 | 16798386 | D7-related 3.2 protein |
| AD-118 | 208657499 | Long form D7 salivary protein |
| AD-560 | 208657485 | Odorant binding protein |
| | ||
| AD-24 | 208657597 | GE rich family salivary gland protein |
| AD-26 | 208657599 | 30 kDa salivary antigen family protein |
| AD-27 | 208657601 | GE rich salivary gland protein |
| AD-21 | 208657603 | GE rich salivary gland protein |
| AD-22 | 208657605 | GE rich salivary gland protein |
| AD-23 | 208657607 | 30 kDa salivary antigen family |
| AD-25 | 208657609 | GE rich salivary gland protein |
| AD-28 | 208657617 | 30 kDa salivary antigen family protein |
| | ||
| AD-99 | 208657573 | salivary anti-thrombin peptide anophelin |
| AD-100 | 208657579 | salivary anti-thrombin peptide anophelin |
| | ||
| AD-133 | 208657683 | gSG7 salivary protein |
| AD-134 | 208657689 | gSG7 salivary protein |
| AD-135 | 208657691 | gSG7 salivary protein |
| | ||
| AD-417 | 208657693 | Kazal domain-containing peptide |
| AD-257 | 208657737 | Kazal domain-containing peptide |
| AD-350 | 208657834 | Kazal domain-containing peptide |
| | ||
| | ||
| AD-10 | 208657477 | SG3 PROTEIN |
| AD-9 | 208657681 | sg3 protein |
| AD-7 | 208657687 | sg3 protein |
| AD-8 | 208657697 | SG3 PROTEIN |
| | ||
| AD-143 | 208657645 | gSG10 salivary mucin |
| AD-146 | 208657647 | gSG10 salivary mucin |
| AD-145 | 208657651 | gSG10 salivary mucin |
| | ||
| AD-45 | 208657695 | mucin-like protein |
| AD-46 | 208657701 | mucin-like protein |
| AD-43 | 208657713 | PUTATIVE 13.5 KDA SALIVARY PROTEIN |
| AD-42 | 208657721 | putative 13.5 kDa salivary protein |
| AD-44 | 208657723 | PUTATIVE 13.5 KDA SALIVARY PROTEIN |
| AD-41 | 208657733 | PUTATIVE 13.5 KDA SALIVARY PROTEIN |
| AD-47 | 208657751 | mucin-like protein |
| | ||
| AD-11 | 208657473 | hypothetical secreted peptide precursor |
| AD-191 | 208657465 | putative salivary secreted mucin 3 – fragment – similar to virus induced protein |
| | ||
| AD-873 | 208657765 | mucin-like peritrophin |
| | ||
| | ||
| IS07-104 | 208657633 | putative 5' nucleotidase/apyrase |
| AD-101 | 208657659 | salivary apyrase – truncated at 5 prime |
| | ||
| AD-573 | 208657575 | salivary peroxidase |
| | ||
| AD-70 | 208657611 | probable salivary maltase precursor |
| | ||
| AD-698 | 208657483 | CLIP-domain serine protease subfamily D – truncated at 5 prime |
| | ||
| | ||
| AD-231 | 208657641 | GAMBICIN PRECURSOR |
| | ||
| AD-124 | 208657731 | defensin |
| | ||
| AD-57 | 208657655 | antimicrobial peptide cecropin |
| AD-236 | 208657739 | antimicrobial peptide cecropin |
| AD-927 | 208657741 | Cecropin precursor |
| | ||
| AD-457 | 208657711 | peptidoglycan recognition protein |
| | ||
| AD-174 | 208657469 | lysozyme |
| AD-175 | 208657471 | lysozyme |
| | ||
| AD-259 | 208657749 | hypothetical secreted protein with GHG repeats |
| | ||
| | ||
| | ||
| AD-38 | 33359651 | Antigen 5-related 2 |
| AD-430 | 208657475 | antigen 5-related 2 protein |
| | ||
| | ||
| AD-196 | 208657685 | conserved secreted mosquito protein |
| | ||
| AD-178 | 208657639 | short gSG8-like protein |
| | ||
| AD-217 | 208657667 | putative salivary secreted peptide |
| AD-216 | 208657679 | putative salivary secreted peptide |
| | ||
| AD-476 | 208657709 | putative 4.3 kDa secreted salivary peptide |
| | ||
| AD-267 | 208657677 | proline rich salivary secreted peptide |
| | ||
| AD-111 | 208657783 | PUTATIVE 41.9 KDA BASIC SALIVARY PROTEIN – truncated at 5 prime |
| AD-112 | 208657807 | putative 41.9 kDa basic salivary protein – truncated at 5 prime |
| AD-114 | 208657821 | 41 kDa family salivary secreted protein |
| | ||
| AD-159 | 208657649 | SG1-like salivary protein |
| AD-160 | 208657653 | SG1-like salivary protein |
| AD-130 | 208657753 | GSG1 PROTEIN |
| AD-85 | 208657767 | PUTATIVE SALIVARY PROTEIN SG1B |
| AD-86 | 208657777 | PUTATIVE SALIVARY PROTEIN SG1 |
| AD-153 | 208657781 | TRIO salivary gland protein precursor – SG1 family |
| | ||
| AD-49 | 208657761 | hypothetical protein |
| AD-51 | 208657819 | hypothetical secreted peptide precursor |
| AD-53 | 208657773 | hypothetical secreted peptide precursor |
| AD-54 | 208657763 | hypothetical secreted peptide precursor |
| AD-90 | 208657779 | putative secreted peptide of the 6 kDa family |
| AD-91 | 208657785 | putative secreted peptide of the 6 kDa family |
| AD-92 | 208657811 | putative secreted peptide of the 6 kDa family |
| AD-89 | 208657747 | putative secreted peptide of the 6 kDa family |
| AD-64 | 208657759 | putative secreted peptide of the 6 kDa family |
| | ||
| AD-37 | 208657719 | hypothetical salivary protein 15 |
| AD-35 | 208657727 | hypothetical salivary protein 15 |
| AD-36 | 208657729 | hypothetical salivary protein 15 |
| | ||
| AD-63 | 208657771 | hypothetical salivary protein 8.2 |
| AD-96 | 208657815 | hypothetical salivary protein 8.2 |
| | ||
| AD-147 | 208657661 | putative secreted salivary basic peptide hyp6.2 |
| | ||
| AD-269 | 208657637 | hyp5.6 salivary basic secreted peptide |
| | ||
| AD-13 | 208657673 | hypothetical secreted protein |
| AD-15 | 208657665 | 30 kDa salivary antigen family protein |
| AD-12 | 208657669 | hypothetical secreted protein |
| AD-14 | 208657671 | hypothetical secreted protein |
| AD-19 | 208657675 | hypothetical secreted protein |
| AD-18 | 208657717 | hypothetical secreted protein |
| | ||
| AD-136 | 208657830 | hypothetical conserved secreted protein |
| AD-138 | 208657848 | hypothetical conserved secreted protein |
| AD-119 | 208657797 | putative secreted peptide |
Figure 2The D7 protein family of . (A) Clustal alignment. (B) Phylogram based on the alignment in (A). The numbers on the tree nodes represent the percent bootstrap support in 10,000 trials. The bar at the bottom indicates 20% amino acid divergence. The An. gambiae sequence names start with D7 followed by s or L for short and long forms; the number following s or L represents the order of the gene in the D7 chromosomal region, following its transcription direction. The An. darlingi sequences start with AD, followed by a number derived from the cluster number, as determined in Supplemental Table S1. For more details, see text.
Figure 3The 30-kD/GE-rich/Aegyptin protein family of mosquitoes. (A) Clustal alignment. (B) Phylogram based on the alignment in (A). The numbers on the tree nodes represent the percent bootstrap support in 10,000 trials (only values above 50% are shown). The bar at the bottom indicates 10% amino acid divergence. The sole An. darlingi sequence is identified by AD-26 and a filled circle symbol. The remaining sequences are named with the first three letters from the genus name followed by two letters from the species name and by their NCBI protein accession number. For more details, see text.
Figure 4The salivary basic tail family of mosquito proteins. (A) Clustal alignment. The sole An. darlingi sequence is identified by AD-217. The remaining sequences are named with the first three letters from the genus name followed by two letters from the species name and by their NCBI protein accession number. Conserved cysteines are shown in black, hydrophobic conserved amino acids (aa) in light blue, conserved Pro and Gly in yellow, conserved bulky non-charged aa (Asn, Gln, Ser, Thr) in grey, conserved Ser + Thr in brown, conserved negatively charged aa in red, identical positively charged aa in violet, conserved charged aa in green. The symbols above the alignment indicate: (*) identical sites; (:) conserved sites; (.) less conserved sites. (B) Phylogram derived from the alignment in (A). The numbers on the tree nodes represent the percent bootstrap support in 10,000 trials (only values above 50% are shown). The bar at the bottom indicates 10% aa divergence.
Figure 5The salivary 4.3-kDa family of mosquito proteins. (A) Clustal alignment. The sole An. darlingi sequence is identified by AD-476. The remaining sequences are named with the first three letters from the genus name followed by two letters from the species name and by their NCBI protein accession number. Conserved cysteines are shown in black, hydrophobic conserved amino acids (aa) in light blue, conserved Pro and Gly in yellow, conserved bulky non-charged aa (Asn, Gln, Ser, Thr) in grey, conserved Ser + Thr in brown, identical negatively charged aa in red, identical positively charged aa in violet, conserved charged aa in green. The symbols above the alignment indicate: (*) identical sites; (:) conserved sites; (.) less conserved sites. (B) Phylogram derived from the alignment in (A). The numbers on the tree nodes represent the percent bootstrap support in 10,000 trials (only values above 50% are shown). The bar at the bottom indicates 5% aa divergence.
Figure 6Clustal alignment of the 41.9-kDa family of mosquito proteins. The sole An. darlingi sequence is identified by AD-114. The remaining sequences are named with the first three letters from the genus name followed by two letters from the species name and by their NCBI protein accession number. For more details, see text. Conserved cysteines are shown in black, hydrophobic conserved amino acids (aa) in light blue, conserved Pro and Gly in yellow, conserved bulky non-charged aa (Asn, Gln, Ser, Thr) in grey, conserved Ser + Thr in brown, conserved negatively charged aa in red, identical positively charged aa in violet, conserved charged aa in green. The symbols above the alignment indicate: (*) identical sites; (:) conserved sites; (.) less conserved sites.
Figure 7The expanded 41.9-kDa family. Phylogram based on the alignment of sequences derived from the use of the PSI-BLAST tool to retrieve sequences on the NR database from the NCBI using as seed the An. darlingi sequence AD-114. The numbers on the tree nodes represent the percent bootstrap support in 10,000 trials (only values above 50% are shown). The bar at the bottom indicates 20% amino acid divergence. Except for the An. darlingi sequence, the remaining sequences are named with the first three letters from the genus name followed by two letters from the species name and by their NCBI protein accession number. For more details, see text.
Figure 8The G1 protein family of anopheline mosquitoes. A) Clustal alignment. (B) Phylogram based on the alignment in (A). The numbers on the tree nodes represent the percent bootstrap support in 10,000 trials (only values above 50% are shown). The bar at the bottom indicates 20% amino acid divergence. The An. darlingi sequences are identified by AD and a filled square symbol. The An. gambiae sequences are identified by a circle and are named as reported before [7]. The remaining sequences are named with the first three letters from the genus name followed by two letters from the species name and by their NCBI protein accession number. For more details, see text.
Figure 9The 2WIRRP family of Anopheline proteins. Clustal alignment of the An. darlingi proteins with the An. gambiae homologue. Background colour follows convention as in Figure 6. Bar labelled I indicates region of Ser [Asp/Glu] [Asp-Glu] repeats. Bar labelled II identifies the WIRRP repeats notable on the An. gambiae sequence.
Identity at amino-acid level between Anopheles darlingi and An. gambiae salivary secreted and housekeeping proteins
| AD-32 | Short D7r4 | 137 | 35 |
| AD-97 | Short D7r4 | 130 | 29 |
| AD-395 | Short D7r5 | 156 | 53 |
| AD-1 | Short D7r3 | 169 | 61 |
| AD-118 | Long D7 1 | 309 | 43 |
| AD-23 | 30-kDa antigen | 252 | 59 |
| AD-133 | gSG7 anophensin | 134 | 47 |
| AD-8 | SG3 mucin | 139 | 34 |
| AD-143 | gSG10 | 188 | 59 |
| AD-47 | 13.5-kDa mucin | 149 | 34 |
| AD-104 | Apyrase | 571 | 66 |
| AD-573 | Peroxidase | 592 | 86 |
| AD-38 | gVAG | 261 | 67 |
| AD-430 | Antigen 5 | 254 | 51 |
| AD-196 | gSG5 | 328 | 46 |
| AD-217 | Basic tail | 116 | 48 |
| AD-159 | SG1-like3 | 376 | 33 |
| AD-130 | gSG1b | 351 | 35 |
| AD-86 | SG1 | 409 | 30 |
| AD-153 | Trio | 383 | 29 |
| AD-138 | Unknown secreted | 241 | 60 |
| AD-191 | Virus-induced mucin | 277 | 71 |
| AD-457 | Peptidoglycan recognition protein | 188 | 94 |
| AD-174 | Lysozyme | 138 | 80 |
| AD-70 | Maltase | 567 | 80 |
| Mean | 272.6 | 53.2 | |
| SE | 29.0 | 3.8 | |
| SD | 144.9 | 19.1 | |
| AD-519 | Tetraspanin | 249 | 85 |
| AD-680 | Unknown conserved | 188 | 90 |
| AD-408 | Tetraspanin | 288 | 74 |
| AD-184 | Unknown conserved | 144 | 86 |
| AD-527 | Unknown conserved | 101 | 87 |
| AD-77 | Unknown conserved | 137 | 45 |
| AD-79 | Unknown conserved | 137 | 45 |
| AD-401 | Unknown conserved | 270 | 56 |
| AD-584 | Ferritin | 231 | 65 |
| AD-94 | Conserved secreted protein | 104 | 84 |
| AD-345 | Conserved secreted protein | 126 | 92 |
| AD-640 | Conserved secreted protein | 126 | 90 |
| AD-189 | Conserved secreted protein | 119 | 79 |
| AD-939 | Conserved secreted protein | 136 | 30 |
| AD-870 | N-methyl-D-aspartate receptor-associated protein | 100 | 87 |
| AD-489 | Phosphatidic acid phosphatase | 298 | 66 |
| AD-205 | 40S ribosomal protein SA (P40)/Laminin receptor 1 | 290 | 82 |
| AD-195 | Ribosomal protein S4 | 262 | 91 |
| AD-220 | 60S ribosomal protein L7 | 258 | 84 |
| AD-398 | Similar to 3-hydroxybutyrate dehydrogenase type 2 | 255 | 90 |
| AD-165 | 60S ribosomal protein L7A – truncated at 5 prime | 253 | 80 |
| AD-224 | 60s ribosomal protein L2/L8 | 252 | 96 |
| AD-167 | 40S ribosomal protein S3A | 247 | 93 |
| AD-295 | emp24/gp25L/p24 family of membrane trafficking protein | 211 | 92 |
| AD-328 | Peptidyl-prolyl cis-trans isomerase | 202 | 89 |
| AD-207 | Ribosomal protein L19 | 190 | 95 |
| AD-225 | 60S ribosomal protein L9 | 190 | 88 |
| AD-201 | 60s ribosomal protein L18 | 189 | 86 |
| AD-193 | 60S ribosomal protein L11 | 188 | 93 |
| AD-222 | 60S ribosomal protein L22 | 187 | 95 |
| AD-710 | Nucleoside diphosphate kinase | 168 | 93 |
| AD-246 | 60S ribosomal protein L21 | 162 | 83 |
| AD-126 | 40S ribosomal protein S19 | 157 | 88 |
| AD-156 | Ribosomal protein L22 | 154 | 84 |
| AD-180 | 60S ribosomal protein L13A | 154 | 79 |
| AD-212 | 40S ribosomal protein S11 | 153 | 92 |
| AD-215 | 60s ribosomal protein L24 | 153 | 89 |
| AD-251 | 40S ribosomal protein S14 | 152 | 99 |
| AD-253 | 60S ribosomal protein L26 | 151 | 94 |
| AD-937 | Hypothetical conserved protein | 150 | 92 |
| AD-235 | 40S ribosomal protein S15 | 149 | 94 |
| AD-151 | Ribosomal protein S16 | 146 | 96 |
| AD-241 | 60S ribosomal protein L14/L17/L23 | 140 | 100 |
| AD-252 | 40S ribosomal protein S12 | 136 | 97 |
| AD-281 | H3 histone, family 3A | 136 | 99 |
| AD-245 | Ribosomal protein L32 | 134 | 93 |
| AD-592 | Mitochondrial ribosomal protein L54 | 134 | 83 |
| AD-230 | 40S ribosomal protein S17 | 131 | 98 |
| AD-239 | Ubiquitin-like/40S ribosomal S30 protein fusion | 131 | 76 |
| AD-240 | 40S ribosomal protein S15/S22 | 130 | 97 |
| AD-229 | Ubiquitin/60s ribosomal protein L40 fusion | 128 | 100 |
| AD-242 | Ribosomal protein S8 | 126 | 84 |
| AD-280 | H2A histone family, member V | 126 | 95 |
| AD-250 | 60S ribosomal protein L31 | 124 | 99 |
| AD-185 | 40S ribosomal protein S20 | 120 | 92 |
| AD-117 | Acidic ribosomal protein P1 | 115 | 89 |
| AD-247 | 60S ribosomal protein L36 | 113 | 95 |
| AD-120 | 60S acidic ribosomal protein P2 | 113 | 82 |
| AD-262 | Mitochondrial F1F0-ATP synthase, subunit Cf6 | 107 | 90 |
| AD-116 | Translation elongation factor EF-1 alpha/Tu | 103 | 97 |
| Mean | 167.1 | 86.1 | |
| SE | 7.1 | 1.8 | |
| SD | 55.0 | 13.8 | |