| Literature DB >> 18088419 |
George F Mayhew1, Lyric C Bartholomay, Hang-Yen Kou, Thomas A Rocheleau, Jeremy F Fuchs, Matthew T Aliota, I-Yu Tsao, Chiung-Yen Huang, Tze-Tze Liu, Kwang-Jen Hsiao, Shih-Feng Tsai, Ueng-Cheng Yang, Nicole T Perna, Wen-Long Cho, Bruce M Christensen, Cheng-Chen Chen.
Abstract
BACKGROUND: The mosquito, Armigeres subalbatus, mounts a distinctively robust innate immune response when infected with the nematode Brugia malayi, a causative agent of lymphatic filariasis. In order to mine the transcriptome for new insight into the cascade of events that takes place in response to infection in this mosquito, 6 cDNA libraries were generated from tissues of adult female mosquitoes subjected to immune-response activation treatments that lead to well-characterized responses, and from aging, naïve mosquitoes. Expressed sequence tags (ESTs) from each library were produced, annotated, and subjected to comparative analyses.Entities:
Mesh:
Year: 2007 PMID: 18088419 PMCID: PMC2262096 DOI: 10.1186/1471-2164-8-462
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of Ar. subalbatus EST and EST cluster production from six cDNA libraries.
| # ESTs | # EST clusters | |
| Bacteria (whole body) | 5,654 | 2,372 |
| Bacteria (hemocytes) | 11,300 | 2,107 |
| 7,014 | 2,510 | |
| 7,192 | 3,096 | |
| Naïve 7 (whole body) | 4,289 | 1,911 |
| Naïve 14 (whole body) | 5,405 | 2,325 |
| Combined | 38,079 | 8,020 |
The condition of mosquitoes/inocula used to generate the material for each cDNA library is listed in the left-hand column. Bacteria = mixture of E. coli and M. luteus.
Figure 1A comparison of EST clusters from . The type of immune response activation for mosquitoes is listed in the primary row and column. At the intersection of each row and column, the number of clusters unique to that combination of libraries is listed in bold, followed by the number of those clusters that are designated as unknown (U) or conserved unknown (CU). Clusters from the 4 immune response activated libraries (Immune activated combined) were queried against the naïve libraries such that: a cluster is represented in at least 1 of the 4 libraries (but not in naive) (top -1), or clusters are represented in all 4 of the libraries (but not in naive) (bottom – 2).
Clusters showing significant differences as determined by Stekel R value and Chi square analysis.
| NADH dehydrogenase subunit 2 | 245 | 23 | 14 | 10 | 21 | 19 | 0 | 0 | |
| NADH dehydrogenase subunit 6 | 172 | 18 | 8 | 1 | 10 | 11 | 0 | 0 | |
| cytochrome c oxidase subunit I | 245 | 67 | 24 | 11 | 59 | 54 | 0 | 0 | |
| unknown | 325 | 52 | 33 | 17 | 26 | 66 | 0 | 0 | |
| cytosolic small ribosomal subunit S26 | 60 | 69 | 57 | 41 | 36 | 67 | 0.000001 | 0.000001 | |
| NADH dehydrogenase | 28 | 2 | 2 | 4 | 6 | 1 | 0.000002 | 0.000001 | |
| ubiquitin | 83 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | |
| cytochrome c oxidase subunit III | 277 | 79 | 83 | 20 | 62 | 52 | 0 | 0 | |
| cytochrome b | 47 | 19 | 14 | 1 | 9 | 8 | 0 | 0 | |
| mitochondrial adenosine triphosphatase subunit 6 | 202 | 60 | 36 | 1 | 51 | 45 | 0 | 0 | |
| ubiquinol-cytochrome c reductase subunit | 62 | 9 | 4 | 12 | 9 | 11 | 0 | 0 | |
| cytochrome c | 74 | 15 | 8 | 12 | 11 | 12 | 0 | 0 | |
| unknown | 243 | 59 | 123 | 79 | 54 | 72 | 0 | 0 | |
| putative: conserved unknown | 32 | 18 | 7 | 3 | 20 | 6 | 0.000001 | 0.000002 | |
| ubiquinol-cytochrome c reductase | 56 | 14 | 5 | 11 | 14 | 9 | 0 | 0 | |
| AT DNA binding | 0 | 7 | 15 | 13 | 6 | 8 | 0.000002 | 0.00033 | |
| questionable: Bombyx mori prophenoloxidase activating factor 3 | 26 | 1 | 3 | 3 | 1 | 1 | 0.000001 | 0 | |
| questionable: conserved unknown | 1 | 6 | 7 | 21 | 8 | 10 | 0.000006 | 0.00008 | |
| ubiquinol-cytochrome-c reductase | 94 | 16 | 8 | 29 | 20 | 26 | 0 | 0 | |
| cytosolic large ribosomal subunit L36 | 27 | 43 | 33 | 24 | 26 | 47 | 0 | 0 | |
| NADH dehydrogenase | 47 | 2 | 8 | 10 | 4 | 7 | 0 | 0 | |
| gelsolin | 129 | 20 | 38 | 33 | 13 | 57 | 0 | 0 | |
| cytosolic large ribosomal subunit L40 | 21 | 5 | 0 | 0 | 5 | 1 | 0 | 0.000005 | |
| cytochrome C oxidase subunit II | 52 | 15 | 12 | 6 | 9 | 27 | 0.000001 | 0.000002 | |
| Myosin alkali light chain 1 | 35 | 3 | 8 | 0 | 3 | 6 | 0 | 0 | |
| questionable: receptor binding | 0 | 0 | 0 | 2 | 6 | 0 | 0.000228 | 0.000002 | |
| questionable: cytochrome c oxidase | 63 | 13 | 6 | 6 | 13 | 3 | 0 | 0 | |
| conserved unknown | 17 | 0 | 4 | 0 | 0 | 4 | 0.000003 | 0.000054 | |
| cytosolic small ribosomal subunit S9 | 39 | 73 | 43 | 68 | 32 | 54 | 0 | 0 | |
| superoxide dismutase | 20 | 2 | 1 | 0 | 1 | 1 | 0.000002 | 0.000001 | |
| ribosomal protein L41 | 276 | 100 | 170 | 94 | 126 | 104 | 0 | 0 | |
| nucleoside diphosphate kinase | 2 | 14 | 1 | 11 | 0 | 5 | 0.000002 | 0.000002 | |
| questionable: protein-tyrosine kinase | 0 | 0 | 0 | 107 | 0 | 0 | 0 | 0 | |
| hydrogen transporting two sector ATPase | 53 | 11 | 11 | 8 | 17 | 9 | 0.000003 | 0.000002 | |
| cecropin | 17 | 1 | 1 | 0 | 0 | 5 | 0.000006 | 0.000023 | |
| serine protease | 1 | 6 | 28 | 9 | 3 | 19 | 0 | 0 | |
| cytosolic small ribosomal subunit S3A | 78 | 26 | 27 | 16 | 20 | 29 | 0.000068 | 0.000132 | |
| defensin | 136 | 14 | 27 | 6 | 5 | 9 | 0 | 0 | |
| putative: heat shock | 0 | 0 | 1 | 56 | 2 | 3 | 0 | 0 | |
| lysozyme | 116 | 17 | 32 | 10 | 11 | 35 | 0 | 0 | |
| cytochrome c oxidase | 130 | 15 | 11 | 10 | 16 | 5 | 0 | 0 | |
| putative: conserved unknown | 0 | 0 | 3 | 41 | 0 | 3 | 0 | 0 | |
| putative: conserved unknown | 0 | 0 | 0 | 9 | 0 | 1 | 0.000047 | 0.000003 | |
| cecropin | 32 | 35 | 17 | 3 | 14 | 11 | 0 | 0 | |
| putative: conserved unknown | 0 | 9 | 13 | 15 | 7 | 7 | 0.000003 | 0.000718 | |
| conserved unknown | 63 | 21 | 11 | 12 | 12 | 16 | 0.000007 | 0.000004 | |
| putative: calcium ion binding protein | 0 | 0 | 13 | 0 | 0 | 0 | 0 | 0 | |
| serine protease | 112 | 18 | 42 | 43 | 7 | 29 | 0 | 0 | |
| unknown | 35 | 0 | 3 | 0 | 2 | 0 | 0 | 0 | |
| proton-transporting ATP synthase complex subunit | 70 | 8 | 9 | 4 | 14 | 16 | 0 | 0 | |
| putative: conserved unknown | 25 | 0 | 3 | 2 | 2 | 2 | 0.000001 | 0 | |
| hydrogen-exporting ATPase | 33 | 2 | 5 | 2 | 1 | 9 | 0 | 0 | |
| putative: odorant-binding | 0 | 0 | 0 | 9 | 0 | 0 | 0.000015 | 0 | |
| questionable: odorant-binding protein 56e | 0 | 0 | 0 | 9 | 0 | 0 | 0.000015 | 0 | |
| trypsin | 0 | 0 | 0 | 25 | 0 | 0 | 0 | 0 | |
| trypsin | 10 | 221 | 98 | 70 | 107 | 70 | 0 | 0 | |
| unknown | 5 | 4 | 9 | 7 | 11 | 39 | 0 | 0 | |
| questionable: odorant-binding | 0 | 0 | 0 | 123 | 0 | 0 | 0 | 0 | |
| questionable: odorant-binding | 0 | 0 | 0 | 28 | 0 | 0 | 0 | 0 | |
| trypsin | 3 | 85 | 10 | 11 | 33 | 2 | 0 | 0 | |
| questionable: apolipophorin | 129 | 0 | 5 | 32 | 2 | 6 | 0 | 0 | |
| vitellogenin C | 0 | 0 | 0 | 18 | 0 | 0 | 0 | 0 | |
| unknown | 130 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | |
| hydrogen transporting two sector ATPase | 41 | 6 | 5 | 2 | 2 | 4 | 0 | 0 | |
| putative: serine protease | 5 | 13 | 28 | 2 | 14 | 6 | 0 | 0 | |
| putative: serine protease | 1 | 15 | 10 | 4 | 12 | 5 | 0.000001 | 0.000002 | |
| unknown | 0 | 3 | 1 | 39 | 2 | 4 | 0 | 0 | |
| ATPase synthase | 16 | 4 | 1 | 0 | 0 | 0 | 0.000003 | 0.000012 | |
| NADH dehydrogenase | 36 | 6 | 3 | 8 | 7 | 1 | 0.000001 | 0.000001 | |
| conserved unknown | 45 | 8 | 11 | 10 | 4 | 6 | 0.000021 | 0.000005 | |
| putative: serine protease | 0 | 0 | 0 | 61 | 0 | 0 | 0 | 0 | |
| putative: serine protease | 0 | 5 | 11 | 28 | 7 | 8 | 0 | 0 | |
| questionable: conserved unknown | 0 | 0 | 0 | 32 | 0 | 0 | 0 | 0 | |
| cytosolic large ribosomal subunit L34a | 1 | 38 | 10 | 9 | 7 | 3 | 0 | 0 | |
| zinc-metalloproteinase precursor | 2 | 14 | 1 | 0 | 12 | 3 | 0 | 0 | |
| putative: cathepsin | 0 | 0 | 0 | 30 | 0 | 0 | 0 | 0 | |
| infection responsive short peptide (gambicin) | 17 | 23 | 40 | 47 | 17 | 24 | 0.000001 | 0.000002 | |
| unknown | 0 | 0 | 0 | 12 | 0 | 0 | 0 | 0 | |
| ATP/ADP antiporter (transporter) | 21 | 3 | 1 | 0 | 1 | 0 | 0 | 0 | |
| beta-globin | 0 | 0 | 0 | 34 | 0 | 0 | 0 | 0 | |
| cathepsin | 0 | 0 | 0 | 19 | 0 | 0 | 0 | 0 | |
| trypsin | 0 | 4 | 0 | 16 | 3 | 2 | 0 | 0 | |
| putative: conserved unknown | 0 | 0 | 0 | 9 | 0 | 0 | 0.000015 | 0 | |
| putative: arrestin 2 | 0 | 13 | 6 | 8 | 9 | 9 | 0.000004 | 0.000307 | |
| questionable: conserved unknown | 0 | 0 | 0 | 32 | 0 | 0 | 0 | 0 | |
| rhodopsin | 0 | 2 | 13 | 11 | 5 | 11 | 0.000001 | 0.000066 | |
| questionable: chitin binding | 0 | 0 | 0 | 12 | 0 | 0 | 0 | 0 | |
| unprocessed 18S ribosomal RNA | 40 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | |
| unknown | 90 | 0 | 0 | 1 | 0 | 2 | 0 | 0 | |
| unknown | 107 | 56 | 65 | 38 | 22 | 89 | 0 | 0 | |
| NADH dehydrogenase subunit | 26 | 2 | 2 | 1 | 8 | 5 | 0.000003 | 0.000006 | |
| questionable: triacylglycerol lipase | 0 | 0 | 0 | 8 | 0 | 0 | 0.000066 | 0.000002 | |
| putative: odorant-binding protein G.1A.F | 0 | 0 | 0 | 22 | 0 | 0 | 0 | 0 | |
| putative: trypsin | 0 | 0 | 0 | 21 | 1 | 0 | 0 | 0 | |
| hydrogen transporting two sector ATPase | 48 | 8 | 8 | 6 | 8 | 13 | 0.000004 | 0.000001 | |
| vitellogenin | 0 | 0 | 0 | 19 | 0 | 0 | 0 | 0 | |
| defensin | 17 | 2 | 5 | 0 | 0 | 0 | 0.000002 | 0.00002 | |
| questionable: threonine-rich salivary mucin | 0 | 0 | 0 | 0 | 6 | 4 | 0.000031 | 0.000003 | |
| NADH dehydrogenase | 36 | 5 | 10 | 0 | 4 | 3 | 0 | 0 |
Clusters and their constituent ESTs were analysed using IDEG6 [20] to find clusters where the number of ESTs were statistically different between the libraries. Of 8,020 clusters analysed, 99 showed a significantly differential number of ESTs collected from one or more of the libraries. Each row represents the Genbank Accession number for cluster, and the columns are the number of ESTs from each of the six libraries that are a member of it. The libraries are: asuhem (bacteria-inoculated, hemocyte), diroinf (whole body, Dirofilaria immitis injected), imacbac (bacteria-inoculated whole body), brumal (blood-fed Brugia malayi), n7 and n14 (naïve, 5–7 and 12–14 days of age). The R value is the inverse log of the Stekel R Score [23], and the Chi value is a general Chi square analysis.
Figure 2Summary of Gene Ontology assignments to 2,793 . A gene list of assigned annotations was processed using WEGO (Web Gene Ontology Annotation Plot). Numbers outside the pie charts represent the number of clusters within each category. (Top) Tier 2 summaries of the three main branches of the gene ontology: Molecular Function, Cellular Component, and Biological Process. The categories of physiological process (GO:0007582) and cellular process (GO:0009987) were removed from the display in order to visualize the categories containing fewer members. (Bottom) Pie charts of Tier 3 and Tier 4, 5, and 6 of the GO subcategory of Biological Processes, response to stimuli (GO:0050896) and defense response.
Armigeres subalbatus EST clusters that represent characterized, published sequences from this mosquito.
| ABC membrane transporter (white) | n/a | |||||||
| aslectin AL-1 | ACN-0180799 | 3 | 1 | 1 | 1 | 0 | 0 | |
| ACN-0183776 | 1 | 0 | 0 | 0 | 0 | 0 | ||
| beta 1,3-glucan recognition protein (GRP) | ACN-0184624 | 0 | 0 | 1 | 0 | 0 | 0 | |
| dopa decarboxylase | ACN-0182608 | 0 | 2 | 1 | 0 | 1 | 0 | |
| dopachrome conversion enzyme | ACN-0181349 | 0 | 1 | 0 | 0 | 0 | 1 | |
| glucose-6-phosphate dehydrogenase (G6pd) | n/a | |||||||
| phenylalanine hydroxylase | ACN-0184263 | 1 | 0 | 0 | 1 | 1 | 1 | |
| prophenoloxidase I | n/a | |||||||
| prophenoloxidase II | n/a | |||||||
| prophenoloxidase III | n/a | |||||||
| prophenoloxidase IV | ACN-0186737 | 0 | 0 | 0 | 1 | 0 | 0 | |
| prophenoloxidase V | ACN-0182951 | 0 | 0 | 1 | 0 | 0 | 0 | |
| prophenoloxidase VI | ACN-0186767 | 0 | 1 | 0 | 0 | 0 | 0 | |
| ACN-0182961 | 1 | 0 | 0 | 0 | 0 | 0 | ||
| serine protease | ACN-0181859 | 0 | 0 | 0 | 1 | 0 | 0 |
Because of the unique, robust melanization response that is elicited when Ar. subalbatus is infected with certain nematode parasites, several molecules related to melanogenesis have been the subject of comprehensive characterization. EST clusters encoding these gene products are presented according to the number of ESTs in the following libraries: asuhem (bacteria-inoculated, hemocyte), diroinf (whole body, Dirofilaria immitis injected), imacbac (bacteria-inoculated whole body), brumal (blood-fed Brugia malayi), n7 and n14 (naïve, 7 and 14 days of age).
Searching Ar. subalbatus EST clusters for immunity-related sequences based on homology to other flies.
| AMP | Cecropin | 98 | 65 | 70 | 56 | 41 | 47 | 377 | 0.070988 |
| AMP | Diptericin | 0 | 2 | 0 | 0 | 0 | 1 | 3 | 0.160764 |
| CASP | CASP | 1 | 1 | 0 | 1 | 1 | 0 | 4 | 0.673041 |
| CAT | CAT | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0.561495 |
| CLIP | CLIP-CLIP-CLIP-SPH | 1 | 1 | 1 | 0 | 0 | 1 | 4 | 0.670142 |
| CLIP | CLIP-SP-CLIP-SP | 1 | 0 | 0 | 1 | 0 | 0 | 2 | 0.668311 |
| CLIP | SP-CLIP | 4 | 0 | 0 | 0 | 0 | 0 | 4 | 0.060745 |
| CLIP | SP-CLIP-SPH | 2 | 0 | 0 | 1 | 0 | 0 | 3 | 0.440816 |
| CLIP | SPH-CLIP | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0.754901 |
| CLIP | SPH-CLIP-SPH | 1 | 1 | 0 | 2 | 0 | 1 | 5 | 0.576541 |
| CLIP | SP-SPH-CLIP | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0.561495 |
| GALE | GALE | 0 | 0 | 2 | 1 | 1 | 2 | 6 | 0.206431 |
| GNBP | GNBP | 0 | 1 | 1 | 0 | 0 | 0 | 2 | 0.420516 |
| IAP | IAP | 1 | 1 | 0 | 3 | 0 | 2 | 7 | 0.260929 |
| IMDPATH | IKKg | 1 | 0 | 0 | 1 | 0 | 0 | 2 | 0.668311 |
| ML | ML | 14 | 10 | 5 | 8 | 9 | 3 | 49 | 0.223609 |
| PGRP | PGRP | 9 | 1 | 4 | 2 | 2 | 5 | 23 | 0.312906 |
| PPO | PPO | 1 | 1 | 0 | 1 | 0 | 0 | 3 | 0.653363 |
| PRDX | GPX | 8 | 2 | 1 | 1 | 0 | 3 | 15 | 0.100553 |
| PRDX | HPX | 1 | 2 | 1 | 3 | 1 | 0 | 8 | 0.446023 |
| PRDX | TPX | 7 | 3 | 5 | 14 | 4 | 8 | 41 | 0.136651 |
| SCR | SCRA | 0 | 7 | 3 | 1 | 2 | 2 | 15 | 0.004735 |
| SCR | SCRB | 1 | 0 | 0 | 1 | 2 | 0 | 4 | 0.26927 |
| SCR | SCRC | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0.754901 |
| SOD | SOD-Cu-Zn | 5 | 3 | 3 | 7 | 2 | 2 | 22 | 0.798952 |
| SPZ | SPZ | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0.575875 |
| SRPN | SRPN-nonINHIB | 4 | 2 | 2 | 0 | 3 | 2 | 13 | 0.281764 |
| TEP | TEP | 1 | 1 | 2 | 1 | 1 | 4 | 10 | 0.376665 |
| TOLLPATH | TUBE | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0.562898 |
Peptide sequences were harvested from ImmunoDB [24] for An. gambiae, Ae. aegypti, and D. melanogaster for each of the following gene families: All 8,020 EST clusters were blasted against a database of these peptides using blastx (e-value cutoff of 1e-5). BLAST hits were parsed from output files using tcl_blast_parser_123_V017.tcl [55] with a cut-off of 40% match, 1e-20 normalized e-value, and a minimum match length of 30 residues. Top hits were taken for each significant match and verified manually. Matches then were categorized by family and subfamily according to Waterhouse et al. (2007) (Left column) and the composition (# of compiled sequences) of EST clusters were collated from the assembly according to the library(s) from which ESTs came: asuhem (bacteria-inoculated, hemocyte), diroinf (whole body, Dirofilaria immitis injected), imacbac (bacteria-inoculated whole body), brumal (blood-fed Brugia malayi), n7 and n14 (naïve, 5–7 and 12–14 days of age). Stekel R values [23] were determined for the groupings using IDEG6 [20] and those having a score of 0.001 or better were considered significantly differential, and are flagged with bolded and underlined text for the row.
Figure 3Homologous sequences for . A) Comparative analysis of Ar. subalbatus EST clusters with predicted peptides from 3 other mosquito species with completed genomes: Ae. aegypti, An. gambiae, and C.p. quinquefasciatus. Overlapping regions indicate homologous sequences from blastx searches against the peptide databases. A homolog is defined as having an e-value cutoff of 1e-20, a percent match of 40% (true matches, not conserved), and a minimum match length of 30 for the high-scoring segment pair (HSP). This comparison includes 8,020 possible cluster sequences from Ar. subalbatus (brackets), of which 3,013 had no homolog. Boxes directly adjacent to circles indicate 1) the species being compared to Ar. subalbatus, and 2) the total # of homologous sequences between that species and Ar. subalbatus. B) A gene list of the total of overlapping and non-overlapping Ar. subalbatus homologs to Ae. aegypti, An. gambiae, and C. p. quinquefasciatus was compared to a gene list of homologs found to D. melanogaster. A significant number of genes (2,074) from Ar. subalbatus have no homolog in the fruit fly, but qualify as homologs to genes in other mosquito species.
Figure 4Frequency distribution of the quality of blastx matches (according to e-value) for genes considered to be homologs in . The number of e-values within a range is presented as a percentage of the total number of homologs per species. The graph shows an increasing trend of higher quality matches in more closely related species, while a majority of matches in distant species are lower quality.