| Literature DB >> 16712725 |
Ali N Dana1, Maureen E Hillenmeyer, Neil F Lobo, Marcia K Kern, Patricia A Romans, Frank H Collins.
Abstract
BACKGROUND: Large scale sequencing of cDNA libraries can provide profiles of genes expressed in an organism under defined biological and environmental circumstances. We have analyzed sequences of 4541 Expressed Sequence Tags (ESTs) from 3 different cDNA libraries created from abdomens from Plasmodium infection-susceptible adult female Anopheles gambiae. These libraries were made from sugar fed (S), rat blood fed (RB), and P. berghei-infected (IRB) mosquitoes at 30 hours after the blood meal, when most parasites would be transforming ookinetes or very early oocysts.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16712725 PMCID: PMC1508153 DOI: 10.1186/1471-2164-7-119
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
cDNA LIBRARY INSERT AND EST SIZES*
| 243 | 1003 | 33 | 3526 | 322 | |
| 189 | 1156 | 45 | 3738 | 152 | |
| 272 | 1044 | 28 | 2789 | 317 | |
| 1068 | 35 | 3351 | 264 | ||
| 590 | 404 | 8 | 820 | 100 | |
| 381 | 394 | 8 | 748 | 100 | |
| 437 | 377 | 8 | 785 | 100 | |
| 567 | 595 | 11 | 1585 | 104 | |
| 455 | 9 | 1039 | 101 | ||
*Library abbreviations are defined in the text. N = number of clone inserts, ESTs sequenced †The mean EST length in nucleotides (nt) is the length following end-trimming, short sequence filtering and EST contig assembly.
SE = standard error of the mean.
Figure 1Distributions of ESTs among the S, RB and IRB libraries. A total of 1408 singletons, unique transcripts, and 567 multi-EST contigs was examined. A. Distribution of singletons among libraries relative to total contigs. The % of 1975 singletons and contigs is shown in brackets following the number of singletons within each library. B. Taxonomic distribution of all 1975 assembled sequence homologies following BLASTX and BLASTN searches of GenBank nr and a BLASTN search of dbEST. C. Identification of 1965 transcripts sharing A. gambiae genomic identity, E < 10-4.
BIOLOGICAL PROCESSES OF GENES REPRESENTED IN LIBRARIES*
| Total S | Total RB | Total IRB | |
| Metabolism | |||
| Simple/Complex Carbohydrate Metabolism and Transport | 21 (28)† | 15 (17) | 13 (15) |
| Oxidative Phosphorylation | 41 (64) | 38 (61) | 30 (69) |
| Lysosomal Enzymatic Digestion | 1 (2) | 0 | 4 (4) |
| Protein Digestion | 13 (63) | 7 (49) | 11 (63) |
| Protein Modification, Metabolism, Transport and Localization | 42 (54) | 38 (49) | 39 (63) |
| Amino Acid and Derivative Metabolism and Transport | 10 (13) | 6 (10) | 16 (20) |
| Nucleobase/Nucleoside/Nucleotide/Nucleic acid Metabolism and Transport | 9 (21) | 10 (12) | 13 (19) |
| Fatty Acid/Lipid Metabolism and Transport | 9 (11) | 8 (8) | 5 (6) |
| Vitamin/Vitamin Derivative/Cofactor Metabolism and Transport | 2 (3) | 2 (3) | 2 (2) |
| Xenobiotic Metabolism and Transport | 5 (5) | 7 (8) | 5 (6) |
| Pigment Synthesis and Transport | 1 (1) | 2 (2) | 4 (5) |
| Other | 3 (4) | 6 (6) | 3 (3) |
| Total | 157 (269) | 139 (225) | 145 (275) |
| Transport | |||
| Ion Transport | 10 (11) | 14 (16) | 12 (14) |
| Receptor-mediated Endocytosis | 7 (10) | 4 (4) | 6 (11) |
| Total | 17 (21) | 18 (20) | 18 (25) |
| Protein Synthesis | |||
| Transcription and mRNA Processing | 17 (23) | 19 (20) | 19 (21) |
| Translation | 108 (436) | 97 (325) | 115 (482) |
| Protein Folding | 14 (24) | 19 (21) | 10 (13) |
| Total | 139 (483) | 135 (366) | 144 (516) |
| Cellular Processes | |||
| Cell Cycle | 14 (17) | 10 (13) | 6 (11) |
| Cellular Proliferation | 4 (4) | 2 (2) | 3 (3) |
| Chromatin Assembly/Disassembly | 4 (6) | 5 (7) | 5 (10) |
| Apoptosis | 2 (2) | 2 (2) | 5 (7) |
| Senescence | 2 (3) | 0 | 1 (2) |
| Viral Life Cycle | 1 (1) | 1 (1) | 0 |
| Total | 27 (33) | 20(25) | 20 (33) |
| Egg Production | |||
| Vitellogenesis/Oogenesis/Embryogenesis | 29 (33) | 24 (40) | 27 (60) |
| Melanization | 0 | 1 (1) | 0 |
| Total | 29 (33) | 25 (41) | 27 (60) |
| Cellular Communication | |||
| Signal Transduction | 11 (14) | 5 (7) | 6 (12) |
| Cell-cell Signaling | 4 (7) | 4 (4) | 2 (3) |
| Total | 15 (21) | 9 (11) | 8 (15) |
| Intra-/Extra-cellular Architecture Maintenance | |||
| Structural | 8 (35) | 11 (24) | 6 (30) |
| Muscle-related | 6 (6) | 4 (5) | 7 (8) |
| Cell Adhesion | 3 (3) | 3 (3) | 1 (1) |
| Cytoskeleton Organization and Biogenesis | 4 (6) | 12 (17) | 12 (18) |
| Total | 21 (50) | 30 (49) | 26 (57) |
| Response to Stress/External Stimulus | |||
| Response to Oxidative Stress | 7 (9) | 8 (11) | 14 (29) |
| Immune/Defense Response | 11 (29) | 9 (15) | 14 (20) |
| Chemosensory Perception | 3 (3) | 1 (1) | 1 (1) |
| Total | 21 (41) | 18 (27) | 29 (50) |
| Unknown | |||
| Total | 573 (776) | 326 (415) | 430 (663) |
*This analysis does not take into account that some genes are represented in more than 1 library or that more than one biological process may have been ascribed to particular genes.
†Number of Contigs containing one or more ESTs (total ESTs/genes).
OXIDATION AND STRESS RESPONSE GENES
| 34 | 2 | 1 | 5 | 8 | 8898 | 1665469 | 1665984 | 0 | X | EAA09273 | agCP14153 | 1.00E-111 | |
| 35 | 0 | 4 | 3 | 7 | 8880 | 1407780 | 1408378 | 0 | X | Q93113 | glutathione S-transferase 1–6 | 1.00E-110 | |
| 172 | 2 | 0 | 2 | 4 | 8816 | 4859270 | 4859540 | 1E-130 | X | EAA05108 | agCP6896 | 7E-65 | |
| 307 | 0 | 1 | 0 | 1 | 8986 | 5275996 | 5276383 | 1E-173 | X | EAA00516 | agCP9336 | 8E-46 | |
| 620 | 0 | 0 | 2 | 2 | 8984 | 9812533 | 9812822 | 1E-161 | X | AAL58538 | 1E-73 | ||
| 622 | 0 | 0 | 1 | 1 | 8980 | 538441 | 538745 | 1E-172 | X | AAF68382 | thioredoxin 1 | 5E-49 | |
| 704 | 0 | 0 | 1 | 1 | 8880 | 1406678 | 1407026 | 0 | X | EAA08605 | agCP2490 | 4E-36 | |
| 909 | 0 | 1 | 0 | 1 | 8984 | 9810450 | 9810635 | 1E-101 | X | AAG45163 | 2E-20 | ||
| 910 | 0 | 0 | 1 | 1 | 8984 | 9809004 | 9809209 | 1E-108 | X | AAG45164 | 6E-28 | ||
| 931 | 0 | 1 | 0 | 1 | 8933 | 621123 | 621383 | 1E-145 | X | EAA09899 | agCP11759 | 1E-73 | |
| 1012 | 0 | 0 | 4 | 4 | 8849 | 2536282 | 2536794 | 0 | X | EAA07169 | agCP10692 | 6E-80 | |
| 1041 | 1 | 0 | 2 | 3 | 8849 | 1674668 | 1675054 | 0 | X | EAA07207 | agCP10713 | 1E-115 | |
| 1042 | 0 | 1 | 2 | 3 | 8986 | 8102109 | 8102313 | 1E-105 | X | EAA00332 | agCP9864 | 2E-90 | |
| 1065 | 0 | 0 | 3 | 3 | 8804 | 83740 | 83869 | 6E-45 | N | AJ284424 | 4A3B-AAW-E-09-F A. gambiae immune competent 4A3B | 2E-51 | |
| 1482 | 0 | 0 | 1 | 1 | 8984 | 9812211 | 9812365 | 2E-74 | X | AAL58538 | 8E-06 | ||
| 1684 | 0 | 1 | 0 | 1 | 8804 | 84431 | 84661 | 1E-117 | X | EAA03855 | agCP1990 | 2E-35 | |
| 1909 | 1 | 0 | 0 | 1 | 8980 | 539114 | 539324 | 1E-109 | N | AJ283949 | 4A3B-AAH-C-12-F A. gambiae immune competent 4A3B | 1E-107 | |
| 2033 | 1 | 0 | 0 | 1 | 8898 | 1672566 | 1673171 | 0 | X | EAA09147 | agCP14131 | 1E-120 | |
| 2078 | 1 | 0 | 0 | 1 | 8984 | 9807310 | 9807705 | 0 | X | AAL59653 | 4E-71 | ||
| 2127 | 0 | 1 | 0 | 1 | 8986 | 8102593 | 8102854 | 1E-13i6 | X | EAA00332 | agCP9864 | 6E-31 | |
| 2156 | 0 | 0 | 1 | 1 | 8880 | 3392447 | 3392665 | 1E-120 | X | EAA08586 | agCP2356 | 2E-43 | |
| 2296 | 0 | 0 | 1 | 1 | 8807 | 1924679 | 1924904 | 1E-110 | X | EAA03983 | agCP3166 | 5E-75 | |
| 2334 | 1 | 0 | 0 | 1 | 8880 | 3614672 | 3614912 | 1E-124 | X | EAA08535 | agCP2389 | 3E-58 |
IMMUNE-RELATED/DEFENSE RESPONSE GENES
| 28 | 17 | 3 | 0 | 20 | 8987 | 14943052 | 14943642 | 0 | X | EAA01687 | agCP11956 | 1E-142 | serine protease |
| 32 | 2 | 3 | 4 | 9 | 8807 | 3383097 | 3383307 | 1E-116 | X | Q17005 | Lysozyme precursor (1,4-beta-N-acetylmuramidase) | 5E-67 | lysozyme precursor (1,4-beta-N-acetylmuramidase) |
| 68 | 1 | 0 | 0 | 1 | 8794 | 217313 | 217497 | 1E-100 | X | EAA02509 | agCP11665 | 3E-13 | serine protease 14D2 |
| 126 | 1 | 0 | 0 | 1 | 8964 | 2322618 | 2322792 | 2E-86 | X | EAA12171 | agCP10937 | 1E-25 | TEP3 |
| 153 | 2 | 0 | 0 | 2 | 8986 | 8953596 | 8953709 | 4E-45 | N | BM621296 | 17000687446469 A.Gam.ad.cDNA1 | 2E-63 | fat-spondin |
| 180 | 1 | 0 | 1 | 2 | 8859 | 3227518 | 3227913 | 0 | X | EAA07758 | agCP1111 | 4E-70 | signal transducer |
| 221 | 0 | 0 | 1 | 1 | 8807 | 3388356 | 3388647 | 1E-158 | X | EAA04406 | agCP3675 | 8E-79 | lysozyme c-8 |
| 418 | 1 | 0 | 3 | 4 | 8944 | 1827235 | 1827637 | 0 | X | EAA10153 | agCP15402 | 3E-64 | gambicin |
| 659 | 0 | 0 | 1 | 1 | 8807 | 3140140 | 3140290 | 5E-80 | X | EAA04458 | agCP3967 | 3E-56 | lysozyme c-4 |
| 804 | 0 | 1 | 0 | 1 | 8960 | 17845546 | 17845795 | 1E-139 | X | CAA09389 | ICHIT protein | 5E-31 | ICHIT |
| 832 | 1 | 0 | 1 | 2 | 8811 | 1718220 | 1718532 | 1E-151 | N | BM635649 | 17000687559053 A.Gam.ad.cDNA1 | 3E-64 | AgToll |
| 995 | 0 | 2 | 2 | 4 | 8960 | 23616 | 23997 | 0 | X | EAA11001 | agCP5701 | 1E-30 | serine protease |
| 1049 | 0 | 2 | 0 | 2 | 8944 | 1995046 | 1995300 | 1E-138 | X | EAA10138 | agCP15205 | 1E-78 | TEP12 |
| 1095 | 1 | 1 | 0 | 2 | 8960 | 16595875 | 16596267 | 0 | X | EAA11334 | agCP6381 | 7E-30 | signal transducer |
| 1120 | 0 | 1 | 1 | 2 | 8986 | 8676449 | 8676732 | 1E-159 | X | EAA00414 | agCP9557 | 5E-31 | serine protease |
| 1197 | 0 | 0 | 1 | 1 | 8847 | 1123621 | 1123927 | 1E-173 | X | EAA06859 | agCP7366 | 3E-25 | Cecropin B |
| 1286 | 0 | 0 | 1 | 1 | 8898 | 2844394 | 2844678 | 1E-149 | X | EAA09116 | agCP14093 | 4E-28 | GNBPB1 |
| 1364 | 0 | 0 | 1 | 1 | 8948 | 918161 | 918491 | 1E-166 | X | EAA10406 | agCP2049 | 2E-54 | ficolin |
| 1616 | 0 | 1 | 0 | 1 | 8980 | 7957050 | 7957284 | 1E-128 | X | CAB90818 | serine protease | 7E-79 | serine protease |
| 1701 | 0 | 1 | 0 | 1 | 8794 | 212210 | 212531 | 0 | X | AAB62929 | serine protease 14D | 2E-56 | serine protease 14D |
| 1922 | 0 | 0 | 1 | 1 | 8816 | 1069890 | 1070396 | 0 | X | EAA05160 | agCP6864 | 2E-55 | ficolin |
| 1997 | 1 | 0 | 0 | 1 | 8859 | 9665976 | 9666551 | 0 | N | AF444782 | AgToll9 | 4E-06 | AgToll9 |
| 2030 | 0 | 0 | 1 | 1 | 8807 | 3137120 | 3137454 | 0 | X | EAA04667 | agCP3164 | 3E-82 | Lysozyme c-7 |
| 2038 | 1 | 0 | 0 | 1 | 8975 | 72415 | 72691 | 1E-155 | N | AJ420785 | spi21F gene | 1E-71 | serpin spi21F |
| 2178 | 0 | 0 | 1 | 1 | 8847 | 1121909 | 1122047 | 6E-73 | X | EAA06858 | agCP7503 | 6E-21 | Cecropin A |
Figure 2Transcript abundance distribution within the 567 multi-EST contigs of the S, RB and IRB libraries. The 1408 singletons are not shown. Contig size = the number of ESTs contained within it.
MOLECULAR FUNCTIONS OF CONTIGS CONTAINING > 20 ESTS.
| 7 | 3 | 8 | 9 | 20 | agCP11398 | structural constituent of ribosome | |
| 14 | 11 | 6 | 3 | 20 | agCP1538 | structural constituent of ribosome | |
| 140 | 5 | 6 | 10 | 21 | agCP7468 | structural constituent of ribosome | |
| 167 | 11 | 4 | 6 | 21 | RpS27A | structural constituent of ribosome | |
| 20 | 13 | 5 | 4 | 22 | agCP10687 | structural constituent of ribosome | |
| 8 | 15 | 5 | 3 | 23 | agCP8133 | structural constituent of ribosome | |
| 532 | 7 | 6 | 10 | 23 | agCP1729 | structural constituent of ribosome | |
| 165 | 10 | 7 | 7 | 24 | agCP7766 | structural constituent of ribosome | |
| 199 | 9 | 5 | 10 | 24 | agCP4384 | structural constituent of ribosome | |
| 17 | 12 | 3 | 10 | 25 | agCP8340 | structural constituent of ribosome | |
| 528 | 6 | 7 | 13 | 26 | agCP4228 | structural constituent of ribosome | |
| 139 | 6 | 5 | 17 | 28 | ebiP415 | structural constituent of ribosome | |
| 229 | 8 | 9 | 11 | 28 | agCP8207 | structural constituent of ribosome | |
| 269 | 10 | 6 | 18 | 34 | agCP9994 | structural constituent of ribosome | |
| 4 | 17 | 5 | 13 | 35 | agCP9893 | structural constituent of ribosome | |
| 521 | 7 | 14 | 15 | 36 | agCP8317 | structural constituent of ribosome | |
| 91 | 13 | 12 | 26 | 51 | agCP11873 | structural constituent of ribosome | |
| 228 | 19 | 16 | 30 | 65 | agCP9509 | structural constituent of ribosome | |
ASSEMBLED SEQUENCES WITH STATISTICALLY SIGNIFICANT DIGITAL NORTHERN VALUES
| 996 | agCP14019; similar to | 0 | 0 | 5 | 0 | 0 | |||||
| 24 | no known predicted protein; shares 96% nucleotide identity with transcript ENSANG00000021567, a gene product belonging to the vitelline membrane protein family | 0 | 1 | 12 | 0 | 0.240 | |||||
| 139 | ebiP415; Ribosomal protein L44 | 6 | 5 | 17 | 0.132 | ||||||
| 475 | agCP12050; AgMuc1 | 2 | 3 | 12 | 0.138 | ||||||
| 145 | agCP6338; protein of unknown function containing a HMW kininogen protein domain | 1 | 2 | 8 | 0.172 | ||||||
| 313 | CG14235; Cytochrome c oxidase | 1 | 2 | 8 | 0.172 | ||||||
| 270 | no known predicted protein; shares sequence identitiy with transcript 17000687051196 A.Gam.ad.cDNA.blood 1 of unknown function | 2 | 3 | 10 | 0.138 | ||||||
| 230 | agCP2518; Vitellogenin 1 | 1 | 7 | 19 | |||||||
| 98 | agCP4445; hydrogen-transporting two-sector ATPase | 0 | 3 | 4 | 0 | 0.119 | |||||
| 521 | agCP8317; Ribosomal protein L13 | 7 | 14 | 15 | |||||||
| 447 | agCP2518; Vitellogenin 1 | 0 | 10 | 8 | 0 | 0.034 | |||||
| 35 | Glutathione S-transferase D1-6 | 0 | 4 | 3 | 0 | 0.060 | 0.081 | ||||
| 537 | agCP5224; Glycine hydroxymethyltransferase | 0 | 5 | 2 | 0 | 0.123 | |||||
| 273 | agCP12503; ATP synthase B chain mitochondrial precursor (FO-ATP synthase subunit B) | 1 | 6 | 3 | 0.123 | ||||||
| 516 | agCP3409; AgPer1 | 8 | 10 | 3 | |||||||
| 8 | agCP8133; Ribosomal protein L38e | 14 | 6 | 3 | 0.076 | ||||||
| 208 | agCP7801; ATP dependent RNA helicase | 5 | 1 | 0 | 0 | 0.113 | |||||
| 28 | agCP11956; Serine protease G13 | 17 | 3 | 0 | 0 | ||||||
| 591 | agCP1095; peptide with no known function, is part of the protein Family mucin 4 tracheobronchial mucin fragment, shares weak identity to | 6 | 1 | 0 | 0 | 0.079 | 0.17 | ||||
| 113 | agCP10139; 65 aa peptide of unknown function containing no known protein domains except signal peptide and transmembrane regions. | 8 | 0 | 0 | 0 | 0 | |||||
| 18 | agCP11264; Trypsin 1 | 25 | 2 | 4 | 0.125 | ||||||
| 408 | 17000687367332 A.Gam.ad.cDNA1; EST of unknown function | 5 | 0 | 0 | 0 | 0 | |||||
| 13 | gene ENSANGG0000002077 6; gene located 4 kb 3' to the Peritrophin 1 that encodes a protein with a chitin binding domain | 16 | 2 | 5 | 0.104 | ||||||
| 99 | agCP3123; Chymotrypsin 2 | 16 | 3 | 8 | 0.069 | ||||||
| 170 | agCP12309; putative homolog of | 7 | 0 | 4 | 0 | 0.086 |
Actual EST counts (S, RB and IRB) and the normalized number of ESTs (S (norm), RB (norm), and IRB (norm)) are given for each cDNA library. Assembled sequences were identified as differentially expressed among the three libraries if the R statistic was less than 0.05 (Stekel et al. 2000). The Audic and Claverie statistic (AC < 0.05) was used to identify libraries in which the assembled sequence was statistically significantly differentially expressed (Audic and Claverie, 1997). Pairwise comparisons between libraries are indicated by AC S-RB, AC S-IRB, and AC RB-IRB, respectively. All statistics were calculated with the IDEG6 Tool (Romualdi et al., 2003). Cells with characters in bold (underlined), bold (italics) and bold indicate statistical values equal to zero, less than 0.01, and less than 0.05, respectively. Cells with characters in italics (only), represent different levels of gene expression ranging from no detectable transcripts (smallest) to 145 transcripts (largest) detected in a library.
Figure 3qRT-PCR of selected genes identified as differentially expressed in A. gambiae abdomens by digital Northern analysis. A. AS 996, a cathepsin B-like protease; B. AS 447, Vitellogenin; C. AS 475, Mucin; D. AS 145, a putative high molecular weight kininogen; and E. AS 99, Chymotrypsin 2. Transcript abundances were normalized against the abundance of RP S7 under the same condition and shown as genomic equivalents * 10,000 for S, RB, and P. berghei-infected (IRB) adult female mosquitoes 30 h post treatment. The arithmetic means of three biological replicates SEM are shown. ANOVA was used to compare the means, and a post hoc Tukey test was used to make pairwise comparisons. Statistically significant differences, P < 0.05 are discussed in the text.