| Literature DB >> 22185595 |
Monica F Poelchau1, Julie A Reynolds, David L Denlinger, Christine G Elsik, Peter A Armbruster.
Abstract
BACKGROUND: Many temperate insects survive the harsh conditions of winter by undergoing photoperiodic diapause, a pre-programmed developmental arrest initiated by short day lengths. Despite the well-established ecological significance of photoperiodic diapause, the molecular basis of this crucial adaptation remains largely unresolved. The Asian tiger mosquito, Aedes albopictus (Skuse), represents an outstanding emerging model to investigate the molecular basis of photoperiodic diapause in a well-defined ecological and evolutionary context. Ae. albopictus is a medically significant vector and is currently considered the most invasive mosquito in the world. Traits related to diapause appear to be important factors contributing to the rapid spread of this mosquito. To generate novel sequence information for this species, as well as to discover transcripts involved in diapause preparation, we sequenced the transcriptome of Ae. albopictus oocytes destined to become diapausing or non-diapausing pharate larvae.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22185595 PMCID: PMC3258294 DOI: 10.1186/1471-2164-12-619
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Flowchart of the main analysis steps outlined in this paper.
Figure 2Distribution of read and contig lengths from . Reads lengths from the Ae. albopictus transcriptome are show in red (N = 1,111,941), Ae. albopictus contig lengths in blue (N = 69,474), and Ae. aegypti transcript lengths (v1.2, http://www.vectorbase.org) in green (N = 18,760).
Summary statistics for reads from 454 GS-FLX sequencing
| Total # of reads | % reads removed | # of reads remaining after filtering | Average filtered read length | Average filtered %GC | % of reads mapped to contig set | % of mapped, discarded reads | |
|---|---|---|---|---|---|---|---|
| DI | 668,269 | 13.59 | 570,807 | 404.11 | 50.76 | 61.19 | 2.62 |
| NDI | 656,977 | 14.91 | 541,134 | 432.62 | 50.75 | 62.38 | 3.01 |
| Combined | 1,325,246 | 14.25 | 1,111,941 | 417.98 | 50.76 | 61.77 | 2.81 |
Summary statistics of reads from two cDNA libraries derived from oocytes from DI (diapause inducing) and NDI (non-diapause inducing) photoperiod treatments are shown.
N50 values, and mean and median lengths, for reads and EST sets
| N50 | Mean read length | Median read length | ||
|---|---|---|---|---|
| Trimmed reads | 482 | 417.98 | 467 | |
| redundant EST set | Contigs only | 1009 | 898.25 | 692 |
| redundant EST set | Contigs and singletons | 641 | 601.75 | 493 |
| trimmed EST set | Contigs only | 1120 | 801.25 | 647 |
| trimmed EST set | Contigs and singletons | 974 | 688.48 | 508 |
Databases used for BLAST homology searches with relevant details for each search
| Reference organism | Sequence type | BLAST algorithm | Minimum e-value | Version | Source | Minimum % identity used in final EST set selection |
|---|---|---|---|---|---|---|
| mRNA | BLASTN | 1.00E-10 | N/A | 85 | ||
| transcripts | BLASTN | 1.00E-10 | AaegL1.2 | 85 | ||
| genomic | BLASTN | 1.00E-10 | AaegL1 | 85 | ||
| peptides | BLASTX | 1.00E-10 | AaegL1.2 | 70 | ||
| peptides | BLASTX | 1.00E-05 | CpipJ1.2 | 0 | ||
| peptides | BLASTX | 1.00E-05 | AgamP3.6 | 0 | ||
| translation | BLASTX | 1.00E-05 | r5.29 | 0 | ||
| peptides | BLASTX | 1.00E-04 | WS218 | 0 | ||
| Swiss-Prot | proteins | BLASTX | 1.00E-03 | 2010_6 | 0 |
The minimum e-value used for the initial EST set, and the minimum % identity used for the final EST set, varied based on phylogenetic distance of the organism/database and Ae. albopictus.
Number of ESTs assigned to each genomic database
| Reference organism | Number of matches, initial EST set | Number of matches, final EST set |
|---|---|---|
| 1,011 | 52 | |
| 91,461 | 10,005 | |
| 1,422 | 142 | |
| 10,736 | N/A | |
| 2,993 | 556 | |
| 1,167 | 550 | |
| 103 | 79 | |
| 21 | 16 | |
| 25 | 19 | |
| Swiss-Prot | 4,331 | 86 |
| total number of matches | 113,270 | 11,505 |
| no hit (contigs) | 6,775 | N/A |
| no hit (singletons) | 37,077 | N/A |
| total number of contigs | 69,474 | N/A |
| total number of singletons | 87,648 | N/A |
Both the preliminary EST set and final, trimmed EST set are listed (see Methods).
Figure 3Box-plot of percent identities from EST BLAST alignments to each reference organism. The solid horizontal line represents the median, the box encompasses the lower and upper quartiles, and the boxplot whiskers encompasses the data extremes.
Figure 4Log fold-change expression (M) versus log abundance (A) of TMM-normalized expression from the 454 dataset. The log fold-change and log abundance for genes that are unique to a treatment is undefined; therefore, these genes are shown at an arbitrarily low abundance on the left of the plot in orange. Genes with higher expression under diapausing (DI) conditions have positive M values, and genes with higher expression under non-diapausing (NDI) conditions have negative M values. Genes that qualified as significantly differentially expressed (corrected p < 0.001) are in red.
Figure 5RPKM values (454 sequence data) versus mean qPCR mRNA abundance for NDI and DI treatments. NDI treatments are shown in red, DI in blue. Lines from the linear regressions for each photoperiod treatment, after removal of the outlier histone2A, are plotted as solid lines (r2(NDI)= 0.70; r2(DI)= 0.90).
Correlation coefficients (r2) between qPCR mRNA abundance and 454 gene expression
| normalization method | NDI | DI |
|---|---|---|
| RPKM | 0.722 | 0.911 |
| TMM | 0.548 | 0.727 |
Correlation coefficients are derived from linear regressions between qPCR mRNA abundance and both TMM and RPKM gene expression measures from 454 read counts for each gene verified by qPCR.
Sensitivity and specificity information calculated from 48 candidate genes
| Normalization method | ||
|---|---|---|
| RPKM | TMM | |
| TP (true positives) | 4 | 6 |
| TN (true negatives) | 19 | 12 |
| FP (false positives) | 19 | 26 |
| FN (false negatives) | 5 | 3 |
| TPR (sensitivity) | 0.44 | 0.67 |
| TNR (specificity) | 0.50 | 0.32 |
Values for both RPKM and TMM expression measures from the 454 data are shown. Sensitivity is calculated as the number of true positives over the number of all positives; Specificity is the number of true negatives divided by all negatives.
Genes verified as DE from qPCR analyses
| Putative homolog | 454 category | Fold-change | corrected p-value | Functional category | |
|---|---|---|---|---|---|
| Aalb_oocyte_rep_c42113_trimmed | phosphoenolpyruvate carboxykinase ( | DI | 0.82 | 0.004 | metabolism |
| Aalb_oocyte_rep_c2808_trimmed | Ecdysone inducible protein L2, putative ( | NDI | -0.85 | 0.005 | endocrine signaling |
| Aalb_oocyte_rep_c41764_trimmed | Inhibitor of growth protein ( | 1.05 | 0.002 | DNA replication/ transcription | |
| Aalb_oocyte_rep_c38864_trimmed | Conserved hypothetical protein (AAEL004873) | NDI | 0.48 | 0.004 | unknown |
| Aalb_oocyte_rep_c36433_trimmed | Conserved hypothetical protein (AAEL008645) | DI | 1.33 | 0.000 | unknown |
| Aalb_oocyte_GH79BIP01BENVH_trimmed | Bhlhzip transcription factor bigmax | DI | 0.87 | 0.010 | DNA replication/ transcription |
| Aalb_oocyte_rep_c3449_trimmed | GPCR Methuselah Family | 0.88 | 0.004 | stress response | |
| Aalb_oocyte_rep_c40438_trimmed | Receptor for activated C kinase, putative ( | DI | 0.86 | 0.003 | endocrine signaling |
| Aalb_oocyte_rep_c18194_trimmed | Conserved hypothetical protein (AAEL012019) | DI | 0.92 | 0.010 | unknown |
| Aalb_oocyte_rep_c431_trimmed | phosphatidylethanolamine-binding protein ( | 0.88 | 0.000 | morphogenesis |
Information on each gene verified as differentially expressed is shown. Putative homologs were derived from BLAST searches (see Methods). "454 category" refers to the differential expression status assigned to the EST based on read counts: DI or NDI, significantly over-expressed under DI or NDI conditions, respectively; and a priori, not DE based on 454 read counts, but of a priori interest based on previous diapause research. Fold-change calculations are based on qPCR results, and were calculated as log2(DI-NDI), where DI and NDI are the average mRNA abundance for DI and NDI treatments, respectively (see Methods). The corrected p-value is derived from Student''s t-test of mRNA abundances derived from qPCR reactions.