| Literature DB >> 21085471 |
Kimberly A Bishop-Lilly1, Michael J Turell, Kristin M Willner, Amy Butani, Nichole M E Nolan, Shannon M Lentz, Arya Akmal, Al Mateczun, Trupti N Brahmbhatt, Shanmuga Sozhamannan, Chris A Whitehouse, Timothy D Read.
Abstract
BACKGROUND: Despite the global threat caused by arthropod-borne viruses, there is not an efficient method for screening vector populations to detect novel viral sequences. Current viral detection and surveillance methods based on culture can be costly and time consuming and are predicated on prior knowledge of the etiologic agent, as they rely on specific oligonucleotide primers or antibodies. Therefore, these techniques may be unsuitable for situations when the causative agent of an outbreak is unknown. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2010 PMID: 21085471 PMCID: PMC2976685 DOI: 10.1371/journal.pntd.0000878
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Major constituents of Ae. aegypti metagenome are listed below, with number of reads corresponding to each category as well as percentage of the total reads.
| 5 infected, F | 1/5 infected, F | 5 noninfected, F | 5 infected, M | 5 infected, M | ||||||
| N2173 | N2175 | N2187 | N2473 | N2474 | ||||||
| reads | % reads | reads | % reads | reads | % reads | reads | % reads | reads | % reads | |
|
| 268,015 | 92.60 | 182,788 | 84.56 | 246,080 | 72.03 | 239,991 | 61.38 | 183,687 | 54.54 |
|
| 2,353 | 0.81 | 2,268 | 1.05 | 15,559 | 4.55 | 1,915 | 0.49 | 4,574 | 1.36 |
|
| 8,567 | 2.96 | 12,663 | 5.86 | 14,656 | 4.29 | 39,615 | 10.13 | 22,119 | 6.57 |
|
| 1,559 | 0.54 | 2,988 | 1.38 | 2,041 | 0.60 | 8,897 | 2.28 | 2,862 | 0.85 |
|
| 8,942 | 3.09 | 15,457 | 7.15 | 63,314 | 18.53 | 100,553 | 25.72 | 123,580 | 36.69 |
|
| 289,436 | 216,164 | 341,650 | 390,971 | 336,822 | |||||
Number of reads corresponding to Ae. aegypti rRNA and Ae. aegypti transcripts were determined by BLAST and CLC workbench. Number of reads derived from bacteria and fungi were determined using MR-RAST RDP and MG-RAST SEED analysis, respectively.
Figure 1Coverage of Dengue genome.
A) Reads resulting from sequencing library N2173 were assembled de novo using the Newbler Assembler (454/Roche) and then the 23 large contigs (greater than 500 nucleotides) were subjected to BLAST analysis to determine their best hits. The 7 Dengue-specific large contigs are shown here as horizontal lines below regions of the Dengue genome they correspond to. Scale is approximate. B) Reads resulting from sequencing of all 5 libraries were mapped to the DENV-1 genome using GS Reference Mapper (454/Roche) and then viewed using Geneious software. The control noninfected mosquito dataset had no reads that mapped to the DENV-1 genome and therefore is not shown here.
Results of reference-guided assembly.
| N2473 | N2474 | N2173 | N2175 | |
| 5 infected, M | 5 infected, F | 1 of 5 infected, F | ||
| rRNA depletion | − | + | − | − |
|
| 243 | 419 | 227 | 46 |
|
| 10,412 | 10,106 | 9,699 | 6,282 |
|
| 97.0% | 94.1% | 90.3% | 58.5% |
|
| 7 | 3 | 7 | 3 |
|
| 7 | 5 | 8 | 15 |
|
| 1,635 | 3,197 | 2,050 | 758 |
|
| 2,272 | 5,590 | 2,366 | 1,149 |
|
| 4.5 | 8.3 | 4.9 | 1 |
Each of the 5 datasets were assembled with GS Reference Mapper software using the DENV-1 genome as a reference. Statistics obtained for each of the 4 datasets containing DENV-1 specific reads are shown here. The N2187 control noninfected mosquito dataset was also mapped to the DENV-1 genome, but resulted in no assembled reads.
Selected transcript profiles in DENV-1 infected and noninfected mosquitoes.
| 5 noninfected | 1 of 5 infected | 5 infected | ||||||||
| N2187 | N2175 | N2173 | ||||||||
| transcript | Vectorbase putative product | # reads | RPKM | rank | # reads | RPKM | rank | # reads | RPKM | rank |
| AAEL017413-RA | no description | 14 | 2,243.73 | 24 | 125 | 137,786.60 | 1 | 643 | 683,170.42 | 1 |
| AAEL017647-RA | 5.8S ribosomal RNA | 29 | 12,150.94 | 4 | 8 | 23,054.49 | 4 | 18 | 49,998.75 | 2 |
| AAEL017811-RA | RNase MRP | 227 | 48,669.62 | 2 | 27 | 39,815.26 | 2 | 29 | 41,219.71 | 3 |
| AAEL017868-RA | eukaryotic type signal | 190 | 41,289.04 | 3 | 13 | 19,430.25 | 5 | 17 | 24,490.91 | 4 |
| recognition particle RNA | ||||||||||
| AAEL017654-RA | small subunit ribosomal RNA, | 42 | 4,833.90 | 10 | 21 | 16,623.45 | 6 | 21 | 16,022.94 | 5 |
| 5′ domain | ||||||||||
| AAEL001702-RA | conserved hypothetical protein | 95 | 11,871.61 | 5 | 15 | 12,892.31 | 7 | 15 | 12,426.59 | 6 |
| AAEL017571-RA | no description | 3617 | 122,231.86 | 1 | 109 | 25,334.72 | 3 | 35 | 7,841.13 | 7 |
| AAEL017247-RA | no description | 10 | 3,391.89 | 13 | 1 | 2,332.89 | 30 | 3 | 6,745.86 | 8 |
| AAEL016633-RA | tRNAGly | 0 | NA | NA | 0 | NA | NA | 1 | 5,902.63 | 9 |
| AAEL016690-RA | tRNAGly | 0 | NA | NA | 0 | NA | NA | 1 | 5,902.63 | 10 |
| AAEL017898-RA | nuclear RNAse P | 38 | 6,619.71 | 8 | 2 | 1,198.14 | 88 | 1 | 1,154.86 | 71 |
| AAEL007818-RB | no description | 38 | 9,957.61 | 6 | 4 | 1,999.62 | 41 | 2 | 963.69 | 91 |
| AAEL007818-RA | trypsin 3A1 | 38 | 2,777.71 | 17 | 2 | 1,005.51 | NA | 2 | 969.19 | 89 |
| AAEL017805-RA | U5 splicesomal RNA | 18 | 9,015.00 | 7 | 0 | NA | NA | 0 | NA | NA |
| AAEL017646-RA | U1 splicesomal RNA | 14 | 5,472.52 | 9 | 1 | 2,688.52 | 24 | 1 | 2,591.40 | 26 |
| AAEL017841-RA | U1 splicesomal RNA | 0 | NA | NA | 0 | NA | NA | 2 | 5,182.80 | 12 |
Selected Ae. aegypti transcripts that were detected are listed along with their putative gene functions as reported in VectorBase, the number of total reads that correspond to each transcript, RPKM value (a normalized measure of expression level reported by CLC Workbench and which represents the number of reads that map per kilobase of exon model per million mapped reads), and the rank order of each transcript per library.
Sequence read depth required for hypothetical sequencing based Dengue virus-like screening experiments.
| Np | pIP = 1.08E-7 | pIP = 1.08E-7 | pIP = 2.75E-8 | pIP = 2.75E-8 |
| P = 0.99 | P = 0.95 | P = 0.99 | P = 0.95 | |
| 10 | 4.26E+08 | 2.77E+08 | 1.67E+09 | 1.09E+09 |
| 50 | 8.53E+07 | 5.55E+07 | 3.35E+08 | 2.18E+08 |
| 100 | 4.26E+07 | 2.77E+07 | 1.67E+08 | 1.09E+08 |
| 1,000 | 4.26E06 | 2.77E+06 | 1.67E+07 | 1.09E+07 |
| 10,000 | 4.26E+05 | 2.77E+05 | 1.67E+06 | 1.09E+06 |
Values in the table are number of reads required (under the assumption that only one read is sufficient to identify the virus) to detect one infected mosquito in 100 (Nh), under different assumptions of the number of infecting viruses (Np). Four different scenarios are presented, based on the high and low values on pIP calculated in this study (N2474, the rRNA depleted sample and N2173) and power to detect of 95% and 99%. Values were calculated using equation 4.