| Literature DB >> 22069519 |
Sijun Liu1, Diveena Vijayendran, Bryony C Bonning.
Abstract
Insects are commonly infected with multiple viruses including those that cause sublethal, asymptomatic, and latent infections. Traditional methods for virus isolation typically lack the sensitivity required for detection of such viruses that are present at low abundance. In this respect, next generation sequencing technologies have revolutionized methods for the discovery and identification of new viruses from insects. Here we review both traditional and modern methods for virus discovery, and outline analysis of transcriptome and small RNA data for identification of viral sequences. We will introduce methods for de novo assembly of viral sequences, identification of potential viral sequences from BLAST data, and bioinformatics for generating full-length or near full-length viral genome sequences. We will also discuss implications of the ubiquity of viruses in insects and in insect cell lines. All of the methods described in this article can also apply to the discovery of viruses in other organisms.Entities:
Keywords: insect virus; next generation sequencing; small RNA; transcriptome; virus discovery
Mesh:
Substances:
Year: 2011 PMID: 22069519 PMCID: PMC3205385 DOI: 10.3390/v3101849
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1.Transmission electron micrographs of the enveloped nucleocapsids of a baculovirus (Autographa californica nucleopolyhedrovirus; Baculoviridae). Inset: virions of White spot syndrome virus (WSSV; Whispoviridae) of shrimp. Based on morphology, WSSV was initially thought to be a baculovirus. TEM courtesy of Hailin Tang.
Comparison of the most commonly used next generation sequencing platforms. (Modified from [38]).
| Fragment / emulsion PCR | Fragment / polony | Fragment / emulsion PCR | ||
| Pyrosequencing | Sequencing by synthesis | Sequencing by ligation | ||
| 700–1000 | 150 | 100 | 75 | |
| 0.7 | 95 | 600 | 300 | |
| Long reads improve mapping in repetitive regions, fast run time | Currently the most widely used platform in the field | Two-base encoding provides inherent error correction | ||
| High reagent cost, high error rate in homopolymer repeats | Low multiplexing capability of samples | Long run time | ||
| Bacterial and insect genome | Variant discovery by whole— genome resequencing or whole—exome capture, virus discovery and gene discovery in metagenomics | Variant discovery by whole—genome resequencing or whole— exome capture, gene discovery in metagenomics | ||
Bioinformatics methods used for virus discovery by Next Generation Sequencing (NGS) data mining [53].
| Libraries are prepared from DNA isolated from the infected host or from purified viruses. | Libraries are prepared from RNA isolated from the infected host or purified viruses. | Libraries are prepared by isolation of small RNA from host total RNA (∼17–30 nt). | |
| Base calling, trim adaptors and remove low quality reads. Cluster reads (optional). | |||
| BLAST analysis/mapping followed by assembly of the reads that have significant hits to viral sequences; or assembly of reads and BLAST analysis of the resulting assembled contigs. | Assemble reads followed by BLAST analysis/mapping. | ||
| Separate contigs with significant hits (e-value: ≤ 1 × 10−3) to viruses from non-virus hits. | |||
| Re-assemble the contigs that hit viral sequences by using various assembly programs (for example software used for Sanger sequence assembly) to generate longer contigs. | |||
| BLAST the assembled contigs against non-redundant (nr) databases and virus databases. | |||
| Identify contigs with hits to viruses [e-value: ≤ 1 × 10−5]. | |||
| Fill the sequence gaps by PCR (RT-PCR, RACE-PCR) and Sanger sequencing. | |||
| Further characterization of virus (classification, localization, transmission, host range). Refer to polythetic criteria for virus group for parameters needed to facilitate virus classification [ | |||
Insect viruses detected/discovered by use of Next Generation Sequencing technologies.
| Birnaviridae (dsRNA) | ||
| Drosophila X virus (DXV) | [ | |
| Drosophila birnavirus (DBV) | [ | |
| Totiviridae (dsRNA) | ||
| Drosophila totivirus (DTV) | [ | |
| Dicistroviridae (+ssRNA) | ||
| Drosophila C virus (DCV) | [ | |
| Black queen cell virus (BQCV) | [ | |
| Kashmir bee virus (KBV) | [ | |
| Acute bee paralysis virus (ABPV) | [ | |
| Isreali acute paralysis virus (IAPV) | [ | |
| Aphid lethal paralysis virus-AP (ALPV-AP) | [ | |
| ALPV-AG | [ | |
| ALPV-Brookings strain (ALPV-Brookings) | [ | |
| Big Sioux river virus (BSRV) | [ | |
| Nodaviridae (+ssRNA) | ||
| American nodavirus (ANV) | [ | |
| Mosquito nodavirus (MNV) | [ | |
| Nidovirales (+ssRNA) | ||
| Cavally virus (CAVV) | Mosquito heads (multiple species) | [ |
| Tetraviridae (+ssRNA) | ||
| Drosophila tetravirus (DTrV) | [ | |
| Togaviridae (+ssRNA) | ||
| Sindbis virus (SINV) | [ | |
| Picornaviridae (+ssRNA) | ||
| Deformed wing virus (DWV) | [ | |
| Sacbrood virus (SBV) | [ | |
| Polydnaviridae | ||
| Costesia vestalis bracovirus (CvBV) | [ | |
| Parvoviridae (ssDNA) | ||
| Myzus persicae densovirus (MpDNV) | [ | |
| Unclassified | ||
| Noravirus (+ssRNA) | [ | |
| Chronic bee paralysis virus (CBPV; +ssRNA) | [ | |
| Glossina pallidipes salivary gland hypertrophy virus (GpSGHV;dsDNA) | [ | |
| Lake Sinai Virus 1 (LSV1;+ssRNA) | [ | |
| Lake Sinai Virus 2 (LSV2;+ssRNA) | [ | |
| Aphis glycines virus (AGV;+ssRNA) | [ | |
| Others | ||
| Many DNA viruses (known and novel) from animal, plant, insect | Various species of female mosquitoes | [ |
| Many known DNA and RNA viruses | [ | |
indicates novel viruses;
Based on the sequence, DTrV is actually Drosophila A virus.
Figure 2.Alignment of short interfering RNAs (siRNA) derived from viral RNA can be used to delineate viral genomic sequences.
Figure 3.Strategies for Insect Virus Discovery. When viral sequences are discovered in EST libraries or by NGS, frozen material or an insect colony established from field caught specimens is valuable for subsequent virus purification for further analyses.