| Literature DB >> 24036074 |
Richard J Hall1, Jing Wang, Angela K Todd, Ange B Bissielo, Seiha Yen, Hugo Strydom, Nicole E Moore, Xiaoyun Ren, Q Sue Huang, Philip E Carter, Matthew Peacey.
Abstract
The discovery of new or divergent viruses using metagenomics and high-throughput sequencing has become more commonplace. The preparation of a sample is known to have an effect on the representation of virus sequences within the metagenomic dataset yet comparatively little attention has been given to this. Physical enrichment techniques are often applied to samples to increase the number of viral sequences and therefore enhance the probability of detection. With the exception of virus ecology studies, there is a paucity of information available to researchers on the type of sample preparation required for a viral metagenomic study that seeks to identify an aetiological virus in an animal or human diagnostic sample. A review of published virus discovery studies revealed the most commonly used enrichment methods, that were usually quick and simple to implement, namely low-speed centrifugation, filtration, nuclease-treatment (or combinations of these) which have been routinely used but often without justification. These were applied to a simple and well-characterised artificial sample composed of bacterial and human cells, as well as DNA (adenovirus) and RNA viruses (influenza A and human enterovirus), being either non-enveloped capsid or enveloped viruses. The effect of the enrichment method was assessed by both quantitative real-time PCR and metagenomic analysis that incorporated an amplification step. Reductions in the absolute quantities of bacteria and human cells were observed for each method as determined by qPCR, but the relative abundance of viral sequences in the metagenomic dataset remained largely unchanged. A 3-step method of centrifugation, filtration and nuclease-treatment showed the greatest increase in the proportion of viral sequences. This study provides a starting point for the selection of a purification method in future virus discovery studies, and highlights the need for more data to validate the effect of enrichment methods on different sample types, amplification, bioinformatics approaches and sequencing platforms. This study also highlights the potential risks that may attend selection of a virus enrichment method without any consideration for the sample type being investigated.Entities:
Keywords: Enrichment; Metagenomic; Purification; Virus
Mesh:
Year: 2013 PMID: 24036074 PMCID: PMC7113663 DOI: 10.1016/j.jviromet.2013.08.035
Source DB: PubMed Journal: J Virol Methods ISSN: 0166-0934 Impact factor: 2.014
Virus enrichment process prior to sequencing in metagenomic studies on human and animal samples.
| Paper title | Author | Year | Journal | Aim of study | Sample | Step 1 | Step 2 | Step 3 | Step 4 | Step 5 | Step 6 | Step 7 | Amplification | Sequencing |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| A virus discovery method incorporating DNase treatment and its application to the identification of two bovine parvovirus species | Allander et al. | 2001 | PNAS | Development of a method for discovery of unknown viruses and elimination of contaminating host DNA. Allowed the discovery of novel bovine parvoviruses. | Serum | – | – | – | – | – | Sequence-Independent Amplification | Cloning and sanger-method | ||
| Metagenomic analyses of an uncultured viral community from human faeces | Breitbart et al. | 2003 | Journal of Bacteriology | Metagenomic analyses of an uncultured viral community from human faeces | Faecal suspension | – | – | – | – | Sequence-independent Amplification | Cloning and sanger-method | |||
| Identification of a new human coronavirus | van der Hoek et al. | 2004 | Nature Medicine | Identification of a new human coronavirus | Suspension of LLC-MK2 cells | – | – | – | – | – | VIDISCA | cDNA-AFLP, cloning and sanger-method | ||
| Viral genome sequencing by random priming methods | Djikeng et al. | 2008 | BMC Genomics | Development of a method for rapid sequencing of whole genomes from new viruses | Bacterial growth media, plasma, cell culture supernatant, faecal suspension | – | – | – | Sequence-Independent Amplification | Cloning and sanger-method | ||||
| A highly divergent Picornavirus in a marine mammal | Kapoor et al. | 2008 | Journal of Virology | Unidentified virus cultured from a seal | Supernatant from infected Vero cell culture | – | – | – | Sequence-Independent Amplification | Cloning and sanger-method | ||||
| Rapid identification of known and new RNA viruses from animal tissues | Victoria et al. | 2008 | PLOS Pathogens | Unidentified viruses cultured in suckling mouse brains | Brain tissue homogenate from mice | – | – | – | Sequence-Independent Amplification | Cloning and sanger-method | ||||
| Discovery of a novel single-stranded DNA virus from a Sea Turtle Fibropapilloma by using viral metagenomics | Ng et al. | 2009 | Journal of Virology | To investigate and purifiy the viruses associated with sea turtle fibropapillomatosis (FP) | External fibropapilloma homogenate | – | – | Sequence-Independent Amplification | Cloning and sanger-method | |||||
| Laboratory procedures to generate viral metagenomes | Thurber et al. | 2009 | Nature Protocols | Development of laboratory procedure for making viral metagenomes | Various (i.e. soil, animal tissues, clinical samples) | – | – | – | As required. Genomiphi or Transplex | High-throughput sequencing | ||||
| Identification and characterisation of deer astroviruses | Smits et al. | 2010 | Journal of General Virology | Detection of novel astroviruses in deer using a metagenomic approach | Faecal suspension | – | – | – | – | Sequence-Independent Amplification | Cloning and sanger-method | |||
| Metagenomic sequencing for virus identification in a public-health setting | Svraka et al. | 2010 | Journal of General Virology | Unidentified viruses cultured in | Cell culture suspension | – | – | – | – | Whole Transcriptome Amplification Kit (Qiagen) or GenomiPhi V2 (GE Healthcare) | Cloning and sanger-method | |||
| Human Picobirnaviruses identified by molecular screening of diarrhoea samples | Van Leeuwen et al. | 2010 | Journal of Clinical Microbiology | Viral metagenomic survey of human diarrhoea of unknown origin | Faecal suspension | – | – | – | – | Sequence-Independent Amplification | Cloning and sanger-method | |||
| A viral discovery methodology for clinical biopsy samples utilising massively parallel next generation sequencing | Daly et al. | 2011 | PLOS ONE | Development of a method for discovery of new viruses from clinical biopsy samples | Liver tissue homogenate from human and canine | – | – | – | – | Sequence-Independent Amplification | 454 sequencing (Roche) and GAII sequencing (Illumina) | |||
| Identification and molecular characterisation of a new nonsegmented double-stranded RNA virus isolated from Culex mosquitoes in Japan | Isawa et al. | 2011 | Virus Research | Identification of 2 infectious agents from the mosquitoes | Mosquito homogenate | – | – | – | Sequence-Independent Amplification (Single Primer Amplification Technique – SPAT) | Sanger-method | ||||
| Diversity and abundance of single-stranded DNA viruses in human faeces | Kim et al. | 2011 | Applied and Environmental Microbiology | Investigation of single-stranded DNA viruses in human faeces | Faecal suspension | – | – | – | GenomiPhi V2 (GE Healthcare) | 454FLX sequencing (Roche) | ||||
| Novel DNA virus isolated from samples showing endothelial cell necrosis in the Japanese eel, | Mizutani et al. | 2011 | Virology | Identification of novel eel virus | Supernantant from infected JEE culture cells | – | – | – | – | – | GenomePlex Whole Genome Amplification or GenomiPhi (GE Healthcare) | Sanger-method or 454FLX sequencing (Roche) | ||
| Broad surveys of DNA viral diversity obtained through viral metagenomics of mosquitoes | Ng et al. | 2011 | PLOS ONE | Viral diversity study in mosquito | Whole mosquito homogenate | – | GenomiPhi (GE Healthcare) | 454 GS20 and 454FLX sequencing (Roche) | ||||||
| Exploring the diversity of plant DNA viruses and their satellites using vector-enabled metagenomics on whiteflies | Ng et al. | 2011 | PLOS ONE | To investigate the diversity of DNA viruses in whiteflies collected from different crops in 2 agriculturally important sites in Florida using vector-enabled metagenomics (VEM). | Whitefly homogenate | – | Genomiphi (GE Healthcare) and GenomePlex (Sigma–Aldrich) | Cloning and sanger-method | ||||||
| Random PCR and ultracentrifugation increases sensitivity and throughput of VIDISCA for screening of pathogens in clinical specimens | Tan et al. | 2011 | Journal of Infection in Developing Countries | Screening for unknown pathogens in clinical specimens | Clinical samples of plasma, throat swab, nasal pharyngeal aspirate. | – | – | – | – | VIDISCA | cDNA-AFLP, cloning and sanger-method | |||
| Metagenomic analysis of fever, thrombocytopenia and leukopenia syndrome (FTLS) in Henan Province, China: Discovery of a new bunyavirus | Xu et al. | 2011 | PLOS Pathogens | Discovery of a new bunyavirus in cases of fever, thrombocytopenia and leukopenia syndrome (FTLS) | Supernatant from infected Vero cell culture | – | – | – | – | Sequence-Independent Amplification | Cloning and sanger-method | |||
| Simultaneous identification of DNA and RNA viruses present in pig faeces using process-controlled deep sequencing | Sachsenröder et al. | 2012 | PLOS ONE | To establish a protocol for the simultaneous analysis of DNA and RNA viruses present in pig faeces | Faecal suspension | Transplex (WTA2, Sigma–Aldrich) | 454FLX sequencing (Roche) | |||||||
| Sequence-independent VIDISCA-454 technique to discover new viruses in canine livers | van der Heijden et al. | 2012 | Journal of Virological Methods | Test the feasibility of VIDISCA-454 to obtain viral sequence information from idiopathic canine hepatitis liver biopsy. | Canine BDE cell culture supernatant | – | – | – | – | – | VIDISCA | 454FLX sequencing (Roche) | ||
| Identification of a novel bat papillomavirus by metagenomics | Tse et al. | 2012 | PLOS ONE | Discovery and characterisation of a novel bat papillomavirus from rectal swabs of asymptomatic wild, food and pet animals using metagenomics | Animal rectal swabs | – | – | – | – | Rapisome pWGA (Biohelix) | 454 GS FLX sequencing (Roche) | |||
| Complete genome sequence of an astrovirus identified in a domestic rabbit ( | Stenglein et al. | 2012 | Virology Journal | To screen samples from a gastroenteritis outbreak in a commercial rabbit colony | Faecal suspension | – | – | – | – | PCR | Cloning and sanger-method |
Publications referred to include: Allander et al. (2001), Breitbart et al. (2003), Daly et al. (2011), Djikeng et al. (2008), Isawa et al. (2011), Kapoor et al. (2008), Mizutani et al. (2011), Ng et al. (2011a), Ng et al. (2009), Ng et al. (2011b), Sachsenroder et al. (2012), Smits et al. (2010), Svraka et al. (2010), Tan le et al. (2011), Thurber et al. (2009), van der Hoek et al. (2004), van Leeuwen et al. (2010), Victoria et al. (2008), Xu et al. (2011).
Low-speed centrifugation.
Filtration (excludes tangential flow).
Ultracentrifugation.
Nuclease treatment.
Unclassified method.
Fig. 1Amount of model organisms detected using quantitative real-time PCR, when different virus enrichment methods are applied. Amounts are measured as copy number of a target gene, represented as log10 values. The average of two independent replicates are shown, with two PCR tests performed per replicate. Error bars show the 95% confidence interval. The grey line represents the copy number of the target gene when no enrichment method is applied.
Fig. 2Relative abundance of taxonomically assigned sequence reads in the viral metagenomic sequence dataset, shown for different enrichment methods. Average values are shown for two independent replicates, and error bars represent the 95% confidence interval.
Quantity and proportion of sequence reads with a positive BLASTN hit against the model organism groups used in the virus discovery metagenomic dataset, comparing the effect of different virus enrichment methods.
| Treatment | Total number of sequence reads | Metazoa | Adenovirus | Influenza | Enterovirus | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| BLASTN hits | % total | BLASTN hits | % total | BLASTN hits | % total | BLASTN hits | % total | BLASTN hits | % total | ||
| No treatment | 1,980,878 | 39,481 | 1.99% | 1,523,514 | 76.9% | 40 | 0.002% | 20 | 0.001% | 3,221 | 0.16% |
| Centrifuge | 2,010,717 | 48,719 | 2.42% | 1,486,125 | 73.9% | 0 | 0.000% | 77 | 0.004% | 14,805 | 0.74% |
| Filtration | 1,941,626 | 65,746 | 3.39% | 1,334,433 | 68.7% | 6 | 0.000% | 110 | 0.006% | 22,731 | 1.17% |
| Nuclease | 1,821,828 | 5,148 | 0.28% | 1,421,268 | 78.0% | 17 | 0.001% | 14 | 0.001% | 2,532 | 0.14% |
| 2-step treatment | 1,730,569 | 53,421 | 3.09% | 1,199,232 | 69.3% | 14 | 0.001% | 57 | 0.003% | 18,712 | 1.08% |
| 3-step treatment | 1,417,803 | 26,856 | 1.89% | 857,873 | 60.5% | 16 | 0.001% | 161 | 0.011% | 67,227 | 4.74% |
Combined total number of sequence reads for two independent physical replicates which were also run on different Illumina MiSeq flowcells. This figure represents the collapsed sequencing data, therefore redundant reads are not represented more than once.
Serial applications of treatment methods. The 2-step method consisted of centrifugation then filtration. The 3-step method consisted of centrifugation, filtration then nuclease-treatment.