| Literature DB >> 19156205 |
Shota Nakamura1, Cheng-Song Yang, Naomi Sakon, Mayo Ueda, Takahiro Tougan, Akifumi Yamashita, Naohisa Goto, Kazuo Takahashi, Teruo Yasunaga, Kazuyoshi Ikuta, Tetsuya Mizutani, Yoshiko Okamoto, Michihira Tagami, Ryoji Morita, Norihiro Maeda, Jun Kawai, Yoshihide Hayashizaki, Yoshiyuki Nagai, Toshihiro Horii, Tetsuya Iida, Takaaki Nakaya.
Abstract
With the severe acute respiratory syndrome epidemic of 2003 and renewed attention on avian influenza viral pandemics, new surveillance systems are needed for the earlier detection of emerging infectious diseases. We applied a "next-generation" parallel sequencing platform for viral detection in nasopharyngeal and fecal samples collected during seasonal influenza virus (Flu) infections and norovirus outbreaks from 2005 to 2007 in Osaka, Japan. Random RT-PCR was performed to amplify RNA extracted from 0.1-0.25 ml of nasopharyngeal aspirates (N = 3) and fecal specimens (N = 5), and more than 10 microg of cDNA was synthesized. Unbiased high-throughput sequencing of these 8 samples yielded 15,298-32,335 (average 24,738) reads in a single 7.5 h run. In nasopharyngeal samples, although whole genome analysis was not available because the majority (>90%) of reads were host genome-derived, 20-460 Flu-reads were detected, which was sufficient for subtype identification. In fecal samples, bacteria and host cells were removed by centrifugation, resulting in gain of 484-15,260 reads of norovirus sequence (78-98% of the whole genome was covered), except for one specimen that was under-detectable by RT-PCR. These results suggest that our unbiased high-throughput sequencing approach is useful for directly detecting pathogenic viruses without advance genetic information. Although its cost and technological availability make it unlikely that this system will very soon be the diagnostic standard worldwide, this system could be useful for the earlier discovery of novel emerging viruses and bioterrorism, which are difficult to detect with conventional procedures.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19156205 PMCID: PMC2625441 DOI: 10.1371/journal.pone.0004219
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Random RT-PCR amplification of cDNA from clinical specimens and quantitative RT-PCR with virus-specific primers.
The samples were nasopharyngeal aspirates and stools (n = 3 and 5, respectively) isolated during 2005–2007 in Osaka, Japan. Influenza A virus and norovirus were detected in the nasopharyngeal aspirates and stool samples, respectively, with other diagnostic methods. (A) RNA extracted from clinical specimens was reverse-transcribed and random-PCR amplified to prepare template DNA for pyrosequencing. One microgram of amplified PCR products in each sample was loaded onto a 1% agarose gel. M indicates 100-bp DNA ladder (NEB). (B) Flu-specific semi-quantitative PCR was performed with 10-fold serial dilutions of the random-PCR products. (C) Norovirus (GII)-specific semi-quantitative PCR was performed with 10-fold serial dilutions of the random-PCR products. Quantitative real-time RT-PCR using a norovirus-specific primer set was also performed, and the estimated copy numbers of norovirus in samples #N1–#N5 are shown in the panel on the right. As a control, cDNA from sample #N3 without random PCR amplification was used (left panel).
Figure 2Pyrosequencing using the GS FLX system.
Amplified cDNA was used as a template for GS FLX analysis. A 70×75 PicoTiterPlate device (gasket for 16 regions) was divided into 2 regions each for 8 samples. Obtained data were then subjected to a data analysis pipeline, as described in the Methods section. Comparison of the organisms from which the best matches for the sequences was shown.
Summary of detected viruses.
| Sample | Age | Read | Virus |
| #F1 | 3 | 460 | Influenza A virus (H3N2) |
| 3 | Human endogenous retrovirus HCML-ARV | ||
| #F2 | 7 | 20 | Influenza A virus (H1N1) |
| #F3 | 5 | 107 | Influenza A virus (H3N2) |
| 7 | WU Polyomavirus | ||
| #N1 | 62 | 7 | Norovirus (GII/4) |
| #N2 | 82 | 7,304 | Norovirus (GII/4) |
| #N3 | 92 | 15,272 | Norovirus (GII/4) |
| 813 | Kyuri green mottle mosaic virus | ||
| 7 | Citrus tristeza virus | ||
| 3 | Enterobacteria phage phiK | ||
| #N4 | 3 | 484 | Norovirus (GII/4) |
| 14 | Human coronavirus HKU1 | ||
| 3 | Phage phiV10 | ||
| 3 | Human endogenous retrovirus K | ||
| #N5 | 44 | 762 | Pepper mild mottle virus |
| 611 | Norovirus (GII/4) | ||
| 17 | Crucifer tobamovirus | ||
| 2 | Tobacco mosaic virus |
Hospitalized patient.
Patients in welfare facilities.
Kindergarten student.
Putative food.
Summary of gene analysis in nasal samples.
| Sample | #F1 | #F2 | #F3 |
|
| 30,958 (100%) | 25,119 (100%) | 21,858 (100%) |
|
| 27,849 (90.0%) | 23,760 (94.6%) | 20,296 (92.9%) |
|
| 572 (1.85%) | 230 (1.23%) | 272 (1.24%) |
|
| 506 (1.63%) | 21(0.10%) | 121 (0.55%) |
|
| 2,031 (6.56%) | 1,108 (1.63%) | 1,169 (5.35%) |
|
| 460 (1.49%) | 20 (0.08%) | 103 (0.49%) |
| Mapping to influenza A virus genome. | |||
|
| 58.30% | 8.10% | 25.60% |
|
| 3.67 | 0.23 | 0.65 |
The E value threshold was set to 1E+05 for taxonomy classification and 1E+40 for virus detection, respectively. Reference sequences used for mapping are CY026275-82, A/Texas/UR06-0566/2007(H3N2).
Figure 3BLASTN (A) and PCR (B) analyses of WUV in Flu-positive nasopharyngeal aspirates.
(A) Alignment of the WUV VP1 nucleotide sequence. The read obtained with the GS FLX sequencer (query) was compared with that of the WUV strain CLFF (subject; NCBI accession number: EU296475). (B) The WUV VP2 gene was detected by PCR using cDNA, which was amplified with random RT-PCR, as a template. The cDNA was diluted 10- and 100-fold and the PCR product was loaded on a 1% agarose gel. M indicates 100-bp DNA ladder.
Summary of gene analysis in fecal samples.
| Sample | #N1 | #N2 | #N3 | #N4 | #N5 |
|
| 15,298 (100%) | 32,335 (100%) | 25,500 (100%) | 18,014 (100%) | 28,823 (100%) |
|
| 400 (2.61%) | 1,031 (22.7%) | 147 (0.58%) | 2,574 (14.3%) | 948 (3.29%) |
|
| 10,963 (71.7%) | 14,423 (48.0%) | 3,039 (11.9%) | 9,180 (51.0%) | 23,955 (83.1%) |
|
| 11 (0.07%) | 8,742 (27.4%) | 20,775 (81.5%) | 546 (3.03%) | 1,571 (5.45%) |
|
| 3,924 (25.7%) | 8,139 (25.2%) | 1,539 (6.04%) | 5,714 (31.7%) | 2,349 (8.15%) |
|
| 7 (0.05%) | 7,302(22.6%) | 15,260 (59.8%) | 484 (2.69%) | 611 (2.12%) |
| Mapping to Norovirus genome | |||||
|
| 2.10% | 97.00% | 98.00% | 77.50% | 84.50% |
|
| 0 | 141.5 | 258.7 | 9.3 | 12.5 |
Reference sequence used for mapping is AY587989, Norovirus Hu/NLV/Oxford/B2S16/2002/UK.
Detected regions in HCoV-HKU1 genome.
| Region (nt) | Genome |
| 3,854–4,083 | Orf 1ab (Replicase) |
| 15,956–16,147 | Orf 1ab (Replicase) |
| 24,506–24,653 | Spike glycoprotein |
| 28,082–28,310 | Membrane glycoprotein - Nucleocapsid phosphoprotein |
Reference sequence: HCoV HKU1 strain N15 genotype B, complete genome (NCBI accession number: DQ415911).
Summary of bacterial and human-derived gene analysis.
| Sample | #F1 | #F2 | #F3 |
|
| 30,958 | 25,119 | 21,858 |
|
| 26,957 (100%) | 23,029 (100%) | 19,612 (100%) |
| rRNA | 252 (0.93%) | 31 (0.13%) | 203 (1.04%) |
| coding region | 637 (2.36%) | 298 (1.29%) | 449 (2.29%) |
| non-coding region | 21,208 (78.7%) | 18,035 (78.3%) | 15,226 (77.6%) |
Hit reads with exon region.
Hit reads with intron and intergenic regions.
Figure 4Process diagram for the viral diagnosis of nasopharyngeal aspirates and fecal samples.