| Literature DB >> 28932815 |
Arifa S Khan1, Siemon H S Ng2, Olivier Vandeputte3, Aisha Aljanahi1, Avisek Deyati4, Jean-Pol Cassart5, Robert L Charlebois2, Lanyn P Taliaferro1.
Abstract
The capability of high-throughput sequencing (HTS) for detection of known and unknown viruses makes it a powerful tool for broad microbial investigations, such as evaluation of novel cell substrates that may be used for the development of new biological products. However, like any new assay, regulatory applications of HTS need method standardization. Therefore, our three laboratories initiated a study to evaluate performance of HTS for potential detection of viral adventitious agents by spiking model viruses in different cellular matrices to mimic putative materials for manufacturing of biologics. Four model viruses were selected based upon different physical and biochemical properties and commercial availability: human respiratory syncytial virus (RSV), Epstein-Barr virus (EBV), feline leukemia virus (FeLV), and human reovirus (REO). Additionally, porcine circovirus (PCV) was tested by one laboratory. Independent samples were prepared for HTS by spiking intact viruses or extracted viral nucleic acids, singly or mixed, into different HeLa cell matrices (resuspended whole cells, cell lysate, or total cellular RNA). Data were obtained using different sequencing platforms (Roche 454, Illumina HiSeq1500 or HiSeq2500). Bioinformatic analyses were performed independently by each laboratory using available tools, pipelines, and databases. The results showed that comparable virus detection was obtained in the three laboratories regardless of sample processing, library preparation, sequencing platform, and bioinformatic analysis: between 0.1 and 3 viral genome copies per cell were detected for all of the model viruses used. This study highlights the potential for using HTS for sensitive detection of adventitious viruses in complex biological samples containing cellular background. IMPORTANCE Recent high-throughput sequencing (HTS) investigations have resulted in unexpected discoveries of known and novel viruses in a variety of sample types, including research materials, clinical materials, and biological products. Therefore, HTS can be a powerful tool for supplementing current methods for demonstrating the absence of adventitious or unwanted viruses in biological products, particularly when using a new cell line. However, HTS is a complex technology with different platforms, which needs standardization for evaluation of biologics. This collaborative study was undertaken to investigate detection of different virus types using two different HTS platforms. The results of the independently performed studies demonstrated a similar sensitivity of virus detection, regardless of the different sample preparation and processing procedures and bioinformatic analyses done in the three laboratories. Comparable HTS detection of different virus types supports future development of reference virus materials for standardization and validation of different HTS platforms.Entities:
Keywords: 454 Roche; Illumina; adventitious viruses; high-throughput sequencing; spiking study; virus detection
Year: 2017 PMID: 28932815 PMCID: PMC5597969 DOI: 10.1128/mSphere.00307-17
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
Lab A spiking study: analysis of individual viral genomes spiked in HeLa cell RNA
| Virus | Total no. | Reference virus genome | |||
|---|---|---|---|---|---|
| Spiked virus genome | HPV18 | ||||
| No. of reads | % genome | No. of reads | % genome | ||
| EBV | 2,224,621 | 57 (2,047) | 1.19 | 2,423 (5,222) | 66.46 |
| RSV | 3,033,912 | 332 (10,992) | 72.52 | 3,114 (5,258) | 66.92 |
| FeLV | 2,631,399 | 15 (3,039) | 35.97 | 2,856 (5,222) | 66.46 |
| REO1 | 2,706,289 | 727 (20,237) | 85.75 | 1,163 (5,216) | 66.39 |
| PCV1 | 2,719,689 | 0 | 0 | 3,011 (5,218) | 66.41 |
| No spike (control) | 2,601,870 | ST | ST | 2,802 (5,252) | 66.84 |
Calculated based upon the total number of reference bases covered by the mapped reads.
The total reference genome was calculated based upon combining all 10 REO1 genes.
ST, see text. Results for the individual viruses are reported in the text.
Lab A spiking study: analysis of individual viruses spiked in HeLa cell lysate
| Virus | Total no. | Reference virus genome | |||
|---|---|---|---|---|---|
| Spiked virus | HPV18 | ||||
| No. of reads | % genome | No. of reads | % genome | ||
| EBV | 1,920,259 | 3,502 (107,223) | 62.24 | 57 (1,770) | 22.53 |
| RSV | 1,605,408 | 1,920 (14,870) | 98.10 | 35 (1,851) | 23.56 |
| FeLV | 2,217,454 | 172 (5,467) | 64.71 | 250 (1,509) | 19.21 |
| REO1 | 1,961,767 | 23 (1,530) | 6.48 | 48 (833) | 10.60 |
| PCV1 | 1,956,346 | 195 (1,759) | 100.06 | 43 (855) | 10.88 |
Calculated based upon the total number of reference bases covered by the mapped reads.
The total reference genome was calculated based upon combining all 10 REO1 genes.
Lab A: mapping summary of individual REO1 genes
| REO1 gene | Virus RNA spiked into HeLa cell RNA | Virus particles spiked into HeLa cell lysate | ||
|---|---|---|---|---|
| No. of reads mapped | % gene covered | No. of reads mapped | % gene covered | |
| L1 | 9 | 52.56 | 3 | 9.78 |
| L2 | 59 | 92.92 | 4 | 2.12 |
| L3 | 41 | 83.18 | 11 | 10.48 |
| M1 | 60 | 91.57 | 0 | 0 |
| M2 | 20 | 90.51 | 4 | 12.43 |
| M3 | 43 | 98.57 | 3 | 7.27 |
| S1 | 243 | 99.72 | 2 | 13.99 |
| S2 | 15 | 97.82 | 3 | 10.36 |
| S3 | 202 | 100.00 | 1 | 1.75 |
| S4 | 38 | 88.71 | 3 | 14.63 |
Calculated based upon the total number of reference bases covered by the mapped reads.
Lab A spiking study: analysis of mixed virus spiked in HeLa cell lysate
| Virus | Total no. of | Relevant reference virus genome | |
|---|---|---|---|
| No. of reads | % genome covered | ||
| EBV | 2,050,654 | 3,224 (87,974) | 51.06 |
| RSV | 816 (12,087) | 79.74 | |
| FeLV | 38 (2,585) | 30.60 | |
| REO1 | 20 (1,785) | 7.56 | |
| PCV1 | 202 (1,600) | 91.01 | |
| HPV18 | 60 (722) | 9.19 | |
Calculated based upon the total number of reference bases covered by the mapped reads.
The total reference genome was calculated based upon combining all 10 REO1 genes.
Reference genome coverage
| Type of spike and reference virus genome | Genome coverage ratio | |
|---|---|---|
| Virus spike | HPV18 | |
| Virus nucleic acids in cell RNA | ||
| EBV | 2.4 | 191.2 |
| RSV | 12.6 | 256.1 |
| FeLV | 2.3 | 215.9 |
| REO1 | 12.2 | 77.7 |
| PCV1 | 0.0 | 216.7 |
| Virus in cell lysate | ||
| EBV | 6.4 | 3.4 |
| RSV | 25.8 | 6.5 |
| FeLV | 5.5 | 11.6 |
| REO1 | 1.8 | 4.1 |
| PCV1 | 17.9 | 3.5 |
| Mixed virus in cell lysate | 5.6 | |
| EBV | 4.9 | |
| RSV | 9.5 | |
| FeLV | 2.6 | |
| REO1 | 1.5 | |
| PCV1 | 15.3 | |
| Control (no spike) | 233.8 | |
Determined as a ratio of the total number of mapped bases of the reads and the number of bases covered in the reference genome.
Lab B spiking study: analysis of mixed virus spiked into cell lysate
| Reference virus | High spike | Medium spike | Low spike | |||
|---|---|---|---|---|---|---|
| No. of reads | % genome | No. of reads | % genome | No. of reads | % genome | |
| EBV | 11,625,702 | 92.6 | 1,132,729 | 83.0 | 35,535 | 77.9 |
| RSV | 4,421 | 95.4 | 1,961 | 92.6 | 70 | 20.4 |
| FeLV | 164 | 35.6 | 21 | 6.5 | 0 | 0.0 |
| REO1 | 31 | 5.3 | 0 | 0.0 | 0 | 0.0 |
| HPV18 | 1,131 | 60.6 | 1,988 | 41.3 | 1,645 | 59.8 |
High spike, 100 copies per cell.
Medium spike, 3 copies per cell.
Low spike, 0.1 copy per cell.
The total number of reads is shown in parentheses.
Percentage of viral genome covered by the mapped reads.
Lab B spiking study: analysis of mixed virus spiked into whole cells
| Reference virus | High spike | Medium spike | Low spike | |||
|---|---|---|---|---|---|---|
| No. of reads | % genome | No. of reads | % genome | No. of reads | % genome | |
| EBV | 4,310,100 | 79.6 | 1,176,557 | 84.3 | 20,892 | 74.8 |
| RSV | 8,255 | 97.6 | 3,325 | 94.7 | 35 | 11.1 |
| FeLV | 165 | 47.1 | 12 | 10.4 | 0 | 0.0 |
| REO1 | 56 | 9.5 | 13 | 2.9 | 0 | 0.0 |
| HPV18 | 1,516 | 52.8 | 3,800 | 64.4 | 2,709 | 65.5 |
High spike, 100 copies per cell.
Medium spike, 3 copies per cell.
Low spike, 0.1 copy per cell.
The total number of reads is shown in parentheses.
Percentage of viral genome covered by the mapped reads.
Lab C spiking study: analysis of DNA (cell pellet plus mixed-virus spike)
| Reference virus | High spike | Medium spike | Low spike | No spike | ||||
|---|---|---|---|---|---|---|---|---|
| No. of reads | % genome | No. of reads | % genome | No. of reads | % genome | No. of reads | % genome | |
| EBV | 1,195,993 | 100.0 | 23,931 | 99.6 | 730 | 33.6 | 0 | 0 |
| RSV | 0 | 0.0 | 2 | 1.3 | 0 | 0.0 | 0 | 0 |
| FeLV | 2 | 2.2 | 0 | 0.0 | 0 | 0.0 | 0 | 0 |
| REO1 | 2 | ND | 0 | 0.0 | 4 | ND | 0 | 0 |
| HPV18 | 1,762 | 66.7 | 2,765 | 64.4 | 2,709 | 65.5 | 1,527 | 66.7 |
High spike, 100 copies per cell.
Medium spike, 3 copies per cell.
Low spike, 0.1 copy per cell.
The total read number is shown in parentheses.
Percentage of viral genome covered by the mapped reads.
ND, not determined because reads map to different segments.
Lab C spiking study: analysis of rRNA-depleted RNA (cell pellet plus mixed-virus spike)
| Reference | High spike | Medium spike | Low spike | No spike (366,849,590) | ||||
|---|---|---|---|---|---|---|---|---|
| No. of reads | % genome | No. of reads | % genome | No. of reads | % genome | No. of reads | % genome | |
| EBV | 433,638 | 99.9 | 10,985 | 98.7 | 159 | 6.3 | 58 | ND |
| RSV | 4,468,472 | 100.0 | 7,809 | 100.0 | 632 | 76.3 | 2 | ND |
| FeLV | 1,080,732 | 99.9 | 571 | 90.0 | 51 | 31.0 | 10 | ND |
| REO1 | 1,222,694 | — | 12,453 | — | 123 | — | 2 | ND |
| HPV18 | 842,420 | 66.7 | 173,739 | 66.7 | 381,291 | 66.7 | 341,199 | 66.7 |
High spike, 100 copies per cell.
Medium spike, 3 copies per cell.
Low spike, 0.1 copy per cell.
The total read number is shown in parentheses.
Percentage of viral genome covered by the mapped reads.
ND, not determined.
—, see Table 10 for percent genome coverage of each REO1 segment.
Lab C spiking study: analysis of REO1 genes in rRNA-depleted samples (cell pellet plus mixed-virus spike)
| REO1 gene | High spike | Medium spike | Low spike | |||
|---|---|---|---|---|---|---|
| No. of reads | % genome | No. of reads | % genome | No. of reads | % genome | |
| L1 | 239,032 | 100.0 | 2,265 | 99.5 | 22 | 39.6 |
| L2 | 34,346 | 100.0 | 354 | 98.2 | 4 | 10.1 |
| L3 | 264,642 | 100.0 | 2,762 | 99.9 | 23 | 47.7 |
| M1 | 156,172 | 100.0 | 1,434 | 99.5 | 10 | 40.6 |
| M2 | 151,276 | 100.0 | 1,813 | 99.8 | 22 | 57.6 |
| M3 | 141,230 | 100.0 | 1,384 | 99.6 | 4 | 13.6 |
| S1 | 83,878 | 100.0 | 801 | 99.5 | 8 | 49.9 |
| S2 | 64,488 | 100.0 | 675 | 99.5 | 10 | 42.2 |
| S3 | 38,088 | 100.0 | 442 | 98.7 | 10 | 43.1 |
| S4 | 49,542 | 100.0 | 523 | 99.5 | 10 | 53.4 |
High spike, 100 copies per cell.
Medium spike, 3 copies per cell.
Low spike, 0.1 copy per cell.
The total read number is shown in parentheses.
Percentage of viral genome covered by the mapped reads.