| Literature DB >> 31231504 |
Jason R Miller1,2, Kari A Dilley1, Derek M Harkins1, Timothy B Stockwell1,3, Reed S Shabman1,4, Granger G Sutton1.
Abstract
The human cell lines HepG2, HuH-7, and Jurkat are commonly used for amplification of the RNA viruses present in environmental samples. To assist with assays by RNAseq, we sequenced these cell lines and developed a subtraction database that contains sequences expected in sequence data from uninfected cells. RNAseq data from cell lines infected with Sendai virus were analyzed to test host subtraction. The process of mapping RNAseq reads to our subtraction database vastly reduced the number non-viral reads in the dataset to allow for efficient secondary analyses.Entities:
Keywords: HepG2; HuH-7; Jurkat; RNAseq; host subtraction; human cell lines
Mesh:
Year: 2018 PMID: 31231504 PMCID: PMC6556987 DOI: 10.12688/f1000research.13580.3
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. The subtraction database construction and application pipelines.
During construction ( a), cells are grown without infection. Contigs from cell line gDNA are combined with host and contaminant references to form the subtraction database. During application ( b), cells are exposed to an infected sample. RNA virus replication accompanies cell growth. Subsequent RNAseq data is filtered to remove close matches to the subtraction database. Remaining data are characterized by the more costly process of aligning to a reference database.
Enrichment of Sendai virus.
Three cell lines were grown with Sendai virus (SeV) infection or mock infection. Some samples were treated with Ribo-Zero (Illumina) to deplete rRNA. (A) All cDNA libraries were sequenced on the Illumina NextSeq platform to generate over 19 million RNAseq reads per sample. (B) At least 89% of reads from every sample mapped to the subtraction database named SDB1. (C) Non-SDB reads were mapped to SeV references. Expressed as a fraction of initial reads, the SeV complement was 5% to 10% in infected samples and less than 1% in the controls. (D) Expressed as a percentage of non-SDB reads, the SeV complement was 78% to 95% in infected-but-not-depleted samples. The complement was larger in the infected-and-depleted samples, and smaller in the mock-infected samples. This suggests that analysis of non-SDB reads could support the detection of known viruses in an uncharacterized sample. The apparent false positive enrichments (e.g. 24.24% SeV in HEP/none/none) can be discounted by applying a minimum requirement for 0.10% viral reads out of initial reads.
| (A) | (B) | (C) | (D) | |||
|---|---|---|---|---|---|---|
| Cell
| Virus
| rRNA
| Initial
| Portion of
| Portion of
| Portion of
|
| HEP | SeV | depletion | 27.20 | 89.06% | 10.38% | 96.74% |
| HEP | SeV | none | 22.06 | 98.96% | 0.96% | 95.27% |
| HEP | none | none | 24.44 | 99.94% | 0.01% | 24.24% |
| HEP | none | depletion | 26.70 | 99.89% | 0.00% | 4.01% |
| HUH | SeV | depletion | 23.44 | 91.44% | 7.03% | 84.14% |
| HUH | SeV | none | 25.79 | 99.30% | 0.51% | 77.59% |
| HUH | none | none | 24.72 | 99.95% | 0.00% | 2.24% |
| HUH | none | depletion | 20.99 | 99.91% | 0.00% | 0.72% |
| JUR | SeV | depletion | 26.59 | 94.18% | 5.65% | 98.66% |
| JUR | SeV | none | 19.28 | 99.42% | 0.47% | 85.77% |
| JUR | none | none | 19.06 | 99.94% | 0.00% | 1.68% |
| JUR | none | depletion | 25.10 | 99.92% | 0.00% | 1.33% |
Enrichment of Ebola and Marburg viruses.
Data from an independent study ( Hölzer ) are derived from RNAseq of Huh-7 cells infected with Ebola virus (EBOV), Marburg virus (MARV), or none (Mock). The data were re-analyzed here using SDB1. Subtraction enriched the EBOV complement to at least 57% in the EBOV-infected samples. Subtraction enriched the MARV complement to at least 39% in the MARV-infected samples. The apparent false positive enrichments can be discounted by applying a minimum requirement for 0.10% viral reads out of initial reads.
| (A) | (B) | (C) | (D) | (E) | (F) | ||
|---|---|---|---|---|---|---|---|
| Virus | Time | Initial
| Portion of
| Portion of
| Portion of
| Portion of
| Portion of
|
| EBOV | 03h | 32.87 | 99.70% | 0.16% | 57.17% | 0.00% | 0.12% |
| EBOV | 07h | 46.62 | 97.47% | 2.34% | 94.41% | 0.00% | 0.01% |
| EBOV | 23h | 50.41 | 62.18% | 37.29% | 99.56% | 0.00% | 0.00% |
| MARV | 03h | 41.81 | 99.76% | 0.01% | 3.62% | 0.08% | 38.96% |
| MARV | 07h | 45.81 | 99.06% | 0.03% | 3.03% | 0.63% | 69.49% |
| MARV | 23h | 34.37 | 97.61% | 0.02% | 0.87% | 1.71% | 75.45% |
| Mock | 03h | 38.55 | 99.82% | 0.02% | 10.79% | 0.00% | 0.20% |
| Mock | 07h | 36.03 | 99.86% | 0.01% | 7.59% | 0.00% | 0.18% |
| Mock | 23h | 36.99 | 99.80% | 0.02% | 9.84% | 0.00% | 0.16% |