| Literature DB >> 31167947 |
Justine Cheval1, Erika Muth1, Gaëlle Gonzalez2, Muriel Coulpier2, Pascale Beurdeley1, Stéphane Cruveiller1, Marc Eloit3,4,5.
Abstract
The use of high-throughput sequencing (HTS) to identify viruses in biologicals differs from current molecular approaches, since its use enables an unbiased approach to detection without the need to design specific primers to preamplify target sequences. Its broad range of detection and analytical sensitivity make it an important tool to ensure that biologicals are free from adventitious viruses. Similar to other molecular methods, however, identification of viral sequences in cells by HTS does not prove viral infection, since this could reflect carryover of inert viral sequences from reagents or other sources or the presence of transcriptionally inactive cellular sequences. Due to the broad range of detection associated with HTS, the above can potentially be perceived as a drawback for the testing of pharmaceutical biological products using this method. In order to avoid the identification of inert viral sequences, we present a methodology based on metabolic RNA labeling and sequencing, which enables the specific identification of newly synthesized viral RNAs in infected cells, resulting in the ability to unambiguously distinguish active infection by DNA or RNA viruses from inert nucleic acids. In the present study, we report the ability to differentiate Vero cells acutely infected by a single-stranded positive-sense RNA virus (tick-borne encephalitis virus) from cells which have been in contact with nonreplicating virus particles. Additionally, we also found a laboratory contamination by the squirrel monkey retrovirus of our Vero cell line, which was derived from an Old World (African green) monkey, a type of contamination which until now has been identified only in cells derived from primates from the New World.IMPORTANCE The use of high-throughput sequencing (HTS) to identify viral contamination of biological products is extremely sensitive and provides a broad range of detection. Nevertheless, viral sequences identified can also be inert. Examples include contamination resulting from reagents or the presence of inactivated viruses in animal-derived components of the cell culture medium. We therefore developed a method that relies on the sequencing of newly synthesized RNAs, an unequivocal sign of the presence of a transcriptionally active virus. This improvement in the specificity of viral testing increases the acceptability of HTS as a standard test for cells used in manufacturing biologicals and in biotherapies.Entities:
Keywords: 4-thiouridine; RNA-seq; Vero; biologicals; high-throughput sequencing; metagenomics; squirrel monkey retrovirus; tick-borne encephalitis virus; virus safety
Mesh:
Substances:
Year: 2019 PMID: 31167947 PMCID: PMC6553555 DOI: 10.1128/mSphere.00298-19
Source DB: PubMed Journal: mSphere ISSN: 2379-5042 Impact factor: 4.389
Number of reads on TBEV and SMRV genomes and horizontal coverage of the genome
| Virus and parameter | Value for parameter at the following time and treatment | |||
|---|---|---|---|---|
| Day 0, no 4sU | Day 1, no 4sU | Day 1, 4sU + alkylation | Day 1, 4sU, no alkylation | |
| TBEV | ||||
| No. of reads | 160,085 (0.27) | 6,408,291 (0.35) | 5,240,211 (0.36) | 5,722,338 (0.32) |
| Horizontal coverage (%) | 100 | 100 | 100 | 100 |
| SMRV | ||||
| No. of reads | 1,807,960 (0.39) | 1,601,090 (0.40) | 1,816,788 (0.48) | 2,479,933 (0.37) |
| Horizontal coverage (%) | 100 | 100 | 100 | 100 |
Reads were mapped on the genomes of TBEV and SMRV found by the agnostic procedure (see Table S1 in the supplemental material).
The number of reads shows the number of reads of the virus on the genome. The values in parentheses are the percentages of negative-sense/total number of reads.
Horizontal coverage of the genome is shown as a percentage of the genome.
Conversion rate of T nucleotides and conversion index
| Sample and parameter | Value for parameter for sample at the following time and treatment | |||
|---|---|---|---|---|
| Day 0, no 4sU | Day 1, no 4sU | Day 1, 4sU + alkylation | Day 1, 4sU, no alkylation | |
| TBEV | ||||
| T→C rate (%) | 0.15 | 0.16/0.16 | 0.87/0.82 | 0.16/0.14 |
| T→A rate (%) | 0.05 | 0.05/0.05 | 0.10/0.10 | 0.05/0.05 |
| T→G (%) | 0.11 | 0.13/0.13 | 0.18/0.18 | 0.12/0.12 |
| Conversion index | 2.12 | 1.78/1.76 | 6.09/5.83 | 1.81/1.60 |
| SMRV | ||||
| T→C rate (%) | 0.15 | 0.15/0.11 | 3.49/3.45 | 0.15/0.11 |
| T→A rate (%) | 0.05 | 0.05/0.05 | 0.10/0.10 | 0.05/0.05 |
| T→G rate (%) | 0.07 | 0.08/0.08 | 0.11/0.12 | 0.07/0.08 |
| Conversion index | 2.41 | 2.42/1.80 | 32.95/31.67 | 2.46/1.80 |
| Cellular transcripts (internal control) | ||||
| T→C rate (%) | 0.10 | 0.06 | 1.83 | 0.05 |
| T→A rate (%) | 0.03 | 0.02 | 0.04 | 0.01 |
| T→G rate (%) | 0.06 | 0.05 | 0.08 | 0.03 |
| Conversion index | 2.12 | 1.60 | 29.67 | 2.34 |
In each TBEV and SMRV cell, the first value (before the slash) and the second value (after the slash) correspond to the conversion rate or index based on the external consensus sequence derived from the D0 reads (“TBEV REFERENCE” and “SMRV REFERENCE”) or from internal consensus sequences directly derived from the reads of the experimental condition (“INTERNAL REFERENCE”), respectively.
FIG 1Distribution of conversion rates along the genomes of SMRV and TBEV. Conversion rates were calculated in reference to the reference sequence derived from day 0 (“TBEV REFERENCE” and “SMRV REFERENCE”) or from an internal consensus sequence based on the reads from each experimental condition (“INTERNAL REFFERENCE”). For the D0 time point, the two modes of calculation are therefore equivalent. (T to C, G, or A, in blue, orange, or green lines, respectively). Abbreviations: D0 and DO, day 0; 4Su, 4-thiouridine; alkyl, alkylation.
FIG 2Conversion indexes expressed as the ratio of the T→C conversion rate to the average of the (T→A, T→G) conversion rates. Conversion rates for viruses were calculated according to the two types of reference sequence as in Fig. 1. Conversion rates for cellular RNA controls are calculated according the CONTROL REFERENCE sequence. Abbreviations: N, no; Y, yes.
Comparison of SMRV-H and SMRV-Vero proteins
| Protein | Length of protein (no. of aa) in: | Amino acid identity (%) | |
|---|---|---|---|
| SMRV-H | SMRV-V | ||
| Gag | 740 | 742 | 93 |
| Protease | 90 | 90 | 98 |
| Pol | 798 | 798 | 99 |
| Env | 575 | 575 | 98 |