| Literature DB >> 23822119 |
Denise A Marston, Lorraine M McElhinney, Richard J Ellis, Daniel L Horton, Emma L Wise, Stacey L Leech, Dan David, Xavier de Lamballerie, Anthony R Fooks.
Abstract
BACKGROUND: With the advent of Next Generation Sequencing (NGS) technologies, the ability to generate large amounts of sequence data has revolutionized the genomics field. Most RNA viruses have relatively small genomes in comparison to other organisms and as such, would appear to be an obvious success story for the use of NGS technologies. However, due to the relatively low abundance of viral RNA in relation to host RNA, RNA viruses have proved relatively difficult to sequence using NGS technologies. Here we detail a simple, robust methodology, without the use of ultra-centrifugation, filtration or viral enrichment protocols, to prepare RNA from diagnostic clinical tissue samples, cell monolayers and tissue culture supernatant, for subsequent sequencing on the Roche 454 platform.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23822119 PMCID: PMC3708773 DOI: 10.1186/1471-2164-14-444
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Details of virus isolates analysed in this study
| | | | | | |
| RV2324 | ISR-50 | Dog | Israel | 1950 | KF154998 |
| RV50 | AZBAT 7453 | Bat ( | US, Az | 1975 | JQ685956* |
| RV61 | - | Human (canine strain) | UK (ex-India) | 1987 | KF154996 |
| RV437 | 269 | Raccoon Dog | Estonia | - | KF154997 |
| RV2417 | R16/08 | Dog | UK (ex Sri Lanka) | 2008 | KF154999 |
| RV2627 | CASA 09/08 | Cow (Canine strain) | Morocco | 2009 | KF155001 |
| RV2772 | Rab91/ D10-867 | Dog | Tanzania | 2010 | KF155002 |
| RV2516 | 23 | Cow (Canine strain) | Iraq | 2010 | KF155000 |
| | | | | | |
| RV20 | RA552 | Bat ( | Denmark | 1986 | KF155003 |
| | | | | | |
| RV1787 | 603-04† | Bat ( | UK (Staines) | 2004 | KF155004 |
| | | | | | |
| RV2508 | snp0971# | African Civet | Tanzania | 2011 | JX193798 |
* Published after sequence was obtained prior to this publication.
† Sequence confirmed by Sanger sequencing from the bat brain material.
#Both brain and TCSN were sequenced. For analysis see Table 2.
- Data not available.
Details of brain and BHK cultured samples, relating to the extraction method used and RNA concentration during preparation stages resulting in number of reads obtained and outcome of obtaining full viral genome sequence
| RV437 | mouse | TRIzol® | 5,404.4 | 410.77 | 121.77 | 282,370 | 1,489 | 0.53 | Yesa | Yes |
| RV61 | mouse | TRIzol® | 3,362.8 | ND | 58.9 | 191,508# | 1,810 | 0.95 | Yesc | Yes |
| RV2417 | Dog | TRIzol® | 5,431.3 | 1799.21 | 57.78 | 96,014 | 1,488 | 1.55 | Yesb | Yes |
| RV2508 | African civet^ | TRIzol® | 6,537.4 | 66.55 | 28.18 | 139,841 | 626 | 0.45 | Yesa | Yes |
| RV2627 | Cow | TRIzol® | 2,385.8 | 111.81 | 26.60 | 11,470 | 682 | 5.95 | Yes | Yes |
| RNeasy mini | N/A | 4.99 | 0.95 | 239 | 2 | 0.84 | No | No | ||
| RV2516 | Cow | TRIzol® | 1,891.2 | 45.84 | 11.09 | 154,068# | 550 | 0.36 | Yesa | Yes |
| RNeasy mini | N/A | 11.12 | 2.14 | 7,991 | 17 | 0.21 | No | No | ||
| RV50 | mouse | TRIzol® | 2,186.2 | 18.76 | 7.64 | 76,568 | 293 | 0.38 | Yes | Yes |
| RV2772 | Dog | TRIzol® | 1,833.9 | 3.27 | 1.50 | 305,110 | 391 | 0.13 | Yes | Yes |
| RNeasy mini | N/A | 7.19 | 1.08 | 48,849 | 180 | 0.37 | No | Yes | ||
| RV2324 | cell pellet | TRIzol® | 10,232.9 | 870.72 | 75.63 | 101,599 | 2,027 | 1.99 | Yes | Yes |
| RV20 depleted | TCSN 100 ml | TRIzol® LS | 3,732.6 | 2.8 | 0.52 | 2,537 | 1,054 | 41.55 | Yes | Yes |
| RV20 | Diluted to 1,000 | ND | 23.47 | 10,332 | 4,369 | 42.29 | Yes | Yes | ||
| RV1787 depleted | TCSN 100 ml | TRIzol® LS | 4,603 | 37.87 | 11.56 | 8,162 | 53 | 0.65 | No | Yes |
| RV1787 | Diluted to 1,000 | ND | 119.47 | 6,861 | 87 | 1.27 | No | Yes | ||
| RV2508 cells | TCSN 250 μl | TRIzol® LS | 129.44 | ND | 11.45 | 14,587# | 223 | 1.529 | No* | Yes |
| RV2508 no cells | 127.05 | ND | 0.46 | 326 | 42 | 12.88 | No* | Yes | ||
| RV2508 high titre | 112.5 | ND | 1.31 | 1,132 | 12 | 1.06 | No* | Yes | ||
TCSN tissue culture supernatant; N/A – Not Applicable as DNAse treatment is incorporated in the RNA extraction protocol, therefore no concentration before gDNA depletion is available; ND - Not Done as gDNA was not depleted from these samples; 1This column contains the RNA concentration after rRNA depletion for all samples, except RV20, RV1787 and RV2508 TCSN where no rRNA depletion was undertaken. †brain either from host, or from one passage in a mouse, *combining these reads resulted in a single genome contig; # combined results from two 454 runs of same library; ^ Stored in RNAlater; a 3’UTR not represented; b 5’UTR not represented; c 3’ and 5’ UTR not represented.
Brain tissue samples ordered by concentration after the depletion process. Where rRNA depletion was not undertaken (RV20, RV1787 and RV2508 TCSN), the concentration values in the Ribogreen column are directly comparable to the RNA extraction concentration.
Figure 1Depth of sequence obtained from (a) infected brain samples and (b) from cultured samples.
Details of single nucleotide polymorphisms (SNPs) detected within the 454 data
| | | | | | |||||||
| RV437 | 44 | 103 | 60 | 8 | 2768 | M (275) | T | C | Leu | Ser | 92 |
| 63 | 3 | 2781 | M (288) | T | C | Val | Val | 96 | |||
| 55 | 51 | 3049 | M (556) | T | C | Leu | Phe | 186 | |||
| 28 | 18 | 4403 | G (1090) | T | C | Leu | Leu | 364 | |||
| 28 | 7 | 7855 | L (2451) | C | T | Asn | Asn | 817 | |||
| RV2417 | 35.5 | 81 | NA | | | | | | | | |
| RV61 | 53.2 | 165 | 34 | 18 | 3503 | G (209) | C | T | Ile | Thr | 70 |
| RV2508 | 17 | 53 | 17 | 23 | 9523 | L (4128) | A | G | Ser | Ser | 1376 |
| RV2627 | 17.7 | 46 | NA | | | | | | | | |
| RV2516 | 16.5 | 43 | NA | | | | | | | | |
| RV50 | 8.6 | 16 | 11 | 27 | 11349 | L (5944) | C | G | Asn | Asp | 1982 |
| RV2772 | 14.5 | 31 | 13 | 38 | 1293 | N (1223) | A | G | Lys | Arg | 408 |
| 22 | 5 | 8179 | L (2771) | T | A | Phe | Tyr | 924 | |||
| | | | | | |||||||
| RV2324 | 65 | 272 | 47 | 11 | 3490 | G (175) | G | A | Glu | Lys | 59 |
| 47 | 21 | 3491 | G (176) | A | G | Glu | Gly | 59 | |||
| 42 | 31 | 3711 | G (396) | C | A | His | Gln | 132 | |||
| 44 | 25 | 4103 | G (788) | A | G | Gln | Arg | 263 | |||
| 36 | 11 | 8376 | L (2967) | C | T | Asp | Asp | 989 | |||
| RV1787 combined | 4.3 | 11 | 4 | 25 | 2679 | M (170) | G | A | Gly | Glu | 57 |
| RV20 combined | 156 | 277 | 107 | 2 | 346 | N (276) | C | A/T | Tyr | */Tyr | 92 |
| 87 | 2 | 1621 | P (105) | T | C | Ser | Ser | 35 | |||
| 81 | 2 | 1669 | P (153) | G | A | Glu | Glu | 51 | |||
| 77 | 3 | 1710 | P (194) | A | T | Gln | Leu | 65 | |||
| 93 | 39 | 3143 | M-G (NA) | C | T | NA | NA | NA | |||
| 103 | 2 | 3671 | G (356) | G | A | Trp | * | 119 | |||
| 96 | 2 | 4403 | G (1087) | T | C | Cys | Arg | 363 | |||
| 94 | 37 | 4584 | G (1268) | T | C | Met | Thr | 423 | |||
| 95 | 2 | 6217 | L (766) | T | C | Ser | Pro | 256 | |||
| 134 | 1 | 7910 | L (2459) | T | C | Ile | Thr | 820 | |||
| 107 | 2 | 8244 | L (2793) | C | A | Ser | Arg | 931 | |||
| 106 | 2 | 8473 | L (3022) | A | G | Thr | Ala | 1008 | |||
NA = not applicable as no SNPs were observed. * stop codon.
Figure 2Alignment of 454 reads for (a) RV20 consensus sequence and the reads with variations, (b) RV61 consensus sequence and published RV61 glycoprotein gene sequences and (c) RV50 published genome sequence; * indicating single nucleotide polymorphisms which are detailed at the top of each figure.
Figure 3Typical variation of the reads over a homopolymeric repeat at the termination signal of the M-gene displayed as (a) an alignment of individual 454 reads with the consensus sequence (contig00001) and a reference sequence (RV437) and (b) a flowgram with reference sequence (RV437, top panel), read G3FY5P04JF0R0 (middle panel) and the difference between the reference sequence and read (bottom panel).