| Literature DB >> 21483482 |
Trevelyan J McKinley1, Pablo R Murcia, Julia R Gog, Mariana Varela, James L N Wood.
Abstract
The development of modern and affordable sequencing technologies has allowed the study of viral populations to an unprecedented depth. This is of particular interest for the study of within-host RNA viral populations, where variation due to error-prone polymerases can lead to immune escape, antiviral resistance and adaptation to new host species. Methods to sequence RNA virus genomes include reverse transcription (RT) and polymerase chain reaction (PCR). RT-PCR is a molecular biology technique widely used to amplify DNA from an RNA template. The method itself relies on the in vitro synthesis of copy DNA from RNA followed by multiple cycles of DNA amplification. However, this method introduces artefactual errors that can act as confounding factors when the sequence data are analysed. Although there are a growing number of published studies exploring the intra- and inter-host evolutionary dynamics of RNA viruses, the complexity of the methods used to generate sequences makes it difficult to produce probabilistic statements about the likely sources of observed sequence variants. This complexity is further compounded as both the depth of sequencing and the length of the genome segment of interest increase. Here we develop a bayesian method to characterise and differentiate between likely structures for the background viral population. This approach can then be used to identify nucleotide sites that show evidence of change in the within-host viral population structure, either over time or relative to a reference sequence (e.g. an inoculum or another source of infection), or both, without having to build complex evolutionary models. Identification of these sites can help to inform the design of more focussed experiments using molecular biology tools, such as site-directed mutagenesis, to assess the function of specific amino acids. We illustrate the method by applying to datasets from experimental transmission of equine influenza, and a pre-clinical vaccine trial for HIV-1.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21483482 PMCID: PMC3068928 DOI: 10.1371/journal.pcbi.1002027
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Figure 1Schematic comparison of clonal vs. SGA sequencing.
Figure 2Schematic diagram of sampling from the same or different populations exhibiting the same population structure (here based on
).
Figure 3Experimental designs for A: EIV, and B: HIV-1 studies.
Animal clipart reproduced from www.openclipart.org under the CC0 1.0 Universal Public Domain Dedication license.
Number of sequences obtained by animal and date from EIV study [6].
| Horse | Day |
|
|
| 3 | 152 |
| 5 | 82 | |
|
| 5 | 154 |
| 6 | 65 | |
| 7 | 154 | |
| 8 | 62 | |
|
| 5 | 67 |
| 6 | 52 | |
| 7 | 72 | |
| 8 | 81 | |
|
| 8 | 69 |
| 9 | 44 | |
| 10 | 50 | |
| 11 | 51 | |
|
| 7 | 83 |
| 8 | 75 | |
| 9 | 52 | |
| 11 | 54 | |
|
| 7 | 112 |
| 8 | 127 | |
|
| 11 | 49 |
| 12 | 54 | |
| 13 | 80 | |
| 15 | 46 | |
|
| 11 | 107 |
|
| 11 | 71 |
|
| 12 | 54 |
| 13 | 81 | |
| 14 | 79 |
*indicates the first naturally challenged animal.
Posterior probability of association, , for different sites from the EIV study.
| Horse | Position |
|
|
|
| 5447 | 478 | 1.0 | 1.0 | 1.0 |
| 6005 | 49 | 1.0 | 1.0 | 1.0 |
| 6005 | 884 | 0.98 | 1.0 | 1.0 |
| 2F50 | 231 | 0.18 | 0.69 | 0.92 |
| 7248 | 134 | 0.09 | 0.50 | 0.84 |
| 6005 | 61 | 0.08 | 0.47 | 0.82 |
Parentheses show the prior probability of association across all models of interest at a site. Sites shown are those in which the for the smallest prior (0.001) is >0.05.
Frequency of bases for site 478 in horse 5447 in the EIV study.
| Base | Challenge horse | Day 8 | Day 9 | Day 10 | Day 11 |
|
| 152 | 69 | 44 | 50 | 37 |
|
| 0 | 0 | 0 | 0 | 8 |
|
| 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 6 |
*Consensus base.
Summary of models and PPAs for site 478 in horse 5447 from the EIV study.
| Model | PPA |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Number of sequences obtained by animal and date from HIV-1 study (M. Varela and J. L. Heeney, in preparation).
| Animal | Week |
|
|
| 22 | |
|
| 2 | 40 |
| 4 | 17 | |
|
| 2 | 22 |
| 4 | 8 | |
|
| 2 | 27 |
| 4 | 6 | |
|
| 2 | 29 |
| 4 | 19 | |
|
| 2 | 42 |
| 4 | 10 | |
|
| 2 | 24 |
| 4 | 35 | |
|
| 2 | 19 |
| 4 | 41 | |
|
| 2 | 9 |
| 4 | 10 | |
|
| 2 | 15 |
| 4 | 17 | |
|
| 2 | 12 |
| 4 | 20 |
*indicates inoculum sequence.
Posterior probability of association, , for different sites from the HIV-1 study.
| Unvaccinated | Vaccinated | |||||
| Position | (0.001) | (0.01) | (0.05) | (0.001) | (0.01) | (0.05) |
| 1518 | 1.0 | 1.0 | 1.0 | |||
| 1518 | 0.27 | 0.79 | 0.95 | |||
| 1449 | 1.0 | 1.0 | 1.0 | |||
| 491 | 1.0 | 1.0 | 1.0 | |||
| 2387 | 1.0 | 1.0 | 1.0 | |||
| 994 | 1.0 | 1.0 | 1.0 | |||
| 994 | 0.99 | 1.0 | 1.0 | |||
| 994 | 0.15 | 0.64 | 0.90 | |||
| 1006 | 1.0 | 1.0 | 1.0 | |||
| 1006 | 0.99 | 1.0 | 1.0 | |||
| 1006 | 0.15 | 0.64 | 0.90 | |||
| 1285 | 1.0 | 1.0 | 1.0 | |||
| 1744 | 1.0 | 1.0 | 1.0 | |||
| 1752 | 1.0 | 1.0 | 1.0 | |||
| 1752 | 1.0 | 1.0 | 1.0 | |||
| 1752 | 1.0 | 1.0 | 1.0 | |||
| 2470 | 1.0 | 1.0 | 1.0 | |||
| 2470 | 1.0 | 1.0 | 1.0 | |||
| 449 | 1.0 | 1.0 | 1.0 | |||
| 836 | 1.0 | 1.0 | 1.0 | |||
| 2219 | 1.0 | 1.0 | 1.0 | |||
| 393 | 1.0 | 1.0 | 1.0 | |||
| 393 | 0.37 | 0.86 | 0.97 | |||
| 756 | 1.0 | 1.0 | 1.0 | |||
| 756 | 0.06 | 0.39 | 0.77 | |||
| 433 | 0.99 | 1.0 | 1.0 | |||
| 433 | 0.14 | 0.62 | 0.90 | |||
| 771 | 0.99 | 1.0 | 1.0 | |||
| 771 | 0.07 | 0.44 | 0.80 | |||
| 942 | 0.99 | 1.0 | 1.0 | |||
| 273 | 0.99 | 1.0 | 1.0 | |||
| 2290 | 0.97 | 1.0 | 1.0 | |||
| 2446 | 0.97 | 1.0 | 1.0 | |||
| 750 | 0.94 | 0.99 | 1.0 | |||
| 138 | 0.89 | 0.99 | 1.0 | |||
| 138 | 0.05 | 0.35 | 0.74 | |||
| 1644 | 0.79 | 0.97 | 1.0 | |||
| 418 | 0.77 | 0.97 | 0.99 | |||
| 7 | 0.74 | 0.97 | 0.99 | |||
| 406 | 0.70 | 0.96 | 0.99 | |||
| 406 | 0.15 | 0.65 | 0.90 | |||
| 1305 | 0.60 | 0.94 | 0.99 | |||
| 1305 | 0.56 | 0.93 | 0.99 | |||
| 504 | 0.39 | 0.86 | 0.97 | |||
| 1512 | 0.39 | 0.86 | 0.97 | |||
| 1792 | 0.31 | 0.82 | 0.96 | |||
| 1525 | 0.31 | 0.82 | 0.96 | |||
| 2492 | 0.26 | 0.78 | 0.95 | |||
| 680 | 0.25 | 0.77 | 0.95 | |||
| 1347 | 0.25 | 0.77 | 0.95 | |||
| 1479 | 0.15 | 0.64 | 0.90 | |||
| 2007 | 0.13 | 0.61 | 0.89 | |||
| 777 | 0.11 | 0.56 | 0.87 | |||
| 270 | 0.11 | 0.54 | 0.86 | |||
| 386 | 0.09 | 0.51 | 0.85 | |||
| 1668 | 0.09 | 0.51 | 0.85 | |||
| 1134 | 0.09 | 0.50 | 0.84 | |||
| 475 | 0.09 | 0.50 | 0.84 | |||
| 2340 | 0.09 | 0.50 | 0.84 | |||
| 426 | 0.08 | 0.48 | 0.83 | |||
| 445 | 0.08 | 0.48 | 0.83 | |||
| 46 | 0.06 | 0.39 | 0.77 | |||
Each line corresponds to a different animal. Values in parentheses show the prior probability of association across all models of interest at a site and reported sites are those for which the with the smallest prior (0.001) is >0.05.
Frequency of bases for site 994 in animals R01093, BB204 and R99004 in the HIV-1 study.
| Inoculum | R01093 | BB204 | R99004 | ||||
| Wk 2 | Wk 4 | Wk 2 | Wk 4 | Wk 2 | Wk 4 | ||
|
| 15 | 0 | 0 | 0 | 0 | 1 | 8 |
|
| 7 | 24 | 33 | 20 | 8 | 13 | 31 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 |
*Consensus base.
Summary of models and PPAs for site 994 in animals R01093, BB204 and R99004 in the HIV-1 study.
| R01093 | BB204 | R99004 | |||
| Model | PPA | Model | PPA | PPA | PPA |
|
| 0.78 |
| 0.73 |
| 0.82 |
|
| 0.17 |
| 0.16 |
| 0.12 |
|
| 0.03 |
| 0.05 |
| 0.03 |
|
| 0.01 |
| 0.02 |
| 0.02 |
|
| 0.01 |
| 0.02 |
| 0.01 |
|
|
|
|
|
|
|
Posterior probability of association, , for different sites using a more stringent criterion from the HIV-1 study.
| Unvaccinated | Vaccinated | |||||
| Position | (0.001) | (0.01) | (0.05) | (0.001) | (0.01) | (0.05) |
| 1449 | 1.0 | 1.0 | 1.0 | |||
| 1518 | 1.0 | 1.0 | 1.0 | |||
| 1518 | 0.66 | 0.95 | 0.99 | |||
| 491 | 1.0 | 1.0 | 1.0 | |||
| 994 | 1.0 | 1.0 | 1.0 | |||
| 994 | 0.82 | 0.98 | 1.0 | |||
| 994 | 0.43 | 0.89 | 0.98 | |||
| 1006 | 1.0 | 1.0 | 1.0 | |||
| 1006 | 0.82 | 0.98 | 1.0 | |||
| 1006 | 0.43 | 0.89 | 0.98 | |||
| 1752 | 1.0 | 1.0 | 1.0 | |||
| 1752 | 0.97 | 1.0 | 1.0 | |||
| 1752 | 0.90 | 0.99 | 1.0 | |||
| 2470 | 1.0 | 1.0 | 1.0 | |||
| 2470 | 1.0 | 1.0 | 1.0 | |||
| 1285 | 1.0 | 1.0 | 1.0 | |||
| 836 | 1.0 | 1.0 | 1.0 | |||
| 756 | 1.0 | 1.0 | 1.0 | |||
| 756 | 0.23 | 0.75 | 0.94 | |||
| 393 | 0.99 | 1.0 | 1.0 | |||
| 393 | 0.61 | 0.94 | 0.99 | |||
| 2290 | 0.98 | 1.0 | 1.0 | |||
| 2446 | 0.98 | 1.0 | 1.0 | |||
| 449 | 0.94 | 0.99 | 1.0 | |||
| 138 | 0.85 | 0.98 | 1.0 | |||
| 1305 | 0.79 | 0.97 | 1.0 | |||
| 1305 | 0.36 | 0.85 | 0.97 | |||
| 2219 | 0.75 | 0.97 | 0.99 | |||
| 504 | 0.46 | 0.89 | 0.98 | |||
| 1512 | 0.46 | 0.89 | 0.98 | |||
| 771 | 0.39 | 0.87 | 0.97 | |||
| 1134 | 0.32 | 0.82 | 0.96 | |||
| 406 | 0.28 | 0.80 | 0.95 | |||
| 750 | 0.27 | 0.79 | 0.95 | |||
| 750 | 0.09 | 0.51 | 0.84 | |||
| 1479 | 0.15 | 0.65 | 0.90 | |||
| 261 | 0.10 | 0.54 | 0.86 | |||
Each line corresponds to a different animal. Values in parentheses shown prior probabilities of association and reported sites are those for which the with the smallest prior (0.001) is >0.05.