| Literature DB >> 24698958 |
Manreetpal Singh Brar1, Mang Shi2, Raymond Kin-Hi Hui1, Frederick Chi-Ching Leung1.
Abstract
Most studies on PRRSV evolution have been limited to a particular region of the viral genome. A thorough genome-wide understanding of the impact of different mechanisms on shaping PRRSV genetic diversity is still lacking. To this end, deep sequencing was used to obtain genomic sequences of a diverse set of 16 isolates from a region of Hong Kong with a complex PRRSV epidemiological record. Genome assemblies and phylogenetic typing indicated the co-circulation of strains of both genotypes (type 1 and type 2) with varying Nsp2 deletion patterns and distinct evolutionary lineages ("High Fever"-like and local endemic type). Recombination analyses revealed genomic breakpoints in structural and non-structural regions of genomes of both genotypes with evidence of many recombination events originating from common ancestors. Additionally, the high fold of coverage per nucleotide allowed the characterization of minor variants arising from the quasispecies of each strain. Overall, 0.56-2.83% of sites were found to be polymorphic with respect to cognate consensus genomes. The distribution of minor variants across each genome was not uniform indicating the influence of selective forces. Proportion of variants capable of causing an amino acid change in their respective codons ranged between 25-67% with many predicted to be non-deleterious. Low frequency deletion variants were also detected providing one possible mechanism for their sudden emergence as cited in previous reports.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24698958 PMCID: PMC3974674 DOI: 10.1371/journal.pone.0088807
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of sample information, assembly-related statistics, and genomic features of sequenced PRRSV strains.
| Sample ID | Isolation date | Sample nature | Genotype | No of reads | No of contigs | Reads captured in contig(s) | Coverage | Average sequence depth | Genomic divergence from reference strains (p-distance) | Nsp2 deletions |
| #1 | 2003 | CCS | Type 2 | 15939 | 1 | 15898 | 96.46 | 312 | 0.098 | Δ314–439,482–486, 832–848 |
| #2 | 2003 | CCS | Type 2 | 19083 | 1 | 18993 | 96.46 | 389 | 0.119 | Δ469–518, 813–848 |
| #3 | 2003 | CCS | Type 1 | 15320 | 1 | 15275 | 98.16 | 319 | 0.086 | Δ180,417–420 |
| #4 | 2003 | CCS | Type 2 | 17121 | 1 | 17043 | 96.46 | 349 | 0.101 | None |
| #5 | 2004 | CCS | Type 1 | 14700 | 1 | 14671 | 98.16 | 306 | 0.089 | Δ180, 417–420 |
| #6 | 2004 | CCS | Type 2 | 22731 | 1 | 22696 | 96.46 | 462 | 0.126 | Δ481, 813–848 |
| #7 | 2004 | CCS | Type 2 | 15650 | 1 | 15625 | 96.46 | 333 | 0.100 | Δ314–439, 482–486, 832–848 |
| #8 | 2004 | CCS | Type 1 | 290 | 7 | 290 | 92.11 | 7 | 0.087 | Δ180, 288–294, 348–376, 417–420 |
| #9 | 2004 | CCS | Type 2 | 11864 | 1 | 11796 | 96.46 | 223 | 0.127 | Δ813–848 |
| #10 | 2004 | CCS | Type 1 | 17943 | 2 | 17902 | 98.11 | 368 | 0.088 | Δ180, 417–420 |
| #11 | 2004 | CCS | Type 2 | 15929 | 1 | 15892 | 96.46 | 328 | 0.123 | Δ813–848 |
| #12 | 2004 | CCS | Type 2 | 20636 | 1 | 20583 | 96.46 | 418 | 0.103 | None |
| #13 | 2005 | CCS | Type 2 | 7335 | 1 | 7311 | 96.46 | 141 | 0.096 | Δ813–848 |
| #14 | 2004 | CCS | Type 2 | 5595 | 1 | 5584 | 96.46 | 108 | 0.003 | None |
| #15 | 2004 | SLT | Type 2 | 5400 | 1 | 5391 | 96.46 | 110 | 0.118 | Δ813–848 |
| #16 | 2004 | SLT | Type 2 | 4205 | 1 | 4140 | 96.46 | 81 | 0.127 | Δ813–848 |
: cell culture supernatant (CCS); swine lung tissue (SLT).
: coverage estimated using prototype isolates for type 1 (LV) and type 2 (ATCC VR2332) PRRSV as references.
*: positions of amino acid deletions given are based on the Nsp2 protein from prototype isolates of type 1 (LV) and type 2 (ATCC VR2332) PRRSV.
: number obtained after quality filtering procedures detailed in the methods section.
: depth calculation takes into account duplicate reads as well.
Figure 1Deep sequencing of HK PRRSV strains.
Shown from the periphery to the centre of the figure for each strain is strain ID, genome ideogram (blue for type 1; red for type 2), sequencing depth across the genome, minor variants detected across the genome with at least 1% frequency (<10% in green, > = 10% in yellow, > = 30% in red), amino acid comparisons (green for type 1; blue for type 2) with reference strains (LV for type 1; ATCC VR2332 for type 2) for individual ORFs across the genome (non-conformity represented by white streaks), and pairwise aligned amino acid differences between sequenced strains for individual ORFs (non-structural ones in light shade; structural ones in dark shade) across the genome (blue for type 1; red for type 2).
Figure 2Phylogenomic and recombination characterization of HK PRRSV strains.
(A) Genotype-specific (type 1 left; type 2 right) phylogenies of HK and globally sequenced PRRSV genomes. Type 1 strains form a monophyletic clade separate from isolates of any other country or region. Type 2 strains classified with either previously known local as known as “endemic” diversity or the “High-Fever”-like variants isolated during virulent outbreaks of PRRS in China. (B) Mapped recombination breakpoints and potential minor parental-like strains of mosaic HK PRRSV strains. Unknown designation (for a missing parental-like strain) was assigned when a recombination event was characterized from an alignment involving only one parental-like strain and the recombinant.
Summary on the nature, incidence, and impact on codons of minor variants across consensus genomes of HK PRRSV strains.
| Strain ID | Non-coding region variants | Coding region variants | ||||||||||
| Transitions | Transversions | InDels | Overall codon diversity (%) | Nonsynonymous variants (%) | Nonsynonymous codon variants (%) | Mean | ||||||
| A↔G | C↔T | A↔T | C↔G | A↔C | G↔T | |||||||
| #1 | 0 | 108 | 143 | 4 | 1 | 2 | 4 | 0 | 5.04 | 57.25 | 56.46 | 4.03 |
| #2 | 0 | 121 | 170 | 13 | 5 | 8 | 20 | 0 | 6.35 | 45.99 | 45.13 | 3.62 |
| #3 | 0 | 68 | 98 | 1 | 3 | 3 | 4 | 0 | 3.40 | 63.84 | 63.33 | 1.74 |
| #4 | 0 | 101 | 152 | 5 | 1 | 2 | 3 | 0 | 4.98 | 66.67 | 64.03 | 3.62 |
| #5 | 1 | 64 | 83 | 4 | 1 | 2 | 6 | 0 | 3.16 | 46.25 | 45.93 | 4.11 |
| #6 | 0 | 165 | 235 | 11 | 2 | 7 | 4 | 0 | 7.81 | 58.02 | 60.10 | 5.70 |
| #7 | 0 | 87 | 93 | 4 | 0 | 2 | 6 | 0 | 3.70 | 53.13 | 53.23 | 2.22 |
| #8 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0.02 | 0 | 0 | / |
| #9 | 1 | 140 | 180 | 16 | 6 | 9 | 12 | 1 | 6.47 | 52.75 | 51.67 | 8.81 |
| #10 | 1 | 94 | 165 | 7 | 2 | 5 | 3 | 0 | 5.23 | 48.19 | 48.91 | 5.36 |
| #11 | 0 | 103 | 143 | 9 | 1 | 5 | 10 | 0 | 5.39 | 53.51 | 53.96 | 4.65 |
| #12 | 1 | 134 | 162 | 4 | 2 | 8 | 7 | 0 | 6.14 | 48.26 | 47.35 | 3.34 |
| #13 | 0 | 53 | 77 | 3 | 2 | 3 | 1 | 1 | 2.70 | 51.43 | 51.06 | 3.04 |
| #14 | 0 | 129 | 187 | 15 | 3 | 12 | 18 | 1 | 6.89 | 25.14 | 26.55 | 12.72 |
| #15 | 0 | 30 | 46 | 1 | 1 | 0 | 3 | 0 | 1.59 | 48.15 | 48.15 | 2.25 |
| #16 | 1 | 31 | 65 | 5 | 0 | 4 | 5 | 0 | 2.01 | 43.64 | 46.15 | 8.16 |
*: percentage of codons (out of the total number of codons) in the coding region with at least one minor variant.
: percentage of minor variants (out of the total number of variants) capable of causing an amino acid change in their respective consensus codons.
: percentage of codons with variants (out of the total number of codons with variants) capable of causing an amino acid change in their respective consensus codons.
Sn: Shannon entropy.
Figure 3Genetic complexity arising from nucleotide variants across ORFs of different PRRSV strains.
The quantitative measure of mean normalized Shannon (Sn) entropy accommodated for differences in sequencing depth and variable ORF sizes. Open (○) and closed (•) circles represented ORF and genomic mean Sn entropies respectively.
Figure 4Predicted effect of minor nonsynonymous variants on the protein structure of different PRRSV ORFs.
Predicted effects were classified as being either neutral (○) or non-neutral (•) only when such effects were independently confirmed by PROVEAN and SNAP tools. Variants which had differing predicted effects by the two software tools were excluded. For PROVEAN, “neutral” designation was assigned when a score higher than −2.5 was obtained and “non-neutral” when lower. Confidence of predictions made by SNAP is indicated by the expected accuracy score generated by the tool. Results are summarized based on individual ORFs (first eight panels) and viral strains (color legend, panel 8). The distribution of consensus predictions (out of total predictions) by the two programs across ORFs of different strains is summarized in the last panel as boxplots (bottom right).
Summary features of deletions detected in minor variants of a subset of HK PRRSV strains.
| Strain | Genomic position | Region | Affected sequence | Variant Frequency (%) | |
| Nucleotide | Amino Acid | ||||
| #9 | 12807–12809 | ORF2a_ORF3 | ATT/− | I/− | 6.17 |
| #13 | 3799–3801 | ORF1a | GGT | G/− | 28.57 |
| #14 | 2713–2715 | ORF1a |
| PVSLGGDVP/− | 7.89 |
*: Deletion falls in the overlapping region of adjacent ORFs affecting both.