| Literature DB >> 29693634 |
Luca Ferretti1, Antonello Di Nardo2, Benjamin Singer3, Lidia Lasecka-Dykes4, Grace Logan5, Caroline F Wright6, Eva Pérez-Martín7, Donald P King8, Tobias J Tuthill9, Paolo Ribeca10.
Abstract
Recombination is one of the determinants of genetic diversity in the foot-and-mouth disease virus (FMDV). FMDV sequences have a mosaic structure caused by extensive intra- and inter-serotype recombination, with the exception of the capsid-encoding region. While these genome-wide patterns of broad-scale recombination are well studied, not much is known about the patterns of recombination that may exist within infected hosts. In addition, detection of recombination among viruses evolving at the within-host level is challenging due to the similarity of the sequences and the limitations in differentiating recombination from point mutations. Here, we present the first analysis of recombination events between closely related FMDV sequences occurring within buffalo hosts. The detection of these events was made possible by the occurrence of co-infection of two viral swarms with about 1% nucleotide divergence. We found more than 15 recombination events, unequally distributed across eight samples from different animals. The distribution of these events along the FMDV genome was neither uniform nor related to the phylogenetic distribution of recombination breakpoints, suggesting a mismatch between within-host evolutionary pressures and long-term selection for infectivity and transmissibility.Entities:
Keywords: intra-host diversity; linkage disequilibrium; quasi-species; recombination
Mesh:
Substances:
Year: 2018 PMID: 29693634 PMCID: PMC5977214 DOI: 10.3390/v10050221
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Average length for mapped reads (Avg Read Len), average coverage (Avg Cov), number of selected SNPs, and their average minor allele frequency (Avg Freq) for each sample. SAT: South African Territories.
| Sample | Serotype | Host | Avg Read Len | Avg Cov | # SNPs | Avg Freq |
|---|---|---|---|---|---|---|
| SAT3/BOT-BUFF/13/70 | SAT3 | African buffalo | 122 | 10,970 | 58 | 0.36 |
| SAT1/UGA-BUFF/10/70 | SAT1 | African buffalo | 120 | 7843 | 36 | 0.17 |
| SAT2/BOT-BUFF/2/69 | SAT2 | African buffalo | 114 | 9330 | 67 | 0.34 |
| SAT2/BOT-BUFF/17/69 | SAT2 | African buffalo | 119 | 9255 | 115 | 0.19 |
| SAT2/BOT-BUFF/107/72 | SAT2 | African buffalo | 128 | 4735 | 108 | 0.15 |
| SAT1/TAN/22/2012 | SAT1 | cattle | 160 | 11,639 | 31 | 0.17 |
| SAT3/ZAM/P2/96 (NAN-11) | SAT3 | African buffalo | 165 | 3098 | 144 | 0.31 |
| SAT3/ZIM/P6/83 BUFF-16 | SAT3 | African buffalo | 100 | 1413 | 34 | 0.38 |
| Samples from Ramirez et al. [ | A, O, Asia1 | Asian buffalo | see [ | see [ | see [ | see [ |
Figure 1(A) Schematic illustration of the effect of a recombination event on sequences and SNPs. Each mutation occurs at most once along the tree. The left part of the sequence comes from a single genealogical tree and all variants are either strongly associated to each other or completely disjoint, hence there is complete linkage disequilibrium (). The SNPs in the example have frequencies 0.5 and 0.75 and overlap in 50% of the sequences, hence D = 0.5 − 0.5 × 0.75 = 0.125. Since the overlap is also the largest possible one for these frequencies, = 0.125 and . The recombination event in the middle of the sequence changes the genealogy (as can be be seen in the genealogical tree on the right, the event prunes the dashed branch and regrafts it elsewhere). Variants in the right part of the sequence are unrelated to the ones across the recombination event, i.e., the presence of one variant in a sequence is not informative about the presence of the other, and linkage disequilibrium is much reduced (). In the example, the two variants both have frequency 0.5 but their frequency of overlapping is 0.25, hence D = 0.25 − 0.5 × 0.5 = 0. The maximum overlapping frequency would be 0.5, hence = 0.5 − 0.5 × 0.5 = 0.25 and . (B) Schematic illustration of the two extreme scenarios for recombination, starting with a mixture of two strains. In the case of freely recombining sequences, the number of recombinants increases steadily with time. The inferred recombination rate corresponds to where r is the recombination rates and t is the sampling time elapsed after the strains were mixed. In the epistatic case, if the fitness of the recombinants is times the fitness of the original strains, their number increases slightly until it reaches a stationary value determined by the recombination-selection balance and proportional to the ratio . The inferred recombination rate at equilibrium corresponds to .
Figure 2Foot-and-mouth disease virus (FMDV) recombination rates inferred from each sample. Samples are coloured according to their serotype and the corresponding rates are smoothed via a Gaussian kernel with a width of 200 bases. The grey intervals at the bottom illustrate the minimal recombination intervals (top) and the full set of recombining pairs of Single Nucleotide Polymorphisms (SNPs) (bottom).
Average recombination rates per site in each sample, normalized with respect to the highest average recombination rate. Background values represent an estimate of the spurious signal due to sequencing errors or multiple mutations.
| Sample | Recombination Rate | (“Background”) |
|---|---|---|
| SAT3/BOT-BUFF/13/70 | 0.1 | (0.008) |
| SAT1/UGA-BUFF/10/70 | 0.06 | (0) |
| SAT2/BOT-BUFF/2/69 | 0.011 | (0.002) |
| SAT2/BOT-BUFF/17/69 | 1 | (0.005) |
| SAT2/BOT-BUFF/107/72 | 0.018 | (0.001) |
Figure 3Phylogenetic signal of recombination along the nucleotide sequence of the polyprotein: Number of recombination breakpoints over a 200-bp window from RDP4 analysis of the real data (violet) and of simulated data (gold) obtained by adding 50 simulated recombination events per 200-bp window.
Figure 4Comparison of within-host and phylogenetic recombination signal: (Top) Location of intervals between minimal recombining SNP pairs (grey) and expected distribution of recombination events corrected for the SNP distribution (red) along the FMDV genome. The expected distribution is based on the number of phylogenetic breakpoints inferred by RDP4 over a 200-bp window (blue) corrected for the probability of detecting a recombination events depending on the genomic location. (Bottom) Same plot for all recombining SNP pairs and the corresponding expected distribution.