| Literature DB >> 34756074 |
Soledad Delgado1, Celia Perales2,3,4, Carlos García-Crespo3,4, María Eugenia Soria2,3,4, Isabel Gallego3,4, Ana Isabel de Ávila3,4, Brenda Martínez-González2,4, Lucía Vázquez-Sirvent2,4, Cecilio López-Galíndez5, Federico Morán6, Esteban Domingo3,4.
Abstract
RNA viruses replicate as complex mutant spectra termed viral quasispecies. The frequency of each individual genome in a mutant spectrum depends on its rate of generation and its relative fitness in the replicating population ensemble. The advent of deep sequencing methodologies allows for the first-time quantification of haplotype abundances within mutant spectra. There is no information on the haplotype profile of the resident genomes and how the landscape evolves when a virus replicates in a controlled cell culture environment. Here, we report the construction of intramutant spectrum haplotype landscapes of three amplicons of the NS5A-NS5B coding region of hepatitis C virus (HCV). Two-dimensional (2D) neural networks were constructed for 44 related HCV populations derived from a common clonal ancestor that was passaged up to 210 times in human hepatoma Huh-7.5 cells in the absence of external selective pressures. The haplotype profiles consisted of an extended dense basal platform, from which a lower number of protruding higher peaks emerged. As HCV increased its adaptation to the cells, the number of haplotype peaks within each mutant spectrum expanded, and their distribution shifted in the 2D network. The results show that extensive HCV replication in a monotonous cell culture environment does not limit HCV exploration of sequence space through haplotype peak movements. The landscapes reflect dynamic variation in the intramutant spectrum haplotype profile and may serve as a reference to interpret the modifications produced by external selective pressures or to compare with the landscapes of mutant spectra in complex in vivo environments. IMPORTANCE The study provides for the first time the haplotype profile and its variation in the course of virus adaptation to a cell culture environment in the absence of external selective constraints. The deep sequencing-based self-organized maps document a two-layer haplotype distribution with an ample basal platform and a lower number of protruding peaks. The results suggest an inferred intramutant spectrum fitness landscape structure that offers potential benefits for virus resilience to mutational inputs.Entities:
Keywords: NS5A-NS5B amplicons; SOM network; fitness platform; genome diversification; haplotype frequency; prototype vector; viral quasispecies
Mesh:
Substances:
Year: 2021 PMID: 34756074 PMCID: PMC8579923 DOI: 10.1128/Spectrum.01459-21
Source DB: PubMed Journal: Microbiol Spectr ISSN: 2165-0497
FIG 1Experimental design and HCV amplicon analysis. (A) Schematic representation of the passages underwent by HCV p0 (derived from HCVcc (initial cell culture HCV population) [12]; Materials and Methods) in Huh-7.5 reporter cells. Populations are depicted as empty circles, and passage number is indicated by p (HCV p100, p3 means population HCV p100 subjected to three passages in Huh-7.5 cells). Experiment 1 (upper part) and experiment 2 (lower part) were performed starting with samples of the same HCV p0, HCV p100, and HCV p200 populations. In experiment 2, (a), (b), and (c) indicate triplicate passage series performed in parallel. A total of 44 HCV populations (corresponding to the empty circles) were analyzed by deep sequencing; Groups HCV p0, HCV p100, and HCV p200 (shaded in gray) gather the ensemble of populations more closely related to populations HCV p0, HCV p100, and HCV p200, respectively. The mutations (and deduced amino acid substitutions) identified in the populations from experiment 1 were reported in reference 18 and those in the populations from experiment 2 in reference 11. (B) HCV genomic residues 7,649 (NS5A-coding region) to 8,653 (NS5B-coding region) (genome numbering according to reference isolate JFH-1) and length in base pairs (bp) of amplicons A1, A2, and A3 analyzed by Illumina MiSeq sequencing. Note that the 21 most 3′-terminal nucleotides of A1 are redundant with the 21 most 5′-terminal nucleotides of A2 and that the 27 most 3′-terminal nucleotides of A2 are redundant with the most 5′-terminal nucleotides of A3. Further details on virus origin, GenBank accession numbers, and sequencing procedures are given in Materials and Methods.
Number of reads and haplotypes derived from MiSeq Illumina sequencing of HCV amplicons A1, A2, and A3
| No. of reads (no. of haplotypes | |||||
|---|---|---|---|---|---|
| Expt | Virus | Passage | Amplicon 1 (7,649 to 7,960) | Amplicon 2 (7,940 to 8,257) | Amplicon 3 (8,231 to 8,653) |
| Expt 1 | HCV p0 | Initial | 240,376 (8) | 189,190 (4) | 122,836 (3) |
| p3 | 243,755 (6) | 273,783 (2) | 119,705 (4) | ||
| HCV p100 | Initial | 225,977 (18) | 251,949 (10) | 108,548 (14) | |
| p3 | 201,355 (14) | 282,139 (10) | 87,596 (15) | ||
| p10 | 188,215 (8) | 197,078 (7) | 79,412 (9) | ||
| HCV p200 | Initial | 50,502 (15) | 166,060 (11) | 50,111 (11) | |
| p3 | 53,462 (13) | 160,179 (10) | 62,872 (12) | ||
| p10 | 57,730 (16) | 149,758 (12) | 51,910 (10) | ||
| Expt 2a | HCV p0 | p1 | 18,817 (4) | 45,378 (4) | 6,759 (3) |
| p2 | 25,866 (4) | 61,599 (3) | 9,693 (3) | ||
| p3 | 33,964 (5) | 112,247 (4) | 6,690 (4) | ||
| p4 | 46,180 (3) | 159,699 (3) | 6,695 (5) | ||
| HCV p100 | p1 | 32,729 (15) | 119,433 (5) | 6,698 (15) | |
| p2 | 34,670 (14) | 138,215 (5) | 8,040 (15) | ||
| p3 | 29,787 (16) | 135,219 (5) | 7,621 (17) | ||
| p4 | 35,007 (13) | 115,195 (5) | 7,101 (16) | ||
| HCV p200 | p1 | 24,630 (13) | 52,247 (15) | 8,862 (18) | |
| p2 | 31,553 (19) | 136,972 (13) | 8,917 (14) | ||
| p3 | 32,166 (20) | 102,391 (13) | 7,137 (15) | ||
| p4 | 54,605 (19) | 179,432 (15) | 13,687 (14) | ||
| Expt 2b | HCV p0 | p1 | 148,212 (9) | 149,731 (2) | 64,912 (2) |
| p2 | 195,318 (5) | 137,365 (3) | 74,195 (3) | ||
| p3 | 138,832 (6) | 120,688 (4) | 60,089 (4) | ||
| p4 | 122,525 (4) | 177,166 (5) | 83,941 (4) | ||
| HCV p100 | p1 | 128,168 (15) | 149,817 (8) | 52,351 (15) | |
| p2 | 116,120 (16) | 84,408 (6) | 75,494 (17) | ||
| p3 | 120,016 (17) | 117,116 (5) | 53,153 (17) | ||
| p4 | 117,114 (16) | 120,914 (6) | 85,711 (17) | ||
| HCV p200 | p1 | 155,347 (17) | 124,204 (15) | 53,455 (16) | |
| p2 | 152,250 (18) | 116,427 (14) | 62,190 (11) | ||
| p3 | 122,481 (12) | 95,088 (14) | 44,088 (13) | ||
| p4 | 85,906 (19) | 135,373 (14) | 73,119 (13) | ||
| Expt 2c | HCV p0 | p1 | 104,149 (8) | 90,216 (3) | 134,903 (2) |
| p2 | 209,396 (8) | 130,509 (3) | 54,018 (2) | ||
| p3 | 86,558 (5) | 69,588 (3) | 124,363 (3) | ||
| p4 | 155,428 (5) | 144,968 (6) | 44,764 (3) | ||
| HCV p100 | p1 | 73,080 (18) | 72,919 (9) | 123,388 (16) | |
| p2 | 186,831 (17) | 79,384 (8) | 55,436 (18) | ||
| p3 | 98,930 (18) | 75,155 (6) | 125,049 (18) | ||
| p4 | 137,944 (17) | 115,719 (7) | 61,177 (18) | ||
| HCV p200 | p1 | 85,384 (20) | 106,672 (16) | 117,945 (14) | |
| p2 | 141,307 (17) | 107,340 (16) | 75,365 (14) | ||
| p3 | 106,261 (21) | 138,379 (12) | 94,545 (15) | ||
| p4 | 141,033 (21) | 159,476 (17) | 58,184 (15) | ||
The experiments, HCV populations, and amplicon numbers are those described in Fig. 1. Mutations were counted relative to the HCV sequence encoded in plasmid Jc1FLAG2(p7-nsGluc2A), as previously described (11, 18). The total number of reads and haplotypes (in parenthesis) were derived as detailed in Materials and Methods.
The HCV genomic residues spanned by each amplicon are 7,649 to 7,960 (amplicon 1), 7,940 to 8,257 (amplicon 2), and 8,231 to 8,653 (amplicon 3) (numbering according to isolate JFH-1, accession number AB047639).
FIG 2SOM-derived haplotype maps and number of peaks distributed according to haplotype abundance. The 15 × 15 neuron grids are displayed for Group HCV p0, Group HCV p100, or Group HCV p200 (defined in Fig. 1A). The number of sequences, haplotypes, and peaks is indicated inside each panel. Peak height (z axis) is determined by haplotype abundance, quantified from UDS reads. Normalized peak heights have been visualized with a color code with a scale included at the right of each haplotype graph. The distribution of number of haplotype peaks (ordinate) versus peak height (sequence abundance in unit range given in abscissa) is displayed below the haplotype maps of each amplicon. The relationship is described by the following functions: (A) amplicon 1: Group HCV p0, y = 3.7934x−0.403 (R2 = 0.7672); Group HCV p100, y = 8.2657x−0.77 (R2 = 0.6463); Group HCV p200, y = 6.6996x−0.709 (R2 = 0.5974); amplicon 2: Group HCV p0, y = 3.1334x−0.281 (R2 = 0.3804); Group HCV p100, y = 3.4527x−0.395 (R2 = 0.3649); Group HCV p200, y = 7.0358x−0.638 (R2 = 0.5818); (B) amplicon 3: Group HCV p0, y = 2.5728x−0.233 (R2 = 0.3807); Group HCV p100, y = 7.453x−0.723 (R2 = 0.7755); Group HCV p200, y = 6.97x−0.629 (R2 = 0.5886). The origin of the sequences, derived haplotypes, and procedures are described in Materials and Methods.
FIG 3Distribution of haplotype peaks among HCV groups. Venn diagrams indicating for each amplicon the number of peaks unique to one HCV group and those shared by two or more HCV groups. Populations are color coded. Peak identity was determined according to data summarized in Table 1, Fig. 2, and in the supplemental material; statistical significances are given in Table S3 (https://saco.csic.es/index.php/s/sFQnRii4dC94LRN).
FIG 4Haplotype maps constructed with the fused NS5B amplicons. The HCV group and number of sequences and haplotypes used for the 25 × 25 neuron graphic and the resulting number of haplotype peaks are indicated inside each map. Peak height is determined by sequence abundance, which is color coded with a scale included at the right of each map. The distribution of number of peaks (ordinate) versus peak height (sequence abundance in unit range displayed in abscissa) is indicated in the plots on the right. The relationship is described by the following functions: Group HCV p0, y = 15.659x−1.001 (R2 = 0.6519); Group HCV p100, y = 93.588x2.441 (R2 = 0.9931); Group HCV p200, −94.031ln(x) + 108.16 (R2 = 0.9931). Procedures are described in Materials and Methods.