| Literature DB >> 28362878 |
Thibault Guinoiseau1, Alain Moreau1, Guillaume Hohnadel1, Nicole Ngo-Giang-Huong2, Celine Brulard3, Patrick Vourc'h3, Alain Goudeau1, Catherine Gaudy-Graffin1.
Abstract
Hepatitis C virus (HCV) evolves rapidly in a single host and circulates as a quasispecies wich is a complex mixture of genetically distinct virus's but closely related namely variants. To identify intra-individual diversity and investigate their functional properties in vitro, it is necessary to define their quasispecies composition and isolate the HCV variants. This is possible using single genome amplification (SGA). This technique, based on serially diluted cDNA to amplify a single cDNA molecule (clonal amplicon), has already been used to determine individual HCV diversity. In these studies, positive PCR reactions from SGA were directly sequenced using Sanger technology. The detection of non-clonal amplicons is necessary for excluding them to facilitate further functional analysis. Here, we compared Next Generation Sequencing (NGS) with De Novo assembly and Sanger sequencing for their ability to distinguish clonal and non-clonal amplicons after SGA on one plasma specimen. All amplicons (n = 42) classified as clonal by NGS were also classified as clonal by Sanger sequencing. No double peaks were seen on electropherograms for non-clonal amplicons with position-specific nucleotide variation below 15% by NGS. Altogether, NGS circumvented many of the difficulties encountered when using Sanger sequencing after SGA and is an appropriate tool to reliability select clonal amplicons for further functional studies.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28362878 PMCID: PMC5376297 DOI: 10.1371/journal.pone.0174852
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Workflow of analysis on Galaxy platform.
Fig 2Overlapping primers for Sanger sequencing.
Primers for Sanger sequencing.
| Primer | Location | Sequence (5'-3') | Samples |
|---|---|---|---|
| P1bE1E2sens | 697–718 | All except c14, c16, c17 | |
| HCV1bextsens | 821–841 | All except c3, c14, c15, c16, c17, c37 | |
| HCV1bintsens | 843–873 | c6, c12, c20, c27, c37, c42 | |
| VDA8505sens | 1178–1196 | All except c14, c16, c17 | |
| seq12sens | 1290–1309 | c34, c36 | |
| HVR1extsens | 1290–1310 | c15 | |
| VDA8505antisens | 1435–1454 | All except c14, c16, c17 | |
| IICantisens | 1605–1620 | All except c14, c15, c16, c17, c37, c42 | |
| GY16N1550sens | 2247–2266 | All except c14, c16, c17, c37, c41 | |
| HCV1bintantisens | 2559–2584 | c12, c20, c27, c37, c42 | |
| P1bE1E2antisens | 2775–2794 | All except c14, c16, c17 |
* Location indicate nucleotide postion relative to the H77 genome
† Samples sequencing with each primer
Comparaison of clonality sample using two different sequencing technology.
| Sample | MiSeq sequencing | Sanger sequencing | Agreement |
|---|---|---|---|
| c1 | C | C | |
| c2 | A (0 | C | |
| c3 | NC (1; 1) | NC (1 | |
| c4 | NC (3; 0) | A (0; 3) | |
| c5 | NC (3; 2) | NC (4; 0) | |
| c6 | C | C | |
| c7 | A (0; 1) | C | |
| c8 | C | C | |
| c9 | C | C | |
| c10 | C | C | |
| c11 | C | C | |
| c12 | A (0; 3) | NC (2; 0) | |
| c13 | C | C | |
| c14 | NC (2; 0) | /// | |
| c15 | NC (2; 2) | A (0; 1) | |
| c16 | NC (1; 0) | /// | |
| c17 | NC (7; 0) | /// | |
| c18 | C | C | |
| c19 | C | C | |
| c20 | C | C | |
| c21 | C | C | |
| c22 | C | C | |
| c23 | C | C | |
| c24 | C | C | |
| c25 | C | C | |
| c26 | NC (1; 0) | NC (1; 0) | |
| c27 | C | C | |
| c28 | C | C | |
| c29 | C | C | |
| c30 | C | C | |
| c31 | NC (1; 0) | C | |
| c32 | C | C | |
| c33 | C | C | |
| c34 | C | C | |
| c35 | C | C | |
| c36 | C | C | |
| c37 | C | C | |
| c38 | NC (1; 0) | C | |
| c39 | C | C | |
| c40 | C | C | |
| c41 | NC (2; 0) | NC (2; 0) | |
| c42 | C | C |
* number of non clonal position with MiSeq sequencing analysis.
† number of ambiguous postion with MiSeq sequencing analysis.
‡ number of non clonal position with Sanger sequencing analysis.
§ number of ambiguous postion with Sanger sequencing analysis.
● MiSeq and Sanger are in agreement.
○ MiSeq and Sanger are in desagreement.
◊Failure of Sanger sequencing.
Comparaison of non clonal position with two sequencing techniques according to the percentage of MiSeq nucleotide variation.
| MiSeq sequencing | Sanger sequencing | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Sample | Location | Depth | A | C | G | T | % variation | Double peak | Primers |
| c41 | 1028 | 1759 | 1 | 819 | 0 | 878 | 48,3 | "+" | 3 |
| c5 | 1380 | 3085 | 1821 | 0 | 1264 | 0 | 41 | "+" | 3 |
| c5 | 916 | 1937 | 1147 | 0 | 790 | 0 | 40,8 | "+" | 3 |
| c5 | 2048 | 3202 | 0 | 1992 | 0 | 1210 | 37,8 | "+" | 2 |
| c3 | 1431 | 2523 | 910 | 0 | 1612 | 1 | 36,1 | "+" | 2 |
| c26 | 1973 | 1803 | 626 | 0 | 1177 | 0 | 34,7 | "+" | 2 |
| c4 | 1707 | 2745 | 2217 | 0 | 528 | 0 | 19,2 | "+/-" | 2 |
| c4 | 1670 | 2302 | 0 | 421 | 0 | 1881 | 18,3 | "+/-" | 2 |
| c15 | 905 | 1479 | 1237 | 0 | 0 | 242 | 16,4 | "+/-" | 2 |
| c4 | 2168 | 2630 | 400 | 0 | 2230 | 0 | 15,2 | "+/-" | 2 |
| c31 | 2379 | 2387 | 2069 | 0 | 318 | 0 | 13,3 | "-" | 2 |
| c38 | 2012 | 1571 | 0 | 1406 | 0 | 165 | 10,5 | "-" | 2 |
| c15 | 1615 | 1343 | 0 | 1236 | 0 | 107 | 8 | "-" | 1 |
| c41 | 2479 | 2952 | 0 | 2726 | 0 | 226 | 7,7 | "-" | 1 |
* Location indicate nucleotide postion relative to the H77 genome.
† % nucleotide variation at this postion after MiSeq analysis.
‡ Number of nucleotide read at this position according to the numbers of primers used.
"+" Double peak clearly visible on electrophoregram at this position.
"+/-" Double peak no clearly visible on electrophoregram at this position.
"-" No double peak on electrophoregram at this position.
Fig 3Examples of superimposed electropherograms.
(A) in agreement with NGS (sample c41; 48,3% nucleotide variation; double peak seen on electropherograms) (B) in disagreement with NGS (sample c38, 10,5% nucleotide variation; absence of double peak on electropherograms).
Comparaison of ambiguous position with two sequencing techniques according to the percentage of MiSeq nucleotide variation.
| MiSeq sequencing | Sanger sequencing | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Sample | Location | Depth | A | C | G | T | % variation | Double peak | Primers |
| c15 | 1008 | 1114 | 1032 | 0 | 82 | 0 | 7,4 | "-" | 1 |
| c12 | 2082 | 1736 | 0 | 0 | 80 | 1656 | 4,6 | "+" | 2 |
| c2 | 826 | 1615 | 0 | 68 | 0 | 1547 | 4,2 | "-" | 3 |
| c15 | 1322 | 1681 | 1617 | 0 | 64 | 0 | 3,8 | "-" | 1 |
| c12 | 1858 | 1723 | 0 | 62 | 0 | 1661 | 3,6 | "-" | 1 |
| c12 | 2291 | 2104 | 70 | 0 | 2034 | 0 | 3,3 | "+" | 1 |
| c5 | 1274 | 1856 | 0 | 1798 | 0 | 58 | 3,1 | "+" | 4 |
| c3 | 2191 | 3735 | 3636 | 0 | 99 | 0 | 2,7 | "-" | 2 |
| c5 | 1811 | 3209 | 3129 | 0 | 80 | 0 | 2,5 | "-" | 2 |
| c10 | 1603 | 2531 | 2472 | 0 | 59 | 0 | 2,3 | "-" | 2 |
| c2 | 2558 | 3174 | 72 | 0 | 3102 | 0 | 2,3 | "-" | 3 |
* Location indicate nucleotide postion relative to the H77 genome.
† % nucleotide variation at this postion after MiSeq analysis.
‡ Number of nucleotide read at this position according to the numbers of primers used.
"+" Double peak clearly visible on electrophoregram at this position.
"+/-" Double peak no clearly visible on electrophoregram at this position.
"-" No double peak on electrophoregram at this position.