Literature DB >> 28746207

Integrating nested PCR with high-throughput sequencing to characterize mutations of HBV genome in low viral load samples.

Xianjun Wang1, Lihui Xu, Yueming Chen, Anbing Liu, Liqian Wang, Peisong Xu, Yunhui Liu, Lei Li, Fei Meng.   

Abstract

Due to the low viral load of hepatitis B virus (HBV) in plasma samples, conventional techniques have limitations to the detection of antiviral resistance mutations. To solve the problem, we developed a fast, highly sensitive, and accurate method to sequence the HBV whole-genome sequencing in plasma samples which had various viral loads from very low to high.Twenty-one plasma samples were collected from patients who were carriers of HBV from the Hangzhou First People's Hospital. Two pairs of conserved, overlapping, nested primers were used to amplify and sequence the whole HBV genome in 8 plasma samples with different viral loads. High-throughput sequencing was performed on Illumina MiSeq platform. Concomitantly, 3 samples were directly sequenced without PCR amplification. We compared amplicon-sequencing with direct sequencing to develop a method for amplifying and characterizing the whole genome of HBV.HBV genome was amplified from all samples and verified by Sanger sequencing, regardless of the viral loads. Sequencing results revealed that only a few reads were mapped to the HBV genome following direct sequencing, while the amplicon-sequencing reads had a good coverage and depth. We identified 50 intrahost single nucleotide variations (iSNVs), 14 of which were low frequency mutations. Interestingly, iSNVs were more common in low viral load samples than in high viral load samples, and mutations in the reverse transcriptase (RT) region were most prevalent.We conclude that amplicon-sequencing is not only a practical method to detect HBV infection with a high sensitivity and accuracy but also enables to detect mutations in the HBV genome in low viral load samples from HBV-infected patients. Thus, our findings provide a new diagnosis method of HBV infection, which is capable of detection of low frequent mutations in low viral load samples.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28746207      PMCID: PMC5627833          DOI: 10.1097/MD.0000000000007588

Source DB:  PubMed          Journal:  Medicine (Baltimore)        ISSN: 0025-7974            Impact factor:   1.889


Introduction

Hepatitis B virus (HBV) infection is a severe global health problem, and there is a high prevalence of HBV infection in China with approximately 93 million HBV carriers, which results in approximately 330,000 deaths annually.[ Early and timely diagnosis of HBV infection is of great importance in the treatment of HBV infections. HBV infection consists of viral attachment, entry and release of the nucleocapsid into the cytoplasm, nucleus import of the viral genome, followed by genome replication in the nucleus.[ Progression of liver diseases caused by HBV infection is fostered by HBV DNA replication.[ HBV genome is a partially double-stranded DNA, consisting of 4 overlapping open reading frames encoding viral polymerase (P), surface antigen (S), core (C), and X protein.[ Due to the lack of polymerase proofreading activity of the viral polymerase and a high replication rate, HBV genome replication has a higher mutation rate than other DNA viruses. Many studies have reported that mutations in the HBV genome are associated with drug resistance, liver disease progression, and immune escape.[ As a result, sequencing of the whole HBV genome is critical for understanding the correlation between mutations and liver disease progression. To date, several conventional techniques have been used to sequence viral genomes and detect the mutations, such as DNA microarray, Sanger sequencing, and qPCR. However, none of these can fully satisfy the requirements for genetic diagnosis in terms of their specificity, sensitivity, or accuracy. This is in particular important when these methods are applied to low viral load samples and to detection of low frequent mutations. The mutation frequencies of Sanger sequencing and qPCR are >20% and >10% to 15%, respectively.[ In recent years, high-throughput sequencing (HTS) has proven to be a powerful tool in diagnosis of virus infections. HTS has been used to sequence the complete HBV genome for identification of virulence factors, drug resistance genes and characterization of single nucleotide variations (SNVs) of the HBV genome.[ Additionally, previous studies have demonstrated that HTS is not sensitive enough to detect low levels of viral genome presented in individual plasma samples.[ Increasing the copy numbers of the viral genome using PCR is thus a key step in sample preparation. However, it is very difficult to amplify the complete HBV genome, because the HBV genome involves a partially double-stranded DNA molecule with 2 breaks. In 1995, Gunther et al[ proposed a 1-step PCR method with a single pair of primers at the termini of the breaks to amplify the HBV genome; however, it is not sensitive enough to detect low viral load of HBV in plasma samples. Subsequently, a highly sensitive amplification method, nested PCR, was proposed in a follow up study, especially for low viral load samples.[ In this study, we successfully integrated nested PCR using 2 pairs of conserved, overlapping nested primers with HTS to develop a fast, highly sensitive, and accurate method for whole-genome sequencing of HBV in plasma samples with various viral loads.

Materials and methods

Patients and samples

We collected 21 plasma samples from patients who were HBV carriers in the First People's Hospital of Hangzhou from March 2015 to December 2015. The hospital ethics committee agreed to conduct the study and the ethical number was No. 051-01. All samples were conducted with conventional detection of HBV infection during preparation of plasma, and the concentration of HBV was initially determined using ELISA (Table 1).
Table 1

The results of clinical information and quantification of HBV DNA or viral titer.

The results of clinical information and quantification of HBV DNA or viral titer.

DNA extraction and quantification

HBV DNA was extracted from 200 μL of plasma sample using the PureLink Viral RNA/DNA Mini Kit (Invitrogen, Carlsbad, CA), according to the manufacturer's instructions (https://tools.thermofisher.com/content/sfs/manuals/purelink_viral_rna_dna_man.pdf) with modification of using 25 μL of sterile water to elute DNA. DNA concentration was quantified using the Qubit dsDNA HS Assay Kit (Invitrogen) (https://tools.thermofisher.com/content/sfs/manuals/Qubit_dsDNA_HS_Assay_UG.pdf). HBV viral load was determined by the Diagnostic Kit for Quantification of Hepatitis B Virus DNA (Puruikang, Beijing, China) using ABI7500 Fast Real-Time PCR system (Applied Biosystems, Foster City, CA), according to the manufacturer's instructions.

Primers and amplification

The complete HBV genome was amplified using nested primers (Table 2) and validated with Sanger sequencing using specific primers.[ The HBV genome was divided into 2 fragments (Fig. 1) and both rounds of nested PCR were performed in 25 μL reaction system. The reaction contained 2 μL of template DNA, 0.5 μL of each 10 μM primer, 2.5 μL of 10× LA Taq Buffer (MgCl2-free), 4 μL of dNTP mixture (2.5 mM each), 2.5 μL of 25 mM MgCl2, and 0.25 μL of TaKaRa LA Taq (5 units/μL). Thermal cycle parameters included predenaturing at 94°C for 5 minutes, followed by 35 cycles (first round of PCR) or 20 cycles (second round of PCR) of denaturation at 94°C for 30 seconds, annealing at 58°C for 30 seconds, extension at 72°C for 3 minutes, and finally extension for 10 minutes at 72°C. PCR products were analyzed in 1.5% agarose gel electrophoresis and cleaned up with Qiagen PCR Purification Kit (Qiagen, Valencia, CA). To validate the sequences of HBV genome, Sanger sequencing was performed with the ABI 3730XL DNA Analyzer (Applied Biosystems) using BigDye Terminators V3.1 according to the manufacturer's instructions.
Table 2

The HBV-specific primers were used for amplifying and sequencing in 2 fragments.

Figure 1

The structure of HBV genome and nested PCR. The blue arrow indicates the first round of amplification and the red arrow indicates the second round of amplification. The primers are depicted along with the HBV genome.

The HBV-specific primers were used for amplifying and sequencing in 2 fragments. The structure of HBV genome and nested PCR. The blue arrow indicates the first round of amplification and the red arrow indicates the second round of amplification. The primers are depicted along with the HBV genome.

Library construction and HTS

Nested PCR products were purified, and the concentrations were determined using the Qubit dsDNA HS Assay Kit (Invitrogen). Two nested PCR fragments from the same sample were mixed with equimolar quantities. Mixed PCR products were used to construct libraries using the Nextera XT DNA Sample Prep Kit (Illumina, San Diego, CA), according to the manufacturer's instructions. The quality of the libraries was determined using the Agilent 2100 Bioanalyzer (Agilent Technologies, Palo Alto, CA) with a high-sensitivity DNA chip. Libraries were subjected to HTS on the Illumina MiSeq (Illumina) platform to generate 2 × 250-bp paired-end reads. To evaluate the efficiency of amplicon-sequencing, 3 different viral loads (high, mediate, and low) of HBV plasma samples were directly sequenced without PCR[ on the Illumina MiSeq platform. In order to eliminate biases in sequencing to the maximum, we also used Nextera XT DNA Sample Prep Kit to construct libraries for direct sequencing.

Data analysis

FASTQ-format raw reads were generated on the Illumina MiSeq sequencer, and a quality control was processed simultaneously. To reduce the false discovery rate of mutations, we trimmed the first 10 bases and the last 30 bases from every >Q20 read. The remaining properly paired reads were mapped against the HBV B-type reference genome (AB981583) using Bowtie2.[ Intrahost single nucleotide variations (iSNVs) were called using the default parameters in the mapping algorithm.

Results

HBV detection and selection of sequencing samples

Primary HBV concentrations of the 21 plasma samples were determined by ELISA. Subsequently, viral genomic DNA was extracted, and the genome copy number was quantified using Qubit and qPCR. The concentrations and viral loads (genome copies) ranged from 0.366 ng/μL to 4.6 ng/μL and 103 to 107 (Table 1), respectively. Given the bias of HTS, we selected 8 samples of various viral loads, according to the results of ELISA and qPCR. Three low viral load samples (H1, H2, H10), 3 moderate viral load samples (H20, H21, H22), and 2 high viral load samples (H11, H14) were selected.

Nested PCR and Sanger sequencing

LA Taq (Takara, Kyoto, Japan) and Phusion High-Fidelity PCR Master Mix with HF Buffer (New England BioLabs, Allschwil, Switzerland) were used to amplify HBV genome. The whole HBV genome was successfully amplified with 2 pairs of nested primers using LA Taq in the aforementioned 8 plasma samples, while the efficiency of the Phusion polymerase was lower than that of the LA Taq (Fig. 2). After 2 rounds PCR products were amplified at 2092 and 1320 bp, respectively. The results of the Sanger sequencing confirmed that the sequences of the PCR products belonged to HBV. Thus, these results suggest that the nested PCR is an appropriate method to obtain HBV whole-genome sequence, regardless of the initial HBV loads.
Figure 2

The results of HBV genomic amplification. (A) The results of first amplification. (B) The results of second amplification. M: marker (2000 bp).

The results of HBV genomic amplification. (A) The results of first amplification. (B) The results of second amplification. M: marker (2000 bp).

Overview of HBV amplicon-sequencing

HTS resulted in an average of 5092 clean reads (range from 2362 to 6792, Table 3) per sample after the low quality reads were removed. The clean paired-end reads were aligned to the reference sequence of the HBV genome, and the alignment rate approached 80% except for samples H1 and H10. The degree of coverage and depth for each sample were high and similar in magnitude (Fig. 3). However, positions 1823 to 1841 had an insufficient depth, which was caused by the gap in the HBV genome structure. Additionally, positions of 500 to 750 also had a lower depth because this DNA segment represents the overlap between adjacent amplicons, but this was determined to be inevitable and acceptable. Therefore, the nested PCR products obtained from different viral load samples were suitable for HTS based on account of its uniformity and unbias.
Table 3

The results of amplicon-sequencing for each sample.

Figure 3

The degree of coverage and depth for all samples. The value of horizontal axis indicates the position of HBV genome. The value of vertical axis indicates the site depth of HBV genome by HTS.

The results of amplicon-sequencing for each sample. The degree of coverage and depth for all samples. The value of horizontal axis indicates the position of HBV genome. The value of vertical axis indicates the site depth of HBV genome by HTS.

HTS without PCR

Sample H1 (low viral load sample), H20 (mediate viral load sample), and H11 (high viral load sample) were directly sequenced without amplification using nested PCR. After trimming the low quality reads, a mean of 1.77 million clean reads (H1: 1.6 M reads, H20: 2.0 M reads, H11: 1.7 M reads) were used for further bioinformatics analysis. Human genomes were wiped out from clean reads and the rest of reads were mapped to the database of the viral genome. Unexpectedly, the number of reads mapped to HBV genome was very low, with only 10 and 4 reads mapped in the high viral load sample H11 and the mediate viral load sample H20, respectively. Furthermore, for the low viral load sample H1, there were no reads that could be mapped to the HBV genome.

Analysis of iSNVs

Based on the results of the alignment with Bowtie2, iSNVs were identified with SAMTOOLS[ (version 1.1). As shown in Table 3, 6 of the 8 samples contained iSNVs, with the number per sample ranging from 2 to 14. The sample H14 and H20 were failed to identify iSNVs. The average depth of mutation was 44.7 (range 18–92) (Table 4). To further characterize the HBV mutations, we annotated a total of 50 iSNVs. The lowest frequency of mutation was 0.02 and the highest frequency was 0.75. In some cases, the reference sequence mutated to 2 or more different bases, though this occurred in a relatively low frequency.
Table 4

Overview of the iSNV detected in 6 samples, including the frequency and depth of mutations, and the alteration of amino acid position.

Overview of the iSNV detected in 6 samples, including the frequency and depth of mutations, and the alteration of amino acid position.

Discussion

Although strategies used to detect HBV infection have been improved with new approaches such as direct sequencing and qPCR,[ it remains a critical issue to detect HBV genome in serum or plasma samples that contain low viral loads. To this end, we developed a method, integrating 2-stage nested PCR with HTS to overcome this difficulty. HBV genome copies were determined using qPCR and nested PCR products were extracted from agarose gel of electrophoresis and subjected to Sanger sequencing. In this study, to sequence the whole genome of HBV in samples with various virus loads, we designed 2 strategies, including 1-step PCR using a single pair of primers[ and nested PCR using 2 pairs of nested primers. Unfortunately, using a single pair of primers resulted in a high failure rate, especially in the low viral load sample. In our preliminary experiments, HBV genome was failed to amplify in 6 out of 7 plasma samples with low viral load HBV (<103) using a single pair of primers. Surprisingly, regardless of the viral loads, the HBV genome was successfully amplified in all the samples using 2 pairs of nested primers. In agreement with other studies,[ although amplification of the whole HBV genome using a single pair of primers is convenient and cost-effective, 2 pairs of nested primers turned out to be an effective approach to amplify the whole genome of HBV, especially in low viral load sample. In other studies, however, the HBV genome was divided into 4 or 6 fragments for amplification.[ Comparison of 2 pairs of nested PCR with 4- or 6-pair PCR, we found that the HBV whole genome could be amplified by all these methods. However, the latter approaches are in convenient and cost-ineffective, and may introduce more false-positive mutations due to the use of more primers. In comparison with the results of amplicon-sequencing and direct sequencing without PCR, we found that amplicon-sequencing offered an enormous advantage in terms of the amount of required data. Direct sequencing resulted in 1.76 million of clean reads but with only a few reads mapped to the HBV genome. In contrast, approximately 80% of amplicon sequences are mapped to the HBV genome, requiring fewer sequences to obtain depth coverage. We speculated that the samples may be contaminated with the human genome DNA in the step of nucleic acid extraction or hemolysis during blood sample preparation. Of importance, although the viral loads of HBV were significantly different in the 8 plasma samples, there was little difference in the numbers of reads obtained nested PCR sequencing. We believe that although the reads that were generated from the nested PCR did not map to the HBV genome at a very high rate, nested PCR appears a remarkable improvement and an acceptable mapping rate, compared with the direct sequencing.[ Furthermore, the number of effective reads is likely enhanced by increasing the loading quantity during nested PCR sequencing. Therefore, we conclude that integrating nested PCR with HTS is a more appropriate method for sequencing of the HBV whole genome in low viral load samples than direct sequencing without PCR. During antiviral therapy with nucleoside analogues (NAs), the emergence of genomic variation is a critical issue. Although it remains unclear whether genomic variation influences the efficiency of antiviral therapy, reports have suggested that viral genomic alterations contribute to the progression of disease status.[ In our study, a total of 50 iSNVs were identified in 6 samples and 21 out of these 50 iSNVs which occurred in the reverse transcriptase (RT)-coding region. We did not identify any known mutations associated with drug resistance to antiviral therapy with NAs in the present study. Nevertheless, mutations in the RT region should be considered when administering NAs. Furthermore, 14 iSNVs were identified with a mutation frequency <20%, indicating that low frequency mutations were widely prevalent in patients. Notably, 30 iSNVs were found in these low viral load samples, whereas only a few mutations were found in high viral load samples. Importantly, mutations in the RT region appeared in all low viral load samples, whereas there were few mutations in this region among high viral load samples. Previous studies have been reported that the isolates with low HBV loads (<105 copies/mL) were more frequently mutated in core protein (HBc) than the isolates with higher HBV loads (>105 copies/mL).[ Furthermore, similar results were found in HIV-1 infections that patients with low viral load (<1000 copies/mL) showed an increased resistance mutation.[ Our findings indicate that the genome mutations are more prevalent in the low viral load samples than the high viral load samples, and that mutations in the RT region may play an enigmatic role during the treatment of HBV infection. Indeed, because the source of the samples was collected from the Southern China, where the HBV genotype is mainly B-type, we did not analyze the results for different HBV genotypes. In fact, we aimed to improve the low frequency mutation detection rate of resistant gene in samples of low HBV loads. These findings should be validated by additional studies, and we plan to collect a large number of low HBV load samples in the future to further investigate the effect of low frequency mutations on the treatment of patients with HBV infection. In conclusion, this paper presents a convenient and cost-effective method to reveal the full landscape of HBV genomes and mutations, which provides an effective way to amplify and sequence the whole HBV genome, especially in samples with low viral load or in low quality samples (e.g., those ones contaminated with human genome), which are usually not appropriate for constructing libraries and direct HTS without PCR amplification. Moreover, our findings suggest that low frequency mutations of the HBV genome should not be neglected, especially among low viral load samples.
  31 in total

Review 1.  Naturally occurring variants of hepatitis B virus.

Authors:  S Günther; L Fischer; I Pult; M Sterneck; H Will
Journal:  Adv Virus Res       Date:  1999       Impact factor: 9.937

2.  Digital PCR provides absolute quantitation of viral load for an occult RNA virus.

Authors:  Richard Allen White; Stephen R Quake; Kenneth Curr
Journal:  J Virol Methods       Date:  2011-09-29       Impact factor: 2.014

3.  A sensitive direct sequencing assay based on nested PCR for the detection of HBV polymerase and surface glycoprotein mutations.

Authors:  Donatella Vincenti; Mariacarmela Solmone; Anna Rosa Garbuglia; Fabio Iacomi; Maria Rosaria Capobianchi
Journal:  J Virol Methods       Date:  2009-03-09       Impact factor: 2.014

Review 4.  Next-generation sequencing technologies in diagnostic virology.

Authors:  Luisa Barzon; Enrico Lavezzo; Giulia Costanzi; Elisa Franchin; Stefano Toppo; Giorgio Palù
Journal:  J Clin Virol       Date:  2013-03-21       Impact factor: 3.168

5.  Biological impact of natural COOH-terminal deletions of hepatitis B virus X protein in hepatocellular carcinoma tissues.

Authors:  H Tu; C Bonura; C Giannini; H Mouly; P Soussan; M Kew; P Paterlini-Bréchot; C Bréchot; D Kremsdorf
Journal:  Cancer Res       Date:  2001-11-01       Impact factor: 12.701

6.  Whole genome characterization of hepatitis B virus quasispecies with massively parallel pyrosequencing.

Authors:  F Li; D Zhang; Y Li; D Jiang; S Luo; N Du; W Chen; L Deng; C Zeng
Journal:  Clin Microbiol Infect       Date:  2014-10-29       Impact factor: 8.067

7.  High association of T1858-G1896 precore mutations with impaired base pairing and high hepatitis B virus DNA levels in HBeAg-negative chronically infected patients.

Authors:  Sandrine Castelain; Véronique Descamps; Etienne Brochot; François Helle; Gilles Duverlie; Eric Nguyen-Khac; Catherine François
Journal:  Arch Virol       Date:  2017-03-13       Impact factor: 2.574

8.  Genome replication, virion secretion, and e antigen expression of naturally occurring hepatitis B virus core promoter mutants.

Authors:  Sameer Parekh; Fabien Zoulim; Sang Hoon Ahn; Adrienne Tsai; Jisu Li; Shigenobu Kawai; Nasser Khan; Christian Trépo; Jack Wands; Shuping Tong
Journal:  J Virol       Date:  2003-06       Impact factor: 5.103

9.  An epidemiological serosurvey of hepatitis B virus shows evidence of declining prevalence due to hepatitis B vaccination in central China.

Authors:  Guo Yonghao; Xu Jin; Li Jun; Dong Pumei; Ye Ying; Feng Xiuhong; Zhang Yanyang; Guo Wanshen
Journal:  Int J Infect Dis       Date:  2015-10-09       Impact factor: 3.623

10.  HBV life cycle: entry and morphogenesis.

Authors:  Stephanie Schädler; Eberhard Hildt
Journal:  Viruses       Date:  2009-09-01       Impact factor: 5.048

View more
  1 in total

1.  Development of a reverse transcription-loop-mediated isothermal amplification (RT-LAMP) assay for the detection of porcine pegivirus.

Authors:  Hao Li; Kai Li; Zhen Bi; Jun Gu; Deping Song; Dan Lei; Suoxian Luo; Dongyan Huang; Qiong Wu; Zhen Ding; Leyi Wang; Yu Ye; Yuxin Tang
Journal:  J Virol Methods       Date:  2019-04-22       Impact factor: 2.014

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.