| Literature DB >> 24917876 |
Chang Sun1, Thomas McAndrew1, Benjamin C Smith1, Zigui Chen1, Marina Frimer2, Robert D Burk3.
Abstract
Invasive cervix cancer (ICC) is the third most common malignant tumor in women and human papillomavirus 16 (HPV16) causes more than 50% of ICC. DNA methylation is a covalent modification predominantly occurring at CpG dinucleotides and increased methylation across the HPV16 genome is strongly associated with ICC development. Next generation (Next Gen) sequencing has been proposed as a novel approach to determine DNA methylation. However, utilization of this method to survey CpG methylation in the HPV16 genome is not well described. Moreover, it provides additional information on methylation "haplotypes." In the current study, we chose 12 random samples, amplified multiple segments in the HPV16 bisulfite treated genome with specific barcodes, inspected the methylation ratio at 31 CpG sites for all samples using Illumina sequencing, and compared the results with quantitative pyrosequencing. Most of the CpG sites were highly consistent between the two approaches (overall correlation, r = 0.92), thus verifying that Next Gen sequencing is an accurate and convenient method to survey HPV16 methylation and thus can be used in clinical samples for risk assessment. Moreover, the CpG methylation patterns (methylation haplotypes) in single molecules identified an excess of complete-and non-methylated molecules and a substantial amount of partial-methylated ones, thus indicating a complex dynamic for the mechanisms of HPV16 CpG methylation. In summary, the advantages of Next Gen sequencing compared to pyrosequencing for HPV genome methylation analyses include higher throughput, increased resolution, and improved efficiency of time and resources.Entities:
Keywords: CpG methylation; human papillomavirus; methylation; methylation haplotypes; next generation sequencing
Year: 2014 PMID: 24917876 PMCID: PMC4042685 DOI: 10.3389/fgene.2014.00150
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Next Gen sequencing assays of bisulfite treated HPV16 DNA in clinical samples.
| L1_1 | 4 | 5602, 5608, 5611, 5617 | 114 |
| L1_2 | 4 | 7034, 7091, 7136, 7145 | 172 |
| L1_7 | 2 | 6650, 6581 | 167 |
| L2_1 | 5 | 4240, 4249, 4261, 4270, 4277 | 130 |
| L2_2 | 3 | 4427, 4437, 4441 | 89 |
| L2_4 | 3 | 5128, 5173, 5179 | 123 |
| L2_5 | 1 | 5378 | 166 |
| E2_1 | 5 | 3412, 3415, 3417, 3433, 3436 ( | 169 |
( ), These sites were present in the NGS data but not PSQ.
Description of barcoded primers.
| 16E2_1F | ACT | GCAG | TTAGGTAGTATTTGGTTAATTATTT |
| 16E2_1R | ACT | GCAG | ATTAAAACACTATCCACTAAATCTCTATAC |
| 16L1_1F | TAC | GTAC | TAATATATAATTATTGTTGATGTAGGTGAT |
| 16L1_1R | TAC | GTAC | AACAACCAAAAAAACATCTAAAAAA |
| 16L1_2F | ACT | GACG | TTTGTAGATTTAGATTAGTTTTTTTTAGGA |
| 16L1_2R | ACT | GACG | TTCAACATACATACAATACTTACAACTTAC |
| 16L1_7F | TAC | GATG | ATGTAGTTTTTGAAGTAGATATGGTAGTA |
| 16L1_7R | TAC | GATG | AATTACCTCTAATACCCAAATATTCAA |
| 16L2_1F | ATC | GACG | TTTTTGTTTGTTTGTTTGTTTTT |
| 16L2_1R | ATC | GACG | ACATATACCTACCTATTTACATATTTTATA |
| 16L2_2F | ATC | GACG | TATGGAAGTATGGGTGTATTTTT |
| 16L2_2R | ATC | GACG | ATTCCCAATAAAATATACCCAATAC |
| 16L2_4F | ATC | GTAC | TTTTGGATATAGTTGTTTTATATAGGTTAG |
| 16L2_4R | ATC | GTAC | CCTTAACACCTATAAATTTTCCACTAC |
| 16L2_5F | ATC | GTCA | TTGTAGAAGAAATAGAATTATAAATTATAA |
| 16L2_5R | ATC | GTCA | AAAAATATAAAAAATACAAATAATACC |
Each primer consisted of 5′ to 3′: 3 bp (LP) – 8 bp Barcode – 4 bp (RP) – Primer Target Sequence.
Figure 1The summary correlation of 27 CpG methylation sites between pyrosequencing and Next Gen sequencing. The x-axis is percent methylation by pyrosequencing and the y-axis is percent methylation by Next Gen sequencing. Each CpG site is plotted for each subject.
Figure 2The correlation of 27 individual CpG methylation sites between pyrosequencing and Next Gen sequencing. Each plot displays one CpG site and the location is indicated at the top. The x-axis indicates Illumina sequencing, the y-axis pyrosequencing for percent methylation.
Figure 3The distribution of the differences between observed and expected frequencies for each “methylation haplotype” in assay L1_1 (A), L1_7 (B), L2_1 (C), L2_2 (D), L2_4 (E), and E2_1 (F). Each boxplot indicate one “methylation haplotype” combination. The green and red bars denote positive and negative value, respectively. (A) “Methylation haplotype” frequency differences in assay L1_1. (B) “Methylation haplotype” frequency differences in assay L1_7. (C) “Methylation haplotype” frequency differences in assay L2_1. (D) “Methylation haplotype” frequency differences in assay L2_2. (E) “Methylation haplotype” frequency differences in assay L2_4. (F) “Methylation haplotype” frequency differences in assay E2_1.