| Literature DB >> 30352621 |
Nikolay Kondratyev1, Arkady Golov2, Margarita Alfimova2, Tatiana Lezheiko2, Vera Golimbet2.
Abstract
BACKGROUND: Methylation of DNA is associated with a variety of biological processes. With whole-genome studies of DNA methylation, it became possible to determine a set of genomic sites where DNA methylation is associated with a specific phenotype. A method is needed that allows detailed follow-up studies of the sites, including taking into account genetic information. Bisulfite PCR is a natural choice for this kind of task, but multiplexing is one of the most important problems impeding its implementation. To address this task, we took advantage of a recently published method based on Pacbio sequencing of long bisulfite PCR products (single-molecule real-time bisulfite sequencing, SMRT-BS) and tested the validity of the improved methodology with a smoking phenotype.Entities:
Keywords: Allele-specific methylation; Clinical sequencing; DNA methylation; Schizophrenia; Single-molecule sequencing; Smoking; Targeted sequencing
Mesh:
Substances:
Year: 2018 PMID: 30352621 PMCID: PMC6199807 DOI: 10.1186/s13148-018-0565-1
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Amplicons utilised in the study
| Reference (index) CpG, Illumina ID | The closest gene to the reference CpG | Genome coordinate (hg19) of the amplicon | DNA strand of the amplicon | Length of the amplicon, bp | Amount of CpGs in the amplicon | Reference |
|---|---|---|---|---|---|---|
| cg05575921 |
| chr5:372478-373819 | + | 1342 | 44 | [ |
| cg21566642 |
| chr2:233283630-233284930 | − | 1301 | 114 | [ |
| cg06126421 |
| chr6:30719327-30720645 | + | 1319 | 19 | [ |
| cg25189904 |
| chr1:68298855-68300158 | − | 1304 | 85 | [ |
| cg09935388 |
| chr1:92947265-92948622 | + | 1358 | 73 | [ |
| cg15417641 |
| chr3:53699512-53700811 | − | 1300 | 17 | [ |
Fig. 1“Panhandle” SMRT-BS. a Computer simulation of primer performance in conventional and bisulfite PCR. The left panel shows the ability of primers to form primer-dimers with P1 = PAIR_ANY_TH and P2 = SELF_ANY_TH parameters of the primer3 output. The right panel demonstrates a tendency of the non-specific annealing by the log10 number of BLAST hits at 3′-ends of the primers. Further details are available in Additional file 1: Supplementary Note 1. b Photography of agarose gel electrophoresis of different long bisulfite PCR products. The lanes are numbered as follows: 1–4 —conventional bisulfite PCR (with specific primers without the “panhandle”), 5–9—“panhandle” bisulfite PCR with the “panhandle” U1 primer as described in the “Methods” section; 1–8 with bisulfite-converted DNA, 9—without the template; 1, 5—without specific primers; 2, 6—the AHRR target; 3, 7—the ALPPL2 target; 4, 8—six-target multiplex PCR. M—DNA ladder marker. c Scheme of the “panhandle” SMRT-BS approach
Fig. 2Methylation and ASM effects in the AHRR amplicon. Top panel. Mean methylation signal is shown in red (smokers) and blue (non-smokers) curves. Shaded areas of respective colour represent standard error. The symbols above the curves signify the reference CpG cg05575921 (star symbol) or important CpGs, selected by the Boruta algorithm (circles). Empty circles relate to the “boruta” model and black to the “boruta.adjusted” model. The size of a circle corresponds to the Boruta importance parameter. Bottom panel. Negative log10-transformed p-levels of “two-tailed” t test of different covariates (Benjamini-Hochberg adjusted) for the individual CpGs are portrayed. The p-levels are shown on the same genomic scale as in the top panel. Vertical dotted red lines on both panels indicate the location of the CpG-SNPs
Fig. 3Selected ASM effects in the AHRR and IER3 target regions. Boxplots for methylation level are shown, boxplot whiskers represent the 25th and 75th percentiles. On the left are the reference CpGs, and on the right are the most important CpGs, selected by the “boruta.adjusted” model. The stars above the brackets denote significant ASM effect (p < 0.05, “two-tailed” t test)
Fig. 4ROC plot for smoking prediction. The ROC curves for each of model based on combined five targets (“index”, “boruta” and “boruta.adjusted”) with true positive rate (TPR) plotted against false positive rate (FPR)