Literature DB >> 36033501

Design and experimental validation of OPERA_MET-A panel for deep methylation analysis by next generation sequencing.

Federico Pio Fabrizio1, Stefano Castellana2, Flavia Centra1, Angelo Sparaneo1, Mario Mastroianno3, Tommaso Mazza2, Michelina Coco1, Domenico Trombetta1, Nicola Cingolani4, Antonella Centonza5, Paolo Graziano4, Evaristo Maiello5, Vito Michele Fazio1,6,7, Lucia Anna Muscarella1.   

Abstract

DNA methylation is the most recognized epigenetic mark that leads to a massive distortion in cancer cells. It has been observed that a large number of DNA aberrant methylation events occur simultaneously in a group of genes, thus providing a growth advantage to the cell in promoting cell differentiation and neoplastic transformation. Due to this reason, methylation profiles have been suggested as promising cancer biomarkers. Here, we designed and performed a first step of validation of a novel targeted next generation sequencing (NGS) panel for methylation analysis, which can simultaneously evaluate the methylation levels at CpG sites of multiple cancer-related genes. The OPERA_MET-A methylation panel was designed using the Ion AmpliSeq™ technology to amplify 155 regions with 125-175 bp mean length and covers a total of 1107 CpGs of 18 cancer-related genes. The performance of the panel was assessed by running commercially available fully methylated and unmethylated control human genomic DNA (gDNA) samples and a variable mixture of them. The libraries were run on Ion Torrent platform and the sequencing output was analyzed using the "methylation_analysis" plugin. DNA methylation calls on both Watson (W) and Crick (C) strands and methylated:unmethylated ratio for each CpG site were obtained. Cell lines, fresh frozen and formalin-fixed paraffin-embedded (FFPE) lung cancer tissues were tested. The OPERA_MET-A panel allows to run a minimum of 6 samples/530 chip to reach an observed mean target depth ≥2,500X (W and C strands) and an average number of mapped reads >750,000/sample. The conversion efficiency, determined by spiking-in unmethylated Lambda DNA into each sample before the bisulfite conversion process, was >97% for all samples. The observed percentage of global methylation for all CpGs was >95% and <5% for fully methylated and unmethylated gDNA samples, respectively, and the observed results for the variable mixtures were in agreement with what was expected. Methylation-specific NGS analysis represents a feasible method for a fast and multiplexed screening of cancer patients by a high-throughput approach. Moreover, it offers the opportunity to construct a more robust algorithm for disease prediction in cancer patients having a low quantity of biological material available.
Copyright © 2022 Fabrizio, Castellana, Centra, Sparaneo, Mastroianno, Mazza, Coco, Trombetta, Cingolani, Centonza, Graziano, Maiello, Fazio and Muscarella.

Entities:  

Keywords:  biomarker; cancer; driver gene; methylation; next generation sequencing

Year:  2022        PMID: 36033501      PMCID: PMC9404304          DOI: 10.3389/fonc.2022.968804

Source DB:  PubMed          Journal:  Front Oncol        ISSN: 2234-943X            Impact factor:   5.738


1 Introduction

DNA methylation is one of the most largely investigated epigenetic footprints, due to its link with several diseases as well as cancers and autoimmune or genetic disorders (1, 2). Of particular interest is the role of DNA methylation at cytosine residues by the addition of a methyl group (5-mC), most frequently at the CpG (cytosine-phosphate-guanine) dinucleotide motif of mammalian genomes. The fluctuation of 5-mC level is generally associated with variation in the genes expression levels and contributes in many cases to the definition of clinical phenotypes, as well as representing in some clinical contexts a useful diagnostic biomarker in guiding therapeutical choices (3, 4). In tumors, the epigenome alteration is linked to the neoplastic transformation, cancer progression and invasion, and encouraged a large number of studies focused on discovering predictive and prognostic power of the methylation status at CpG sites (5). To date, the CpG methylation changes represent an attractive source of biomarkers that could have a significant impact on both early and advanced tumors management. Moreover, methyl CpG sites could arise during therapy resistance in patients, so they could represent a new option for the longitudinal monitoring of the neoplastic evolution. Many scientific findings originate from managing large long-existing available datasets and, in this context, the aberrant methylation of the CpGs located at the promoter regions of tumor suppressor genes is gaining prominence (6–8). By contrast, only few large-scale studies have been focused on the role and prognostic impact on cancer of different methylation patterns across the genes, such as intragenic or gene body methylation, that may have a different role in the transcriptional regulation and efficiency of genes machinery (9, 10). The translation of all this epigenetic knowledge in clinical practice is not complete, mainly due to the lack of high-throughput and quantitatively accurate approaches that can rapidly profile poor quality and quantity of DNA obtained by tumor tissue biopsies. Pyrosequencing and bisulfite-cloning/sequencing are the most widely used methods for low-cost analyses to measure the methylation level at single CpGs of genes in daily practice with reasonable quantitative accuracy. Although highly useful, the limitations of these techniques include samples consumption, short-read length, and low sample throughput (11). In this methodology paper, we designed and experimentally validated a customized methylation panel for NGS analysis (OPERA_MET-A panel), to scan relevant CpG sites in 155 regions of 18 cancer-related genes mainly involved in the NRF2/KEAP1 pathway and immunotherapy. gDNA from FFPE, frozen tissues and cell lines were tested to simultaneously evaluate their density and average methylation levels for all or single targeted CpG sites. Finally, using this NGS approach, we obtained details about the strand-specific specificity of CpG methylation in targeted regions of the selected genes.

2 Material and methods

2.1 Biological samples selection

Three different types of biological samples were used to validate the OPERA_MET-A panel: n.3 lung paired tumor/non-neoplastic FFPE tissues (830T/N, 881T/N, 889T/N), n.3 lung tumors optimal cutting temperature compound (OCT) embedded (435T, 475T, 495T), n.2 lung cell lines (tumor A459 and normal MRC5) were used. Tissues were collected from anonymous patients, according to the guidelines of the Local Ethical Committee of IRCCS Casa Sollievo della Sofferenza Hospital, Italy, whereas cell lines were purchased from the American Type Culture Collection (ATCC, Manassas, VA, USA). A commercially available fully methylated (>95%, CpGenome Universal Methylated DNA, Millipore, Chemicon) and unmethylated (<5%, CpGenome Universal Unmethylated DNA, Chemicon) genomic gDNA with four mixtures (~25%, ~50%, ~75% and ~90%) were used as positive and negative controls to the optimal DNA conversion and library preparation.

2.2 gDNA extraction and sodium bisulfite conversion

gDNA was extracted from cell lines and fresh OCT embedded tissues by using the standard phenol-chloroform procedure (12), whereas 10 μm FFPE sections were extracted using the GeneRead DNA FFPE kit (Qiagen, MD, USA), following the manufacturer’s instruction. Before starting the bisulfite conversion, 0,33% of Unmethylated Lambda DNA (Promega) was added to each single gDNA in order to estimate bisulfite conversion efficiency during samples treatment. For each sample and control mixture (methylated/unmethylated DNA), a minimum of 250ng of gDNA extracted was treated by using Epitect Bisulfite kit (Qiagen) (13), to ensure a minimum of 100ng of converted gDNA recovery for libraries preparation. The quantification of both extracted gDNA and bisulfite treated gDNA was performed using Qubit® ssDNA Assay Kit on Qubit™ 3.0 Fluorometer (Thermo Fisher, Life Scientific).

2.3 NGS workflow

2.3.1 Ion AmpliSeq™ methylation panel design

The OPERA_MET-A panel was designed in collaboration with the Ion AmpliSeq custom design team through the Ion AmpliSeq White Glove Service (Thermo Fisher Scientific); it includes multiple informative and challenging genes regions also tracked by the Infinium Human Methylation 450 BeadChip (Illumina Inc.) or annotated as having a prognostic value in peer-reviewed scientific papers. Specifically, the OPERA_MET-A panel allows the simultaneous quantification and analysis of 155 genomic regions, each of them covered by amplicons with a size range of 125-175bp length (amplicons designed for both Watson and Crick strands) and covers 1107 CpGs located in 18 cancer-related genes (KEAP1, SPARC, PD-L1/CD274, PD-1/CD279, HAR1B, TMPRSS4, RASSF1A, HOXA9, NFE2L2, GPX2, PGD, TXNRD1, GCLC, AKR1C2, SRXN1, ABCC2, PTEN, CDKN2A/P16, GRCh37/hg19 release). Details about the CpG sites included in the methylation NGS panel were fully listed in .

2.3.2 Library preparation and sequencing

Targeted amplifications were performed starting from a standard quantity of bisulfite converted gDNA for each multiplex PCR amplification. Amplicons were generated using the two primer pools of the OPERA_MET-A panel, following the manufacturer’s recommendation for the Ion AmpliSeq Library Kit Plus for Bisulfite methylation library production (Thermo Fisher Scientific). Briefly, 2 μL of 5X Ion AmpliSeq™ HiFi mix, 2 μL of 5X Ion AmpliSeq™ of primers pools and 100ng of bisulfite converted gDNA were mixed and amplified following temperature conditions to achieve DNA target amplification. Then, primer sequences were partially digested by adding 2 μL of FuPa Reagent and loaded in a thermal cycler under user guide conditions. Each library was labeled with a single Ion Xpress™ barcode and Ion P1 adapter (Thermo Fisher Scientific) by adding 4 μL of Switch Solution, 2 μL of diluted barcode and 2 μL of DNA Ligase to the reaction mixture, following the temperature conditions recommended by the manufacturer. Finally, the Agencourt AMPure XP bead (Beckman Coulter, Brea, CA, USA) was used to complete three rounds of purifications on a magnet rack and 50 µL of Low TE was added to elute the library. A 100-fold dilution of purified and amplified libraries was quantified using the Ion Library TaqMan quantitation kit (Thermo Fisher Scientific) in a 7900 Real-Time PCR System (Thermo Fisher Scientific), following standard qPCR cycling. Library profiles were checked by capillary electrophoresis using the High Sensitivity D1000 ScreenTape on Tape Station 2200 (Agilent Technologies), and equimolar concentrations of samples libraries were pooled. 40pM of manually pooled normalized libraries were used for template preparation and Ion 530™ chip loading on the Ion Chef Instrument (Thermo Fisher Scientific) using the Ion 520™&530™ ExT Kit-Chef (Thermo Fisher Scientific). Sequencing was performed on Ion GeneStudio S5 (Thermo Fisher Scientific). A maximum of 6 samples were loaded on a single chip per sequencing run.

2.3.3 Bioinformatic analysis

The methylation analysis was performed using the previously described outline (14, 15) and the Ion Torrent Suite™ Software (version 5.10.1) running on the Torrent Server (Thermo Fisher Scientific) was used to process the sequencing data. The “methylation_analysis” Torrent Suite plugin (Thermo Fisher Scientific) was used to analyze the sequencing output of the OPERA_MET-A panel and annotate the percentage of each targeted CpG site. This analysis plugin performs sequencing read alignment onto the W and C strands of the GRCh37/hg19 reference genome and then assesses the methylation status in a strand-specific manner. Reports and text files were generated for each amplicon, containing the number of methylated and unmethylated reads as well as the percentage of methylation per amplicon in relation to the targeted region/CpG sites (14, 15). A summary report was created for each sample that includes the barcode name, the assigned sample name, the total number of reads that cover the target CpGs, and the percentage of methylated reads. In addition, for each barcode, text files with the number of methylated reads, unmethylated reads, and percent methylation for each amplicon were generated. Separate text files were created for (i) the target CpGs, (ii) all CpGs in the amplicon insert, and (iii) all non-CpG cytosines in the amplicon insert. Each amplicon may contain zero, one, or more CpG targets (hotspots) of interest.

3 Statistical analysis

A simple linear regression model (Observed methylation ~ Expected methylation), was used to model the relationship between observed and expected global methylation results using the OPERA MET-A panel. The “Expected values” were % of methylation level of different control mixtures obtained from fully methylated and unmethylated commercial gDNA samples (>95%, ~90%, ~75%, ~50%, ~25%, ~10%, <5%). The Wilcoxon signed-rank test was used to assess significant differences in CpG methylation levels between the W and C strands, as defined by genes and patients. All results were deemed statistically significant when p is <0.05. R Foundation for Statistical Computing was used to perform all statistical analyses and plots (version 4.0, packages: ggplot2, dplyr, devtools, PairedData).

4 Results

4.1 Targeted regions selection

The Ion AmpliSeq™ Design Pipeline considered theoretical CpG genome conversion to perform primer design and selection, amplicon tiling and generation of optimal amplicon pooling. The input targets were a list of both single hotspot CpGs (CpG interspersed) and genomic regions containing CpG sites (CpG Island). The target regions of selected genes ranged from 125 to 175bp. They were chosen for validation among those having a potential translational impact in peer-reviewed scientific papers related to immunotherapy in various cancer types (PD-1, PD-L1) (16–27), oxidative stress (KEAP1, NFE2L2, GPX2, PGD, TXNRD1, GCLC, AKR1C2, SRXN1, ABCC2) (28–35), and other cancer/early-stage prognostic biomarkers (SPARC, HAR1B, TMPRSS4, RASSF1A, HOXA9, PTEN, CDKN2A/P16) mainly related to lung cancer (36–59), ( ). For each gene included in the NGS panel, both island-located and interspersed CpGs were tacked ( ).
Table 1

Full list of targeted CpGs by OPERA_MET-A panel.

Target IDGeneAccession NumberChromosomeStart (hg19)End (hg19)CpG location (Island/interspersed)Gene location
cg04909257 PGD NM_001304451.2chr11046249710462499interspersedintron
CpG_Island NFE2L2 NM_001145412.3chr2178128273178129847Islandintron/exon
cg11532131 PD-1/CD279 NM_005018.3chr2242792224242792226interspersedexon
cg10057601 242793077242793079interspersedexon
cg22235901 242793206242793208interspersedexon
CpG_Island-1 242794853242795083Islandintron/exon
cg25798782 242795281242795283interspersedintron
cg01632474 242799311242799313interspersedintron
cg21670983 242799459242799461interspersedintron
CpG_Island-2 242799488242799696Islandintron
cg18096388 242800972242800974interspersedexon
cg25890838 242801045242801047interspersedexon
cg02122525 242801251242801253interspersed5’ upstream
cg14453145 242801895242801897interspersed5’ upstream
cg17322655 242802126242802128interspersed5’ upstream
cg20805133 242802191242802193interspersed5’ upstream
cg19811994 RASSF1A NM_170713.3chr35037364050373642interspersedintron
CpG_Island-1 5037426450375629Islandintron/exon
cg24049629 5037647450376476interspersedintron
CpG_Island-2 5037780350378540Islandintron/exon/5’ upstream
cg10505630 SPARC NM_003118.4chr5151051213151051215interspersedintron
cg23174201 151054255151054257interspersedintron
cg27128761 151055649151055651interspersedintron
cg26389330 151057859151057861interspersedintron
cg14518209 151066267151066269interspersedintron
CpG_Island 151066456151066695Islandexon
cg25913233 151066682151066684interspersed5’ upstream
cg22116670 151066729151066731interspersed5’ upstream
cg07539983 151067340151067342interspersed5’ upstream
cg02731193 GCLC NM_001498.4chr65340718553407187interspersedintron
CpG_Island HOXA9 NM_152739.4chr72720391527206462Islandintron/exon/5’ upstream
cg15837913 PD-L1/CD274 NM_014143.4chr954498895449891interspersed5’ upstream
CpG_Island 54504095450629Islandintron/exon/5’ upstream
cg13474877 54507235450725interspersedintron
cg19724470 54509355450937interspersedintron
cg12840719 CDKN2A NM_000077.5chr92196823221968234interspersedexon
CpG_Island-1 2196835821968728Islandintron
CpG_Island-2 2197091321971190Islandexon
CpG_Island-3 2197457821975306Islandintron/exon
cg19648686 AKR1C2 NM_001354.6chr1050449915044993interspersedintron
CpG_Island PTEN NM_000314.8chr108962177289624128Islandintron/exon*
cg02307823 8967590089675902interspersedintron
cg19378330 ABCC2 NM_000392.5chr10101605987101605989interspersedintron
cg05775918 TMPRSS4 NM_019894.4chr11117947554117947556interspersed5’ upstream**
cg03634928 117947610117947612interspersed5’ upstream**
cg27300950 117947627117947629interspersed5’ upstream**
cg25116503 117947656117947658interspersed5’ upstream**
cg22957898 117947876117947878interspersedexon**
cg03331715 TXNRD1 NM_182729.3chr12104689086104689088interspersedintron
cg10880599 GPX2 NM_002083.4chr146540847965408481interspersedintron***
cg09643186 6540945165409453interspersedexon***
cg26155983 6541014465410146interspersed5’ upstream
cg01586432 KEAP1 NM_203500chr191059701510597017interspersedexon
cg02337283 1059997510599977interspersedexon
cg22779878 1060044510600447interspersedexon
CpG_Island 1060228010602878Islandexon
cg20226327 1060295910602961interspersedintron
cg24892871 1061104210611044interspersedintron
cg26988016 1061280110612803interspersedintron
cg15204119 1061317910613181interspersedintron
cg06911149 1061345510613457interspersedintron
cg15676203 1061348710613489interspersedintron
cg03890664 1061349110613493interspersedintron
cg26500801 1061385410613856interspersedintron
cg02428100 1061402110614023interspersedexon
cg25801292 1061427110614273interspersed5’ upstream
cg12095186 1061509610615098interspersed5’ upstream
cg03754063 1061519810615200interspersed5’ upstream
cg18484212 SRXN1 NM_080725.3chr20631460631462interspersedintron
CpG_Island HAR1B NR_003245.1chr206173327561734521Islandintron/exon

*(“intron/exon” for KLLN, NM_001126049.2, C strand);

**(“intron” for SMIM35, NM_001394164.1, C strand);

*** (“intron” for FTNB, NM_001202559.1, W strand).

Full list of targeted CpGs by OPERA_MET-A panel. *(“intron/exon” for KLLN, NM_001126049.2, C strand); **(“intron” for SMIM35, NM_001394164.1, C strand); *** (“intron” for FTNB, NM_001202559.1, W strand).

4.2 Samples type and libraries profile – pre-sequencing quality control

To experimentally validate the performance of our NGS methylation panel, we decided to test three different types of biological samples having different grades of fragmentation and verify whether and how the density of global and single methylation status for each CpG site/region at gene Island occurs. The different epi/methyl-print is based on a set of 3 paired FFPE lung tumors/matched non-neoplastic tissues, 3 OCT embedded lung tumors samples and adenocarcinoma lung cancer cell A549 and non-neoplastic cell MRC5 lines. Upon libraries amplification, the quality and molar concentration of each library were determined using Tape Station 2200 (Agilent) and no substantial differences in the libraries quality and length were observed among converted gDNA templates from cell lines, OCT embedded and FFPE Tissues ( ). By contrast, differences in quantity and library profiles were observed when an input of converted gDNA amounts of 30 ng and 100 ng were used to construct libraries ( ). Considering the concentration and size distribution of amplicons, the higher libraries quality profile was observed for 100 ng of converted gDNA, which was therefore adopted for all the above reported experiments.
Figure 1

Quality control steps of library construction using the OPERA_MET-A panel for targeted next generation bisulfite sequencing. (A) Representative picture of chromatograms showing a high quality library profile of methylated and unmethylated DNA control samples and eight gDNA samples from cell lines, FFPE and OCT embedded tissues. (B) Representative images from Agilent Tape Station 2200 NGS libraries for all bisulfite treated DNA samples.

Quality control steps of library construction using the OPERA_MET-A panel for targeted next generation bisulfite sequencing. (A) Representative picture of chromatograms showing a high quality library profile of methylated and unmethylated DNA control samples and eight gDNA samples from cell lines, FFPE and OCT embedded tissues. (B) Representative images from Agilent Tape Station 2200 NGS libraries for all bisulfite treated DNA samples.

4.3 Evaluation of analytical performance of OPERA_MET-A panel

Converted gDNA from cell lines, OCT embedded and FFPE tissues were successfully analyzed. Results about metrics per sample was summarized in . The mean number of reads for samples was 1034910 ± 101511 (ranging from 329889 to 2077231), the mean number of read length was 76 ± 2 (ranging from 57 to 85 bp); the mean number of mapped reads was 769384 ± 74468 (ranging from 244244 to 1397720), the mean percentage of read on target was 55 ± 3% for W strand (ranging from 33% to 79%), and 53 ± 4% for C strand (ranging from 23% to 81%). A difference in % of unmapped reads was observed between tested mixes of commercially bisulfite treated gDNA samples and bisulfite treated gDNA obtained from biological sample biopsies (≤12% vs ≥26%). Anyway, a high intra-samples variability was observed among gDNA from all samples type ( ). The mean number of read per amplicon was 2862 ± 304 for W strand (ranging from 1035 to 5875), and 3081 ± 345 for C strand (ranging from 333 to 6433), in line with the data previously reported by Luo et al. for the Ion AmpliSeq™ Methylation Panel for Cancer Research (15).
Table 2

Metrics per sample obtained using the OPERA_MET-A panel for targeted bisulfite NGS.

MetricMean valueMinimum observedMaximum observed
Number of total reads10349103298892077231
Number of mapped reads7693842442441397720
Read length (base pair)76bp57bp85bp
Percentage of read on target for W strand55%33%79%
Percentage of read on target for C strand53%23%81%
Number of read per amplicon for W strand286210355875
Percentage of read on target for C strand30813336433
Metrics per sample obtained using the OPERA_MET-A panel for targeted bisulfite NGS.

4.4 Estimation of optimal samples per chip number

The empirical calculation of cut-off of minimum required mapped reads per target was made according to the following criteria: >20 methylated read for testing CpG site, >40% of tumor cell content and >10% methylation level per site and was also related to an expected conversion efficiency >99%. We can therefore assume a minimum of 500 total mapped reads per site to achieve a successful run/site. Based on the experimental sequencing performance of samples, we assume that a minimum of 6 samples/530 chip can be loaded.

4.5 Performance of bisulfite conversion rate and global methylation detection by OPERA_MET-A panel

Bisulfite conversion of gDNA was employed in order to discriminate methylated versus unmethylated cytosines, where unmethylated cytosines were deaminated to uracil nucleotide; converted DNA template generated thymines during PCR amplification. The bisulfite conversion rate was calculated by evaluating the Lambda control DNA. Once bisulfite conversion is performed, theoretically every C residue in the unmethylated Lambda control DNA should be converted to a T. The methylation_analysis plugin counted the number of C residues that are present in the sequence to determine the percentage of the sequence that is methylated, assuming that after the bisulfite conversion reaction, the only C residues that remain in the sequence were methylated in the original sample. The value is shown in the percent.ME (percent methylation) column for each sample. The optimal bisulfite conversion rate should be >99%, calculated as the difference from 100% of the average of the observed W and C percent.ME values for the unmethylated Lambda DNA in each sample and control (60). To estimate the conversion efficiency of our samples, replicate commercially gDNA samples of average methylation states across all CpGs of approximately >95% and <5%, and different mixtures of the two methylation states (~90%, ~75%, ~50%, ~25% and ~10%) were used to construct libraries using the custom panel. As per standard operative procedure, unmethylated Lambda DNA was spiked into each sample prior to bisulfite conversion and primers exist in the panel to determine conversion efficiency using the sequencing output files. A high conversion efficiency was obtained for all treated control samples, with a mean of 98,5% ± 0,2 of conversion efficiency (ranging from 97,5% to 99,1%), ( ). The methylation_analysis plugin allowed both the alignment and the methylation status calling. As a result, a high concordance between average percent methylation across all amplicons (target_CpG) of OPERA_MET-A panel between observed and expected % of methylation was obtained for each methylation state (p value<0.0001, Adjusted R2 = 0.95), ( ).
Figure 2

Global methylation levels across all amplicons (target_CpGs) of OPERA_MET-A panel. gDNA samples of average methylation states across all CpGs of approximately >95% and <5%, and different mixtures of methylation states (~90%, ~75%, ~50%, ~25% and ~10%) were used. (A) Distribution of observed global methylation levels for each gDNA control mixture in three replicates (replicate 1=red, replicate2=green, replicate3=blue), the expected ones are in black. (B) Linear regression analysis of global methylation levels for each gDNA control mixtures (replicate1=red, replicate2=green, replicate3=blue). The filled color areas represent the standard error of each replicate.

Global methylation levels across all amplicons (target_CpGs) of OPERA_MET-A panel. gDNA samples of average methylation states across all CpGs of approximately >95% and <5%, and different mixtures of methylation states (~90%, ~75%, ~50%, ~25% and ~10%) were used. (A) Distribution of observed global methylation levels for each gDNA control mixture in three replicates (replicate 1=red, replicate2=green, replicate3=blue), the expected ones are in black. (B) Linear regression analysis of global methylation levels for each gDNA control mixtures (replicate1=red, replicate2=green, replicate3=blue). The filled color areas represent the standard error of each replicate. Methylation data obtained by NGS from FFPE samples (paired tumor/non-neoplastic samples) were used to evaluate the utility of OPERA_MET-A panel to visualize the CpG methylation distribution along different gene regions (Island and interspersed CpGs). As expected, variations of methylation level of CpGs were observed between paired samples from the same patients and among patients for PD-1/CD279, that was shown as representative model for this specific application ( ). Finally, methylation data obtained by NGS from paired FFPE samples to investigate the CpG methylation distribution between W and C strands. A subset of CpGs mapped in KEAP1, PD-1/CD279 and RASSF1A genes ( ) was chosen as pilot for this analysis. For each targeted CpG located on both W and C strand, the total number of reads that cover the target CpGs, the number of methylated reads, unmethylated reads, and percent methylation for each amplicon were generated for each targeted CpG site on both W and C strand of the genes. Interestingly, we observed a striking change in global CpG methylation across the RASSF1A gene on both W and C strands in all samples (p<0.01, Wilcoxon signed-rank test), whereas no substantial differences in global CpG methylation distribution between W and C strands were observed in CD279/PD-1 and KEAP1 genes ( and , ).
Figure 3

(A) Schematic epigrams showing the CpGs methylation levels along different PD-1/CD279 gene regions in paired tumor/normal FFPE samples. (B) Schematic representation of the PD-1/CD279 gene structure within the human hg19 genome sequence. From top to bottom: NCBI RefSeq and Consensus CDS tracks for PD-1/CD279 exon/intron structure; predicted CpG islands (“Regulation” >> “CpG Island” track); CpG methylation sites map targeted by OPERA_MET-A panel are mapped.

Figure 4

Strand specific distributions of CpG methylation in samples using OPERA_MET-A panel. The average percent of global CpG methylation at RASSF1A gene (target_CpGs) in paired non-neoplastic and tumor tissues of FFPE samples 830, 881, 889. Global % CpG methylation at W strands is in light blue boxplots, at C strands in red boxplot (Wilcoxon signed-rank test).

(A) Schematic epigrams showing the CpGs methylation levels along different PD-1/CD279 gene regions in paired tumor/normal FFPE samples. (B) Schematic representation of the PD-1/CD279 gene structure within the human hg19 genome sequence. From top to bottom: NCBI RefSeq and Consensus CDS tracks for PD-1/CD279 exon/intron structure; predicted CpG islands (“Regulation” >> “CpG Island” track); CpG methylation sites map targeted by OPERA_MET-A panel are mapped. Strand specific distributions of CpG methylation in samples using OPERA_MET-A panel. The average percent of global CpG methylation at RASSF1A gene (target_CpGs) in paired non-neoplastic and tumor tissues of FFPE samples 830, 881, 889. Global % CpG methylation at W strands is in light blue boxplots, at C strands in red boxplot (Wilcoxon signed-rank test).

5 Discussion

With an increasing number of clinically actionable targets, the optimization of NGS technology ensured a high sensitivity, specificity and time-saving of analysis by easily assessing every nucleotide change in multiple targets gene/or regions at single-base resolution. More recently, the NGS technology is also enhancing the methylome analysis, thus contributing to expand the knowledge and characterization of differentially methylated gDNA regions in many cancer-related human genes (61). Starting from a relatively low quantity of biological materials, the NGS approach offers in this specific field the great advantage to successfully quantify DNA methylation density and its differences at specific CpG sites of both promoter or intragenic regions (62, 63). The great main translational advantage of this feasible high-throughput approach in a more specific cancer context is the opportunity to construct a more robust and integrated algorithm to predict the disease evolution of patients. Moreover, it could support the selection and a better stratification of surgically-resected patients for follow-up and enhance the development of novel therapeutic approaches, not yet based on an epigenetic profile in daily practice (64, 65). The design and validation of the OPERA_MET-A panel for NGS analysis were based on this assumption. This panel allows to obtain libraries starting from gDNA extracted from different matrices, such as cell lines, OCT embedded frozen and FFPE tissues. Considering the recently published data on NGS for methylation analysis using the gene-targeted AmpliSeq technology, very promising and comparable results to those already been published were obtained (15) in terms of mean target depth ≥2,500X (W and C strand), average number of mapped reads >750,000/sample and concordance results between expected and observed % of global methylation for all CpGs. More specifically, the OPERA_MET-A panel primer design pipeline include more amplicons (155 amplicons) than those generated by Ion Ampliseq™ Methylation Panel for Cancer Research (40 amplicons) that covers 18 non-overlapping genes and can be used to analyze gDNA from cell lines, human FFPE and OCT tissues to reach a comparable performance in terms of time consuming, observed/expected methylation rate concordance and average mapped reads. The same “methylation_analysis” plugin was used to successfully perform alignment and methylation calling for amplicons on both W and C strands. Moreover, the values of the obtained run metric parameters confirm that our customized panel allows us to obtain targeted and quantitative information on tumor markers with a high resolution starting from a relatively low DNA quantity and quality input ( ).
Figure 5

Schematic representation of methylation NGS analysis workflow using OPERA_MET-A panel.

Schematic representation of methylation NGS analysis workflow using OPERA_MET-A panel. The OPERA_MET-A panel covers multiple regions of 18 cancer-driver genes whose methylation profile was previously proven to have a translational impact on lung cancer progression but also in other solid tumors. Among these, the panel allows the epigenetic scanning of poorly investigated CpGs located at the intragenic exon 3 island of the KEAP1 gene, that were linked to KRAS mutant status in NSCLC patients (29) and at the SPARC gene promoter, which has been proposed as an interesting prognostic biomarker in NSCLC with useful application in the squamous early-stage group (40). Moreover, CpGs located at the CD274/PD-L1 promoter region and CD279/PD-1 were also included in the panel design, since they have recently emerged as an independent prognostic factor associated with shorter overall survival in triple negative breast, colorectal, prostate, gastric, papillary thyroid, melanoma and head and neck cancer patients (16–27). Many increasing literature evidences suggest that, not only CpG Islands, but also CpG shores methylation correlate with gene expression (66, 67). Furthermore, additional evidence would suggest methylation regulatory regions can extent into exon1/intron 1 of a gene and are outside the promoter/CpG island locus (68–70). In the presented work, we mainly focused on CpG island loci and selected the candidate gene list and loci based on an existing literature evidences and own research field of interest results ( ). Further planned analyses will evaluate differences in prognostic impact related to CpGs in single gene or among genes of the presented panel in cancer patients, thus allowing an upgrade of OPERA_MET-A panel. While the main focus of this study was to investigate methylation on multiple cancer-related genes by using a multigene NGS panel starting from low DNA quality and quantity, our analysis also offers new insight into the heterogeneity of CpG methylation patterns among genes and patients. Using the “methylation_analysis” plugin to manage data obtained from the OPERA_MET-A panel, differentially methylated cytosines can be found along the same gene (i.e. PD-1/CD279) and/or in each gene in both double-stranded (i.e., symmetric, as for PD-1 and KEAP1 genes) and single-stranded (i.e., asymmetric, as for RASSF1A) contexts. While single stranded DNA methylation can be more frequently detected in non-CG methylation contexts (71), CpG positions are usually expected to be either fully methylated or fully unmethylated in both DNA strands. Therefore, there should not be a “right” or “wrong” strand when choosing to study conventional methylation patterns and data obtained from one strand can be safely assumed to apply also to the second strand. Double stranded DNA methylation primarily occurs in the case of nucleobase symmetry between sense (CG) and antisense (GC) DNA strands. However, methylation in a CG context is not always double-stranded; it can also occur on just a single strand, as described in solid tumors (72) and should be related to genes function and transcription. By consequence, the ability to determine the exact locations and status of CpG methylation in W and C strands separately could provide interesting experimental evidence for innovative clinical applications. The main limitation of the OPERA_MET-A panel relies on the limited number of genes and samples to be analyzed in a single run to achieve an acceptable coverage to obtain epigenetic information from degraded samples having variable % tumor cell content. Further improvements are required, such as extending the number of clinically relevant genes whose methylation has a prognostic value in tumors and additional orthogonal evaluation of the methylation density using conventional approaches to study methylation of genes at single CpG level. It should also keep in mind that, when we worked on different matrices, including fixed tissues, an optimized pre-analytical and analytical workflows are demanded in order to obtain an optimal quality and quantity of DNA and decrease the risk of suffering from a critical bisulfite conversion. As also reported in our small subsets of samples from different matrices (commercial gDNA, cell lines, OCT and FFPE tissues) degraded DNA samples (by cross-linking, deamination and fragmentation) could in fact affect the efficiency of NGS analysis, so their pre-analytical and analytical manipulation represents a critical aspect to evaluate in order to improve the homogeneity and efficiency of bisulfite conversion and high throughput NGS sequencing (73, 74). In conclusion, considering all together, the obtained NGS performance for OPERA_MET-A panel corroborates the utility NGS approach for methylation pattern analysis among groups, validation of whole approaches, identification of gDNA methylation for different regions of single/multiple genes (promoter and other regulatory regions) or CpG islands in multiple samples aimed at the discovery of biomarkers having clinical relevance.

Data availability statement

The original contributions presented in the study are included in the article/ . Further inquiries can be directed to the corresponding authors.

Ethics statement

The studies involving human participants were reviewed and approved by Local Ethical Committee of IRCCS Casa Sollievo della Sofferenza Hospital, Italy. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author contributions

Conceptualization, supervision, writing—original draft preparation, LAM. Methodology and validation FF, FC and MC. Visualization, formal analysis and software SC, TM and MM. Data curation, FF, AS, LAM. Resources, NC and PG, and Investigations, FF, SC, FC. Writing—review and editing: FF, SC, MM, DT, AC, PG, EM, VF, LAM. Funding acquisition, LAM and EM. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Italian Ministry of Health, Ricerca Corrente 2021-22, by the “5 x1000” voluntary contributions to Fondazione IRCCS Casa Sollievo della Sofferenza.

Acknowledgments

The authors thank Dr. Giorgio Pea for his technical and logistic assistance in the OPERA_MET-A panel design; NGS run parameters and Methylation_analysis plugin optimization. The authors also thank Prof. Andreina Guarnieri for the professional English editing of the manuscript, Teresa Balsamo for her technical assistance in performing NGS analysis and Federica Russo for tissue samples handling.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
  72 in total

1.  Identification of functional methylated CpG loci in PD-L1 promoter as the novel epigenetic biomarkers for primary gastric cancer.

Authors:  Mohammad Amini; Maryam Hejazi; Khodayar Ghorban; Ahad Mokhtarzadeh; Behzad Baradaran
Journal:  Gene       Date:  2020-12-23       Impact factor: 3.688

2.  Promoter Methylation of PTEN Is a Significant Prognostic Factor in Melanoma Survival.

Authors:  Mi Ryung Roh; Sameer Gupta; Kyu-Hyun Park; Kee Yang Chung; Martin Lauss; Keith T Flaherty; Göran Jönsson; Sun Young Rha; Hensin Tsao
Journal:  J Invest Dermatol       Date:  2016-02-15       Impact factor: 8.551

3.  Characterization of the Clinical Significance of PD-1/PD-Ls Expression and Methylation in Patients With Low-Grade Glioma.

Authors:  Jie Mei; Yun Cai; Rui Xu; Xuejing Yang; Weijian Zhou; Huiyu Wang; Chaoying Liu
Journal:  Technol Cancer Res Treat       Date:  2021 Jan-Dec

4.  Epigenetic alterations leading to TMPRSS4 promoter hypomethylation and protein overexpression predict poor prognosis in squamous lung cancer patients.

Authors:  Maria Villalba; Angel Diaz-Lagares; Miriam Redrado; Arrate L de Aberasturi; Victor Segura; Maria Elena Bodegas; Maria J Pajares; Ruben Pio; Javier Freire; Javier Gomez-Roman; Luis M Montuenga; Manel Esteller; Juan Sandoval; Alfonso Calvo
Journal:  Oncotarget       Date:  2016-04-19

Review 5.  Methylation Dynamics of RASSF1A and Its Impact on Cancer.

Authors:  Giorgio Malpeli; Giulio Innamorati; Ilaria Decimo; Maria Bencivenga; Armel Herve Nwabo Kamdje; Roberto Perris; Claudio Bassi
Journal:  Cancers (Basel)       Date:  2019-07-09       Impact factor: 6.639

Review 6.  The emerging role of epigenetic therapeutics in immuno-oncology.

Authors:  Michael J Topper; Michelle Vaz; Kristen A Marrone; Julie R Brahmer; Stephen B Baylin
Journal:  Nat Rev Clin Oncol       Date:  2019-09-23       Impact factor: 66.675

7.  Clinical validation of a targeted methylation-based multi-cancer early detection test using an independent validation set.

Authors:  E A Klein; D Richards; A Cohn; M Tummala; R Lapham; D Cosgrove; G Chung; J Clement; J Gao; N Hunkapiller; A Jamshidi; K N Kurtzman; M V Seiden; C Swanton; M C Liu
Journal:  Ann Oncol       Date:  2021-06-24       Impact factor: 32.976

8.  A prognostic DNA methylation signature for stage I non-small-cell lung cancer.

Authors:  Juan Sandoval; Jesus Mendez-Gonzalez; Ernest Nadal; Guoan Chen; F Javier Carmona; Sergi Sayols; Sebastian Moran; Holger Heyn; Miguel Vizoso; Antonio Gomez; Montse Sanchez-Cespedes; Yassen Assenov; Fabian Müller; Christoph Bock; Miquel Taron; Josefina Mora; Lucia A Muscarella; Triantafillos Liloglou; Michael Davies; Marina Pollan; Maria J Pajares; Wenceslao Torre; Luis M Montuenga; Elisabeth Brambilla; John K Field; Luca Roz; Marco Lo Iacono; Giorgio V Scagliotti; Rafael Rosell; David G Beer; Manel Esteller
Journal:  J Clin Oncol       Date:  2013-09-30       Impact factor: 44.544

9.  The prognostic value of homeobox A9 (HOXA9) methylation in solid tumors: a systematic review and meta-analysis.

Authors:  Hai Cai; Zhi-Bin Ke; Ru-Nan Dong; Hang Chen; Fei Lin; Wen-Cai Zheng; Shao-Hao Chen; Jun-Ming Zhu; Shao-Ming Chen; Qing-Shui Zheng; Yong Wei; Xue-Yi Xue; Ning Xu
Journal:  Transl Cancer Res       Date:  2021-10       Impact factor: 1.241

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.