| Literature DB >> 26425700 |
Matthias Wielscher1, Klemens Vierlinger1, Ulrike Kegler1, Rolf Ziesche2, Andrea Gsur3, Andreas Weinhäusel1.
Abstract
Disease-specific alterations of the cell-free DNA methylation status are frequently found in serum samples and are currently considered to be suitable biomarkers. Candidate markers were identified by bisulfite conversion-based genome-wide methylation screening of lung tissue from lung cancer, fibrotic ILD, and COPD. cfDNA from 400 μl serum (n = 204) served to test the diagnostic performance of these markers. Following methylation-sensitive restriction enzyme digestion and enrichment of methylated DNA via targeted amplification (multiplexed MSRE enrichment), a total of 96 markers were addressed by highly parallel qPCR. Lung cancer was efficiently separated from non-cancer and controls with a sensitivity of 87.8%, (95%CI: 0.67-0.97) and specificity 90.2%, (95%CI: 0.65-0.98). Cancer was distinguished from ILD with a specificity of 88%, (95%CI: 0.57-1), and COPD from cancer with a specificity of 88% (95%CI: 0.64-0.97). Separation of ILD from COPD and controls was possible with a sensitivity of 63.1% (95%CI: 0.4-0.78) and a specificity of 70% (95%CI: 0.54-0.81). The results were confirmed using an independent sample set (n = 46) by use of the four top markers discovered in the study (HOXD10, PAX9, PTPRN2, and STAG3) yielding an AUC of 0.85 (95%CI: 0.72-0.95). This technique was capable of distinguishing interrelated complex pulmonary diseases suggesting that multiplexed MSRE enrichment might be useful for simple and reliable diagnosis of diverse multifactorial disease states.Entities:
Keywords: AUC, area under curve; Biomarker; COPD, chronic obstructive pulmonary disease; Ct-value, cycle threshold; HOXD10; HP, hypersensitivity pneumonitis; ILD, interstitial lung disease, IPF, idiopathic pulmonary fibrosis; Liquid biopsy; MSP, methyl specific priming; MSRE, methyl sensitive restriction enzyme; NSIP, non-specific interstitial pneumonitis; PAX9; ROC, receiver operating characteristics; UIP, usual interstitial pneumonia; cfDNA, cell-free DNA; methyl-sensitive restriction enzyme; multiplex PCR
Mesh:
Substances:
Year: 2015 PMID: 26425700 PMCID: PMC4563135 DOI: 10.1016/j.ebiom.2015.06.025
Source DB: PubMed Journal: EBioMedicine ISSN: 2352-3964 Impact factor: 8.143
Clinical Characteristics of patient serum/plasma samples.
250 samples were analyzed. 204 within the original set and 46 within the PoP (proof of principle) set. Type indicates whether a serum or plasma was available from the patients. Age indicates the mean age of each patient group. S indicates serum; P Plasma; female reflects the portion of female patients in percent. Information on the characteristic histopathologic pattern of ILD was not available for 12 fibrotic ILD patients and cancer staging information was not available for 7 sera of the original set and 3 sera of the PoP set. AdenoCa, lung adenocarcinoma; SqCC: squamous cell carcinoma, SCLC: small cell lung cancer, LCLC: large cell lung cancer.
| Original set (n = 204) | PoP set (n = 46) | |||||||
|---|---|---|---|---|---|---|---|---|
| Patients | Type | Female | Age | Patients | Type | Female | Age | |
| Healthy | n = 27 (13.2%) | S&P | 27.5% | 57.5 | n = 23 (50%) | P | 17.4% | 63.3 |
| COPD 0 | n = 34 (16.6%) | S | 26.5% | 48 | – | |||
| TNM I&II | n = 9 (4.4%) | P | 44.4% | 65.6 | n = 8 (17.4%) | P | 37.5% | 60.4 |
| TNM III&IV | n = 15 (7.3%) | P | 40% | 65 | n = 12 (26%) | P | 58.3% | 63.7 |
| AdenoCa | n = 11 (5.4%) | P | 54.5% | 63.9 | n = 9 (19.5%) | P | 77.7% | 58.3 |
| SqCC | n = 8 (3.9%) | P | 12.5% | 60.1 | n = 4 (8.6%) | P | 0% | 65.3 |
| SCLC | n = 7 (3.4%) | P | 57.1% | 70.1 | n = 6 (13%) | P | 66.6% | 69.3 |
| LCLC | n = 7 (3.4%) | P | 0% | 70.7 | n = 4 (8.7%) | P | 0% | 58.2 |
| – | ||||||||
| GOLD I–II | n = 31 (15.1%) | S | 19.4% | 52.6 | – | |||
| GOLD III–IV | n = 11 (5.4%) | S | 18.2% | 62.4 | – | |||
| – | ||||||||
| IPF, limited UIP | n = 15 (7.3%) | S | 20% | 65.3 | – | |||
| IPF, advanced UIP | n = 10 (4.9%) | S | 50% | 56.6 | – | |||
| NSIP | n = 11 (5.4%) | S | 27.3% | 68 | – | |||
| HP | n = 22 (10.7%) | S | 27.3% | 51.7 | – | |||
Fig. 1Study flow diagram. The discovery was performed on bisulfite converted DNA derived from lung tissue of lung cancer patients, fibrotic ILD, COPD patients and healthy controls. Candidate markers revealed by Illumina 450K arrays were validated on the same sample material via MSRE digestion based qPCR. The best performing assays in this procedure were then used for minimal invasive cfDNA methylation detection. Thus a 4-gene model could be established, which was validated on an independent sample set of 46 patients.
Fig. 2Overview of cfDNA analysis. (A) Log2-transformed amount of cfDNA per ml serum/plasma is shown. Error bars indicate standard deviation of the mean. (B) The colored strings represent the cfDNAs. Those with CH3 groups represent methylated DNA, the purple strings demonstrate cfDNA from healthy tissue (purple) and lung cancer (red), respectively.
cfDNA processing workflow: Each reaction was based on one serum sample (400 μl). During enzymatic digestion, methylation protected the methylated cfDNA strings which then served as templates for targeted amplification (Pre-amp). One multiplexed preamplification was performed per sample using 96 primer pairs as indicated by different colors. Amplification results are shown in black color. Specific methylation markers were detected by individual qPCR reactions. The lower panel shows the prediction and resampling approach (see Method section). (C) Fisher discriminant analysis was performed using the top 30 markers subsequently, the data were projected to 2 most informative projection directions (discriminant scores). The plot shows separation of patient samples based on the transformed data, which may be interpreted similar to a Principal Component Analysis. (D) ROC-curve analysis shows quality of separation of each analyzed disease versus healthy controls.
Fig. 3Differential diagnosis approach. (A) The classification scheme starts with the separation of cancer samples, which are subsequently subdivided into TNMI&II and TNMIII&IV. The non-cancer samples are classified into healthy, fibrosis and COPD. This scheme implicates three prediction rounds, with results given in (B) and (C). The result for the separation of cancer TNMI&II and cancer TNMIII&IV is given in Supplemental Fig. S9. Percent values indicate the correct classification rate applying the determined cut off value given in sections B and C for each disease. (B and C) Bar plots indicate the relative variable importance for each model. The relative variable importance reflects the contribution of each variable to the prediction success. ROC curves indicate the quality of group separation including the chosen cut off value given as black dot. The rightmost panel summarizes the ROC curve analysis starting with the values (Area under curve, Sensitivity and Specificity) derived from conventional ROC curve analysis. The section below, starting with “CutOff”, lists the values gained by the application of the specific cut off values. (B) Distinguishes between cancer and residual samples(C) classify residual samples into healthy, fibrotic ILD and COPD.
Fig. 4Representative markers for differential diagnosis. Upper panel sections A and B demonstrate the effect of each variable on class probability. Class probability is given on the y-axis, while delta Ct-values are shown on the x-axis. Dependence of each predictor variable is averaged over the distribution of all modeled variables. The upper panel demonstrates the change of class probability (healthy, cancer, ILD, and COPD) as a function of Ct-value changes for the 4 top markers identified. The lower panels display boxplots of delta Ct-values for each marker. Due to the applied PCR methodology, lower delta Ct-values indicate increased marker methylation.
Fig. 5Results of simulated prospective sample prediction. Simulation was achieved via an adjusted resampling strategy (Supplemental Fig. S1). (A) The upper panel shows pie diagrams of classification results derived from the simulation of prospective samples. The samples arranged according to their clinical diagnosis. Each pie represents one patient group. Each section of the pie represents the predicted sample memberships in percent. No Diagn. reflects 11 samples, which could not be classified to a specific disease or as healthy, because probabilities were below all cut off values. (B) The lower panel shows the classification of 4 representative patients. Patient 1 suffers from lung cancer; patient 2 was diagnosed with a limited UIP, patient 3 was diagnosed with COPD GOLDII and patient 4 is a healthy control. The x-axis represents the class dependence probability for each patient. The error bar indicates the range from the cut off value to a 100% probability.
Fig. 6Proof of Principle: Prospective sample prediction. ROC curve analysis of 46 patient samples. Prediction is based on coefficients derived from the model presented Fig. 3. The dashed line represents the separation of cancer and non-cancer patients, applying a weighted model of all 64 variables. The solid line represents the prediction based on the top 4 markers. Panel below gives boxplots of the delta Ct-values for each marker out of top four marker model. Due to the applied PCR methodology lower delta Ct-values indicate an increased methylation of the marker.