Lennart Raman1, Malaïka Van der Linden1, Ciel De Vriendt2, Bliede Van den Broeck3, Kristoff Muylle4, Dries Deeren5, Franceska Dedeurwaerdere6, Sofie Verbeke7, Amélie Dendooven8, Katrien De Grove2, Saskia Baert2, Kathleen Claes9, Björn Menten9, Fritz Offner2, Jo Van Dorpe10. 1. Department of Pathology, Ghent University, Ghent University Hospital, Ghent, Belgium; Center for Medical Genetics, Department of Biomolecular Medicine, Ghent University, Ghent University Hospital, Ghent. 2. Department of Clinical Hematology, Ghent University, Ghent University Hospital, Ghent. 3. Department of Nuclear Medicine, Ghent University, Ghent University Hospital, Ghent. 4. Department of Nuclear Medicine, AZ Delta, Roeselare. 5. Department of Hematology, AZ Delta, Roeselare. 6. Deparment of Pathology, AZ Delta, Roeselare. 7. Department of Pathology, Ghent University, Ghent University Hospital, Ghent. 8. Department of Pathology, Ghent University, Ghent University Hospital, Ghent, Belgium; Faculty of Medicine and Health Sciences, University of Antwerp, Wilrijk. 9. Center for Medical Genetics, Department of Biomolecular Medicine, Ghent University, Ghent University Hospital, Ghent. 10. Department of Pathology, Ghent University, Ghent University Hospital, Ghent. Jo.VanDorpe@uzgent.be.
Abstract
Shallow-depth sequencing of cell-free DNA, a cheap and standardized approach to obtain molecular information on tumors non-invasively, is insufficiently explored for lymphoma diagnosis and disease follow-up. This study collected 318 samples, including longitudinal liquid and paired solid biopsies, from a prospectively recruited cohort of 38 Hodgkin lymphoma (HL) and 85 aggressive B-cell non- HL patients, represented by 81 diffuse large B-cell lymphoma (DLBCL) cases. Following sequencing, copy number alterations and viral read fractions were derived and analyzed. At diagnosis, liquid biopsies showed detectable copy number alterations in 84.2% of HL (88.6% for classical HL) and 74.1% of DLBCL patients. Copy number profiles between liquid-solid pairs were highly concordant within DLBCL (r=0.815±0.043); and, compared to tissue, HL liquid biopsies had abnormalities with higher amplitudes (P=.010), implying that tumor DNA is more abundant in plasma. Additionally, 39.5% of HL and 13.6% of DLBCL cases had a significantly elevated number of plasmatic Epstein-Barr virus DNA fragments, achieving a sensitivity of 100% compared to current standard. Longitudinal analysis determined that, when detectable, copy number patterns were similar across (re)staging moments in refractory/relapsed patients. Moreover, the overall profile anomaly highly correlated with the total metabolic tumor volume (P.
Shallow-depth sequencing of cell-free DNA, a cheap and standardized approach to obtain molecular information on tumors non-invasively, is insufficiently explored for lymphoma diagnosis and disease follow-up. This study collected 318 samples, including longitudinal liquid and paired solid biopsies, from a prospectively recruited cohort of 38 Hodgkin lymphoma (HL) and 85 aggressive B-cell non- HL patients, represented by 81 diffuse large B-cell lymphoma (DLBCL) cases. Following sequencing, copy number alterations and viral read fractions were derived and analyzed. At diagnosis, liquid biopsies showed detectable copy number alterations in 84.2% of HL (88.6% for classical HL) and 74.1% of DLBCL patients. Copy number profiles between liquid-solid pairs were highly concordant within DLBCL (r=0.815±0.043); and, compared to tissue, HL liquid biopsies had abnormalities with higher amplitudes (P=.010), implying that tumor DNA is more abundant in plasma. Additionally, 39.5% of HL and 13.6% of DLBCL cases had a significantly elevated number of plasmatic Epstein-Barr virus DNA fragments, achieving a sensitivity of 100% compared to current standard. Longitudinal analysis determined that, when detectable, copy number patterns were similar across (re)staging moments in refractory/relapsed patients. Moreover, the overall profile anomaly highly correlated with the total metabolic tumor volume (P.
B-cell-derived lymphomas can develop at different stages along the B-lymphocyte differentiation pathways. This generates a range of tumor entities[1] including Hodgkin lymphoma (HL) as well as diverse non-HL (nHL) varieties, with diffuse large B-cell lymphoma (DLBCL) the most common nHL.Both HL and DLBCL typically display distinct morphological, immunohistochemical and (epi)genetic features.[2-5] Their high cell turnover makes them excellent candidates for ‘liquid biopsy’-based research. Necrotic and apoptotic tumor cells have been shown to shed DNA[6] into the peripheral circulation and this is informative with regard to diagnosis, prognosis and therapy.[7] In theory, this would enable real-time (serial) sampling of the entire (often heterogeneous)[8] genomic lymphoma architecture in a convenient and non-invasive manner through traditional blood sampling.Recent studies[9-11] have focused on plasma cell-free DNA (cfDNA) analysis using ultra-deep targeted sequencing[12] for single nucleotide variant (SNV) and translocation detection. Overall, these studies have provided two main insights: firstly, that variant allele frequencies often correlate with disease status; and, secondly, that surveilling circulating tumor DNA (ctDNA) might outperform [18]F-fluoro- 2-deoxyglucose positron emission tomography/computed tomography (PET/CT) scans in terms of sensitivity. This latter finding holds a great deal of potential for relapse risk assessment by quantifying minimal residual disease.Despite their potential, ultra-deep sequencing techniques are still expensive and require targeted panels. In contrast, shallow (coverage ~0.25x) whole-genome sequencing (sWGS) for primarily copy number detection is cheaper and fully operative in hospitals that offer noninvasive prenatal testing.[13] One of the hallmarks of sWGS is its high specificity in identifying malignant cells: in comparison to SNV, large (i.e., >5 Mb) somatic copy number alterations are rarely detected in unaffected subjects[14] whereas SNV accumulate over time. Worth mentioning is that this also occurs in hematopoietic stem cells, a phenomenon referred to as clonal hematopoiesis,[15] potentially resulting in incorrect driver identifications.sWGS of cfDNA may therefore have clinical potential, especially for patients with lesions that are difficult to biopsy (e.g. those in brain; deep lymph nodes in the thoracic cavity or abdomen); or when dealing with ambiguous PET/CT scans, as both imaging and clinical symptoms of lymphoma are often non-specific. Nevertheless, this approach has only been superficially investigated.[16] We, therefore, evaluated whether sWGS could serve as a molecular test in addition to established diagnostic methods, using a diverse set of 123 prospectively recruited lymphoma patients, comprising baseline and longitudinal blood samples, supplemented with paired formalin-fixed paraffin-embedded (FFPE) biopsies.
Methods
Ethics statement
This study was approved by the institutional ethics committee at Ghent University Hospital (EC/2016/0307). Written informed consent was obtained from all patients.
Study population
Between January 2016 and November 2019, 123 lymphoma patients were recruited at Ghent University Hospital and AZ Delta Roeselare. The subtypes studied include 38 HL (3 nodular lymphocyte predominant HL; 23 nodular sclerosis classical HL [cHL]; 5 mixed cellularity cHL; 5 lymphocyte-rich cHL; 1 lymphocyte- depleted cHL; and 1 not otherwise specified [NOS]); 81 DLBCL (61 NOS; 4 Epstein-Barr virus [EBV]-positive; 10 primary mediastinal large B-cell lymphomas [PMBCL]; 2 high-grade Bcell lymphomas; 2 T-cell/histiocyte-rich large B-cell lymphomas; 1 intravascular large B-cell lymphoma; and 1 plasmablastic lymphoma) and four grey-zone lymphomas (GZL) (Online Supplementary File S2: Tables S1 and S2). Patients were included at the start, ‘baseline’, of a new line of therapy (i.e., baseline represents initial diagnosis for 115 cases; after ineffective treatment in 6 other cases; and at relapse for the 2 remaining cases). Although patient recruitment was random, the final cohort was enriched for HL patients.
Sample series
Liquid biopsies were collected at baseline for all patients (n=123) and a random selection of paired FFPE tissue was made (n=33); this was obtained from 15 HL and 18 DLBCL patients. Longitudinal liquid biopsies were taken following milestones in treatment with 93 sequenced for 31 patients. These were intentionally enriched for refractory or relapsed disease: interim evaluations after two and four cycles of ABVD in HL; interim evaluations after four cycles of R-CHOP in nHL; at restaging after ineffective treatment or relapse; following successful treatment; and every 6 months for patients in maintained complete remission (CR). For HL (n=9), six patients had obtained CR within a year of inclusion whereas three were refractory. For nHL (n=22), 11 patients reached CR within a year; six were refractory; and five relapsed.Negative liquid biopsies were included from non-invasive prenatal assays (n=60), supplemented with benign FFPE tissue (n=9) as a control for the solid biopsies. In total, 318 samples were thus analyzed.
Clinical and pathological characteristics
Patient demographics and clinical variables were available through routine clinical practice. The total metabolic tumor volume (MTV) was measured by PET/CT using AW workstation semi-automated segmentation software (GE medical systems, Waukesha, WI, USA), with a standard uptake value cutoff of 2.5.Pathology diagnosis included standard histological examination of FFPE biopsy material in combination with immunohistochemistry (IHC). Double/triple hit (MYC, BCL2 and/or BCL6) and double expressor (MYC and BCL2) samples were detected using fluorescence in situ hybridization and IHC, respectively. The germinal center B-cell (GCB) cell of origin (COO) status was assigned through the Hans algorithm.[17]When using chromogenic in situ hybridization (CISH), the presence of EBV-encoded RNA (EBER) was evaluated via INFORM EBER probes (Ventana Medical Systems, Tucson, AZ, USA). Selected liquid biopsies were additionally tested using the quantitative EBV-polymerase chain reaction (PCR) method, as established by Bordon et al.[18]
Sequencing and bioinformatic analysis
Each laboratory step and subsequent computational analysis can be found in Online Supplementary File S1: Supplemental methods. These include: copy number profiling;[19] defining the tumor burden using the copy number profile abnormality (CPA) score[20] and estimated tumor fraction;[21] methods used to derive viral read fractions of HIV-1, HIV-2, EBV, JC polyomavirus, human Tlymphotropic virus 1, human herpesvirus 8 and hepatitis C virus; random forest modeling[22] of copy number profiles to predict tumor subtype; detection of copy number driver peaks;[23] and general statistical testing.
Results
Shallow-depth sequencing of cell-free DNA enables accurate copy number profiling in lymphoma
For healthy individuals, copy number profiles are assumed to be approximately ‘flat’, implying that large alterations are absent. In a previous study,[20] in order to statistically distinguish cancerous from control profiles, we developed a score that increases with rising deviance from the naturally-occurring healthy background variance: the CPA score. Based on this variable, 95/123 baseline liquid biopsies were detected as cancerous (Figure 1A; Online Supplementary File S2: Table S3). As expected, the score reached significantly higher (P<0.001; Welch test) values in the lymphoma group compared to the control group. There was no significant difference (P=0.250; Fisher exact test) in the fraction of abnormal observations between HL (32/38; 84.2%) and DLBCL (60/81; 74.1%), nor when specifically considering cHL (31/35; 88.6%; P=0.091). At Ann Arbor stage I, only one of six liquid biopsies showed deviations whereas stage II to IV presented comparable detection rates (Online Supplementary File S1: Figure S1).In order to confirm whether the copy number profiles derived from blood samples represent those from tissue biopsies, we executed a concordance analysis using 33 liquid-solid pairs (Online Supplementary File 1: Liquid-solid pairs). One representative DLBCL case, patient 59, demonstrated very similar deviations in both samples (Figure 1B), and the Pearson correlation between this patient’s smoothened profiles amounted to 0.916. We can note, from left to right, four clusters distinguished in a corresponding scatter plot (Figure 1C): deletions, copy neutral bins, duplications and amplifications. It should be noted that the amplitudes of alterations, a concept defined as the absolute value of a segment’s log2 ratio,[19] are more extreme in liquid than in tissue biopsies (Figure 1B), caused by a greater ctDNA fraction in patient 59’s liquid biopsy. This is also indicated by a steeper identity line than least squares fit in the scatter plot (Figure 1C). Overall, when the CPA score in the liquid biopsy was abnormal, DLBCL pairs showed high concordance (r=0.815±0.043; 95% confidence interval). However, this was not the case for HL (r=0.260±0.055; 95% confidence interval) patients (Figure 1D).
Figure 1.
Feasibility of copy number profiling through sWGS of cfDNA. (A) CPA score comparison between common HL and DLBCL subtypes. Dots represent blood samples at baseline; default box plots indicate underlying distributions. The bottom grey box delineates the abnormality cutoff defined by the controls. According to this limit, the numbers at the top define which samples have abnormal copy number profiles. The overlapping histograms on the left display the overall CPA distribution. (B) Copy number profile comparison between the liquid and solid biopsy of patient 59. Dots are bins for which copy number is inferred whereas graph lines represent smoothened profiles. The ‘ratio’ equals the observed over the expected number of reads. (C) Scatter plot comparison of bins (grey dots) and smoothened values (solid black graph line) between the liquid and solid biopsy of patient 59. Identity line (black dotted) and least squares fit (red dotted), with corresponding Pearson correlation coefficient (r), are shown. (D) Scatter plot of all Pearson correlations between solid and liquid biopsy pairs, defined as in C, in relation to the CPA of the liquid biopsy. The grey box on the left matches the abnormality cutoff from A. Horizontal colored lines embedded in colored boxes represent means and their uncertainty (95% confidence interval), respectively. (E) Control-normalized amplitude comparison between the liquid and solid biopsies of HL and DLBCL patients. Samples are represented by dots; same-patient biopsies are connected; the underlying distribution is clarified by violin plots. P-values result from paired t-tests. sWGS: shallow whole-genome sequencing; cfDNA: cell-free DNA; CPA: copy number profile abnormality; HL: Hodgkin lymphoma; nHL: non-HL; DLBCL: diffuse large B-cell lymphoma; cHL: classical HL; ncHL: non-cHL; NLPHL: nodular lymphocyte predominant HL; NSCHL: nodular sclerosis cHL; MCCHL: mixed cellularity cHL; LRCHL: lymphocyte-rich cHL; GZL: grey zone lymphoma; NOS: not otherwise specified; GCB: germinal center B-cell; unc: unclassified; EBV: Epstein-Barr virus; PMBCL: primary mediastinal large B-cell lymphoma; HGBCL: high-grade B-cell lymphoma; THRLBCL: T-cell/histiocyte-rich large B-cell lymphoma; c, control.
Next, the amplitudes were compared between paired liquid and solid biopsies (Figure 1E). These numbers were averaged per sample and additionally normalized by subtracting the mean across the control samples (0.005 for liquid; 0.026 for solid), which results in a fairer comparison, as FFPE material tends to produce profiles with higher levels of noise,[24] naturally increasing the overall deviance. This variable was found to be higher in liquid than in solid biopsies for HL (P=0.010; paired ttest), but not for DLBCL (P=0.316; paired t-test). Worthy of note is that, within HL, the mean of the solid biopsies barely varied from zero, meaning that most of these profiles are hard to differentiate from normal controls, and, therefore, the low liquid-solid correlation observed in HL patients (Figure 1D), seems to be caused by a scarcity of tumor cells in the FFPE samples rather than a low ctDNA fraction. From a biological perspective, this is reasonable: neoplastic Hodgkin and Reed-Sternberg cells are always embedded in an inflammatory background of non-affected cells,[25] causing resemblance to healthy tissue.Feasibility of copy number profiling through sWGS of cfDNA. (A) CPA score comparison between common HL and DLBCL subtypes. Dots represent blood samples at baseline; default box plots indicate underlying distributions. The bottom grey box delineates the abnormality cutoff defined by the controls. According to this limit, the numbers at the top define which samples have abnormal copy number profiles. The overlapping histograms on the left display the overall CPA distribution. (B) Copy number profile comparison between the liquid and solid biopsy of patient 59. Dots are bins for which copy number is inferred whereas graph lines represent smoothened profiles. The ‘ratio’ equals the observed over the expected number of reads. (C) Scatter plot comparison of bins (grey dots) and smoothened values (solid black graph line) between the liquid and solid biopsy of patient 59. Identity line (black dotted) and least squares fit (red dotted), with corresponding Pearson correlation coefficient (r), are shown. (D) Scatter plot of all Pearson correlations between solid and liquid biopsy pairs, defined as in C, in relation to the CPA of the liquid biopsy. The grey box on the left matches the abnormality cutoff from A. Horizontal colored lines embedded in colored boxes represent means and their uncertainty (95% confidence interval), respectively. (E) Control-normalized amplitude comparison between the liquid and solid biopsies of HL and DLBCL patients. Samples are represented by dots; same-patient biopsies are connected; the underlying distribution is clarified by violin plots. P-values result from paired t-tests. sWGS: shallow whole-genome sequencing; cfDNA: cell-free DNA; CPA: copy number profile abnormality; HL: Hodgkin lymphoma; nHL: non-HL; DLBCL: diffuse large B-cell lymphoma; cHL: classical HL; ncHL: non-cHL; NLPHL: nodular lymphocyte predominant HL; NSCHL: nodular sclerosis cHL; MCCHL: mixed cellularity cHL; LRCHL: lymphocyte-rich cHL; GZL: grey zone lymphoma; NOS: not otherwise specified; GCB: germinal center B-cell; unc: unclassified; EBV: Epstein-Barr virus; PMBCL: primary mediastinal large B-cell lymphoma; HGBCL: high-grade B-cell lymphoma; THRLBCL: T-cell/histiocyte-rich large B-cell lymphoma; c, control.
Shallow-depth sequencing of cell-free DNA enables sensitive Epstein-Barr virus detection in lymphoma
Sequencing reads were mapped to seven different lymphoma- associated viruses. Of the lymphoma liquid biopsies, 39/216 had abnormally-elevated EBV read fractions compared to the negative controls (Online Supplementary File S2: Table S3); whereas one patient tested positive for JC polyomavirus (13 reads) at surveillance (patient 79). In accordance with clinical records, no other viruses were detected. Within control samples, no or exclusively low amounts of EBV reads were noted. The latter was the case for 15/60 liquid (read fraction range 0.032-0.059 parts per million [ppm]) and 6/9 solid controls (0.029-0.08 ppm).The EBV read fraction was evaluated for all baseline liquid biopsies, with 28/123 patients testing positive (Figure 2A). For GZL, 2/4 patients were positive, and, as expected, more HL samples were EBV-positive (15/38; 39.5%) than DLBCL samples (11/81; 13.6%) (P=0.004; Fisher exact test). All but one EBV-positive sample had an abnormal CPA score in addition (Figure 2B), which significantly associates the latter categorical variables (P=0.004; Fisher exact test).
Figure 2.
Feasibility of EBV detection through sWGS of cfDNA. (A) EBV read fraction comparison between common HL and DLBCL subtypes. Dots represent blood samples at baseline. The bottom grey box delineates the abnormality cutoff defined by the controls. According to this limit, the numbers at the top define which samples have abnormally-elevated EBV fractions. The overlapping histograms on the left display the overall EBV read fraction distribution. (B) Scatter plot between the CPA score and the EBV read fraction. Symbols represent blood samples at baseline; color shows whether CPA is abnormal; the + or – symbol is assigned in accordance to A, thereby specifying EBV positivity or negativity, respectively. Pie charts visualize contingency table; the P-value results from Fisher’s exact test. Swarm and box plots on the right-hand side indicate the EBV read fraction, grouped by CISH for EBER detection outcome. Dots represent blood samples; numbers at the top are assigned as in A. Colored dots are additionally tested by PCR in blood samples. (C) Control-normalized EBV comparison between the liquid and solid biopsies. Samples are represented by dots; same-patient biopsies are connected. Patients without elevated an EBV level in both their solid and liquid biopsy were omitted. The P-value results from a paired Wilcoxon rank-sum test. (D) ROC analyses executed using ranked viral-read fractions and three different gold standards: CISH, overridden by PCR; CISH only; solid biopsy EBV read fraction. EBV: Epstein-Barr virus; sWGS: shallow whole-genome sequencing; cfDNA: cell-free DNA; HL: Hodgkin lymphoma; nHL: non-HL; DLBCL: diffuse large B-cell lymphoma; CPA: copy number profile abnormality; CISH: chromogenic in situ hybridization; EBER: EBV-encoded RNA; PCR: polymerase chain reaction; cHL: classical HL; ncH:, non-cHL; ROC: receiver operating characteristic; AUC: area under the curve; NLPHL: nodular lymphocyte predominant HL; NSCHL: nodular sclerosis cHL; MCCHL: mixed cellularity cHL; LRCHL: lymphocyte-rich cHL; GZL: grey zone lymphoma; NOS: not otherwise specified; GCB: germinal center B-cell; unc: unclassified; PMBCL: primary mediastinal large B-cell lymphoma; HGBCL: high-grade B-cell lymphoma; THRLBCL: T-cell/histiocyte-rich large B-cell lymphoma; c: control.
Feasibility of EBV detection through sWGS of cfDNA. (A) EBV read fraction comparison between common HL and DLBCL subtypes. Dots represent blood samples at baseline. The bottom grey box delineates the abnormality cutoff defined by the controls. According to this limit, the numbers at the top define which samples have abnormally-elevated EBV fractions. The overlapping histograms on the left display the overall EBV read fraction distribution. (B) Scatter plot between the CPA score and the EBV read fraction. Symbols represent blood samples at baseline; color shows whether CPA is abnormal; the + or – symbol is assigned in accordance to A, thereby specifying EBV positivity or negativity, respectively. Pie charts visualize contingency table; the P-value results from Fisher’s exact test. Swarm and box plots on the right-hand side indicate the EBV read fraction, grouped by CISH for EBER detection outcome. Dots represent blood samples; numbers at the top are assigned as in A. Colored dots are additionally tested by PCR in blood samples. (C) Control-normalized EBV comparison between the liquid and solid biopsies. Samples are represented by dots; same-patient biopsies are connected. Patients without elevated an EBV level in both their solid and liquid biopsy were omitted. The P-value results from a paired Wilcoxon rank-sum test. (D) ROC analyses executed using ranked viral-read fractions and three different gold standards: CISH, overridden by PCR; CISH only; solid biopsy EBV read fraction. EBV: Epstein-Barr virus; sWGS: shallow whole-genome sequencing; cfDNA: cell-free DNA; HL: Hodgkin lymphoma; nHL: non-HL; DLBCL: diffuse large B-cell lymphoma; CPA: copy number profile abnormality; CISH: chromogenic in situ hybridization; EBER: EBV-encoded RNA; PCR: polymerase chain reaction; cHL: classical HL; ncH:, non-cHL; ROC: receiver operating characteristic; AUC: area under the curve; NLPHL: nodular lymphocyte predominant HL; NSCHL: nodular sclerosis cHL; MCCHL: mixed cellularity cHL; LRCHL: lymphocyte-rich cHL; GZL: grey zone lymphoma; NOS: not otherwise specified; GCB: germinal center B-cell; unc: unclassified; PMBCL: primary mediastinal large B-cell lymphoma; HGBCL: high-grade B-cell lymphoma; THRLBCL: T-cell/histiocyte-rich large B-cell lymphoma; c: control.The sensitivity and specificity of EBV detection through sWGS of cfDNA was established by a comparison with routine CISH for EBER detection results, performed on FFPE tissue. All 18 samples that tested positive for EBER, as well as 8/46 EBER-negative samples, had abnormally-increased EBV levels in plasma, achieving a sensitivity and specificity of 100% and 82.6%, respectively (Figure 2B; Online Supplementary File S2: Tables S3 and S4). The liquid biopsies with the highest viral read fractions of the EBER-negative cases did, however, accommodate increased levels of EBV fragments in blood, as confirmed by a subsequent PCR analysis (Figure 2B). This indicates that routine CISH might have inadequate sensitivity or that viral fragments may have been absent in the tumor cells studied. When considering the latter five as true positives, specificity increases to 93.5%.As for the copy number alteration amplitudes, liquid biopsies had higher (P=0.039; paired Wilcoxon signed-rank test) EBV read fractions than solid biopsies (Figure 2C). To conclude, three receiver operating characteristic (ROC) analyses (Figure 2D) were included to further demonstrate the performance of this EBV detection approach. Ranked plasma EBV viral read fractions and three different gold standards were used: CISH overridden by PCR (area under the curve [AUC]=0.994); CISH only (AUC=0.948); and solid biopsy reads (AUC=0.868).
Copy numbers derived from liquid biopsies can aid histological classification
When visualizing fractions of aberrant samples across genomic loci per histological subtype, both general (e.g., gains at 2p, holding REL) and subtype-specific fingerprints can be distinguished (Figure 3). Unsupervised clustering[26] applied to these summarizing profiles results in the anticipated histological hierarchy. Indeed, nHL and HL are separated as two definite entities, with the exception of GZL and PMBCL, which cluster alongside the HL group. Notwithstanding their non-Hodgkin histology, both have been described as being molecularly related to HL,[27,28] hence the structure of the dendrogram. GZL in particular demonstrates overlapping features with cHL and PMBCL.[28]
Figure 3.
Overview of aberrations detected using liquid biopsies across lymphoma subtypes. Colored waves represent fractions of aberrant samples across genomic loci. Patterns above the y=0 lines indicate gains whilst opposite contours represent losses (e.g., when the top of the wave is at 50%, 50% of samples have a gain at the corresponding locus; when the bottom of the wave is at 10%, 10% of samples have a loss at the corresponding locus). The dendrogram (left) results from hierarchical clustering applied to the Pearson distances (i.e., d = [1 – r]/2) between the means of these waves. Graph lines represent smoothened mean log2 ratios. The number of used baseline samples (n) is indicated on the right-hand side. Subtypes represented by fewer than three patients were excluded. DLBCL: diffuse large Bcell lymphoma; GCB: germinal center B-cell; unc: unclassified; EBV: Epstein-Barr virus; GZL: grey zone lymphoma; PMBCL: primary mediastinal large B-cell lymphoma; HL: Hodgkin lymphoma; cHL: classical HL; ncHL: non-cHL; NSCHL: nodular sclerosis cHL; MCCHL: mixed cellularity cHL; LRCHL: lymphocyte-rich cHL; NLPHL: nodular lymphocyte predominant HL.
To mimic how this translates as a potential clinical application for samples that are difficult to classify morphologically, or in cases in which obtaining tumor tissue is complicated, random forest predictive modeling, in combination with cross-validation, was applied to four differential diagnoses (Figure 4A): HL versus DLBCL; cHL versus PMBCL; DLBCL, NOS versus PMBCL; and within DLBCL, NOS, GCB versus non-GCB. For these analyses, obviously only baseline liquid biopsies were employed. Two ROC AUC can be discussed here: one for all samples and one disregarding flat copy number profiles. The latter statistic is considered in the next paragraph as it represents the performance of a rational application: profiles without aberrations will never be subjected to machine learning algorithms since predictions would be random.
Figure 4.
Feasibility of copy number profiles as a clinical lymphoma subtyping tool. (A) ROC analysis following out of bag random forest modeling to differentially diagnose HL versus DLBCL; cHL versus PMBCL; DLBCL, NOS versus PMBCL; and within DLBCL, NOS, GCB versus non-GCB. Exclusively, blood samples at baseline were included. ROC curves when omitting flat profiles (i.e., only CPA abnormal cases) are shown in addition. (B) The top plot is generated as in Figure 3. The black graph line shows the mean absolute difference between these profiles. The four horizontal grey bars are colored at significantly-enriched aberration peaks, assumed to hold driver genes, detected by GISTIC (gains on top, losses below; yellow for DLBCL, green for HL; emphasized by (inverted) ‘V’ symbols). The plot below visualizes the importance of regions for the differential diagnosis according to a HL vs. DLBCL random forest. (C) Similar to (B) but for the cHL versus PMBCL and DLBCL, NOS versus PMBCL classification problems. ROC: receiver operating characteristic; AUC: area under the curve; CPA: copy number profile abnormality; HL: Hodgkin lymphoma; cHL: classical HL; DLBCL: diffuse large B-cell lymphoma; PMBCL: primary mediastinal large B-cell lymphoma; NOS: not otherwise specified; GCB: germinal center B-cell.
Overview of aberrations detected using liquid biopsies across lymphoma subtypes. Colored waves represent fractions of aberrant samples across genomic loci. Patterns above the y=0 lines indicate gains whilst opposite contours represent losses (e.g., when the top of the wave is at 50%, 50% of samples have a gain at the corresponding locus; when the bottom of the wave is at 10%, 10% of samples have a loss at the corresponding locus). The dendrogram (left) results from hierarchical clustering applied to the Pearson distances (i.e., d = [1 – r]/2) between the means of these waves. Graph lines represent smoothened mean log2 ratios. The number of used baseline samples (n) is indicated on the right-hand side. Subtypes represented by fewer than three patients were excluded. DLBCL: diffuse large Bcell lymphoma; GCB: germinal center B-cell; unc: unclassified; EBV: Epstein-Barr virus; GZL: grey zone lymphoma; PMBCL: primary mediastinal large B-cell lymphoma; HL: Hodgkin lymphoma; cHL: classical HL; ncHL: non-cHL; NSCHL: nodular sclerosis cHL; MCCHL: mixed cellularity cHL; LRCHL: lymphocyte-rich cHL; NLPHL: nodular lymphocyte predominant HL.The differential diagnosis of HL and DLBCL, using computationally- processed copy number profiles, was found to be accurate (AUC of 0.967). Out of bag principles (i.e., cross-validation for bootstrapping methods, to overcome train/test bias) were applied to a set of 119 samples, meaning that, at each iteration, sufficient data were available for model training; and, biologically, HL and DLBCL express divergent profiles (Figure 3). PMBCL has features between those of cHL and DLBCL.[27] Nonetheless, copy number profiles can aid differential diagnoses, indicated by AUC values of 0.931 and 0.981, respectively. Finally, the GCB versus non-GCB COO status does not seem to trigger specific copy number alterations, as shown by the resulting AUC of 0.450. We could not test whether the activated B-cell versus GCB distinction by gene expression profiling[29] would perform better. This should be explored in future work.During training, learning algorithms assign higher weights to features that improve classification. For the HL versus DLBCL problem, the top three discriminative features - or loci, in our case - are located at 2p, 9p and 11q (Figure 4B, bottom). In addition, the GISTIC software aims to define driver genes in pattern peaks across a set of samples. Of note, there were three interesting peaks in these regions that were significantly enriched (Figure 4B, top), matching several anticipated genes: in both HL and DLBCL, 2p16.1 (REL) gains;[30,31] 9p24.1 (JAK2 and PD-L1) gains in HL;[32] 9p21.3 (CDKN2A) losses in DLBCL;[33] and to conclude, 11q23.3 (CEP164) gains in DLBCL[34] (Online Supplementary File S2: Table S5). Although 2p16.1 was found to be amplified, or duplicated, in both HL and DLBCL, the model gives importance to the 2p arm, as these aberrations are more frequently detected in HL compared to DLBCL (Figure 4B, top).[35]Finally, locus importance was examined for the two other well-performing models: to a certain extent, 5p, 9q, and chromosomes 14 and 19 allow separation of cHL and PMBCL, while more specific 7p and, once more, 9p24.1 (JAK2 and PD-L1)[ cytobands are decisive in differentiating DLBCL from PMBCL (Figure 4C).
Longitudinal liquid biopsies correlate with overall disease status
Serial blood samples were analyzed to observe copy number alteration dynamics over time and to study longitudinal changes in CPA and EBV read fractions. One example of an intra-patient evolution is demonstrated by patient 32, who had refractory DLBCL (Figure 5A). In this case, alterations disappeared during treatment (MTV on PET/CT decreased from 915 cm[3] at baseline to 4 cm[3] between both interim moments), and reappeared when progression was diagnosed after 212 days (MTV increased to 79 cm[3]). The smoothened profiles at initial staging and subsequent restaging at progression were very similar (r= 0.946; Figure 5B), notwithstanding the long time interval, suggesting that a portion of ‘remnant’ tumor had never been completely eliminated and that minimal residual disease had caused recidivism. Despite the significant correlation, evidence of tumor evolution was present, as indicated by several changes along the copy number alteration pattern, e.g.,on chromosome arms 2q and 6p; and on chromosome 14 Figure 5A). These zones are represented by irregularities in the scatter plot trend between the profiles (Figure 5B).
Figure 5.
Copy number profile dynamics and EBV read fractions across serial liquid biopsies. (A) Four smoothened profiles corresponding to longitudinal samples of refractory patient 32, colored in - ordered chronologically - black (diagnosis), green (response), yellow (response) and brown (progression). Arrow on top indicates relative point in time of each sample collection moment. The grey background represents the 95% confidence interval of healthy controls. (B) Scatter plot of smoothened copy number profiles between sample one and four from (A). Scatter symbols are tiny characters, representing chromosomes; for perceptibility, each chromosome character is uniquely colored. (C) Dot plot of pairwise Pearson correlations between copy number profiles at longitudinal staging moment. Patients are represented by solid (refractory) or dotted lines (relapsed). Each dot shows the correlation between two staging moments. Horizontal lines indicate means. (D) Swarm plot CPA score comparison of the most recent follow-up liquid biopsy between patients in CR and patients that are not. The grey box shows the abnormality cutoff. (E) Longitudinal EBV read fraction evaluation. Exclusively EBV-positive patients, according to sWGS data, are included. Same-patient samples are connected with lines. The timescale is relative, where 0 corresponds to baseline. The grey box shows the abnormality cutoff. Colors represent a patient’s current disease status. Thicker trend lines result from loess fitting. (F) The same as (E) but evaluates CPA score rather than EBV read fraction. All HL patients with followup are included. Swarm and box plots on the righthand side additionally indicate the CPA score distribution, grouped according to whether the samples were drawn at staging, interim or surveillance. (G) Same as (F), but for nHL patients. EBV: Epstein-Barr virus; CPA: copy number profile abnormality; CR: complete remission; sWGS: shallow whole-genome sequencing; HL: Hodgkin lymphoma; nHL: non-HL.
Feasibility of copy number profiles as a clinical lymphoma subtyping tool. (A) ROC analysis following out of bag random forest modeling to differentially diagnose HL versus DLBCL; cHL versus PMBCL; DLBCL, NOS versus PMBCL; and within DLBCL, NOS, GCB versus non-GCB. Exclusively, blood samples at baseline were included. ROC curves when omitting flat profiles (i.e., only CPA abnormal cases) are shown in addition. (B) The top plot is generated as in Figure 3. The black graph line shows the mean absolute difference between these profiles. The four horizontal grey bars are colored at significantly-enriched aberration peaks, assumed to hold driver genes, detected by GISTIC (gains on top, losses below; yellow for DLBCL, green for HL; emphasized by (inverted) ‘V’ symbols). The plot below visualizes the importance of regions for the differential diagnosis according to a HL vs. DLBCL random forest. (C) Similar to (B) but for the cHL versus PMBCL and DLBCL, NOS versus PMBCL classification problems. ROC: receiver operating characteristic; AUC: area under the curve; CPA: copy number profile abnormality; HL: Hodgkin lymphoma; cHL: classical HL; DLBCL: diffuse large B-cell lymphoma; PMBCL: primary mediastinal large B-cell lymphoma; NOS: not otherwise specified; GCB: germinal center B-cell.Copy number profile dynamics and EBV read fractions across serial liquid biopsies. (A) Four smoothened profiles corresponding to longitudinal samples of refractory patient 32, colored in - ordered chronologically - black (diagnosis), green (response), yellow (response) and brown (progression). Arrow on top indicates relative point in time of each sample collection moment. The grey background represents the 95% confidence interval of healthy controls. (B) Scatter plot of smoothened copy number profiles between sample one and four from (A). Scatter symbols are tiny characters, representing chromosomes; for perceptibility, each chromosome character is uniquely colored. (C) Dot plot of pairwise Pearson correlations between copy number profiles at longitudinal staging moment. Patients are represented by solid (refractory) or dotted lines (relapsed). Each dot shows the correlation between two staging moments. Horizontal lines indicate means. (D) Swarm plot CPA score comparison of the most recent follow-up liquid biopsy between patients in CR and patients that are not. The grey box shows the abnormality cutoff. (E) Longitudinal EBV read fraction evaluation. Exclusively EBV-positive patients, according to sWGS data, are included. Same-patient samples are connected with lines. The timescale is relative, where 0 corresponds to baseline. The grey box shows the abnormality cutoff. Colors represent a patient’s current disease status. Thicker trend lines result from loess fitting. (F) The same as (E) but evaluates CPA score rather than EBV read fraction. All HL patients with followup are included. Swarm and box plots on the righthand side additionally indicate the CPA score distribution, grouped according to whether the samples were drawn at staging, interim or surveillance. (G) Same as (F), but for nHL patients. EBV: Epstein-Barr virus; CPA: copy number profile abnormality; CR: complete remission; sWGS: shallow whole-genome sequencing; HL: Hodgkin lymphoma; nHL: non-HL.Overall, observations were consistent for refractory and relapsed patients: when detectable, copy number patterns were very similar across different longitudinal staging moments (Figure 5C). One exception was found in patient 73 (rightmost patient in Figure 5C), who appeared to relapse with a divergent copy number profile (Online Supplementary File S1: Longitudinal samples). This case concerned an aggressive EBV-positive plasmablastic lymphoma that is known to present complex karyotypic abnormalities and, presumably, an instable genome.[36]The most recent follow-up of all 17 patients in CR showed that none had remaining copy number alterations; in contrast, this was the case in only 5/14 patients in whom CR was thought not to be reached (Figure 5D). In total, five EBV-positive patients, according to sWGS of cfDNA, were detected at diagnosis of the 31 patients with follow-up. Of these, three went into CR and, consequently, no longer had abnormally elevated EBV levels; one refractory patient (patient 6) was noted to have recurrently high EBV levels and one relapsed patient (patient 73) was detected with an unusually high EBV read fraction, following an earlier drop to 0 ppm at first response evaluation (Figure 5E). It should be noted that, for this patient, no tumor was detected under PET/CT at the fourth time point (Online Supplementary File 2: Table S4), however, the steadily increasing EBV fraction anticipated actual relapse at the fifth time point, after which the MTV increased rapidly along with the plasma concentration of viral fragments (Online Supplementary File S1: Longitudinal samples).A similar analysis, concerning the CPA score, was performed for all patients with HL (Figure 5F) and nHL (Figure 5G) with follow-up. Similar conclusions were made according to the trend lines, which largely summarize the longitudinal analyses: a prompt drop for patients in CR; varying values for refractory patients; and a drop, followed by a newly-triggered rise, for relapsed patients. It is significant how the CPA decreased rapidly when treatment was initiated: interim samples had normal CPA scores unless they represented a refractory patient. Finally, 6/31 patients never transcended the CPA abnormality cutoff, meaning that these, probably, have insufficient ctDNA. In two cases (patients 66 and 71), however, the MTV reached extreme levels (>1000 cm[3]) while the CPA score stayed low (Online Supplementary File S1: Longitudinal samples). For these patients, it is more likely that copy number alterations really were absent.Of the above cases, additional clinical information from four interesting patients further illustrates the potential of shallow-depth sequencing for disease monitoring - e.g., when dealing with ambiguous PET/CT images or tumor and viral entities beyond initial suspicion. These are discussed as short case-reports in Online Supplementary File S1: Case reports.
Copy number anomalies detected in liquid biopsies correlate with clinical parameters
Five lymphoma-related clinical variables were included in an extensive concordance analysis of the baseline and longitudinal liquid biopsies: Ann Arbor stage; International Prognostic Index/Score; lactate dehydrogenase concentration; b2 microglobulin concentration; and MTV (Online Supplementary File S1: Figure S2). The estimated tumor fraction, according to copy number profiles, positively and significantly correlates with all of these clinical variables, rendering the former estimate relevant. The CPA score is intrinsically correlated to the tumor fraction, so similar associations as for the tumor fraction arise. Since all these variables are naturally cross-related, the derived relations are obviously not claimed to be independently significant.
Discussion
To date, molecular profiling of lymphoma has largely been focused on the characterization of actionable targets, such as point mutations and translocations. This profiling requires panel sequencing, currently using tissue biopsies as the DNA source and targeting a substantial number of genes. However, until now, the availability of this precise information has contributed little to actual progress in patient care, despite its tempting rationale.[37] In addition, applying these concepts in a minimally-invasive manner requires ultra-deep sequencing (~2000x),[38] of which the implementation in routine practice remains challenging.In this study, we evaluated sWGS of cfDNA for the diagnosis, differential diagnosis, and disease monitoring of HL and DLBCL. In contrast to targeted sequencing, this approach provides genomic insight by copy number profiling in an accessible and standardized manner. Indeed, noninvasive prenatal testing, which involves the same laboratory and computational steps, has evolved into an application performed daily in molecular diagnostic laboratories. The short turnaround time (approximately, 4 days) and lowprice tag (around $200, including processing costs) renders sWGS a fortiori feasible.Copy number profiling is not a surrogate for targeted sequencing, nor does it enable the characterization of phenotypic traits, such as CD20 expression. However, to a limited extent, it does enable the prediction of targetable genes, such as JAK2, PD-L1 and REL. Further, regarding tumor fraction and specificity, it has the advantage of easier and more stable interpretation: once a copy number alteration is visible, the presence of sufficient ctDNA is most likely ascertained given that large somatic copy number alterations are rarely detected in unaffected subjects.[14]For DLBCL, the sensitivity of copy number profiling is a shortcoming: only 74.1% of baseline liquid biopsies were found to have detectable aberrations, which is lower than that reported from targeted ultra-deep duplex sequencing studies (e.g., 98% in Kurtz et al.[39]). On the other hand, the sensitivity was surprisingly high for HL (84.2% for HL overall and 88.6% for cHL in this study; 88.9% in a smaller proof-of-concept study[16]) and compares to that of SNVbased studies (e.g., 81.2% in Spina et al.[11]). It ought to be noted that the lower limit of detection for CNA is often considered to be 3% tumor fraction.[21] It is worth mentioning that this limit could be lowered as paired-end sequencing at greater depths (1x-1.5x) is expected to become available for routine diagnosis at a similar cost. This would empower computational tumor-read enrichment by insertsize filtering, as shorter cfDNA fragments are more likely be tumor-derived, thus increasing sensitivity.[20,40,41]The signs and symptoms of aggressive lymphoma are often non-specific, having much in common with a broad range of different disorders, including autoimmune, immunodeficiency and systemic diseases, infections and other malignancies. Therefore, currently, accurate lymphoma diagnosis is essentially only possible through invasive tissue biopsy. However, affected sites are sometimes difficult to reach and tissue-based subtyping may be complicated when dealing with inconclusive or small biopsies, or partially-involved lymph nodes. Since several cases of differential diagnosis by computationally-processed copy number profiles have been found to be accurate, copy number profiling of cfDNA might enter clinical practice as a test for patients with: suspected lymphoma with a mediastinal mass (to guide the need for mediastinoscopy); deep-seated pathologic lymph nodes in the abdomen, especially following surgery; a remaining suspicion of lymphoma after a negative biopsy, caused by ambiguous PET/CT images; probable central nervous system lymphoma (to avoid radical surgery); probable intravascular localization;[42] and, generally, lymphoma in children (to avoid unnecessary surgery). For these indications, the approach presented here could be easily integrated with other tests in order to decide whether invasive procedures are still necessary.sWGS of cfDNA was found to be 100% sensitive for the detection of EBV. However, one fact that could preclude its application in lymphoma diagnosis is that all possible causes of elevated EBV levels (e.g., ongoing infectious mononucleosis) are detected simultaneously through blood samples irrespectively of whether these are related to lymphoma. CISH does not have this shortcoming and can be used to assess a neoplastic association. Nonetheless, the combination of an increased EBV level and typical HL or DLBCL alterations strongly indicates an EBV-driven lymphoma. This is especially relevant for the diagnosis of post-transplant or immunodeficiency-associated lymphoma. sWGS could be employed, in part, to detect EBV prior to performing a CISH test (around $140), when required. The latter test should only be performed when EBV is positive.Future in-house research will focus on the inclusion of other (rarer) lymphoma subtypes, including T-cell lymphomas, to obtain a broader image of the alterations’ specificity compared to subtype. Studies will also concentrate on novel methods to study cfDNA (epi)genetics, including cellfree reduced-representation bisulfite sequencing for methylation detection, which are likely to improve specificity. Since this study was concluded, new liquid biopsies have been analyzed by sWGS at our institutions, mostly in cases of diagnostic ambiguity. While negative results currently never exclude the presence of malignancies, positive results ascertain disease.
Authors: Peter Vandenberghe; Iwona Wlodarska; Thomas Tousseyn; Luc Dehaspe; Daan Dierickx; Magali Verheecke; Anne Uyttebroeck; Oliver Bechter; Michel Delforge; Vincent Vandecaveye; Nathalie Brison; Gregor E G Verhoef; Eric Legius; Frederic Amant; Joris R Vermeesch Journal: Lancet Haematol Date: 2015-01-20 Impact factor: 18.959
Authors: Franziska C Eberle; Jaime Rodriguez-Canales; Lai Wei; Jeffrey C Hanson; J Keith Killian; Hong-Wei Sun; Lisa G Adams; Stephen M Hewitt; Wyndham H Wilson; Stefania Pittaluga; Paul S Meltzer; Louis M Staudt; Michael R Emmert-Buck; Elaine S Jaffe Journal: Haematologica Date: 2011-04 Impact factor: 9.941
Authors: Florent Mouliere; Richard Mair; Dineika Chandrananda; Francesco Marass; Christopher G Smith; Jing Su; James Morris; Colin Watts; Kevin M Brindle; Nitzan Rosenfeld Journal: EMBO Mol Med Date: 2018-12 Impact factor: 12.137
Authors: Lennart Raman; Machteld Baetens; Matthias De Smet; Annelies Dheedene; Jo Van Dorpe; Björn Menten Journal: Prenat Diagn Date: 2019-07-11 Impact factor: 3.050
Authors: Lennart Raman; Malaïka Van der Linden; Kim Van der Eecken; Karim Vermaelen; Ingel Demedts; Veerle Surmont; Ulrike Himpe; Franceska Dedeurwaerdere; Liesbeth Ferdinande; Yolande Lievens; Kathleen Claes; Björn Menten; Jo Van Dorpe Journal: Genome Med Date: 2020-04-21 Impact factor: 11.117
Authors: Malaïka Van der Linden; Bram Van Gaever; Lennart Raman; Karim Vermaelen; Ingel Demedts; Veerle Surmont; Ulrike Himpe; Yolande Lievens; Liesbeth Ferdinande; Franceska Dedeurwaerdere; Joni Van der Meulen; Kathleen Claes; Björn Menten; Jo Van Dorpe Journal: Cancers (Basel) Date: 2022-04-18 Impact factor: 6.575