| Literature DB >> 34018157 |
Hunter R Underhill1,2,3.
Abstract
Circulating cell-free DNA (ccfDNA) has emerged as a promising diagnostic tool in oncology. Identification of tumour-derived ccfDNA (i.e. circulating tumour DNA [ctDNA]) provides non-invasive access to a malignancy's molecular landscape to diagnose, inform therapeutic strategies, and monitor treatment efficacy. Current applications of ccfDNA to detect somatic mutations, however, have been largely constrained to tumour-informed searches and identification of common mutations because of the interaction between ctDNA signal and next-generation sequencing (NGS) noise. Specifically, the low allele frequency of ctDNA associated with non-metastatic and early-stage lesions may be indistinguishable from artifacts that accrue during sample preparation and NGS. Thus, using ccfDNA to achieve non-invasive and personalized molecular profiling to optimize individual patient care is a highly sought goal that remains limited in clinical practice. There is growing evidence, however, that further advances in the field of ccfDNA diagnostics may be achieved by improving detection of somatic mutations through leveraging the inherently shorter fragment lengths of ctDNA compared to non-neoplastic ccfDNA. Here, the origins and rationale for seeking to improve the mutation-based detection of ctDNA by using ccfDNA size profiling are reviewed. Subsequently, in vitro and in silico methods to enrich for a target ccfDNA fragment length are detailed to identify current practices and provide perspective into the potential of using ccfDNA size profiling to impact clinical applications in oncology.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34018157 PMCID: PMC8249304 DOI: 10.1007/s40291-021-00534-6
Source DB: PubMed Journal: Mol Diagn Ther ISSN: 1177-1062 Impact factor: 4.074
Fig. 1Acquisition and characteristics of circulating cell-free DNA (ccfDNA). Whole blood acquired through venipuncture is centrifuged to separate plasma from buffy coat and erythrocytes (a). ccfDNA derived from apoptosis is present in plasma as various multiples of nucleosomes—DNA wrapped around a histone core with a linker fragment of DNA (~10 bp) joining adjacent nucleosomes (a). The relative quantity and fragment length distribution of ccfDNA is shown in (b), where the most abundant fragment length corresponds to the length of the mononucleosome. The fragment lengths of circulating tumour DNA (ctDNA) tend to be shorter than ccfDNA (c); however, there is substantial overlap. Enrichment of ctDNA has generally focused on isolation of fragment lengths < 150 bp to improve the ratio between ctDNA and ccfDNA
Fig. 2Profile of publications from 2016 to 2020 using NGS for mutation-based ctDNA detection. A PubMed querya was used to initiate a search in each year for publications with a total sample size ≥ 10. Publications that sought to detect ctDNA associated with minimal residual disease were excluded. The list is first grouped by year (Yr), then disease severity, and finally by whether or not the mutation-based search for ctDNA was tumour informed. The reference (Ref) column identifies the citation. When feasible, data associated with different disease severities are presented separately and the reference number is non-black to support matching of data from the same source. In one study (Ref. [112]), data obtained with and without a tumour-informed search were merged and separation was not possible (marked ‘Both’). Sensitivity (Sens) represents a study value corresponding to the mutation-based detection of ctDNA to determine presence/absence of a malignancy. The associated sample size is presented in the adjacent column (Patients). Specificity (Spec) is reported only if obtained from healthy control data. Sensitivity and specificity values should be interpreted cautiously as calculations can vary substantially within and between publications. For example, in Ref. [131] the authors report detection of ctDNA in 55% of patients. However, no mutations identified in solid tumour DNA were present in ccfDNA, so sensitivity is also shown at 0% for a tumour-informed search. To gain adequate contextual understanding of values, reviewing the publication’s supplemental data may be necessary. ccfDNA circulating cell-free DNA, ctDNA circulating tumour DNA, mixed = cancers from different organs, NB neuroblastoma. *Specificity not calculated, controls used to determine error rate (< 3.3 × 10−7 false positive mutation calls per base); †controls used for error modelling; **specificity not calculated, median error rate of 0.03 non-silent single nucleotide variants per Mb; ††specificity not calculated, 307 of 342 targeted positions error free. aPubMed query: (cell-free DNA[Title]) AND (cancer) AND (("YEAR/01/01"[Date - Publication] : "YEAR/12/31"[Date - Publication]) AND (next-generation sequencing) AND (circulating))
Fig. 3Characteristics of publications from 2016 to 2020 using NGS for mutation-based ctDNA detection. In a, the violin plots show the sensitivity of mutation-based ctDNA detection relative to disease severity. In general, there was a trend towards increased detection with more severe disease. In b, sensitivity is relatively similar across years likely because more difficult to detect tumours are being included in later years, which may adversely affect sensitivity but also indicates detection strategies are working towards inclusion of more challenging cancers. In c, the total number of publications profiled for each year from Fig. 2 is shown (dark gray) and the number of those publications that include early/stage I–II disease is also depicted (light gray). In more recent years, a larger proportion of publications are seeking to detect ctDNA in early-stage and non-metastatic cancers. ctDNA circulating tumour DNA, NGS next-generation sequencing
In vitro size selection for ctDNA detection
| Publication | Methods | Cancer (sample size) | Baseline detection characteristics | Results |
|---|---|---|---|---|
| Underhill et al., 2016 [ | PAGE with manual excision to extract six fractions from the mononucleosome after library generation and PCR amplification ddPCR used to measure MAF | Lung ( | 1.1- to 9.1-fold enrichment of The sample with a MAF of 0.33% showed strongest enrichment at 9.1-fold | |
| Mouliere et al., 2018 [ | 3.0% agarose cassette targeting a single ccfDNA fraction (90–150 bp) prior to library generation and PCR amplification sWGS to calculate tMAD scores | HGSOC ( Healthy controls ( | Median tMAD score of 0.015 in patients (mean of 0.05; range 0.005–0.30) Median tMAD score of 0.01 in controls (range 0.004–0.015) 50% of patients had an increased tMAD score compared to controls | 0.9- to 6.4-fold enrichment of tMAD scores (mean of 2.5-fold and median of 2.1-fold enrichment) After size selection in 48 patient samples and 18 control samples, 82% of samples had an increased tMAD score compared to controls tMAD scores increased ≥ 2-fold after size selection in 54.3% (19/35) and 22.2% (4/18) of pre-treatment patients and controls, respectively |
3.0% agarose cassette targeting a single ccfDNA fraction (90–150 bp) prior to library generation and PCR amplification WES for point mutations | HGSOC ( | 821 mutations identified in 6 patients, with mean MAF of 12.0% (median of 9.5%, range 0.53–100%) | Mean increase in MAF of 4.19-fold (median 4.27-fold increase) In 5 of 6 patients, size selection identified an additional 171 mutations with mean MAF of 27.9% (median of 25%, range 8.3–75%) | |
| HGSOC ( | 202 mutations identified in 6 patients, with mean MAF of 8.1% (median of 6.5%, range 1.2–47.6%) | Mean increase of 2.1-fold (median 1.5-fold, range 0.9- to 11-fold) In 4 of 6 patients, size selection identified an additional 89 mutations, with mean MAF of 39.3% (median 37.5%, range 15.4–100%) | ||
| Hellwig et al., 2018 [ | Automated liquid handler using a 3.0% agarose matrix in a 12-channel cassette to extract 3 fractions after library generation and PCR amplification ddPCR Capture-enrichment panel | Melanoma ( | All samples were positive for point mutations in | At a median insert size of 141 bp, MAFs were enriched on average by 2.9-fold (median 2.5-fold, range 0.2–10.3) as measured by ddPCR At a median insert size of 141 bp, MAFs were enriched on average by 2.0-fold (median of 2.0-fold, range 0.8–3.3) as measured by NGS At median insert size of 167 bp, MAFs were reduced by a median of 0.8 and 0.7-fold using ddPCR and NGS, respectively MAF measured by ddPCR and NGS were strongly correlated (Pearson’s |
| Ishida et al., 2020 [ | SPRIselect beads to obtain fractionated small ccfDNA (100–400 bp) dPCR Hotspot amplicon-based sequencing in 50 genes | Colorectal cancer ( | Size selection increased mean MAFs of driver genes by dPCR from 6.8% to 10.7% Size selection increased mean MAFs by NGS from 16.3% to 18.8% For MAFs > 1%, NGS detected a higher average number of mutations in size-selected small ccfDNA compared to unselected ccfDNA (1.8 vs. 1.0 per case, respectively) |
ccfDNA circulating cell-free DNA, ctDNA circulating tumour DNA, dPCR digital PCR, ddPCR droplet dPCR, HGSOC high-grade serous ovarian carcinoma, MAF mutant allele frequency, NGS next-generation sequencing, PAGE polyacrylamide gel electrophoresis, PCR polymerase chain reaction, sWGS shallow whole genome sequencing, tMAD trimmed median absolute deviation from copy number neutrality, WES whole exome sequencing
In silico size-based filtering of insert size for ctDNA detection
| Publication | Methods | Cancer (sample size) | Baseline detection characteristics | Results |
|---|---|---|---|---|
| Mouliere et al., 2018 [ | sWGS to calculate tMAD scores Size selection for inserts with lengths of 90–150 bp | “High ctDNA” cancers from melanoma, ovarian, lung, colorectal, cholangiocarcinoma, and other ( Healthy controls ( | AUC = 0.69 | After size selection, AUC increase to 0.90 |
WES for point mutations Size selection for inserts with lengths of 90–150 bp | HGSOC ( | 821 mutations identified in 6 patients, with mean MAF of 12.0% (median of 9.5%, range 0.53–100%) | Mean increase in MAF of 2.2-fold (median 2.25-fold increase) In 6 of 6 patients, size selection identified an additional 188 mutations, with mean MAF of 21.5% (median of 16.9%, range 3.0–88.9%) | |
| HGSOC ( | 202 mutations identified in 6 patients, with mean MAF of 8.1% (median of 6.5%, range 1.2–47.6%) | In 6 of 6 patients, size selection identified an additional 122 mutations, with mean MAF of 30.3% (median of 25.0%, range 7.3–85.7%) | ||
| Colorectal, cholangiocarcinoma pancreatic, and prostate ( | 2133 mutations in plasma with matched mutations in tumour DNA (MAF range ~1% to ~70%) | Size selection increased mean MAF by 1.7-fold in 97% of mutations In 13 of 16 patients, size selection identified additional mutations | ||
| Smith et al., 2020 [ | sWGS to detect SCNAs based on tMAD score Size selection for inserts with lengths of 90–150 bp | Renal tumours (benign to metastatic; | SCNA detected in 4 of 48 (6.3%) samples | After size selection, 41 of 48 samples met criteria for tMAD analysis (> 2 million reads) Average tMAD score increased 2.2-fold (range 1.25–4.83) SCNA-based ctDNA detected in 8 additional patients (11/48, 22.9%) |
| RCC, | SCNA detected in 8 of 43 samples (18.6%), with median MAF of 7% (range 4–17%) | SCNA detection increased to 14 of 43 samples (32.6%) MAF increased by a mean of 2.2-fold (range 0.9–5.7) to a median of 8% (range 4–23%) | ||
| Nygard et al., 2020 [ | sWGS to detect SCNAs based on tMAD score Size selection for inserts with lengths of 90–150 bp | Inoperable, stage III NSCLC ( | SCNA detection in 5 (22%) samples from 3 of 6 patients | SCNA detection increased to 16 of 23 samples (70%) from 6 of 6 patients |
AUC area under the curve, ctDNA circulating tumour DNA, HGSOC high-grade serous ovarian carcinoma, MAF mutant allele frequency, NSCLC non-small cell lung cancer, RCC renal cell carcinoma, SCNA somatic copy number alteration, sWGS shallow whole genome sequencing, tMAD trimmed median absolute deviation from copy number neutrality, WES whole exome sequencing
In silico size-based weighting of potential somatic mutations for ctDNA and tumour detection
| Publication | Methods | Cancer (sample size) | Results |
|---|---|---|---|
| Mouliere et al., 2018 [ | sWGS RF algorithm that included proportion of fragments in defined size ranges and tMAD score | “High ctDNA” cancers from melanoma, ovarian, lung, colorectal, cholangiocarcinoma, and other ( “Low ctDNA” cancers from renal, brain, bladder, and pancreas ( Healthy controls ( | In high ctDNA cancers, an RF model yielded an AUC of 0.994 for distinguishing cancer patients from controls In low ctDNA cancers, an RF model yielded an AUC of 0.914 for distinguishing cancer patients from controls An RF model using fragmentation features alone (leaving out tMAD score) yielded AUCs of 0.989 and 0.891 for cancer types with a high and low amount of ctDNA, respectively |
| Wan et al., 2020 [ | Tumour-informed, patient-specific, custom-capture panels INVAR weights mutant reads across all patient-specific mutation loci based on the empirical distribution of mutant fragments in all other samples in the cohort being studied to give a size range enriched in cancer greater weight | Stage II–III melanoma after complete resection ( Stage IV melanoma ( | ctDNA detected in 11 stage II–III patients (28.9%, specificity at 98.6%), and the integrated MAF in 9 of the 11 patients was below the 95% LOD for a “perfect” single-locus assay based on ccfDNA input amount (AUC = 0.64) ctDNA detected in the baseline samples of 9 stage IV patients (100%) ctDNA detected in 50 of 52 treatment and follow-up samples from 9 stage IV patients where the integrated MAF in 15 of the 50 samples was below the 95% LOD for a “perfect” single-locus assay based on ccfDNA input amount |
Stage I–IV breast cancer ( NSCLC ( Renal tumours ( Brain tumours ( | 16 samples from stage I–II breast cancer, sensitivity of 62.5% at specificity of 90% (AUC = 0.81) 19 samples from stage IV breast cancer, sensitivity of 100% at specificity of 100% (AUC = 1.00) 8 patients with grade II–IV brain tumours, sensitivity of 75% at specificity of 90% (AUC = 0.92) 24 patients with stage I–IV renal tumours, sensitivity of 41.7% at specificity of 90% (AUC = 0.66) 19 patients with stage I–III NSCLC, sensitivity of 63.1% at specificity of 98% (AUC = 0.80) | ||
| Smith et al., 2020 [ | Custom-panel with patient-specific mutations detected in tumour DNA by WES and 109 genes commonly mutated in RCC INVAR | Renal tumours (benign to metastatic; | ctDNA was detected in 12 patients (54.5%) |
Custom-panel targeting 10 genes in renal cancers INVAR | RCC ( | ctDNA was detected in 8 patients (18.6%) Mean MAF was 8.3% (range 3.5–18%) | |
| Chabon et al., 2020 [ | Tumour-informed search using a 355-kb panel of 255 genes recurrently mutated in NSCLC Lung-CLiP – a multi-tiered machine-learning approach that includes fragment size to estimate the probability that a ccfDNA mutation is tumour derived | Stage I–II NSCLC ( Stage III NSCLC ( Risk-matched controls ( | For stage I–II patients: Sensitivity of ~30% at 98% specificity Sensitivity of ~56% at 80% specificity For stage III patients: Sensitivity of ~60% at 98% specificity Sensitivity of ~100% at 80% specificity |
AUC area under the curve, ccfDNA circulating cell-free DNA, ctDNA circulating tumour DNA, INVAR INtegration of VAriant Reads, LOD limit of detection, Lung-CLiP lung cancer likelihood in plasma, MAF mutant allele frequency, NSCLC non-small cell lung cancer, RCC renal cell carcinoma, RF random forest, sWGS shallow whole genome sequencing, tMAD trimmed median absolute deviation from copy number neutrality, WES whole exome sequencing
| Application of next-generation sequencing for the detection of somatic mutations in circulating cell-free DNA derived from solid tumour malignancies seeks to revolutionize precision medicine by using a simple blood draw to detect cancers, monitor response to therapies, and personalize treatment strategies. |
| Advancing cell-free DNA diagnostics to early-stage and non-metastatic cancers has been limited by challenges associated with distinguishing the true signal of low-frequency tumour-derived cell-free DNA from noise generated during next-generation sequencing. |
| Differences in the fragment length between tumour-derived cell-free DNA (circulating tumour DNA [ctDNA]) and cell-free DNA originating from healthy cells is a biologic phenomenon that can be leveraged to improve detection of ctDNA. |
| Both laboratory-based (in vitro) and computer-based (in silico) methods are being developed to use the fragment length profile of ctDNA to improve cell-free DNA diagnostics in cancer. |
| Further investigations into using size-based analyses in difficult to detect cancers are necessary to expand the role of cell-free DNA in non-invasive applications of precision oncology. |