| Literature DB >> 35121756 |
Havell Markus1,2,3, Dineika Chandrananda1,2, Elizabeth Moore1,2, Florent Mouliere1,2,4, James Morris1,2, James D Brenton1,2, Christopher G Smith1,2, Nitzan Rosenfeld5,6.
Abstract
Circulating tumor DNA (ctDNA) in blood plasma is present at very low concentrations compared to cell-free DNA (cfDNA) of non-tumor origin. To enhance ctDNA detection, recent studies have been focused on understanding the non-random fragmentation pattern of cfDNA. These studies have investigated fragment sizes, genomic position of fragment end points, and fragment end motifs. Although these features have been described and shown to be aberrant in cancer patients, there is a lack of understanding of how the individual and integrated analysis of these features enrich ctDNA fraction and enhance ctDNA detection. Using whole genome sequencing and copy number analysis of plasma samples from 5 high grade serious ovarian cancer patients, we observed that (1) ctDNA is enriched not only in fragments shorter than mono-nucleosomes (~ 167 bp), but also in those shorter than di-nucleosomes (~ 240-330 bp) (28-159% enrichment). (2) fragments that start and end at the border or within the nucleosome core are enriched in ctDNA (5-46% enrichment). (3) certain DNA motifs conserved in regions 10 bp up- and down- stream of fragment ends (i.e. cleavage sites) could be used to detect tumor-derived fragments (10-44% enrichment). We further show that the integrated analysis of these three features resulted in a higher enrichment of ctDNA when compared to using fragment size alone (additional 7-25% enrichment after fragment size selection). We believe these genome wide features, which are independent of genetic mutational changes, could allow new ways to analyze and interpret cfDNA data, as significant aberrations of these features from a healthy state could improve its utility as a diagnostic biomarker.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35121756 PMCID: PMC8816939 DOI: 10.1038/s41598-022-05606-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1(A) In this study we performed paired-end high coverage whole genome sequencing on plasma samples from 5 high grade serous ovarian cancer patients and (B) paired-end shallow whole genome sequencing (sWGS) on 49 healthy plasma controls. We created a control panel by merging reads across all healthy samples (C) Three features were calculated for each plasma DNA fragment: fragment size, relative position of fragment start and end sites with respect to the closest nucleosome dyad, and a nucleotide frequency score based on the 10 bp region spanning either side of fragment start and end. For both the control panel and the patient high-coverage datasets, fragments were selected based on these features and repeated random down-sampling was carried out in order to create 10 technical replicates (0.1 × coverage, 2.2 million reads each). These shallow whole-genome datasets underwent copy number analysis and the cancer-cell fraction (CCF) was calculated using ichorCNA. This allowed us to determine features that are enriched in circulating tumor DNA versus circulating cell-free DNA.
Figure 2Fragment size analysis. (A) Fragment size distribution of 5 HGSOC patients and panel of healthy controls. The vertical dashed lines are placed on the fragment sizes between 52 and 172 bp where 10 bp periodicity is observed. The vertical lines at 240 and 324 bp show the range at which a shift in the di-nucleosomal peak occurs between HGSOC patients and healthy controls. The inset plot enlarges the density profile in the di-nucleosomal fragment length range. (B) Copy number aberration analysis of all technical replicates comprising of different fragment length ranges as indicated by figure legend. The cancer cell fraction (CCF) was measured for each dataset with and without fragment selection. The relative CCF on the y-axis is the ratio of these two estimates. Above each plot, the TP53 MAF, as determined by TAm-Seq is shown for each patient (C) The average relative copy number values from the 10 replicates of patient 2. The different colors indicate different size selection ranges as well as no size selection. Corresponding data from the other patients are shown in supplementary Fig. 3.
Figure 3Location of DNA fragments relative to nucleosomal dyads. (A) The density of fragment start and end sites with respect to nucleosome dyads for patients and controls. The vertical lines are drawn at 75 bp downstream and upstream of the nucleosome center to signify the region of nucleosome core and linker. (B) Copy number aberration quantification of all technical replicates containing fragments starting and ending within nucleosome core or linker regions. (C) The average relative copy number values from the 10 replicates of patient 2. The different colors indicate results from using fragments that start and end within nucleosome core region, and fragments that start and end within nucleosome linker region as well as all fragments with no selection.
Figure 4Tri-nucleotide motif score. (A) Copy number aberration analysis of all technical replicates containing fragments with tri-nucleotide motif scores less than or greater than − 0.3, − 0.15, 0, 0.15, and 0.3. The CCF of all the random samples is shown relative to median CCF from unselected data. (B) The average relative copy number values from the 10 replicates of patient 2. The different colors indicate results from using fragments with no selection, fragments with tri-nucleotide motif score less than or equal to − 0.3, − 0.15, 0, and greater than or equal to 0, 0.15, 0.3. (C) The fragment size distribution in the control pool and HGSOC patient 5 of fragments with no selection, fragments with tri-nucleotide motif score less than or equal to -0.3 and greater than or equal to 0.3.
Figure 5Analysis of fragments selected for multiple features. (A) Copy number aberration analysis of all technical replicates containing fragments with different lengths and motif score combinations. (B) Aggregated TP53 mutant allele fraction measurements after selecting fragments with various features. The fragments from Patient 2, Patient 3, Patient 4, and Patient 5 were aggregated as these samples had a single point mutation while Patient 1 had an insertion. (C) The aggregated TP53 mutant allele fraction is shown relative to the aggregated MAF with no-selection for different fragment feature selections.