| Literature DB >> 36010184 |
Angela Oberhofer1, Abel J Bronkhorst1, Carsten Uhlig1, Vida Ungerer1, Stefan Holdenrieder1.
Abstract
All cell and tissue types constantly release DNA fragments into human body fluids by various mechanisms including programmed cell death, accidental cell degradation and active extrusion. Particularly, cell-free DNA (cfDNA) in plasma or serum has been utilized for minimally invasive molecular diagnostics. Disease onset or pathological conditions that lead to increased cell death alter the contribution of different tissues to the total pool of cfDNA. Because cfDNA molecules retain cell-type specific epigenetic features, it is possible to infer tissue-of-origin from epigenetic characteristics. Recent research efforts demonstrated that analysis of, e.g., methylation patterns, nucleosome occupancy, and fragmentomics determined the cell- or tissue-of-origin of individual cfDNA molecules. This novel tissue-of origin-analysis enables to estimate the contributions of different tissues to the total cfDNA pool in body fluids and find tissues with increased cell death (pathologic condition), expanding the portfolio of liquid biopsies towards a wide range of pathologies and early diagnosis. In this review, we summarize the currently available tissue-of-origin approaches and point out the next steps towards clinical implementation.Entities:
Keywords: cell-free DNA; epigenetics; liquid biopsy; tissue-of-origin
Year: 2022 PMID: 36010184 PMCID: PMC9406971 DOI: 10.3390/diagnostics12081834
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Tissue-of-origin analysis of cell-free DNA. (a) Different organs and various cell types release cell-free DNA (cfDNA) into blood plasma. (b) This clinical biospecimen represents a highly heterogeneous mixture of cfDNA molecules, often complicating the analytical differentiation between different cfDNA subtypes. (c) Multiple different epigenetic characteristics can be employed for tissue-of-origin analysis such as unique methylation patterns, fragmentation profiles and fragment end-points, transcription-factor binding sites occupancy, nucleosome positioning, as well as post-translational histone modifications. (d) Analysis of these features poses an analytical challenge, but various approaches developed recently enable determination of tissue-of-origin of individual cfDNA molecules, facilitating localization of tumors or tissue damage in specific regions such as the heart or liver.
Overview of key studies that performed tissue-of-origin analysis on cfDNA using various approaches based on methylation patterns, nucleosome positioning patterns, TFBS occupancy, histone modifications, and fragmentomics. The data literature search was performed with the PubMed NCBI database. All deconvolution methods listed in this table are reference-based and employ a classification, unless stated otherwise.
| Epigenetic Feature | Method | Approach | Disease | Deconvolution Method | References |
|---|---|---|---|---|---|
|
| CpG islands analysis | WGBS | HCC, NIPT, Transplant | QP | [ |
|
| Analysis of adjacent CpG sites | Bisulfite amplicon-seq | PDAC, CRC, Diabetes, Transplant, MS, TBI, IBD | Read-specific binary classification | [ |
|
| Methylation haplotype block analysis | scRRBS, WGBS | CRC, LCP | QP, Random forest, feature extension “haplotype blocks” | [ |
|
| Analysis of differentially methylated regions (DMRs) + ctDNA abundance | CfMeDIP-seq | PDAC, AML, lung and breast cancer, CRC, RCC, bladder cancer | Limma, binomial GLM (GLMnet) | [ |
|
| Cell-type methylation atlas | Microarray | Sepsis, islet transplantation, CRC, lung, breast, prostate cancer, CUP | NNLS | [ |
|
| CancerDetector | Microarray, WGBS | Liver cancer | Maximizing log-likelihood model (grid search) | [ |
|
| Cell-type methylation atlas | Deep WGBS | COVID-19 | dynamically programmed probabilistic Bayes model, NNLS, wgbstools | [ |
|
| MCED test validation | Bisulfite amplicon-seq | 12 cancer types | Ensemble logistic regression, | [ |
|
| Windowed protection score (L-WPS/S-WPS) analysis of long/short fragments | Deep WGS | Small-cell lung cancer, squamous cell lung cancer, colorectal adenocarcinoma, HCC, ductal carcinoma in situ breast cancer | Windowed approach, | [ |
|
| Nucleosome-depleted region (NDR) analysis | WGS | Breast cancer | ABSOLUTE [ | [ |
|
| Accessibility score analysis | sWGS | Prostate adenocarcinoma, breast cancer, colon adenocarcinoma | Logistic regression | [ |
|
| Analysis of activating histone modifications | cfChIP-seq | Colorectal carcinoma, diverse liver diseases, AMI | Robust linear regression (rlm R-package) | [ |
|
| Fragment size distribution analysis | sWGS + in vitro size selection | High-grade serous ovarian cancer | Logistic regression, | [ |
|
| Orientation-aware plasma DNA fragmentation analysis (OCF) | WGS | Pregnancy, transplant, HCC, CRC, lung cancer | Nucleosome-depletion signal used to calculate OCF value, | [ |
|
| DNA evaluation of fragments for early interception (DELFI) | sWGS + genome-wide fragmentation pattern analysis | Breast, colorectal, lung, ovarian, pancreatic, gastric, and bile duct cancer | Multi-classifying approach, | [ |
|
| Motif diversity score (MDS) analysis using an adopted normalized Shannon entropy | WGS | HCC, pregnancy, liver transplantation, CRC, lung cancer; head and neck squamous cell, and nasopharyngeal carcinoma | SVM, logistic regression | [ |
|
| Promoter fragmentation entropy (PFE) analysis using a modified Shannon index | Epigenetic expression inference from cfDNA-seq (EPIC-seq) | Non-small-cell lung cancer, diffuse large B cell lymphoma | Dirichlet-multinomial model, | [ |
Abbreviations: TFBS: transcription factor binding site; HCC: hepatocellular carcinoma; NIPT: non-invasive prenatal testing; PDAC: pancreatic ductal adenocarcinoma; CRC: colorectal cancer; MS: multiple sclerosis; TBI: traumatic brain injury; IBD: inflammatory bowel disease; LCP: lung cancer primary; AML: acute myeloid leukemia; RCC: renal cell carcinoma; AMI: acute myocardial infarction; MCEP: multi-cancer early prediction; WGBS: whole genome bisulfite sequencing; scRRBS: single-cell reduced representation bisulfte sequencing; cfMeDIP-seq: cell-free methylated DNA immunoprecipitation and sequencing; cfChIP-seq: cell-free chromatin immunoprecipitation and sequencing; GLM: generalized linear model; NNLS: non-negative least squares; QP: quadratic programming; SVM: support vector machine.