| Literature DB >> 35109903 |
Tanjina Kader1,2, Magnus Zethoven1, Kylie L Gorringe3,4.
Abstract
Clonal analysis of tumour sequencing data enables the evaluation of the relationship of histologically distinct synchronous lesions, such as co-existing benign areas, and temporally distinct tumours, such as primary-recurrence comparisons. In this review, we summarise statistical approaches that are commonly employed to define tumour clonal relatedness using data from bulk DNA technologies. We discuss approaches using total copy number, allele-specific copy number and mutation data, and the relative genomic resolution required for analysis and summarise some of the current tools for inferring clonal relationships. We argue that the impact of the biological context is critical in selecting any particular approach, such as the relative genomic complexity of the lesions being compared, and we recommend considering this context before employing any method to a new dataset.Entities:
Mesh:
Year: 2022 PMID: 35109903 PMCID: PMC8809045 DOI: 10.1186/s13059-022-02600-6
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Summary of clonality approaches illustrating their features in relation to the type of genetic event. Clonality methods are placed horizontally relative to the minimal assay type that can be used and vertically by the relative confidence each method provides in the clonal relationship. CN, copy number; SV, structural variation; BAF, B-allele frequency and the assay method; LC-WGS, low coverage whole genome sequencing; CGH, comparative genomic hybridisation; SNP, single nucleotide polymorphism; WES, whole exome sequencing; WGS, whole genome sequencing
Summary of the types of technologies used in clonality analysis
| Methods | Depth/resolution | CNA | BAF | Variant detection | Subclonal reconstruction | Comment |
|---|---|---|---|---|---|---|
| CGH arrays | Low: 5–10 Mb | ✓ | Suitable for clonality analysis in tumours with high SCNA | |||
| Low coverage WGS | Low: <4× | ✓ | ||||
| SNP arrays | 300 K–1 M SNPs | ✓ | ✓ | Suitable for clonality analysis in tumours with high and low SCNA | ||
| Targeted sequencing panel | High depth (>200×) but only on target regions | ✓ | ✓ | ✓ | SCNA called from off-target sequence suited to clonality analysis but BAF may be unreliable with so few genes; phylogenetic analysis is possible with ultra-deep sequencing (>500×) using TRONCO but limited power due to selective genomic markers [ | |
| Low coverage WES | 30–60× | ✓ | ✓ | ✓ | Read depth might be too low for subclonal analyses but suited to clonality analyses of paired/multi-region samples. Needs matching normal DNA for maximum power. | |
| High coverage WES | High: >60× | ✓ | ✓ | ✓ | ✓ | Powerful for clonality analyses but needs matching normal DNA. Low purity might interfere with the estimation of subclonal SCNA especially at lower depth. |
| WGS | High breakpoint resolution but depth can be low >30× | ✓ | ✓ | ✓ | ✓ | |
| Single cell sequencing | Low individual cell resolution | ✓ | ✓ | ✓ | Individual cell allele-specific CN for deep subclonal reconstruction [ |
CGH comparative genomic hybridisation, SNP single nucleotide polymorphism, WES whole exome sequencing, WGS whole genome sequencing, SCNA somatic copy number alterations, BAF B-allele frequency
Fig. 2Summary of evolutionary models. This figure depicts different aspects of cancer evolution and is adapted from [20, 21]. In parallel evolution, the tumour arises from a single cancer-initiating cell (at the base of the trunk) in the same patient, whereas in convergent evolution, phenotypically similar tumours arise from different cancer-initiating cells in the same or different patients. Purple circles depict tumour subclones with a similar phenotype
Fig. 3Example of allele-specific copy number changes. The top panel shows an example of copy number changes with B-Allele Frequency. The bottom panel shows an example of mirrored subclonal allelic imbalance. This term is used when within the same tumour, one subclone has gained or lost the maternal allele but the other subclone has gained or lost the paternal allele of the same chromosome, indicative of a parallel evolutionary events. Only high depth sequencing data can estimate with this level of resolution
Summary of the statistical approaches for clonality analysis from bulk DNA sequencing
| Author/publication | Technology, type of genomic profile | Tissues (types of tumour), Samples, sample types, treatment types | Method 1 | Method 2 | Method 3 | Preferred method by the author(s) | Significance |
|---|---|---|---|---|---|---|---|
| Waldman et al. 2000 [ | CGH arrays, SCNA | FFPE (DCIS), 1 mm surgical margin ( | (see text for details) | Unsupervised hierarchical clustering based on | In house developed | None | 17/18 were classified as pairs based on Method 1 and 2; 16/18 were classified as pairs based on Method 3; 1 discordant case was common among all methods |
| Teixeira et al. 2004 [ | CGH arrays, SCNA | FF (IBC), | Probability = dividing number of occurrences of that particular genetic alteration/number of tumours analysed | N/A | None | All cases were concordant between these two methods except one (patient 12 presented with both ipsilateral and bilateral tumours i.e. two in each breast) | |
| Bollet et al. 2008 [ | SNP arrays, SCNA | FF (IBC), cases | Hierarchical clustering of Pearson correlation was used to derive a dendrogram | Clinical definition with matched histopathological subtypes, location of the recurrent tumour, grades and hormonal status (see text for details) | Shared breakpoints/partial identity score outperforms the clinical definition/overall SCNA | Method 2 provides significant difference for metastasis free survival than Method 3 ( | |
| Clonality R Package 2011 [ | CGH arrays (SCNA, LOH), Mutation analysis (NGS) | Publicly available data as well as own cohort: FF, FFPE: IBC, lobular Carcinoma in situ [ | SCNA ( | N/A | N/A | N/A | Statistical approach deriving |
| Updated Clonality Package 2019-2020 [ | Mutation analysis (NGS) | As above | Estimated marginal probability of occurrence of a shared mutation in TCGA, Test developed only for metastasised tumours at a different site [ | N/A | N/A | N/A | Updated R package estimating individual probability for clonal relatedness. |
| Newburger et al. 2013 [ | WGS (median 53.4x), somatic SNVs, aneuploidy | FFPE (matched normal, early neoplasia w/wo atypia, carcinoma), | Using | N/A | N/A | N/A | IDC had 2.5x private somatic SNVs than the early neoplasia, and 10x than normal tissues; 4/6 cases early neoplasia shared a common ancestor (neoplasia and IDC shared a significant number of SNVs); genome of shared ancestor are already aneuploid. |
| Weng et al. 2015 [ | TSP, SNV | FFPE (synchronous early breast neoplasia with IBC and/ or DCIS); | Highly accurate | N/A | N/A | N/A | Atypical hyperplasia (AH) and DCIS/IBC shared a most common ancestor while DCIS/IDC have more private SNVs than AH. AH also has a greater mutation burden than typical ductal hyperplasia lesions. |
| Schultheis et al. 2016 [ | WES (105x n=5), TSP (453x; n=18): method validation b/w WES & TSP ( | FFPE (synchronous endometrioid endometrial and endometrioid ovarian carcinoma) | CI2: based on somatic mutations and their frequency in TCGA or a given cohort | N/A | None: CI and CI2 provide concordant results: CI was used by others in multiple studies [ | CI was discordant with clinical definitions (22/23 was clonal | |
| Biermann et al. 2018 [ | aCGH, SNP arrays (SCNA), DNA methylation, Whole RNA sequencing | FFPE (IBC), | - - - | - - | SI | -Discordance between methods with clinical definition - Clonality analysis by SI are in agreement with other approaches except 5 patients | |
| Roth et al. 2014 [ | WGS, WES (>100×) | 1000 Genomes Project samples | Hierarchical Bayes statistical model: PyClone estimates the clonal architecture and composition using somatic mutant allelic fractions adjusted for sequencing errors, tumour cell content, ploidy and local CN profile | N/A | N/A | N/A | Subclonal reconstruction utilising somatic mutations |
| Deshwar et al. 2015 [ | WGS | Simulated data | PhyloWGS: subclonal reconstruction using both somatic mutations and SCNA | N/A | N/A | N/A | Incorporates critical contribution of SCNA for subclonal reconstruction |
| Kaufmann et al. 2021 [ | High depth WGS | Pan-cancer Analysis of Whole Genomes, | MEDICC2: defines minimum event distance between pairs of SCNA profiles and uses neighbour-joining to infer relatedness | N/A | N/A | N/A | Incorporates whole genome doubling events |
N/A not applicable, WES whole exome sequencing, TSP targeted sequencing panel, WGS whole genome sequencing, SCNA somatic copy number alterations, SNV single nucleotide variants, VAF variant allele frequencies, BCS breast conserving surgery, CI Clonality index, NGS next-generation sequencing, FFPE formalin-fixed paraffin-embedded, DCIS ductal carcinoma in situ, IBC invasive breast cancer. *Methods are described only for SCNA, LOH and mutational data. MEDICC minimum event distance for intra-tumour copy-number comparisons