| Literature DB >> 35159019 |
Shuoying Qu1, John W M Martens1, Antoinette Hollestelle1, Marcel Smid1.
Abstract
Patients with cancers that are deficient for homologous recombination repair (HRD) may benefit from PARP inhibitor treatment. Therefore, methods that identify such cancers are crucial. Using whole genome sequencing data, specific genomic scars derived from somatic mutations and genomic rearrangements can identify HRD tumors, with only BRCA1-like HRD cancers profoundly displaying small (<10 kb) tandem duplications (TDs). In this manuscript we describe a method of detecting BRCA1-type HRD in breast cancer (BC) solely from RNA sequencing data by identifying TDs surfacing in transcribed genes. We find that the number of identified TDs (TD-score) is significantly higher in BRCA1-type vs. BRCA2-type BCs, or vs. HR-proficient BCs (p = 2.4 × 10-6 and p = 2.7 × 10-12, respectively). A TD-score ≥2 shows an 88.2% sensitivity (30 out of 34) to detect a BRCA1-type BC, with a specificity of 64.7% (143 out of 221). Pathway enrichment analyses showed genes implicated in cancer to be affected by TDs of which PTEN was found significantly more frequently affected by a TD in BRCA1-type BC. In conclusion, we here describe a novel method to identify TDs in transcripts and classify BRCA1-type BCs with high sensitivity.Entities:
Keywords: BRCA1-type breast cancer; PTEN inactivation; RNA sequencing; cancer driver enrichment; classification algorithm; homologous recombination repair deficiency; tandem duplication
Year: 2022 PMID: 35159019 PMCID: PMC8833645 DOI: 10.3390/cancers14030753
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1Identifying transcripts with a TD region. Top panel schematically shows how a junction read (green) would be located on the DNA reference. Introns are shown as horizontal lines. The 5′ of the junction read maps to the 3′ end of exon 3, while the remainder of the read maps to the 5′ of exon 2. Since this is an unexpected orientation and depending on where the majority of the read maps, the read is annotated as mapping to one of the boundaries. The unmapped bases are annotated as soft clipped (S in the CIGAR score). The chromosomal positions of the junction ‘breakpoints’ are used as the start and end of a region. The bottom panel shows a TD of exon 2–3, with the read mates (purple) are partly or completely located outside the region defined by the chromosomal position of the junction read locations.
Figure 2Distribution of TD-score in all samples (A) and by CHORD-call (B). p-values are from Mann–Whitney U-test. Sample sizes are n = 34, n = 17 and n = 204 for BRCA1- and BRCA2-type HRD and HRP, respectively.
Figure 3ROC curve of TD score to predict a BRCA1-type BC. Diagonal black line indicates a diagnostic test with performance no better than chance.
Genes recurrently affected by a TD in BRCA1-type BC. The FDR is calculated using the Hochberg method.
| Gene | N of BRCA1-Type Samples with a TD | N of BRCA2-Type & HRP Samples with a TD | Nominal | FDR |
|---|---|---|---|---|
|
| 4 | 13 | 0.257 | 0.627 |
|
| 3 | 12 | 0.431 | 0.627 |
|
| 3 | 1 | 0.008 | 0.119 |
|
| 3 | 0 | 0.002 |
|
|
| 3 | 4 | 0.052 | 0.26 |
|
| 2 | 0 | 0.017 | 0.119 |
|
| 2 | 1 | 0.048 | 0.26 |
|
| 2 | 8 | 0.627 | 0.627 |
|
| 2 | 0 | 0.017 | 0.119 |
|
| 2 | 0 | 0.017 | 0.119 |
|
| 2 | 0 | 0.017 | 0.119 |
|
| 2 | 0 | 0.017 | 0.119 |
|
| 2 | 4 | 0.184 | 0.627 |
|
| 2 | 0 | 0.017 | 0.119 |
|
| 2 | 0 | 0.017 | 0.119 |
|
| 2 | 0 | 0.017 | 0.119 |
|
| 2 | 0 | 0.017 | 0.119 |
N, number; TD, tandem duplication; HRP, homologous recombination repair proficient; FDR, false discovery rate. Bold indicates the significant finding.
Recurrent TDs in PTEN. Breakpoint coordinates are based on human reference genome GRCh38.
| Sample | N TD Read-Pairs | Size (bp) | Start of Region (Exon N) | End of Region (Exon N) | Effect on Protein | DNA 5’ Breakpoint | DNA 3’ Breakpoint |
|---|---|---|---|---|---|---|---|
| p1 | 6 | 7739 | 3 | 5 | out of frame | 87,924,739 | 87,934,651 |
| p2 | 13 | 2206 | 4 | 5 | out of frame | 87,926,191 | 87,941,211 |
| p3 | 2 | 239 | 5 | 5 | in frame | 87,932,213 | 87,935,777 |
N, number; TD, tandem duplication; bp, base pairs.
Figure 4Example of a TD in PTEN. DNA breakpoints were identified before exon 3 and after exon 5, leading to a TD of these 3 exons. In the RNAseq data, sequence reads were identified spanning both the intron breakpoints as well as the resulting transcriptome junction of exon 5 to exon 3. Introns are indicated by the arrowed lines and are not to scale. Coordinates are from the GRCh38 reference sequence.