| Literature DB >> 32193467 |
Li C Xia1, Paul Van Hummelen1, Matthew Kubit2, HoJoon Lee1, John M Bell2, Susan M Grimes2, Christina Wood-Bouwens1, Stephanie U Greer1, Tyler Barker3, Derrick S Haslem3, James M Ford1, Gail Fulde3, Hanlee P Ji4,5, Lincoln D Nadauld6.
Abstract
DNA copy number aberrations (CNA) are frequently observed in colorectal cancers (CRC). There is an urgent need for CNA-based biomarkers in clinics,. n For Stage III CRC, if combined with imaging or pathologic evidence, these markers promise more precise care. We conducted this Stage III specific biomarker discovery with a cohort of 134 CRCs, and with a newly developed high-efficiency CNA profiling protocol. Specifically, we developed the profiling protocol for tumor-normal matched tissue samples based on low-coverage clinical whole-genome sequencing (WGS). We demonstrated the protocol's accuracy and robustness by a systematic benchmark with microarray, high-coverage whole-exome and -genome approaches, where the low-coverage WGS-derived CNA segments were highly accordant (PCC >0.95) with those derived from microarray, and they were substantially less variable if compared to exome-derived segments. A lasso-based model and multivariate cox regression analysis identified a chromosome 17p loss, containing the TP53 tumor suppressor gene, that was significantly associated with reduced survival (P = 0.0139, HR = 1.688, 95% CI = [1.112-2.562]), which was validated by an independent cohort of 187 Stage III CRCs. In summary, this low-coverage WGS protocol has high sensitivity, high resolution and low cost and the identified 17p-loss is an effective poor prognosis marker for Stage III patients.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32193467 PMCID: PMC7081316 DOI: 10.1038/s41598-020-61643-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The study design and whole genome sequencing analytical workflow. (A) The whole-genome sequencing (WGS) analysis share the same sample preparation, DNA extraction and quality control steps as WES (color shaded light green). The prepared genomic DNA libraries are pooled for WGS directly, while they require additional PCR amplification and hybridization steps to generate exomic libraries for pooled WES. (B) We integrated CNVkit, Gistic2 and various R packages to perform copy number segmentation, CNA calling and biomarker discovery analyses.
Figure 2Benchmarks to evaluate low-coverage WGS approach and bioinformatics. (A) Pearson’s correlation coefficients (PCC) between low-coverage WGS and microarray segments as stratified by segmentation tools; (B) PCC between low-coverage and high-coverage WGS as stratified by tumor purity; (C) The means of robust standard deviation (MAD) as stratified by low-coverage WGS and WES analysis platforms.
Summary statistics and multivariate cox regression results for the Stage III colorectal cancer discovery cohort.
| Discovery Cohort (n = 134) | Patient Characteristics | Count (%)/Mean (Range) | Hazard Ratio | 95% CI | p-value | significance |
|---|---|---|---|---|---|---|
| Age | At Diagnosis | 73 (22–93) | 0.997 | (0.9715, 1.0034) | 0.795 | n.s. |
| Gender | Male (Reference) | 66 (49%) | ||||
| Female | 68 (51%) | 0.935 | (0.5523, 1.0701) | 0.801 | n.s. | |
| Ethnicity | White (Reference) | 128 (96%) | ||||
| Hispanic | 2 (1%) | 1.236 | (0.7066, 2.1606) | 0.458 | n.s. | |
| Smoking Status | Smoker (Reference) | 36 (27%) | ||||
| Nonsmoker | 92 (69%) | 0.631 | (0.3637, 1.0951) | 0.102 | n.s. | |
| Body weight | BMI | 29 (18–60) | 1.001 | (0.9487, 1.0552) | 0.984 | n.s. |
| Treatment | Chemotherapy (Reference) | 74 (55%) | ||||
| Refused/Not recommended | 43 (32%) | 0.416 | (0.2393, 0.7249) | 0.002 | ** | |
| Tumor Grade | High | 58 (43%) | 1.271 | (0.7667, 2.1082) | 0.352 | n.s. |
| Low (Reference) | 76 (57%) | |||||
| Tumor Side | Right Colon (Reference) | 90 (72%) | ||||
| Left Colon | 44 (28%) | 0.576 | (0.3423, 0.9707) | 0.038 | * | |
| Recurrence | None | 50 (37%) | 0.652 | (0.3579, 1.1892) | 0.163 | n.s. |
| Recurrence (Reference) | 72 (54%) | |||||
| Overall Survival | >5-year from diagnosis | 16 (12%) |
1Statistical significance is based on the fitted multivariate cox model (Eq. 1). n.s.: not significant, *P < 0.05, **P < 0.01.
2Treatment is any treatment received after surgical resection. Chemotherapy: if received any forms of 5FU, Folfox, or Capecitabine.
3Missing data for each variable was <13%.
Figure 3Copy number profiles of the discovery cohort. Copy number ratios (CNR) are shown for upper split panel: patients had chr17p loss; and lower split panel patients had no chr17p loss – all based on Gistic2 calls. Row color coding: black for shorter survival patients (the lower 50%) and grey for longer survival patients (the upper 50%).
Figure 4Arm-level chr17p loss predicts for poorer survival in Stage III CRC. (A) Venn diagram for shared arm-level CNAs between the discovery and TCGA validation cohorts. (B) The Kaplan–Meier plots of the Stage III CRC discovery cohort as stratified by patients’ status of carrying the chr17p arm loss (SCNA_CHR_ARM_17p_del = 1 for yes, otherwise 0). Also shown are box plots for comparing (C) number of focal CNAs (D_N_Focals) and (D) number of arm-level CNAs (D_N_Arms) between patients carrying or not carrying the chr17p loss.