| Literature DB >> 29109393 |
Viktor A Adalsteinsson1,2, Gavin Ha3,4,5, Samuel S Freeman3,5, Atish D Choudhury4, Daniel G Stover4,5, Heather A Parsons4,5, Gregory Gydush3, Sarah C Reed3, Denisse Rotem3, Justin Rhoades3, Denis Loginov3,6, Dimitri Livitz3, Daniel Rosebrock3,5, Ignaty Leshchiner3, Jaegil Kim3, Chip Stewart3, Mara Rosenberg3, Joshua M Francis3,4, Cheng-Zhong Zhang3,4,5, Ofir Cohen3,4, Coyin Oh3, Huiming Ding6, Paz Polak3,5,7, Max Lloyd4, Sairah Mahmud4, Karla Helvie4, Margaret S Merrill4, Rebecca A Santiago4, Edward P O'Connor4, Seong H Jeong4, Rachel Leeson6, Rachel M Barry6, Joseph F Kramkowski4, Zhenwei Zhang4, Laura Polacek4, Jens G Lohr3,4, Molly Schleicher3, Emily Lipscomb3, Andrea Saltzman3, Nelly M Oliver4, Lori Marini4, Adrienne G Waks4,8, Lauren C Harshman4, Sara M Tolaney4, Eliezer M Van Allen3,4,5,8, Eric P Winer4, Nancy U Lin4, Mari Nakabayashi4,5, Mary-Ellen Taplin4, Cory M Johannessen3, Levi A Garraway3,4,5,8,9, Todd R Golub3,4,5,9, Jesse S Boehm3, Nikhil Wagle3,4,5, Gad Getz10,11,12, J Christopher Love13,14, Matthew Meyerson15,16,17,18.
Abstract
Whole-exome sequencing of cell-free DNA (cfDNA) could enable comprehensive profiling of tumors from blood but the genome-wide concordance between cfDNA and tumor biopsies is uncertain. Here we report ichorCNA, software that quantifies tumor content in cfDNA from 0.1× coverage whole-genome sequencing data without prior knowledge of tumor mutations. We apply ichorCNA to 1439 blood samples from 520 patients with metastatic prostate or breast cancers. In the earliest tested sample for each patient, 34% of patients have ≥10% tumor-derived cfDNA, sufficient for standard coverage whole-exome sequencing. Using whole-exome sequencing, we validate the concordance of clonal somatic mutations (88%), copy number alterations (80%), mutational signatures, and neoantigens between cfDNA and matched tumor biopsies from 41 patients with ≥10% cfDNA tumor content. In summary, we provide methods to identify patients eligible for comprehensive cfDNA profiling, revealing its applicability to many patients, and demonstrate high concordance of cfDNA and metastatic tumor whole-exome sequencing.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29109393 PMCID: PMC5673918 DOI: 10.1038/s41467-017-00965-y
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Copy number and tumor fractions from ULP-WGS. a cfDNA workflow. b Genome-wide copy number from 0.1× ULP-WGS of cfDNA from a healthy donor. c Genome-wide copy number from 25× WGS and 0.1× WGS of cell-free DNA from a metastatic breast cancer patient (MBC_315), and 1× WGS and WES of matched tumors from this patient. SCNA for tumor WES and cfDNA 25× coverage WGS were predicted using TITAN[17] (“Methods”). d Comparison of copy ratios between ULP-WGS of cfDNA with deep (>10×) WGS of the same cfDNA sample, WGS (1×) of matched tumors from 22 metastatic breast cancer (MBC) patients, and WES (average mean target coverage 173×) of matched tumors from 41 MBC and prostate cancer (CRPC) patients. Log2 copy ratios were computed as normalized read coverage for each 1 Mb (WGS/ULP-WGS) and the mean of overlapping 50 kb bins (WES) after adjustment for tumor fraction/purity. The correlation of copy ratios between tumor and cfDNA was computed using Spearman rank correlation (coefficient ρ). F-measure (F1) is the harmonic mean of the CNA positive predictive value (precision) and sensitivity (recall) performance. Recall is defined as the proportion of SCNA gain/loss in tumor biopsy also observed in ULP-WGS of cfDNA (“Methods”). e Comparison of tumor fractions estimated from ULP-WGS and WES of cfDNA. Samples (n = 35) with similar tumor ploidy (difference < 0.75 and ploidy ≥1.5) estimated in both ULP-WGS and tumor WES are shown. The correlation between the two data types was calculated using Pearson correlation (coefficient r). Red line denotes y = x. WES tumor fractions were estimated using ABSOLUTE[16] (shown) and TITAN (Supplementary Fig. 15, Supplementary Data 6)
Fig. 2Comparison of whole-exome sequencing of cfDNA to whole-exome sequencing of matched tumor biopsies. a Fraction of clonal (≥0.9 cancer cell fraction, CCF) and subclonal (<0.9 CCF) SSNVs detected by MuTect in WES of tumor biopsies and confirmed (i.e., supported by ≥3 variant reads) in WES of cfDNA. Sites with <3 reads that had power <0.9 for mutation calling were not included when computing the fraction of SNVs confirmed (“Methods”). b Fraction of clonal and subclonal SSNVs detected in WES of cfDNA and confirmed in WES of tumor biopsies. For 18 patients with WES of cfDNA at a second time point t 2, SSNVs not detected in the matched tumor biopsy but confirmed at t 2 are indicated with black. c Analysis of clonal dynamics in an ER+ breast cancer patient diagnosed with metastatic disease 1.5 years (yrs) prior to biopsy and cfDNA collection (t 1, Day 0). Clustering analysis of CCF for SSNVs between matched tumor biopsy and cfDNA (t 1) is shown in the left panel. The right panel shows the CCF of four mutation clusters, one containing ESR1 L536P (Subclonal Cluster 1, orange) and the other containing ESR1 D538G (Subclonal Cluster 2, light blue), at t 1 and t 2 (51 days apart) from a patient with ER+ metastatic breast cancer being treated with a SERD. The lymph node biopsy was taken at the same time as cfDNA t 1. Mutations were clustered by the CCFs for each pair of samples using Phylogic[39] (“Methods”). Error bars represent the 95% credible interval of the joint posterior density of the clusters. Mutations, excluding indels, having ≥90% estimated power based on coverage in both samples are shown; clusters with fewer than three mutations are excluded. The number of mutations in each cluster is indicated in the legend in parentheses
Fig. 3Genomic alterations of known significance and applicability to large cohorts. a, b The alteration status of significantly mutated genes predicted by MutSig2CV[23, 24] (a), focal SCNAs (b), and known cancer-associated genes[22] are shown for cfDNA and tumor biopsies from 27 metastatic breast cancer (MBC) patients. Mutated genes with MutSig2CV q-value < 0.1 are statistically significant. Mutations that were exclusively detected in one sample may be present at low CCF in the other matched sample but were excluded from the frequency calculation. SCNA frequencies were computed for oncogenes (MYC to ERBB2) and tumor suppressors (BRCA1 to ATM) using only amplification and deletion status, respectively. Mutations were predicted using MuTect and SCNAs were predicted using ReCapSeg and ABSOLUTE. Red dot indicates distinct mutations in tumor and cfDNA. c Mutational signatures in whole-exome sequencing of cfDNA and tumor biopsies were predicted using a Bayesian non-negative matrix factorization (NMF) approach[29] (“Methods”). Samples with predicted biallelic inactivation of BRCA1/2 are indicated in red and blue. Black line denotes y = x; blue line denotes model fit using linear least squares regression. d Neoantigen burden, defined as the number of predicted neoantigen SSNVs, was calculated using NetMHCpan[33] (“Methods”). Black line denotes y = x; blue line denotes model fit using linear least squares regression. e Applicability to many patients with metastatic cancer. Tumor fractions estimated from ULP-WGS of cfDNA from 903 blood samples from 391 patients with metastatic breast cancer and 536 blood samples from 129 patients with metastatic prostate cancers. The earliest blood drawn for each patient is shown. Samples with coverage <0.05× were excluded