| Literature DB >> 33391473 |
Tao Jiang1, Liyan Jiang2, Xiaorong Dong3, Kangsheng Gu4, Yueyin Pan5, Qin Shi6, Guojun Zhang7, Huijuan Wang8, Xiaochun Zhang9, Nong Yang10, Yuping Li11, Jianping Xiong12, Tienan Yi13, Min Peng14, Yong Song15, Yun Fan16, Jiuwei Cui17, Gongyan Chen18, Wei Tan19, Aimin Zang20, Qisen Guo21, Guangqiang Zhao22, Ziping Wang23, Jianxing He24, Wenxiu Yao25, Xiaohong Wu26, Kai Chen27, Xiaohua Hu28, Chunhong Hu29, Lu Yue30, Da Jiang31, Guangfa Wang32, Junfeng Liu33, Guohua Yu34, Junling Li35, Henghui Zhang36, Lihong Wu36, Lu Fang36, Dandan Liang36, Yi Zhao36, Weihong Zhao37, Wenmin Xie37, Shengxiang Ren1, Caicun Zhou1.
Abstract
Rationale: Platinum-based chemotherapy is one of treatment mainstay for patients with advanced lung squamous cell carcinoma (LUSC) but it is still a "one-size fits all" approach. Here, we aimed to investigate the predictive and monitoring role of circulating cell-free DNA (cfDNA) profiling for the outcome of first-line chemotherapy in patients with advanced LUSC.Entities:
Keywords: Non-small-cell lung cancer; cell-free DNA; chemotherapy; machine learning
Mesh:
Substances:
Year: 2021 PMID: 33391473 PMCID: PMC7681090 DOI: 10.7150/thno.51243
Source DB: PubMed Journal: Theranostics ISSN: 1838-7640 Impact factor: 11.556
Figure 1Schematic illustration of the overall investigation. A. Machine learning algorithm to generate CNV-based RS for response prediction via integrating cfDNA molecular features; B. ICP-based dynamic change of VAF as baseline and cycle 2 treatment monitored the treatment response.
Figure 2The mutational landscape of included patients. Upper panel: The frequency of listed driver genes. Middle panel: The matrix of mutations in a selection of frequently mutated genes. Columns represent samples. Right panel: The total number of patients harboring mutations in each gene. LP, paclitaxel liposome plus cisplatin; GP, gemcitabine plus cisplatin; CR, complete response; PR, partial response; SD, stable disease; PD, disease progression.
Figure 3CNV pattern of 31 genes including CASP8, PPHLN1, PIGF, KEAP1, SDHC, MOV10L1, CCND3, MTRR, ID3, STK11, SEL1L3, ARMC5, MYCL, SMARCA4, BAT, MYO10, SMO, TSHR, IRFB, SOX9, CIC, CCR4, HSPA1B, FLCN, PRPF39, RRP1B, PRKCI, ARPC2, SOCS1, ERCC2, CEBPA showed obviously distinct distribution between patients with CR/PR and SD/PD in all (A), LP (B) and GP (C) group. From inside to out of each circus plot: the first circle represents the CNVs of patients in SD and PD group (orange represents amplification, green represents loss or deletion); the second circle represents the CNVs of patients in PR and CR group (red represents amplification, blue represents loss or deletion). Outermost circle represents the chromosomes. CR, complete response; PR, partial response; SD, stable disease; PD, disease progression. LP, paclitaxel liposome plus cisplatin; GP, gemcitabine plus cisplatin.
Figure 4Generation of CNV-based RS for response prediction. A. Different co-efficient importance values in this model via selecting features with the best accuracy score in the ensemble or LASSO supervised method; B. Receiver operator characteristic curve analysis result in training set; C. Receiver operator characteristic curve analysis result in validation set. CNV, copy number variation; AUC, area under the ROC curve. Feature selection (Fig. A) was carried out with two steps. First, several statistical methods were utilized to evaluate the difference between two groups of samples in training set for each feature, including deviation, mutual information, AUC and p-values of Chi-Square test, Wilcoxon rank sum test, ANOVA and Student's t test, after which features with significantly different signal in at least four of criteria mentioned above were selected. Then, the method of LASSO was conducted to select features with the best accuracy score.
Figure 5Relationship between RS and treatment outcomes. A. ORR comparison between RS high and low group in training set; B. ORR comparison between RS high and low group in validation set; C. Kaplan-Meier curve of PFS comparison between RS high and low group in training set; D. Kaplan-Meier curve of PFS comparison between RS high and low group in validation set; E. Kaplan-Meier curve of OS comparison between RS high and low group in training set; F. Kaplan-Meier curve of OS comparison between RS high and low group in validation set. RS, RESPONSE SCORE; ORR, objective response rate; PFS, progression-free survival; HR, hazard ratio. Unpaired student t test were applied for comparison of response rate between RS high and low groups. The Kaplan-Meier curve with log-rank test was used to test the significance of differences between two groups.
Figure 6ICP-based dynamic change of VAF monitored the treatment response. A. ICP-based change of VAF between CR/PR and SD/PD at baseline and cycle 2 treatment. B. ORR comparison between VAF detectable and undetectable at cycle 2 treatment; C. Kaplan-Meier curve of PFS comparison between VAF detectable and undetectable at cycle 2 treatment. ORR, objective response rate; PFS, progression-free survival; BL, baseline; C2, cycle 2 treatment; CR, complete response; PR, partial response; SD, stable disease; PD, disease progression. Paired student t test were applied for the dynamic change of cfDNA VAF between baseline and C2 detection time. Unpaired student t test were applied for comparison of response rate between C2 detectable and undetectable groups. The Kaplan-Meier curve with log-rank test was used to test the significance of differences between two groups.