| Literature DB >> 34257842 |
Yu Rong1, Shan-Shan Dong1, Wei-Xin Hu1, Yan Guo1, Yi-Xiao Chen1, Jia-Bin Chen1, Dong-Li Zhu1, Hao Chen1, Tie-Lin Yang1,2.
Abstract
Detecting SNPs associated with drug efficacy or toxicity is helpful to facilitate personalized medicine. Previous studies usually find SNPs associated with clinical outcome only in patients received a specific treatment. However, without information from patients without drug treatment, it is possible that the detected SNPs are associated with patients' clinical outcome even without drug treatment. Here we aimed to detect drug response SNPs based on data from patients with and without drug treatment through combing the cox proportional-hazards model and pairwise Kaplan-Meier survival analysis. A pipeline named Detection of Drug Response SNPs (DDRS) was built and applied to TCGA breast cancer data including 363 patients with doxorubicin treatment and 321 patients without any drug treatment. We identified 548 doxorubicin associated SNPs. Drug response score derived from these SNPs were associated with drug-resistant level (indicated by IC50) of breast cancer cell lines. Enrichment analyses showed that these SNPs were enriched in active epigenetic regulation markers (e.g., H3K27ac). Compared with random genes, the cis-eQTL genes of these SNPs had a shorter protein-protein interaction distance to doxorubicin associated genes. In addition, linear discriminant analysis showed that the eQTL gene expression levels could be used to predict clinical outcome for patients with doxorubicin treatment (AUC = 0.738). Specifically, we identified rs2817101 as a drug response SNP for doxorubicin treatment. Higher expression level of its cis-eQTL gene GSTA1 is associated with poorer survival. This approach can also be applied to identify new drug associated SNPs in other cancers.Entities:
Keywords: Breast cancer; Drug response; Prognosis; SNP
Year: 2021 PMID: 34257842 PMCID: PMC8254081 DOI: 10.1016/j.csbj.2021.06.026
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1An overview of the approach. Step 1: We performed SNP × drug interaction analysis in all patients (including patients with drug treatment and patients without any drug treatment). SNPs with significant drug term (P < 0.05 for) and significant interaction term (P < 0.05 for) were remained. Step 2: For SNPs obtained from the first step, we performed Kaplan-Meier (KM) analysis in subjects with different genotypes to select SNPs associated with drug response in patients with drug treatment. KM analysis in patients without any drug treatment was also performed to remove the SNPs associated with survival but this association was not related to drug treatment. Step 3: We performed univariate cox proportional hazards analysis for each drug response SNPs get from the first two steps. The coefficients for all SNPs was used to calculate drug response score (DRS) in another population to test the performance of drug response prediction.
Pre-treatment characteristics of the patients.
| Patient with doxorubicin treatment | Patients without any drug treatment | All patients | |
|---|---|---|---|
| Number | 363 | 321 | 684 |
| Age | |||
| <=50 | 173 (48%) | 71 (22%) | 244 (36%) |
| >50 | 189 (52%) | 240 (77%) | 429 (64%) |
| Mean (SD) | 52 (10) | 62 (14) | 56 (13) |
| Unknown | 1 | 10 | 11 |
| Nodal status | |||
| Positive | 238 (66%) | 152 (48%) | 390 (57%) |
| Negative | 122 (33%) | 158 (49%) | 280 (41%) |
| Unknown | 3 (1%) | 11 (3%) | 14 (2%) |
| T stage | |||
| 1 | 73 (21%) | 79 (25%) | 152 (22%) |
| 2 | 233 (64%) | 177 (55%) | 410 (60%) |
| 3 | 52 (14%) | 41 (13%) | 93 (14%) |
| 4 | 5 (1%) | 22 (7%) | 27 (4%) |
| Unknown | 0 (0%) | 2 (0%) | 2 (0%) |
| Pathologic stage | |||
| 1 | 36 (10%) | 59 (18%) | 95 (14%) |
| 2 | 213 (59%) | 174 (54%) | 387 (57%) |
| 3 | 108 (30%) | 67 (21%) | 175 (26%) |
| 4 | 2 (0%) | 9 (3%) | 11 (1%) |
| Unknown | 4 (1%) | 12 (4%) | 16 (2%) |
| ER Status | |||
| Positive | 244 (67%) | 225 (70%) | 469 (69%) |
| Negative | 106 (29%) | 79 (25%) | 185 (27%) |
| Unknown | 13 (4%) | 17 (5%) | 30 (4%) |
| PR Status | |||
| Positive | 209 (58%) | 188 (59%) | 397 (58%) |
| Negative | 139 (38%) | 115 (36%) | 254 (37%) |
| Unknown | 15 (4%) | 18 (5%) | 33 (5%) |
| HER2 Status | |||
| Positive | 45 (12%) | 61 (19%) | 106 (15%) |
| Negative | 207 (57%) | 144 (45%) | 351 (51%) |
| Unknown | 111 (31%) | 116 (36%) | 227 (34%) |
Fig. 2A: Boxplot of DRS of patients in different molecular subtypes. Patients were stratified into groups of Luminal A, Luminal B, HER2 and Basal like by the ER, PR and HER2 markers. B: Boxplot of DRS between doxorubicin-resistant cells (High IC50) and doxorubicin-sensitive (Low IC50) cells. C: Enrichment analysis heatmap plot of those significant SNPs’ epigenetic annotation. Enrichment score (ES) are color-coded from light to dark.
Fig. 3A: Violin plot of mean shortest PPI distances to doxorubicin target. Red bars represent the mean PPI distance of the cis-eQTL genes to doxorubicin target genes, doxorubicin related enzymes, doxorubicin related enzymes/carriers/transporters, and doxorubicin-interacting genes. Blue bars represent the mean shortest PPI distance of 10,000 groups of randomly selected genes. B: ROC curve for the predictive performance of the LDA genomic pCR predictor with these eQTL genes as features. C: Volcano plot of univariate cox proportional hazards results of eQTL genes. Horizontal axis showed the univariate cox proportional hazards confidences and vertical axis showed the negative log of the P values. Doxorubicin-resistant genes are shown on upper and doxorubicin-sensitive genes are shown on below. D: Volcano plot of univariate cox proportional hazards results of PAS of enriched pathways. Horizontal axis shows the univariate cox proportional hazards confidences and vertical axis showed the negative log of the P values. Positive pathways are shown on upper and negative pathways are shown on below. E: Univariate cox proportional hazards confidences of 6 positive pathways had significant risk PAS and 7 negative pathways had significant protective PAS. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.) (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 4A: Multivariate cox proportional hazards results about doxorubicin, rs2817101, rs2817101 × doxorubicin (interaction term) factors in all patients. B: Pairwise Kaplan-Meier survival analysis in patients with and without drug treatment separately of rs2817101. C: Multivariate cox proportional hazards results about rs2817101, age, pathologic stage, histological subtypes, Lymph nodes status, ER, PR and HER2 status factors respectively patients with and without drug treatment. Significant P-value threshold was set at 0.05. D: Boxplot of GSTA1 expression levels (log2(TPM + 1)) based on rs2817101 genotypes. E. Kaplan-Meier survival analysis between high and low GSTA1 expression patients in GSE25055.