| Literature DB >> 33838170 |
Nicola Rares Franco1, Michela Carlotta Massi2, Francesca Ieva3, Andrea Manzoni4, Anna Maria Paganoni5, Paolo Zunino6, Liv Veldeman7, Piet Ost8, Valérie Fonteyne9, Christopher J Talbot10, Tim Rattay11, Adam Webb12, Kerstie Johnson13, Maarten Lambrecht14, Karin Haustermans15, Gert De Meerleer16, Dirk de Ruysscher17, Ben Vanneste18, Evert Van Limbergen19, Ananya Choudhury20, Rebecca M Elliott21, Elena Sperk22, Marlon R Veldwijk23, Carsten Herskind24, Barbara Avuzzi25, Barbara Noris Chiorda26, Riccardo Valdagni27, David Azria28, Marie-Pierre Farcy-Jacquet29, Muriel Brengues30, Barry S Rosenstein31, Richard G Stock32, Ana Vega33, Miguel E Aguado-Barrera34, Paloma Sosa-Fajardo35, Alison M Dunning36, Laura Fachal37, Sarah L Kerns38, Debbie Payne39, Jenny Chang-Claude40, Petra Seibold41, Catharine M L West42, Tiziana Rancati43.
Abstract
AIM: To identify the effect of single nucleotide polymorphism (SNP) interactions on the risk of toxicity following radiotherapy (RT) for prostate cancer (PCa) and propose a new method for polygenic risk score incorporating SNP-SNP interactions (PRSi).Entities:
Keywords: Epistasis; Genetic risk factors; Late toxicity; Prostate cancer; Radiotherapy; SNPs
Mesh:
Year: 2021 PMID: 33838170 PMCID: PMC8754257 DOI: 10.1016/j.radonc.2021.03.024
Source DB: PubMed Journal: Radiother Oncol ISSN: 0167-8140 Impact factor: 6.901
Fig. 1.An illustration of the methodology used to generated polygenic risk scores incorporating SNP-SNP interactions (PRSi). (a) Data are available for multiple SNPs for patients identified as with (red) or without (blue) radiotherapy toxicity. (b) Our algorithm computes frequent (arbitrarily defined as seen in ≥ 10% of patients) SNP-SNP combinations, termed SNP-allele sets, associated with radiotherapy toxicity (i.e. the minority class). For example both the first and the third patient have a SNP2 value of 2 (i.e. homozygosity of the minor allele) and SNP10 value of 0 (i.e. homozygosity of the major allele). We call this SNP2 = 2, SNP10 = 0 combination a SNP-allele set. As a further example both the fifth and the sixth patient have SNP2 = 1, SNP5 = 2, SNP10 = 1 and SNP23 = 2: this is another SNP-allele set. (c) SNP-allele sets are transformed into patient-specific features, with a “1/yes” value if the patient harbours the considered SNP-allele set and a “0/no” value if the patient does not. Odds ratios are calculated for each SNP-allele set on the risk of toxicity. (d) Lists of risk SNP-allele sets associated with increased (risk) and decreased (protection) toxicity probability are generated. (e) Risk Score (RS) and Protection Score (PS) are calculated for each patient as the frequency in an individual’s genome of SNP-allele sets in the “Risk List” and in the “Protection List”, thus generating a table as in the Figure. Patients with toxicity should have RS near 1 and PS near 0, the converse for patients without toxicity. RS and PS data are fit to a logistic regression model to estimate weights for RS and PS for calculating the final PRSi. The distribution of PRSi should be different for patients with and without toxicity. The more separated the two distributions are, the better the PRSi is discriminating patients with toxicity.
Considered SNPs for each toxicity endpoint. SNPs selected for SNP-allele sets learning for each toxicity endpoint. The SNPs in the table are those identified as relevant in [8] to separate between patients with/without late toxicity symptoms. ▼.
| Late Urinary Frequency grade ≥ 2 | Late Haematuria grade ≥ 1 | Late Nocturia grade ≥ 2 | Late Decreased Urinary Stream grade ≥ 1 | Late Rectal Bleeding grade ≥ 1 |
|---|---|---|---|---|
| rs141799618 | rs10101158 | rs10969913 | rs10209697 | rs62091368 |
| rs8075565 | rs708498 | rs77530448 | rs1799983 | rs4775602 |
| rs12591436 | rs77530448 | rs62091368 | rs17362923 | rs264631 |
| rs76273496 | rs17055178 | rs11219068 | rs673783 | rs17599026 |
| rs10969913 | rs147596965 | rs264651 | rs8098701 | rs11122573 |
| rs1799983 | rs7366282 | rs1799983 | rs77530448 | rs76273496 |
| rs8098701 | rs10969913 | rs8098701 | rs6535028 | rs17362923 |
| rs17599026 | rs12591436 | rs7366282 | rs7366282 | rs10969913 |
| rs7366282 | rs79604958 | rs11122573 | rs845552 | rs6535028 |
| rs708498 | rs8098701 | rs17055178 | rs1045485 | rs10209697 |
| rs17055178 | rs845552 | rs17599026 | rs76273496 | rs141799618 |
| rs11122573 | rs7829759 | rs10497203 | rs17055178 | rs8098701 |
| rs10209697 | rs10209697 | rs6432512 | rs11122573 | rs1045485 |
Performance of Logistic Models fitted with RS and PS. Results of logistic models for the 5 considered toxicity endpoints. The values chosen for the hyperparameter K are reported in the third row. The fitted values for α and β are reported together with their 95% confidence intervals. (*) The rows Sensitivity, Specificity and OR, refer to the metrics obtained by thresholding the predicted probabilities in the logistic model using the cutoff that maximizes the Youden index. The value of such cutoff is also reported in the table (last row). ▼.
| Late Urinary Frequency grade ≥ 2 | Late Haematuria grade ≥ 1 | Late Nocturia grade ≥ 2 | Late Decreased Urinary Stream grade ≥ 1 | Late Rectal Bleeding grade ≥ 1 | |
|---|---|---|---|---|---|
| Patients | 1,334 | 1,343 | 1,250 | 1,234 | 1,366 |
| With toxicity N (%) | 56 (4.2%) | 74 (5.5%) | 223 (17.8%) | 211 (17.1%) | 160 (11.7%) |
| K | 15 | 13 | 8 | 15 | 12 |
| α | 13.25 ± 3.86 | 9.63 ± 3.43 | 3.22 ± 1.57 | 7.04 ± 1.94 | 3.73 ± 1.84 |
| β | −5.37 ± 2.62 | −4.60 ± 2.53 | −3.82 ± 1.57 | −4.51 ± 1.66 | −2.48 ± 1.66 |
| γ | −3.27 | −3.13 | −1.32 | −1.63 | −2.16 |
| AUC | 0.78 | 0.71 | 0.61 | 0.68 | 0.63 |
| Sensitivity* | 67.9% | 71.6% | 77.6% | 64.9% | 75.6% |
| Specificity* | 77.9% | 60.2% | 38.6% | 65.6% | 45.5% |
| OR* | 7.456 | 3.818 | 2.171 | 3.529 | 2.593 |
| Probability cutoff | 5.1% | 4.5% | 17.6% | 18.8% | 10.3% |
Comparison of PRSi distribution between patients with and without toxicity. Comparison of the polygenic risk score incorporating SNP-SNP interactions (PRSi) distribution for patients with and without toxicity (separately for each of the 5 considered toxicity endpoints). The PRSi medians in the two classes are reported and compared with the Wilcoxon test for independent samples; the p-value of such test is reported (third row). The distribution of the score as a whole is also compared in the two classes using the Kolmogorov-Smirnov two-samples test; p-values of the latter are reported in the table (last row). ▼.
| Late Urinary Frequency grade ≥ 2 | Late Haematuria grade ≥ 1 | Late Nocturia grade ≥ 2 | Late Decreased Urinary Stream grade ≥ 1 | Late Rectal Bleeding grade ≥ 1 | |
|---|---|---|---|---|---|
| Median (patients with toxicity) | 0.611 | 0.740 | −0.149 | 0.168 | 0.208 |
| Median (patients without toxicity) | −0.357 | 0.033 | −0.224 | −0.133 | 0.001 |
| Wilcoxon | |||||
| Kolmogorov-Smirnov |
Fig. 2.Results for grade 2 late urinary frequency. Panel (a): SNP-allele sets participating in the definition of Risk Score (RS); panel (b) SNP-allele sets participating in the definition of the Protection Score (PS). In both cases, each row identifies a SNP-allele set, with SNPs running along columns. Different colours correspond to different alleles of each single SNP in the SNP-allele set. Note that SNP-allele sets are, in general, defined by different numbers of alleles. For example, the first Risk SNP-allele set (starting from top) is defined through 8 alleles (rs141799618 = 0, rs8075565 = 1, rs12591436 = 2, rs1096913 = 0, rs17599026 = 0, rs808498 = 2, rs11122573 = 0 and rs10209697 = 0) while the last one involves 4 alleles only (rs12591436 = 1, rs76273496 = 0, rs17599026 = 1 and rs7366282 = 0). Each SNP can participate in the definition of multiple SNP-allele sets (e.g. rs10209697 is included in 7 SNP-allele sets for RS and in 7 SNP-allele sets for PS). Panel (c) ROC curve for the logistic model described in Equation 1 and calculated with best-fit parameters α and β reported in Table 1. AUC and the point in the ROC curve identifying the best probability cutoff value (according to the Youden index) are also reported; panel (d) Box-plot representation for the distribution of the polygenic risk score incorporating SNP-SNP interactions (PRSi) for patients with and without toxicity calculated using equation (2). The red-dashed line represents the thresholding value for the PRSi related to the probability cutoff in Table 2: patients with a score above this threshold will have a predicted probability, according to equation 1, that is above the cutoff, and viceversa.
Fig. 3.Results for grade ≥ 1 late rectal bleeding. Panels read as in Fig. 2.