| Literature DB >> 33178576 |
Michela Carlotta Massi1,2, Francesca Gasperoni3, Francesca Ieva1,2,4, Anna Maria Paganoni1,2,4, Paolo Zunino1, Andrea Manzoni1, Nicola Rares Franco1, Liv Veldeman5,6, Piet Ost5,6, Valérie Fonteyne5,6, Christopher J Talbot7, Tim Rattay7, Adam Webb7, Paul R Symonds7, Kerstie Johnson7, Maarten Lambrecht8, Karin Haustermans8, Gert De Meerleer8, Dirk de Ruysscher9,10, Ben Vanneste10, Evert Van Limbergen9,10, Ananya Choudhury11, Rebecca M Elliott11, Elena Sperk12, Carsten Herskind12, Marlon R Veldwijk12, Barbara Avuzzi13, Tommaso Giandini14, Riccardo Valdagni13,15,16, Alessandro Cicchetti16, David Azria17, Marie-Pierre Farcy Jacquet18, Barry S Rosenstein19,20, Richard G Stock19, Kayla Collado19, Ana Vega21,22,23, Miguel Elías Aguado-Barrera21,22, Patricia Calvo22,24, Alison M Dunning25, Laura Fachal25,26, Sarah L Kerns27, Debbie Payne28, Jenny Chang-Claude29,30, Petra Seibold29, Catharine M L West11, Tiziana Rancati16.
Abstract
Background: REQUITE (validating pREdictive models and biomarkers of radiotherapy toxicity to reduce side effects and improve QUalITy of lifE in cancer survivors) is an international prospective cohort study. The purpose of this project was to analyse a cohort of patients recruited into REQUITE using a deep learning algorithm to identify patient-specific features associated with the development of toxicity, and test the approach by attempting to validate previously published genetic risk factors.Entities:
Keywords: autoencoder; deep learning; late toxicity; prostate cancer; snps; validation
Year: 2020 PMID: 33178576 PMCID: PMC7593843 DOI: 10.3389/fonc.2020.541281
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Full list of SNPs selected from the literature for validation and associated toxicity endpoint following prostate radiotherapy.
| rs10519410 | 3.7 | 1.3 × 10−6 | ( |
| rs17055178 | 1.95 | 6.2 × 10−10 | ( |
| rs17599026 | 3.12 | 4.16 × 10−8 | ( |
| rs342442 | 0.51 | 3.86 × 10−7 | ( |
| rs8098701 | 2.41 | 2.11 × 10−6 | ( |
| rs7366282 | 3.2 | 2.03 × 10−6 | ( |
| rs10209697 | 2.66 | 2.27 × 10−6 | ( |
| rs4997823 | 0.49 | 2.35 × 10−6 | ( |
| rs7356945 | 1.74 | 3.71 × 10−6 | ( |
| rs6003982 | 0.51 | 4.28 × 10−6 | ( |
| rs10101158 | 1.8 | 4.39 × 10−6 | ( |
| rs7720298 | 2.71 | 3.21 × 10−8 | ( |
| rs17362923 | 2.7 | 6.79 × 10−7 | ( |
| rs76273496 | 3.68 | 2.71 × 10−6 | ( |
| rs144596911 | 3.6 | 2.94 × 10−6 | ( |
| rs62091368 | 4.36 | 3.95 × 10−6 | ( |
| rs141342719 | 3.5 | 3.97 × 10−6 | ( |
| rs673783 | 2.49 | 4.33 × 10−6 | ( |
| rs10969913 | 3.92 | 2.9 × 10−10 | ( |
| rs11122573 | 1.92 | 1.8 × 10−8 | ( |
| rs708498 | 0.24 | n.a. | ( |
| rs845552 | 0.95 | n.a. | ( |
| rs1799983 | 0.19 | n.a. | ( |
| rs1045485 | 0.27 | n.a. | ( |
| rs10497203 | 1.48 | 8.84 × 10−11 | ( |
| rs7582141 | 1.45 | 4.64 × 10−11 | ( |
| rs6432512 | 1.42 | 1.97 × 10−10 | ( |
| rs264651 | 1.49 | 1.48 × 10−7 | ( |
| rs264588 | 1.45 | 3.08 × 10−10 | ( |
| rs264631 | 1.43 | 6.4 × 10−10 | ( |
| rs147596965 | 1.95 | 6.19 × 10−8 | ( |
| rs77530448 | 1.43 | 7.36 × 10−8 | ( |
| rs4906759 | 1.73 | 1.55 × 10−7 | ( |
| rs71610881 | 1.82 | 5.41 × 10−7 | ( |
| rs141799618 | 1.55 | 1.22 × 10−6 | ( |
| rs2842169 | 1.32 | 1.45 × 10−6 | ( |
| rs11219068 | 1.32 | 1.74 × 10−6 | ( |
| rs8075565 | 1.32 | 2.20 × 10−6 | ( |
| rs6535028 | 1.34 | 2.70 × 10−6 | ( |
| rs4775602 | 1.26 | 3.20 × 10−6 | ( |
| rs7829759 | 1.39 | 3.84 × 10−6 | ( |
| rs79604958 | 1.60 | 4.33 × 10−6 | ( |
| rs12591436 | 1.20 | 5.66 × 10−6 | ( |
overall toxicity as defined by calculating the Standardized Total Average Toxicity (STAT) score (.
All these variants are highly correlated in European populations and represent the same association signal. See also correlation matrix as determined in the REQUITE population in the .
Hazard Ratio.
SNPs were selected using Least Absolute Shrinkage and Selection Operator (LASSO) multivariable regression out of a panel of 384 previous identified SNPs, p-value not available.
Figure 1Simplified scheme of a Deep AutoEncoder.
Figure 2Schematic representation of the workflow used to identify which features to select to characterize the minority class (i.e., patients with toxicity) with respect to the majority class (patients without toxicity).
Figure 3Results for late rectal bleeding grade≥1 from the Deep Sparse AutoEncoder. The 43 considered SNPs are reported in the x-axis and the averaged Reconstruction Errors (RE) are reported in the y-axis (top panel), red columns refer to patients with toxicity, while blue columns refer to patients without toxicity. In the lower panel the difference between averaged Reconstruction Errors between the two classes are represented for each SNP (i.e., differences between red and blue columns). For most SNPs, the difference is close to zero (red line in the bottom panel of the figure). The chosen thresholds in this difference (i.e., highest 30, 20, 10, and 5% differences) are selecting SNPs associated to the toxicity outcome. Green circles refer to SNPs that were previously identified as associated with late rectal bleeding, while blue circles refer to SNPs that were previously associated with overall toxicity as defined by calculation of the Standardized Total Average Toxicity (STAT) score (24). Red stars indicate SNPs (either specific for this endpoint or related to overall toxicity) defining patients with toxicity as outliers with respect to the characteristics of patients without toxicity. Labels show SNPs that not directly associated with late rectal bleeding/overall toxicity, but contributing to their identification. The label states for which toxicity endpoint the SNPs were originally associated with in the literature: FREQ=urinary frequency, HEMA=haematuria, NOCT=nocturia, STREAM=decreased urinary stream.
Deep Sparse AutoEncoder testing of SNPs associated with Late Rectal Bleeding.
| rs10519410 | ( | Not validated | Not validated | Not validated | Not validated |
| rs17055178 | ( | Not validated | Not validated | Not validated | Not validated |
| rs264631 | ( | Not validated | Not validated | ||
| rs141799618 | ( | Not validated | Not validated | ||
grade≥1 (all considered SNPs reported in the table) and to overall toxicity as defined by calculation of the Standardized Total Average Toxicity (STAT) score (.
Figure 4Results for late urinary frequency grade≥2 from the Deep Sparse AutoEncoder. The 43 considered SNPs are reported in the x-axis and the averaged Reconstruction Errors (RE) are reported in the y-axis (top panel), red columns refer to patients with toxicity, while blue columns refer to patients without toxicity. In the lower panel the difference between averaged Reconstruction Errors between the two classes are represented for each SNP (i.e., differences between red and blue columns). For most SNPs, the difference is close to zero (red line in the bottom panel of the figure). The chosen thresholds in this difference (i.e., highest 30, 20, 10, and 5% differences) are selecting SNPs associated to the toxicity outcome. Green circles refer to SNPs that were previously identified as associated with late urinary frequency, while blue circles refer to SNPs that were previously associated with overall toxicity as defined by calculation of the Standardized Total Average Toxicity (STAT) score (24). Red stars indicate SNPs (either specific for this endpoint or related to overall toxicity) defining patients with toxicity as outliers with respect to the characteristics of patients without toxicity. Labels show SNPs that not directly associated with late urinary frequency/overall toxicity, but contributing to their identification. The label states for which toxicity endpoint the SNPs were originally associated with in the literature: BLEE=rectal bleeding, HEMA=haematuria, NOCT=nocturia, STREAM=decreased urinary stream.
Results from Deep Sparse AutoEncoder testing of SNPs associated with Urinary Frequency.
| rs17599026 | ( | Not validated | |||
| rs342442 | ( | Not validated | Not validated | Not validated | Not validated |
| rs8098701 | ( | ||||
| rs7366282 | ( | ||||
| rs10209697 | ( | Not validated | Not validated | ||
| rs4997823 | ( | Not validated | Not validated | Not validated | Not validated |
| rs7356945 | ( | Not validated | Not validated | Not validated | Not validated |
| rs6003982 | ( | Not validated | Not validated | Not validated | Not validated |
| rs10101158 | ( | Not validated | Not validated | Not validated | Not validated |
| rs147596965 | ( | Not validated | Not validated | Not validated | |
| rs77530448 | ( | ||||
| rs8075565 | ( | Not validated | Not validated | Not validated | |
| rs12591436 | ( | Not validated | Not validated | Not validated | |
Late Urinary Frequency grade≥2 (all considered SNPs reported in the table) and to overall toxicity as defined by calculation of the Standardized Total Average Toxicity (STAT) score (24) (in this case only “Identified” SNPs were reported in the table). The SNPs that were correctly identified by the algorithm are flagged as “Identified.”
Figure 5Results for late haematuria grade≥1 from the Deep Sparse AutoEncoder. The 43 considered SNPs are reported in the x-axis and the averaged Reconstruction Errors (RE) are reported in the y-axis (top panel), red columns refer to patients with toxicity, while blue columns refer to patients without toxicity. In the lower panel the difference between averaged Reconstruction Errors between the two classes are represented for each SNP (i.e., differences between red and blue columns). For most SNPs, the difference is close to zero (red line in the bottom panel of the figure). The chosen thresholds in this difference (i.e., highest 30, 20, 10, and 5% differences) are selecting SNPs associated to the toxicity outcome. Green circles refer to SNPs that were previously identified as associated with late haematuria, while blue circles refer to SNPs that were previously associated with overall toxicity as defined by calculation of the Standardized Total Average Toxicity (STAT) score (24). Red stars indicate SNPs (either specific for this endpoint or related to overall toxicity) defining patients with toxicity as outliers with respect to the characteristics of patients without toxicity. Labels show SNPs that not directly associated with late haematuria/overall toxicity, but contributing to their identification. The label states for which toxicity endpoint the SNPs were originally associated with in the literature: BLEE=rectal bleeding, FREQ=urinary frequency, NOCT=nocturia, STREAM=decreased urinary stream.
Results from Deep Sparse AutoEncoder testing of SNPs associated with Late Haematuria.
| rs11122573 | ( | Not validated | Not validated | Not validated | Not validated |
| rs708498 | ( | Not validated | |||
| rs845552 | ( | Not validated | Not validated | ||
| rs147596965 | ( | Not validated | |||
| rs77530448 | ( | Not validated | Not validated | ||
| rs7829759 | ( | Not validated | Not validated | ||
| rs79604958 | ( | Not validated | Not validated | ||
| rs12591436 | ( | Not validated | Not validated | ||
Late Haematuria grade≥1 (all considered SNPs reported in the table) and to overall toxicity as defined by calculation of the Standardized Total Average Toxicity (STAT) score (.
Figure 6Results for late nocturia grade≥2 from the Deep Sparse AutoEncoder. The 43 considered SNPs are reported in the x-axis and the averaged Reconstruction Errors (RE) are reported in the y-axis (top panel), red columns refer to patients with toxicity, while blue columns refer to patients without toxicity. In the lower panel the difference between averaged Reconstruction Errors between the two classes are represented for each SNP (i.e., differences between red and blue columns). For most SNPs, the difference is close to zero (red line in the bottom panel of the figure). The chosen thresholds in this difference (i.e., highest 30, 20, 10, and 5% differences) are selecting SNPs associated to the toxicity outcome. Green circles refer to SNPs that were previously identified as associated with late nocturia, while blue circles refer to SNPs that were previously associated with overall toxicity as defined by calculation of the Standardized Total Average Toxicity (STAT) score (24). Red stars indicate SNPs (either specific for this endpoint or related to overall toxicity) defining patients with toxicity as outliers with respect to the characteristics of patients without toxicity. Labels show SNPs that not directly associated with late nocturia/overall toxicity, but contributing to their identification. The label states for which toxicity endpoint the SNPs were originally associated with in the literature: BLEE=rectal bleeding, FREQ=urinary frequency, HEMA=haematuria, STREAM=decreased urinary stream.
Results from Deep Sparse AutoEncoder testing of SNPs associated with Late Nocturia.
| rs1799983 | ( | Not validated | Not validated | Not validated | |
| rs1045485 | ( | Not validated | Not validated | Not validated | Not validated |
| rs10497203 | ( | Not validated | Not validated | ||
| rs264651 | ( | Not validated | Not validated | ||
| rs77530448 | ( | Not validated | Not validated | ||
| rs11219068 | ( | Not validated | Not validated | ||
Late Nocturia grade≥2 (all considered SNPs reported in the table) and to overall toxicity as defined by calculation of the Standardized Total Average Toxicity (STAT) score (.
Figure 7Results for late decreased urinary stream grade≥1 from the Deep Sparse AutoEncoder. The 43 considered SNPs are reported in the x-axis and the averaged Reconstruction Errors (RE) are reported in the y-axis (top panel), red columns refer to patients with toxicity, while blue columns refer to patients without toxicity. In the lower panel the difference between averaged Reconstruction Errors between the two classes are represented for each SNP (i.e., differences between red and blue columns). For most SNPs, the difference is close to zero (red line in the bottom panel of the figure). The chosen thresholds in this difference (i.e., highest 30, 20, 10, and 5% differences) are selecting SNPs associated to the toxicity outcome. Green circles refer to SNPs that were previously identified as associated with late decreased urinary stream, while blue circles refer to SNPs that were previously associated with overall toxicity as defined by calculation of the Standardized Total Average Toxicity (STAT) score (24). Red stars indicate SNPs (either specific for this endpoint or related to overall toxicity) defining patients with toxicity as outliers with respect to the characteristics of patients without toxicity. Labels show SNPs that not directly associated with late decreased urinary stream /overall toxicity, but contributing to their identification. The label states for which toxicity endpoint the SNPs were originally associated with in the literature: BLEE=rectal bleeding FREQ=urinary frequency, HEMA=haematuria, NOCT=nocturia.
Results from Deep Sparse AutoEncoder testing of SNPs associated with Late Decreased Urinary Stream*.
| rs7720298 | ( | Not validated | Not validated | Not validated | Not validated |
| rs17362923 | ( | Not validated | Not validated | Not validated | Not validated |
| rs76273496 | ( | Not validated | |||
| rs144596911 | ( | Not validated | Not validated | Not validated | Not validated |
| rs62091368 | ( | Not validated | Not validated | Not validated | Not validated |
| rs141342719 | ( | Not validated | Not validated | Not validated | Not validated |
| rs673783 | ( | Not validated | Not validated | Not validated | |
| rs10969913 | ( | Not validated | Not validated | Not validated | Not validated |
| rs77530448 | ( | Not validated | Not validated | Not validated | |
| rs6535028 | ( | Not validated | Not validated | Not validated | |
Late Decreased Urinary Stream grade≥1 (all considered SNPs reported in the table) and to overall toxicity as defined by calculation of the Standardized Total Average Toxicity (STAT) score (.