| Literature DB >> 33919687 |
Lisa Bang1, Manu Shivakumar2, Tullika Garg3,4, Dokyoon Kim2,5.
Abstract
Urothelial carcinoma of the bladder (UC) is the fifth most common cancer in the United States. Germline variants, especially rare germline variants, may account for a portion of the disparity seen among patients in terms of UC incidence, presentation, and outcomes. The objectives of this study were to identify rare germline variant associations in UC incidence and to determine its association with clinical outcomes. Using exome sequencing data from the DiscovEHR UC cohort (n = 446), a European-ancestry, North American population, the complex influence of germline variants on known clinical phenotypes were analyzed using dispersion and burden metrics with regression tests. Outcomes measured were derived from the electronic health record (EHR) and included UC incidence, age at diagnosis, and overall survival (OS). Consequently, key rare variant association genes were implicated in MR1 and ADGRL2. The Kaplan-Meier survival analysis reveals that individuals with MR1 germline variants had significantly worse OS than those without any (log-rank p-value = 3.46 × 10-7). Those with ADGRL2 variants were found to be slightly more likely to have UC compared to a matched control cohort (FDR q-value = 0.116). These associations highlight several candidate genes that have the potential to explain clinical disparities in UC and predict UC outcomes.Entities:
Keywords: biobank; bladder cancer; electronic health record; rare variant analysis
Year: 2021 PMID: 33919687 PMCID: PMC8069815 DOI: 10.3390/cancers13081864
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1Schematic overview of the association study to identify genetic correlations between germline variants and urothelial carcinoma (UC) endophenotypes. For the classic GWAS, PLINK files from the single nucleotide polymorphisms (SNP) array were used; for the rare variant association, WES sequencing was used to identify rare variants and BioBin used to bin them into biologically relevant gene bins for association testing using either a burden-based or dispersion-based test. For the association study of germline variants with UC incidence, the germline variants found in the UC cohort (n = 446) were compared to a matched non-cancer cohort (n = 892).
Clinical phenotypic variables of the urothelial cancer cohort within DiscovEHR study derived from the electronic health record (EHR) data.
| All in Cohort | Males (%) | Females (%) | ||
|---|---|---|---|---|
| Age at Diagnosis | <50 | 21 (4.7%) | 14 (66.7%) | 7 (33.3%) |
| 50–59.9 | 66 (14.8%) | 57 (86.4%) | 9 (13.6%) | |
| 60–69.9 | 135 (30.2%) | 98 (72.5%) | 37 (27.4%) | |
| 70+ | 224 (50.2%) | 184 (82.1%) | 40 (17.9%) | |
| Sex | 446 | 353 (79.1%) | 93 (20.9%) | |
| Race | White | 446 (100%) | ||
| Smoking Status | Smoking | 302 (67.7%) | 257 (85.1%) | 48 (14.9%) |
| Nonsmoking | 144 (32.3%) | 97 (67.4%) | 45 (32.6%) | |
| Grade | Grade I | 42 (9.4%) | 32 (76.2%) | 10 (23.8%) |
| Grade II | 117 (26.2%) | 88 (75.2%) | 29 (24.8%) | |
| Grade III | 29 (6.5%) | 24 (82.8%) | 5 (18.2%) | |
| Grade IV | 233 (52.2%) | 190 (81.5%) | 43 (18.5%) | |
| Other | 25 (5.6%) | 19 (76.0%) | 6 (24.0%) | |
| Stage | Ta | 213 (47.8%) | 163 (76.5%) | 50 (23.5%) |
| Tis | 9 (2%) | 6 (66.7%) | 3 (33.3%) | |
| T1 | 99 (22.2%) | 81 (81.8%) | 18 (18.2%) | |
| T2 | 42 (9.4%) | 34 (81.0%) | 6 (19.0%) | |
| T3 | 17 (3.8%) | 13 (76.4%) | 4 (23.6%) | |
| T4 | 7 (1.6%) | 6 (85.7%) | 1 (14.3%) | |
| Unknown | 59 (13.2%) | 50 (84.7%) | 9 (15.3%) | |
| Vital Status | Alive | 315 (70.6%) | 246 (78.1%) | 69 (21.9%) |
| Deceased | 131 (29.4%) | 107 (81.7%) | 24 (18.3%) | |
| Recurrence | None or Unknown | 425 (95.3%) | 334 (73.1%) | 90 (26.9%) |
| 1+ | 21 (4.7%) | 18 (85.7%) | 3 (14.3%) |
Results of rare germline variant analysis for UC incidence by gene using dispersion metrics sequence kernel association test (SKAT) and optimal-SKAT (SKAT-O), with SKAT-O p-value adjusted for false discovery rate.
| Bin | Num_Loci | SKAT_Logistic | SKAT-O | FDR |
|---|---|---|---|---|
|
| 59 | 0.000385 | 7.32 × 10−6 | 0.11797 |
|
| 67 | 0.000129 | 6.53 × 10−5 | 0.460917 |
|
| 70 | 4.01 × 10−5 | 8.58 × 10−5 | 0.460917 |
|
| 9 | 0.000203 | 0.000203 | 0.59549 |
|
| 19 | 0.000125 | 0.000262 | 0.59549 |
|
| 50 | 0.000118 | 0.00027 | 0.59549 |
|
| 35 | 0.000138 | 0.000282 | 0.59549 |
|
| 133 | 0.000129 | 0.000307 | 0.59549 |
|
| 28 | 0.000332 | 0.000332 | 0.59549 |
Figure 2Functional annotation of ADGRL2 (a). Lollipop diagram of variants of interest binned to gene ADGRL2 found within DiscovEHR urothelial carcinoma cohort. Patients with such variants in ADGRL2 exhibited a marginally greater likelihood of UC incidence. Blue variant markers denote synonymous variants (resulting in the same amino acid codon after translation) and green variants are nonsynonymous mutations. (b). Predicted consequences of VEP-annotated variants in ADGRL2 found in the DiscovEHR UC cohort. Four variants were predicted to be regulatory region variants and 18 predicted to be missense variants.
Figure 3Comprehensive analyses of MR1 (a). Kaplan–Meier survival curves after diagnosis with UC for MR1 rare germline variant carrier patients compared to patients without germline variants in MR1. (b). MR1 gene expression in RPKM from RNA-seq of 95 human individuals of 27 tissues from Human Protein Atlas, with top results including urinary bladder, spleen, adrenal gland. (c). MuPIT visualization of MR1 protein with predicted protein impact areas from variants in MR1 found in DiscovEHR UC cohort (green) and predicted areas from TCGA (purple).
Cox proportional hazards for MR1 variant carrier status, patient sex, and smoking status.
| Coefficient |
| Coefficient SE | z |
| Lower 0.95 | Upper 0.95 | |
|---|---|---|---|---|---|---|---|
| Sex | −0.05 | 0.95 | 0.24 | −0.20 | 0.84 | −0.51 | 0.41 |
| Smoking | −0.17 | 0.85 | 0.22 | −0.77 | 0.44 | −0.60 | 0.26 |
|
| −1.44 | 0.24 | 0.31 | −4.72 | <0.005 | −2.04 | −0.84 |
Figure 4Multi-tissue- eQTL analysis for rs3747956, variant within MR1 found in UC cohort calculated to have the greatest likelihood of impacting mRNA expression of MR1 (p = 1 × 10−12). Top result (NES = 0.335) found from GTEx adrenal gland samples (n = 175) and second from top result (NES = 0.323) found from GTEx spleen samples.