| Literature DB >> 32079541 |
Arjun Bhattacharya1, Montserrat García-Closas2,3, Andrew F Olshan4,5, Charles M Perou5,6,7, Melissa A Troester4,7, Michael I Love8,9.
Abstract
BACKGROUND: The relationship between germline genetic variation and breast cancer survival is largely unknown, especially in understudied minority populations who often have poorer survival. Genome-wide association studies (GWAS) have interrogated breast cancer survival but often are underpowered due to subtype heterogeneity and clinical covariates and detect loci in non-coding regions that are difficult to interpret. Transcriptome-wide association studies (TWAS) show increased power in detecting functionally relevant loci by leveraging expression quantitative trait loci (eQTLs) from external reference panels in relevant tissues. However, ancestry- or race-specific reference panels may be needed to draw correct inference in ancestrally diverse cohorts. Such panels for breast cancer are lacking.Entities:
Keywords: Breast cancer; Expression quantitative trait loci (eQTL); Polygenic traits; Survival; Transcriptome-wide analysis (TWAS)
Mesh:
Year: 2020 PMID: 32079541 PMCID: PMC7033948 DOI: 10.1186/s13059-020-1942-6
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1CBCS eQTL results across race and compared with GTEx. a Cis-trans plot of top eQTL by gene stratified by self-reported race. Each point represents the top eQTL for a given gene. The color and size of each point reflects the Benjamini-Bogomolov FDR-adjusted P value (BBFDR) for that eQTL. eGenes with BBFDR < 0.01 are labeled. b Comparison of effect sizes of eGenes with significant cis-eQTLs in CBCS (Y-axis) and GTEx (X-axis) over tissue type, stratified by race. eGenes are colored by the GTEx tissue that shows the largest effect size. GTEx effect sizes on the X-axis are multiplied by the sign of the correlation between the genotypes of the GTEx and CBCS eSNPs
Fig. 2Predictive performance of models in cross-validation, external validation, and across race. a Comparison of cross-validation R2 across race in CBCS. Cross-validation R2 in CBCS WW women (X-axis) and CBCS AA women (Y-axis) for each of the 151 analyzed genes. Scales are logarithmic. Dotted lines represent R2 = 0.01. Colors represent the model with which a given gene can be predicted at R2 > 0.01. b Comparison of validation R2 across race in TCGA for 149 analyzed genes found in TCGA expression data. c Comparison of validation R2 across race in held-out CBCS samples for 50 analyzed genes. d Comparison of R2 of genes in TCGA AA sample imputed from WW models (X-axis) and the AA models (Y-axis). e Comparison of R2 of genes in held-out CBCS AA sample imputed from WW models (X-axis) and the AA models (Y-axis)
Fig. 3Predictive performance of key genes, accounting for sampling variability. Validation R2 across PAM50 molecular subtype and estrogen receptor status, stratified by race, for example genes with highly variable R2 in TCGA (a) and held-out CBCS (b). Squared Spearman correlation (Y-axis), denoted R2, between observed and predicted gene expression is plotted for different genes (X-axis), stratified by PAM50 subtype and estrogen receptor status. Points are colored and shaped according to subtype. Error bars provide 90% confidence intervals inverted from the corresponding permutation test
Genes with GReX found in association with breast cancer-specific survival in AA women
| Region | Gene | Hazard ratio (90% CI)a | GReX | ||
|---|---|---|---|---|---|
| 0.83 (0.73, 0.95) | −2.52 | 1.5 × 10−3 | 0.021 (0.055) | ||
| 1.22 (1.07, 1.41) | 2.76 | 5.4 × 10−4 | 0.011 (0.047) | ||
| 0.85 (0.74, 0.97) | −2.34 | 3.2 × 10−3 | 0.020 (0.033) | ||
| 0.82 (0.72, 0.93) | −2.85 | 3.4 × 10−4 | 0.010 (0.026) |
aHazard ratio and FDR-adjusted 90% confidence intervals, Z-statistic, and P value of association of GReX with breast cancer-specific survival
bCross-validation R2 of gene expression in AA models
Fig. 4GWAS and TWAS results in AA women. a Manhattan plot of traditional GWAS on breast cancer survival. Genomic regions found to be significantly associated with survival in TWAS are represented in various colors. No SNVs reach Benjamini-Hochberg FDR-adjusted genome-wide significance. b Manhattan plot of TWAS on breast cancer survival. Genomic regions found to be significant at FDR-adjusted P < 0.10 are highlighted in red. The blue line represents a cutoff of FDR-adjusted α = 0.05 and the dotted black line represents a cutoff of FDR-adjusted α = 0.10. c Caterpillar plot of log-hazard rates with FDR-adjusted 90% confidence levels (X-axis) and genomic position (Y-axis). Results shown are significant at nominal P < 0.10. Genes highlighted in red represent genes with GReX significantly associated with survival at FDR-adjusted P < 0.10
Genes with GReX found in association with breast cancer-specific survival
| Gene | Closest survival-associated SNPa | Distance to closest survival-associated SNPa | Hazard ratio, adjusting for adjacent GWAS-SNP (90% CI)b | |
|---|---|---|---|---|
| rs202100873 | 87.1 kb | 0.84 (0.74, 0.94) | 0.027 | |
| rs72068647 | 266.9 kb | 1.18 (1.04, 1.33) | 0.046 | |
| rs66487567 | 271.9 kb | 0.88 (0.78, 1.00) | 0.096 | |
| rs376302305 | 89.4 kb | 0.84 (0.75, 0.94) | 0.028 |
aTop survival-associated SNP in cis-region of the given gene from GWAS for survival and distance of top cis-SNP from gene
bFDR-adjusted hazard ratio, 90% confidence interval, and P value for association of GReX and breast cancer-specific survival, adjusting for adjacent survival-associated SNPs