| Literature DB >> 29844655 |
Yi Yang1, Saonli Basu1, Lisa Mirabello2, Logan Spector3, Lin Zhang1.
Abstract
Osteosarcoma is considered to be the most common primary malignant bone cancer among children and young adults. Previous studies suggest growth spurts and height to be risk factors for osteosarcoma. However, studies on the genetic cause are still limited given the rare occurrence of the disease. In this study, we investigated in a family trio data set that is composed of 209 patients and their unaffected parents and conducted a genome-wide association study (GWAS) to identify genetic risk factors for osteosarcoma. We performed a Bayesian gene-based GWAS based on the single-nucleotide polymorphism (SNP)-level summary statistics obtained from a likelihood ratio test of the trio data, which uses a hierarchically structured prior that incorporates the SNP-gene hierarchical structure. The Bayesian approach has higher power than SNP-level GWAS analysis due to the reduced number of tests and is robust by accounting for the correlations between SNPs so that it borrows information across SNPs within a gene. We identified 217 genes that achieved genome-wide significance. Ingenuity pathway analysis of the gene set indicated that osteosarcoma is potentially related to TP53, estrogen receptor signaling, xenobiotic metabolism signaling, and RANK signaling in osteoclasts.Entities:
Keywords: Bayesian HSVS; fused lasso; gene-based GWAS; multiple testing; trio data
Year: 2018 PMID: 29844655 PMCID: PMC5967162 DOI: 10.1177/1176935118775103
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Figure 1.-log(1-Pˆg) for the 11 119 genes. The horizontal line at 3.0 ) indicates the critical value for the selection of genes. Genes are indexed in order of their genomic locations.
Figure 2.Examples of effect estimates for SNPs within 4 genes identified by the HSVS. The x-axis represents the index of SNPs in an order that reflects the relationship of their adjacent positions in that particular gene. The solid black dot indicates the z score of the association test. The asterisk indicates the posterior median. The vertical line indicates the 95% credible interval. The horizontal line indicates the marker for 0.
The top 10 ingenuity pathway analysis (IPA) canonical pathways enriched with the 217 selected genes.
| IPA canonical pathways | Ratio | |
|---|---|---|
| Xenobiotic metabolism signaling | .047 | 0.028 |
| PXR/RXR activation | .051 | 0.062 |
| Estrogen receptor signaling | .067 | 0.039 |
| LPS/IL-1–mediated inhibition of RXR function | .095 | 0.027 |
| TR/RXR activation | .098 | 0.041 |
| Hepatic cholestasis | .100 | 0.031 |
| RANK signaling in osteoclasts | .103 | 0.040 |
| .114 | 0.111 | |
| Neuropathic pain signaling in dorsal horn neurons | .124 | 0.035 |
| Autophagy | .125 | 0.05 |
The selected ingenuity pathway analysis (IPA) upstream regulators.
| Upstream regulator | Molecule type | |
|---|---|---|
| TP53 | Transcription regulator | 8.27E−06 |
| ERN1 | Kinase | 1.37E−04 |
| STAT6 | Transcription regulator | 5.01E−04 |
| IL4 | Cytokine | 5.14E−04 |
| TGFB1 | Growth factor | 6.79E−04 |
| Topotecan | Chemical drug | 1.14E−03 |
| LY294002 | Chemical—kinase inhibitor | 1.36E−03 |
| Dexamethasone | Chemical drug | 2.50E−03 |
| RARA | Ligand-dependent nuclear receptor | 3.81E−03 |
| Camptothecin | Chemical drug | 5.15E−03 |
| NFYB | Transcription regulator | 7.56E−03 |
| CREB1 | Transcription regulator | 8.11E−03 |
Figure 3.The Venn diagram that shows the number of significant genes identified by the HSVS method, the SNP-level GWAS with the Benjamini-Hochberg procedure, and the minimum P value method with with the Benjamini-Hochberg procedure. The number in each nonoverlapping region is the number of genes exclusively identified by that particular method. For example, the HSVS method was able to identify 65 genes not identifiable by the other 2 methods. GWAS indicates genome-wide association study; HSVS, hierarchical structured variable selection; SNP, single-nucleotide polymorphism.
Figure 4.Examples of effect estimates for SNPs within 4 genes exclusively identified by the minimum P value method with the Benjamini-Hochberg procedure. The x-axis represents the index of SNPs in an order that reflects the relationship of their adjacent positions in that particular gene. The solid black dot indicates the z score of the association test. The asterisk indicates the posterior median. The vertical line indicates the 95% credible interval. The horizontal line indicates the marker for 0.
Comparison of the average power and type I error rate of 3 methods averaging over 20 simulations.
| Method | Power | Type I error |
|---|---|---|
| HSVS | 0.852 | 0.058 |
| SNP-level GWAS | 0.782 | 0.042 |
| Minimum | 0.836 | 0.070 |