| Literature DB >> 35574500 |
Kellie J Archer1, Anna Eames Seffernick1, Shuai Sun1, Yiran Zhang2.
Abstract
The stage of cancer is a discrete ordinal response that indicates the aggressiveness of disease and is often used by physicians to determine the type and intensity of treatment to be administered. For example, the FIGO stage in cervical cancer is based on the size and depth of the tumor as well as the level of spread. It may be of clinical relevance to identify molecular features from high-throughput genomic assays that are associated with the stage of cervical cancer to elucidate pathways related to tumor aggressiveness, identify improved molecular features that may be useful for staging, and identify therapeutic targets. High-throughput RNA-Seq data and corresponding clinical data (including stage) for cervical cancer patients have been made available through The Cancer Genome Atlas Project (TCGA). We recently described penalized Bayesian ordinal response models that can be used for variable selection for over-parameterized datasets, such as the TCGA-CESC dataset. Herein, we describe our ordinalbayes R package, available from the Comprehensive R Archive Network (CRAN), which enhances the runjags R package by enabling users to easily fit cumulative logit models when the outcome is ordinal and the number of predictors exceeds the sample size, P > N, such as for TCGA and other high-throughput genomic data. We demonstrate the use of this package by applying it to the TCGA cervical cancer dataset. Our ordinalbayes package can be used to fit models to high-dimensional datasets, and it effectively performs variable selection.Entities:
Keywords: LASSO; cumulative logit; penalized models; spike-and-slab; variable inclusion indicators
Year: 2022 PMID: 35574500 PMCID: PMC9097970 DOI: 10.3390/stats5020021
Source DB: PubMed Journal: Stats (Basel) ISSN: 2571-905X
ordinalbayes parameters available for all models.
| Parameter | Description and Default Values |
|---|---|
|
| |
|
| Variance for |
|
| Variance associated with any unpenalized predictors in the MCMC chain (default 10) |
|
| Number of iterations for adaptation (default 5000) |
|
| Number of iterations of the Markov chain to run (default 5000) |
|
| Number of parallel chains to run (default 3) |
|
| Number of saved steps for each chain (default 9999) |
|
| The thinning interval for monitors (default 3) |
|
| Run the MCMC on multiple processors (default |
|
| Specify which penalized ordinal model to fit (default |
|
| If |
|
| If |
|
| An integer value for the random seed to ensure reproducibility |
|
| If |
ordinalbayes parameters for each penalized ordinal Bayesian model.
| Model | Parameters in Ordinalbayes Call to Specify | Description |
|---|---|---|
|
| ||
| lasso | The penalty parameter | |
|
| ||
| normalss |
| The variance for the spike (set to some small positive value, e.g., 0.01) |
|
| The variance for the slab (set to some large positive value, e.g., 10) | |
|
| Use a constant prior for | |
|
| Use a random prior for | |
|
| ||
| dess |
| The penalty parameter |
|
| The parameter value for the spike, e.g., | |
|
| Use a constant prior for | |
|
| Use a random prior for | |
|
| ||
| regressvi |
| The penalty parameter |
|
| Use a constant prior for | |
|
| Use a random prior for | |
Transcripts significant from the regression-based variable inclusion indicator Bayesian ordinal model when testing H0 : γ = 0 versus H : γ = 1 using the Bayes factor and a threshold of 4. Annotation information obtained on 28 February 2022 from https://www.ncbi.nlm.nih.gov/gene, https://www.genecards.org, and https://lncipedia.org.
| Ensemble ID | Gene Symbol | Chr |
|
|---|---|---|---|
|
| |||
| ENSG00000076344 |
| 16 | 0.179 |
| ENSG00000077274 |
| X | 0.264 |
| ENSG00000101888 |
| X | 0.194 |
| ENSG00000115548 |
| 2 | 0.174 |
| ENSG00000122884 |
| 10 | 0.186 |
| ENSG00000125430 |
| 17 | 0.286 |
| ENSG00000131370 |
| 3 | 0.175 |
| ENSG00000135443 |
| 12 | 0.334 |
| ENSG00000136457 |
| 17 | 0.179 |
| ENSG00000138398 |
| 2 | 0.240 |
| ENSG00000150636 |
| 18 | 0.281 |
| ENSG00000161277 |
| 19 | 0.283 |
| ENSG00000163510 |
| 2 | 0.301 |
| ENSG00000164485 |
| 6 | 0.196 |
| ENSG00000164651 |
| 7 | 0.231 |
| ENSG00000166091 |
| 14 | 0.215 |
| ENSG00000166342 |
| 18 | 0.197 |
| ENSG00000171121 |
| 3 | 0.186 |
| ENSG00000177173 | Pseudogene, parent | 1 | 0.258 |
| ENSG00000180229 |
| 15 | 0.196 |
| ENSG00000188817 |
| 3 | 0.236 |
| ENSG00000197360 |
| 19 | 0.214 |
| ENSG00000203601 | LINC00970 | 1 | 0.316 |
| ENSG00000225449 |
| 2 | 0.235 |
| ENSG00000230201 | Pseudogene, parent | 17 | 0.286 |
| ENSG00000233996 | Pseudogene, parent | 2 | 0.248 |
| ENSG00000236138 |
| 3 | 0.247 |
| ENSG00000236819 | LINC01563 | 17 | 0.311 |
| ENSG00000250602 | lnc-ALDH7A1-1 | 5 | 0.246 |
| ENSG00000253923 | Pseudogene, parent | 8 | 0.302 |
| ENSG00000256980 |
| 6 | 0.207 |
| ENSG00000259083 | lnc-TRAPPC6B-1 | 14 | 0.263 |
| ENSG00000259134 | LINC00924 | 15 | 0.352 |
| ENSG00000260484 | lnc-OPRK1-2 | 8 | 0.263 |
| ENSG00000263612 | lnc-ZNF517-4 | 8 | 0.228 |
| ENSG00000264049 | MIR4737 | 17 | 0.266 |
| ENSG00000264954 |
| 17 | 0.221 |
| ENSG00000265579 | lnc-CBLN2-1 | 18 | 0.227 |
| ENSG00000271711 | Pseudogene, parent | 3 | 0.264 |
| ENSG00000272071 | lnc-PAPD7-2 | 5 | 0.279 |
| ENSG00000276517 | Lnc-TTC27-9 | 2 | 0.221 |
| 1 | 2 | 3 | |
|---|---|---|---|
| 1 | 120 | 15 | 2 |
| 2 | 4 | 32 | 13 |
| 3 | 0 | 14 | 42 |
| 1 | 2 | 3 | |
|---|---|---|---|
| 1 | 124 | 28 | 0 |
| 2 | 0 | 20 | 0 |
| 3 | 0 | 13 | 57 |
| 1 | 2 | 3 | |
|---|---|---|---|
| 1 | 120 | 9 | 1 |
| 2 | 4 | 45 | 7 |
| 3 | 0 | 7 | 49 |