| Literature DB >> 25983544 |
Mateusz Makowski1, Kellie J Archer1.
Abstract
The cytokinesis-block micronucleus (CBMN) assay can be used to quantify micronucleus (MN) formation, the outcome measured being MN frequency. MN frequency has been shown to be both an accurate measure of chromosomal instability/DNA damage and a risk factor for cancer. Similarly, the Agilent 4×44k human oligonucleotide microarray can be used to quantify gene expression changes. Despite the existence of accepted methodologies to quantify both MN frequency and gene expression, very little is known about the association between the two. In modeling our count outcome (MN frequency) using gene expression levels from the high-throughput assay as our predictor variables, there are many more variables than observations. Hence, we extended the generalized monotone incremental forward stagewise method for predicting a count outcome for high-dimensional feature settings.Entities:
Keywords: Poisson regression; gene expression; high-throughput; micronuclei; penalized model
Year: 2015 PMID: 25983544 PMCID: PMC4415688 DOI: 10.4137/CIN.S17278
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Figure 1Number of predictors correctly identified as nonzero. This figure shows the distribution of the number of predictors correctly identified as nonzero over 100 simulations. There were five predictors that were set as nonzero. Boxplots are separated by the type of distribution used to generate the data and the number of observations.
Figure 3Accuracy of count predictions. This figure shows the distribution of the sum of residuals squared over 100 simulations using a learning data set and an independent test data set. Boxplots are separated by the type of model fit to the data and the number of observations. The results for glmpath with Gaussian family using an offset are not displayed because both values are above 50000.
Figure 2Number of predictors incorrectly identified as nonzero. This figure shows the distribution of the number of predictors incorrectly identified as nonzero over 100 simulations. There were 95 predictors for which their coefficients were set to zero. Boxplots are separated by the type of distribution used to generate the data and the number of observations.
Figure 4Histogram of MN counts.
Figure 5Plot of actual MN counts versus predicted MN counts using GMIFS.
Genes identified as associated with MN count by both GMIFS and glmpath.
| PROBE ID | GENE SYMBOL | GENE NAME | ASSOCIATED WITH CANCER | GMIFS | GLMPATH |
|---|---|---|---|---|---|
| A-23-P100196 | USP10 | ubiquitin specific peptidase 10 | Glioblastoma multiforme | X | X |
| A-23-P138967 | SDHD | succinate dehydrogenase complex | Tumor Suppressor | X | X |
| A-23-P42331 | HMGA1 | high mobility group AT-hook 1 | Pancreatic Adenocarcinoma | X | X |
| A-23-P9293 | TJP2 | tight junction protein 2 | Breast | X | X |
| A-24-P19410 | CBX7 | chromobox homolog 7 | Carcinomas | X | X |
| A-24-P214858 | TREML2 | triggering receptor expressed on myeloid cells-like 2 | Pancreatic | X | X |
| A-24-P2463 | WHSC1 | Wolf-Hirschhorn syndrome candidate 1 | Carcinogenesis | X | X |
| A-24-P397584 | TBCC | tubulin folding cofactor C | None Found | X | X |
| A-24-P398064 | KIAA0258 | KIAA0258 | Colorectal | X | X |
| A-32-P18547 | C21ORF57 | chromosome 21 open reading frame 57 | Breast | X | X |
| A-23-P103824 | FAU | Finkel-Biskis-Reilly murine sarcoma virus (FBR-MuSV) ubiquitously expressed | None Found | X | |
| A-23-P209394 | CFLAR | CASP8 and FADD-like apoptosis regulator | Human cancers | X | |
| A-23-P79911 | PSMF1 | proteasome (prosome, macropain) inhibitor subunit 1 (PI31) | Breast | X | |
| A-24-P202567 | ITPKC | inositol 1,4,5-trisphosphate 3-kinase C | Cervical | X | |
| A-24-P31235 | EIF5A | eukaryotic translation initiation factor 5A | Chronic myeloid leukemia | X | |
| A-24-P405054 | C1ORF144 | chromosome 1 open reading frame 144 | Mantle cell lymphoma | X | |
| A-32-P156549 | C1ORF144 | X | |||
| A-23-P118313 | GABARAPL2 | GABA(A) receptor-associated protein-like 2 | Lung | X | |
| A-23-P143817 | MYLK | myosin, light polypeptide kinase | Gastric | X | |
| A-23-P156809 | LOC642880 | similar to FKSG62 | None Found | X | |
| A-23-P394304 | PDZK1IP1 | PDZK1 interacting protein 1 | Thyroid | X | |
| A-23-P39665 | SLC11A1 | solute carrier family 11, member 1 | Esophageal | X | |
| A-23-P67529 | KCNN4 | potassium intermediate/small conductance calcium-activated channel, subfamily N, member 4 | Colorectal | X | |
| A-24-P594683 | LOC645592 | similar to peptidylprolyl isomerase A isoform 1 | X | ||
| A-24-P708161 | X | ||||
| A-24-P98086 | GNA12 | guanine nucleotide binding protein (G protein) alpha 12 | Oral | X | |
| A-32-P10067 | X | ||||
| A-32-P137849 | X | ||||
| A-32-P169754 | LOC145221 | EST | X | ||
| A-32-P208078 | MTHFR | 5,10-methylenetetrahydrofolate reductase (NADPH) | Breast | X |