| Literature DB >> 26562156 |
Jessie J Hsu1,2,3, Dianne M Finkelstein1,2, David A Schoenfeld1,2.
Abstract
One goal of cluster analysis is to sort characteristics into groups (clusters) so that those in the same group are more highly correlated to each other than they are to those in other groups. An example is the search for groups of genes whose expression of RNA is correlated in a population of patients. These genes would be of greater interest if their common level of RNA expression were additionally predictive of the clinical outcome. This issue arose in the context of a study of trauma patients on whom RNA samples were available. The question of interest was whether there were groups of genes that were behaving similarly, and whether each gene in the cluster would have a similar effect on who would recover. For this, we develop an algorithm to simultaneously assign characteristics (genes) into groups of highly correlated genes that have the same effect on the outcome (recovery). We propose a random effects model where the genes within each group (cluster) equal the sum of a random effect, specific to the observation and cluster, and an independent error term. The outcome variable is a linear combination of the random effects of each cluster. To fit the model, we implement a Markov chain Monte Carlo algorithm based on the likelihood of the observed data. We evaluate the effect of including outcome in the model through simulation studies and describe a strategy for prediction. These methods are applied to trauma data from the Inflammation and Host Response to Injury research program, revealing a clustering of the genes that are informed by the recovery outcome.Entities:
Mesh:
Year: 2015 PMID: 26562156 PMCID: PMC4643008 DOI: 10.1371/journal.pone.0141874
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Concordance Maps for Simulated Data with Outcome Included.
Concordance varies from 0% (white) to 100% (black). When the variance ratio (τ 2/σ 2) is small, including outcome produces clearer clusters.
Fig 2Concordance Maps for Simulated Data with Outcome Excluded.
Concordance varies from 0% (white) to 100% (black). When the variance ratio (τ 2/σ 2) is small, including outcome produces clearer clusters.
Simulation Results: Parameter estimates resulting from simulation for the model with and without outcome with N = 80 patients, J = 50 genes, and 100 replications with 1000 iterations each.
| Parameter | True value | Mean | SE | Mean of SE |
|---|---|---|---|---|
|
| 1 | 0.99 | 0.03 | 0.03 |
|
| 4 | 4.09 | 0.38 | 0.35 |
|
| -5 | -4.99 | 0.12 | 0.13 |
|
| 0 | -0.03 | 0.13 | 0.13 |
|
| 5 | 4.97 | 0.14 | 0.14 |
Fig 3Concordance Map For Glue Trauma Data with Outcome Included.
List of Genes Sorted by Increasing β.
|
| Gene Symbol | Gene Name |
|---|---|---|
| -8.544 | MMP8 | Matrix metallopeptidase 8 (neutrophil collagenase) |
| -8.544 | MMP8 | Matrix metallopeptidase 8 (neutrophil collagenase) |
| -8.544 | OLFM4 | Olfactomedin 4 |
| -8.544 | LCN2 | Lipocalin 2 |
| -8.544 | CD24 | CD24 molecule |
| -8.544 | LTF | Lactotransferrin |
| -8.544 | CEACAM8 | Carcinoembryonic antigen-related cell adhesion molecule 8 |
| -8.544 | TCN1 | Transcobalamin I (vitamin B12 binding protein, R binder family) |
| -8.544 | CD24 | CD24 molecule |
| -8.544 | CEACAM6 | Carcinoembryonic antigen-related cell adhesion molecule 6 |
| -3.047 | PTGS2 | Prostaglandin-endoperoxide synthase 2 (prostaglandin G/H synthase and cyclooxygenase) |
| -3.047 | PTGS2 | Prostaglandin-endoperoxide synthase 2 (prostaglandin G/H synthase and cyclooxygenase) |
| -0.219 | HLA-DQB1 | Major histocompatibility complex, class II, DQ beta 1 |
| -0.219 | HLA-DQA1 | Major histocompatibility complex, class II, DQ alpha 1 |
| 2.155 | OLAH | Oleoyl-ACP hydrolase |
| 2.155 | OLAH | Oleoyl-ACP hydrolase |
| 2.222 | IFIT2 | Interferon-induced protein with tetratricopeptide repeats 2 |
| 2.222 | HLA-DRA | Major histocompatibility complex, class II, DR alpha |
| 2.222 | HLA-DPA1 | Major histocompatibility complex, class II, DP alpha 1 |
| 2.222 | EPSTI1 | Epithelial stromal interaction 1 (breast) |
| 2.222 | HLA-DRA | Major histocompatibility complex, class II, DR alpha |
| 2.222 | HLA-DRB1 | Major histocompatibility complex, class II, DR beta 1 |
| 2.222 | OAS3 | 2’-5’-Oligoadenylate synthetase 3, 100kDa |
| 2.222 | HLA-DRB1 | Major histocompatibility complex, class II, DR beta 1 |
| 2.222 | HLA-DRB1 | Major histocompatibility complex, class II, DR beta 1 |
| 2.222 | HLA-DPB1 | Major histocompatibility complex, class II, DP beta 1 |
| 2.222 | PMAIP1 | Phorbol-12-myristate-13-acetate-induced protein 1 |
| 2.222 | HLA-DMB | Major histocompatibility complex, class II, DM beta |
| 2.222 | HLA-DMA | Major histocompatibility complex, class II, DM alpha |
| 2.222 | HLA-DRB1 | Major histocompatibility complex, class II, DR beta 1 |
| 2.222 | GNLY | Granulysin |
| 2.222 | TGFBI | Transforming growth factor, beta-induced, 68kDa |
| 2.222 | HLA-DPA1 | Major histocompatibility complex, class II, DP alpha 1 |
| 2.222 | GNLY | Granulysin |
| 2.222 | OAS1 | 2’-5’-oligoadenylate synthetase 1, 40/46kDa |
| 2.222 | CD74 | CD74 molecule, major histocompatibility complex, class II invariant chain |
| 2.222 | HLA-DQB1 | Major histocompatibility complex, class II, DQ beta 1 |
| 2.701 | TDRD9 | Tudor domain containing 9 |
| 2.701 | ANKRD55 | Ankyrin repeat domain 55 |
| 2.701 | VNN1 | Vanin 1 |
| 2.701 | NSUN7 | NOP2/Sun domain family, member 7 |
| 2.701 | VNN1 | Vanin 1 |
| 2.701 | CDK5RAP2 | CDK5 Regulatory subunit associated protein 2 |
| 2.701 | OLAH | Oleoyl-ACP hydrolase |
| 2.701 | CDK5RAP2 | CDK5 regulatory subunit associated protein 2 |
| 2.701 | IL1R2 | Interleukin 1 receptor, type II |
| 2.701 | LOC399972 | Hs.585206 hypothetical LOC399972 |
| 2.701 | MIAT | Myocardial infarction associated transcript (non-protein coding) |
| 2.701 | IL1R2 | Interleukin 1 receptor, type II |
| 2.701 | ENSG000002542 | Homo sapiens, clone IMAGE:4696931, mRNA |
| 2.701 | NAIP | NLR family, apoptosis inhibitory protein |
| 2.701 | SLC26A8 | Solute carrier family 26, member 8 |
| 2.701 | LRG1 | Leucine-rich alpha-2-glycoprotein 1 |
| 2.701 | GRB10 | Growth factor receptor-bound protein 10 |
| 2.701 | ATP6V1C1 | ATPase, H+ transporting, lysosomal 42kDa, V1 subunit C1 |
| 2.701 | IL1R1 | Interleukin 1 receptor, type I |
| 2.701 | HGF | Hepatocyte growth factor (hepapoietin A; scatter factor) |
| 2.701 | PDGFC | Platelet derived growth factor C |
| 2.701 | GALNT14 | UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 14 |
| 2.701 | DACH1 | Dachshund homolog 1 (Drosophila) |
| 2.701 | AGFG1 | ArfGAP with FG repeats 1 |
| 2.701 | LOC100127983 | Hypothetical protein LOC642730 |
| 2.701 | FOLR3 | Folate receptor 3 (gamma) |
| 2.701 | SIPA1L2 | Signal-induced proliferation-associated 1 like 2 |
| 3.559 | IFIT1 | Interferon-induced protein with tetratricopeptide repeats 1 |
| 3.559 | IFI44L | Interferon-induced protein 44-like |
| 3.559 | RSAD2 | Radical S-adenosyl methionine domain containing 2 |
| 3.559 | RSAD2 | Radical S-adenosyl methionine domain containing 2 |
| 3.559 | IFI44 | Interferon-induced protein 44 |
| 3.559 | HERC5 | Hect domain and RLD 5 |
| 3.559 | IFIT3 | Interferon-induced protein with tetratricopeptide repeats 3 |
| 3.559 | IFIT3 | Interferon-induced protein with tetratricopeptide repeats 3 |
| 3.559 | EPSTI1 | Epithelial stromal interaction 1 (breast) |
| 3.559 | IFIT2 | Interferon-induced protein with tetratricopeptide repeats 2 |
| 3.559 | CMPK2 | Cytidine monophosphate (UMP-CMP) kinase 2, mitochondrial |
| 3.559 | MX1 | Myxovirus (influenza virus) resistance 1, interferon-inducible protein p78 |
| 3.559 | IFI6 | Interferon, alpha-inducible protein 6 |
| 3.559 | XAF1 | XIAP associated factor 1 |
| 3.559 | ISG15 | ISG15 ubiquitin-like modifier |
| 3.559 | IFI44 | Hs.82316 Interferon-induced protein 44 |
| 3.559 | XAF1 | XIAP associated factor 1 |
| 3.559 | IFIT5 | Interferon-induced protein with tetratricopeptide repeats 5 |
| 3.559 | XAF1 | XIAP associated factor 1 |
| 3.559 | OAS2 | 2’-5’-oligoadenylate synthetase 2, 69/71kDa |
| 3.559 | IFIT5 | Interferon-induced protein with tetratricopeptide repeats 5 |
| N/A | PCOLCE2 | Procollagen C-endopeptidase enhancer 2 |
| N/A | THBS1 | Thrombospondin 1 |