| Literature DB >> 16549007 |
Abstract
BACKGROUND: Genetic markers hold great promise for refining our ability to establish precise prognostic prediction for diseases. The development of comprehensive gene expression microarray technology has allowed the selection of relevant marker genes from a large pool of candidate genes in early-phased, developmental prognostic marker studies. The primary analytical task in such studies is to select a small fraction of relevant genes, typically from a list of significant genes, for further investigation in subsequent studies.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16549007 PMCID: PMC1544357 DOI: 10.1186/1471-2105-7-156
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The averaged ACVL for several values of K across 100 simulations. Of the 2,000 genes, 50 genes were configured to be informative. The number of patients was 100.
| Correlation matrix | ||||||
| Exchangeable | Lymphoma data* | |||||
| CC† | MC‡ | CC | MC | CC | MC | |
| 1 | 41.2 | 41.2 | 51.6 | 51.6 | 48.2 | 48.2 |
| 5 | 33.3 | 34.1 | 46.7 | 48.2 | 45.6 | 46.4 |
| 10 | 29.3 | 33.2 | 43.3 | 46.6 | 45.2 | 48.2 |
| 20 | 24.9 | 49.0 | 38.4 | 62.1 | 45.0 | 57.4 |
| 50 | 17.3 | - | 34.6 | - | 41.7 | - |
| 75 | 21.3 | - | 40.1 | - | 42.3 | - |
| 100 | 25.3 | - | 44.0 | - | 43.2 | - |
| 200 | 36.0 | - | 53.3 | - | 47.0 | - |
| 500 | 50.6 | - | 63.6 | - | 53.9 | - |
| 1000 | 56.6 | - | 70.3 | - | 58.8 | - |
*The correlation matrix obtained from the lymphoma data for the top 50 genes in the gene filtering with model (1).
†Cox model with the compound covariate (model (3))
‡Multivariate Cox model (model (6)) for the top K (≤ 20) genes
Figure 1Kaplan-Meier estimates of survival for the low IPI stratum (solid curves) and the high IPI stratum (dashed curves) in the validation set. The P-value pertains to a two-sided log-rank test.
Figure 2Plot of ACVL as K is varied. The ACVL is minimized when K = 75.
Figure 3Kaplan-Meier estimates of survival for the two groups based on the compound covariate (CC) (the low CC group (solid curves) and the high CC group (dashed curves) for all patients (Panel A) and for each stratum by IPI Score (Panel B) in the validation set. The P-value in graph pertains to a two-sided log-rank test. The P-value for a stratified log-rank test is 0.0941.
Figure 4Plot of ACVL as K is varied after adjustment for IPI Score. The ACVL is minimized when K = 85.
Figure 5Kaplan-Meier estimates of survival for the two groups based on the prognostic index (the low prognostic index group (solid curves) and the high prognostic index group (dashed curves) for all patients (Panel A) and for each stratum by IPI Score (Panel B) in the validation set. The P-value in graph pertains to a two-sided log-rank test. The P-value for a stratified log-rank test is 0.0293.
The selected genes from the gene filtering with no adjustment for IPI Score and from that with the adjustment by gene-expression signature.
| Group | Gene filtering | Gene class | |||||
| No Adjust. | Adjust. | MHC-II | Proliferation | Lymph-node | Unknown | Total | |
| I | Selected | Selected | 16 | 20 | 1 | 6 | 43 |
| II | Selected | Not Selected | 1 | 20 | 0 | 11 | 32 |
| III | Not Selected | Selected | 7 | 23 | 0 | 12 | 42 |