| Literature DB >> 17176468 |
Abstract
BACKGROUND: Microarray studies provide a way of linking variations of phenotypes with their genetic causations. Constructing predictive models using high dimensional microarray measurements usually consists of three steps: (1) unsupervised gene screening; (2) supervised gene screening; and (3) statistical model building. Supervised gene screening based on marginal gene ranking is commonly used to reduce the number of genes in the model building. Various simple statistics, such as t-statistic or signal to noise ratio, have been used to rank genes in the supervised screening. Despite of its extensive usage, statistical study of supervised gene screening remains scarce. Our study is partly motivated by the differences in gene discovery results caused by using different supervised gene screening methods.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17176468 PMCID: PMC1764766 DOI: 10.1186/1471-2105-7-537
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Empirical study: validity of supervised gene screening. The percentages of individual genes being included in the 20% top ranked genes computed from 1000 bootstrap samples.
Leukemia data: number of genes and overlaps identified by the logistic + TGDR models using genes passed eight different supervised screenings.
| Statistic | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
| 1 | 34 | 22 | 25 | 33 | 28 | 24 | 22 | 27 |
| 2 | 35 | 28 | 22 | 23 | 22 | 27 | 29 | |
| 3 | 36 | 25 | 25 | 18 | 27 | 31 | ||
| 4 | 36 | 28 | 24 | 22 | 27 | |||
| 5 | 31 | 24 | 22 | 22 | ||||
| 6 | 33 | 18 | 20 | |||||
| 7 | 35 | 26 | ||||||
| 8 | 36 |
Concordance evaluation of 20% top ranked genes identified using the eight different supervised screening statistics. The numbers are relative fractions of overlapped genes. Marginal: genes are ranked based on marginal statistics; BRI: genes are ranked based on BRI.
| Marginal | BRI | |||||||||||||||
| Statistic | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
| 1 | 1.00 | 0.87 | 0.86 | 0.93 | 0.82 | 0.72 | 0.86 | 0.87 | 1.00 | 0.86 | 0.85 | 0.94 | 0.82 | 0.75 | 0.85 | 0.86 |
| 2 | 1.00 | 0.97 | 0.94 | 0.88 | 0.75 | 0.98 | 0.99 | 1.00 | 0.97 | 0.93 | 0.89 | 0.79 | 0.98 | 0.99 | ||
| 3 | 1.00 | 0.93 | 0.90 | 0.76 | 0.98 | 0.97 | 1.00 | 0.91 | 0.90 | 0.79 | 0.98 | 0.97 | ||||
| 4 | 1.00 | 0.85 | 0.74 | 0.93 | 0.94 | 1.00 | 0.86 | 0.76 | 0.91 | 0.92 | ||||||
| 5 | 1.00 | 0.82 | 0.88 | 0.88 | 1.00 | 0.87 | 0.88 | 0.89 | ||||||||
| 6 | 1.00 | 0.76 | 0.75 | 1.00 | 0.78 | 0.78 | ||||||||||
| 7 | 1.00 | 0.98 | 1.00 | 0.97 | ||||||||||||
| 8 | 1.00 | 1.00 | ||||||||||||||
| 1 | 1.00 | 0.80 | 0.79 | 0.91 | 0.78 | 0.71 | 0.75 | 0.80 | 1.00 | 0.78 | 0.77 | 0.90 | 0.74 | 0.71 | 0.72 | 0.78 |
| 2 | 1.00 | 0.96 | 0.89 | 0.89 | 0.78 | 0.90 | 0.96 | 1.00 | 0.96 | 0.88 | 0.87 | 0.81 | 0.88 | 0.96 | ||
| 3 | 1.00 | 0.88 | 0.88 | 0.79 | 0.91 | 0.96 | 1.00 | 0.87 | 0.88 | 0.81 | 0.89 | 0.95 | ||||
| 4 | 1.00 | 0.84 | 0.75 | 0.83 | 0.89 | 1.00 | 0.82 | 0.78 | 0.81 | 0.88 | ||||||
| 5 | 1.00 | 0.81 | 0.85 | 0.88 | 1.00 | 0.84 | 0.83 | 0.86 | ||||||||
| 6 | 1.01 | 0.75 | 0.79 | 1.00 | 0.76 | 0.82 | ||||||||||
| 7 | 1.00 | 0.88 | 1.00 | 0.85 | ||||||||||||
| 8 | 1.00 | 1.00 | ||||||||||||||
| 1 | 1.00 | 0.81 | 0.81 | 0.91 | 0.75 | 0.69 | 0.68 | 0.82 | 1.00 | 0.78 | 0.78 | 0.89 | 0.73 | 0.70 | 0.61 | 0.79 |
| 2 | 1.00 | 1.00 | 0.90 | 0.85 | 0.75 | 0.78 | 0.95 | 1.00 | 0.99 | 0.88 | 0.85 | 0.78 | 0.70 | 0.94 | ||
| 3 | 1.00 | 0.90 | 0.85 | 0.75 | 0.79 | 0.94 | 1.00 | 0.89 | 0.86 | 0.78 | 0.70 | 0.94 | ||||
| 4 | 1.00 | 0.82 | 0.73 | 0.74 | 0.90 | 1.00 | 0.82 | 0.75 | 0.67 | 0.89 | ||||||
| 5 | 1.00 | 0.80 | 0.75 | 0.84 | 1.00 | 0.84 | 0.68 | 0.84 | ||||||||
| 6 | 1.00 | 0.64 | 0.77 | 1.00 | 0.61 | 0.80 | ||||||||||
| 7 | 1.00 | 0.73 | 1.00 | 0.65 | ||||||||||||
| 8 | 1.00 | 1.00 | ||||||||||||||
| 1 | 1.00 | 0.75 | 0.75 | 0.89 | 0.65 | 0.58 | 0.64 | 0.78 | 1.00 | 0.72 | 0.72 | 0.87 | 0.64 | 0.59 | 0.54 | 0.74 |
| 2 | 1.00 | 1.00 | 0.86 | 0.74 | 0.58 | 0.84 | 0.91 | 1.00 | 1.00 | 0.85 | 0.78 | 0.63 | 0.70 | 0.91 | ||
| 3 | 1.00 | 0.86 | 0.74 | 0.58 | 0.84 | 0.90 | 1.00 | 0.85 | 0.78 | 0.63 | 0.70 | 0.91 | ||||
| 4 | 1.00 | 0.72 | 0.60 | 0.74 | 0.87 | 1.00 | 0.72 | 0.61 | 0.63 | 0.85 | ||||||
| 5 | 1.00 | 0.67 | 0.69 | 0.73 | 1.00 | 0.71 | 0.61 | 0.76 | ||||||||
| 6 | 1.00 | 0.50 | 0.61 | 1.00 | 0.46 | 0.66 | ||||||||||
| 7 | 1.00 | 0.75 | 1.00 | 0.61 | ||||||||||||
| 8 | 1.00 | 1.00 | ||||||||||||||
Concordance evaluation of 40% top ranked genes identified using the eight different supervised screening statistics. The numbers are relative fractions of overlapped genes. Marginal: genes are ranked based on marginal statistics; BRI: genes are ranked based on BRI.
| Marginal | BRI | |||||||||||||||
| Statistic | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
| 1 | 1.00 | 0.94 | 0.93 | 0.97 | 0.88 | 0.93 | 0.78 | 0.94 | 1.00 | 0.93 | 0.93 | 0.96 | 0.88 | 0.93 | 0.79 | 0.93 |
| 2 | 1.00 | 0.99 | 0.97 | 0.89 | 0.79 | 1.00 | 0.99 | 1.00 | 0.99 | 0.96 | 0.90 | 0.99 | 0.81 | 1.00 | ||
| 3 | 1.00 | 0.96 | 0.89 | 0.79 | 0.99 | 0.99 | 1.00 | 0.96 | 0.90 | 0.99 | 0.81 | 0.99 | ||||
| 4 | 1.00 | 0.89 | 0.78 | 0.97 | 0.96 | 1.00 | 0.90 | 0.96 | 0.80 | 0.97 | ||||||
| 5 | 1.00 | 0.85 | 0.89 | 0.89 | 1.00 | 0.90 | 0.87 | 0.90 | ||||||||
| 6 | 1.00 | 0.79 | 0.79 | 1.00 | 0.81 | 0.99 | ||||||||||
| 7 | 1.00 | 0.99 | 1.00 | 0.81 | ||||||||||||
| 8 | 1.00 | 1.00 | ||||||||||||||
| 1 | 1.00 | 0.88 | 0.88 | 0.94 | 0.84 | 0.78 | 0.88 | 0.87 | 1.00 | 0.87 | 0.87 | 0.94 | 0.83 | 0.85 | 0.77 | 0.87 |
| 2 | 1.00 | 0.98 | 0.94 | 0.89 | 0.80 | 0.98 | 0.97 | 1.00 | 0.98 | 0.93 | 0.89 | 0.95 | 0.80 | 0.98 | ||
| 3 | 1.00 | 0.94 | 0.90 | 0.80 | 0.97 | 0.97 | 1.00 | 0.93 | 0.90 | 0.96 | 0.81 | 0.97 | ||||
| 4 | 1.00 | 0.88 | 0.79 | 0.94 | 0.92 | 1.00 | 0.87 | 0.91 | 0.79 | 0.93 | ||||||
| 5 | 1.00 | 0.85 | 0.89 | 0.89 | 1.00 | 0.88 | 0.86 | 0.89 | ||||||||
| 6 | 1.00 | 0.80 | 0.78 | 1.00 | 0.79 | 0.93 | ||||||||||
| 7 | 1.00 | 0.95 | 1.00 | 0.81 | ||||||||||||
| 8 | 1.00 | 1.00 | ||||||||||||||
| 1 | 1.00 | 0.87 | 0.87 | 0.94 | 0.80 | 0.77 | 0.89 | 0.83 | 1.00 | 0.87 | 0.87 | 0.94 | 0.81 | 0.80 | 0.76 | 0.87 |
| 2 | 1.00 | 1.00 | 0.93 | 0.85 | 0.76 | 0.93 | 0.93 | 1.00 | 1.00 | 0.93 | 0.86 | 0.87 | 0.79 | 0.95 | ||
| 3 | 1.00 | 0.93 | 0.85 | 0.76 | 0.93 | 0.93 | 1.00 | 0.93 | 0.86 | 0.87 | 0.78 | 0.96 | ||||
| 4 | 1.00 | 0.83 | 0.77 | 0.93 | 0.88 | 1.00 | 0.84 | 0.85 | 0.78 | 0.92 | ||||||
| 5 | 1.00 | 0.82 | 0.82 | 0.83 | 1.00 | 0.78 | 0.85 | 0.86 | ||||||||
| 6 | 1.00 | 0.79 | 0.72 | 1.00 | 0.71 | 0.82 | ||||||||||
| 7 | 1.00 | 0.86 | 1.00 | 0.80 | ||||||||||||
| 8 | 1.00 | 1.00 | ||||||||||||||
| 1 | 1.00 | 0.85 | 0.85 | 0.93 | 0.75 | 0.69 | 0.87 | 0.84 | 1.00 | 0.82 | 0.82 | 0.90 | 0.74 | 0.77 | 0.66 | 0.82 |
| 2 | 1.00 | 1.00 | 0.92 | 0.81 | 0.66 | 0.92 | 0.97 | 1.00 | 1.00 | 0.90 | 0.81 | 0.86 | 0.68 | 0.96 | ||
| 3 | 1.00 | 0.91 | 0.80 | 0.65 | 0.92 | 0.98 | 1.00 | 0.90 | 0.81 | 0.86 | 0.68 | 0.96 | ||||
| 4 | 1.00 | 0.78 | 0.68 | 0.92 | 0.90 | 1.00 | 0.78 | 0.86 | 0.67 | 0.90 | ||||||
| 5 | 1.00 | 0.74 | 0.77 | 0.80 | 1.00 | 0.73 | 0.77 | 0.81 | ||||||||
| 6 | 1.00 | 0.69 | 0.64 | 1.00 | 0.60 | 0.83 | ||||||||||
| 7 | 1.00 | 0.90 | 1.00 | 0.70 | ||||||||||||
| 8 | 1.00 | 1.00 | ||||||||||||||
Concordance evaluation of 60% top ranked genes identified using the eight different supervised screening statistics. The numbers are relative fractions of overlapped genes. Marginal: genes are ranked based on marginal statistics; BRI: genes are ranked based on BRI.
| Marginal | BRI | |||||||||||||||
| Statistic | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
| 1 | 1.00 | 0.96 | 0.96 | 0.98 | 0.88 | 0.79 | 0.96 | 0.96 | 1.00 | 0.94 | 0.94 | 0.97 | 0.88 | 0.94 | 0.79 | 0.94 |
| 2 | 1.00 | 0.99 | 0.98 | 0.89 | 0.80 | 1.00 | 1.00 | 1.00 | 0.99 | 0.97 | 0.90 | 1.00 | 0.80 | 1.00 | ||
| 3 | 1.00 | 0.97 | 0.89 | 0.80 | 0.99 | 0.99 | 1.00 | 0.97 | 0.90 | 0.99 | 0.80 | 0.99 | ||||
| 4 | 1.00 | 0.89 | 0.80 | 0.98 | 0.98 | 1.00 | 0.89 | 0.97 | 0.80 | 0.97 | ||||||
| 5 | 1.00 | 0.87 | 0.89 | 0.89 | 1.00 | 0.90 | 0.86 | 0.89 | ||||||||
| 6 | 1.00 | 0.80 | 0.80 | 1.00 | 0.80 | 1.00 | ||||||||||
| 7 | 1.00 | 1.00 | 1.00 | 0.80 | ||||||||||||
| 8 | 1.00 | 1.00 | ||||||||||||||
| 1 | 1.00 | 0.94 | 0.94 | 0.97 | 0.89 | 0.82 | 0.94 | 0.94 | 1.00 | 0.91 | 0.91 | 0.96 | 0.87 | 0.91 | 0.79 | 0.91 |
| 2 | 1.00 | 0.99 | 0.97 | 0.91 | 0.82 | 0.99 | 0.99 | 1.00 | 0.99 | 0.95 | 0.91 | 0.98 | 0.81 | 0.99 | ||
| 3 | 1.00 | 0.97 | 0.91 | 0.81 | 0.98 | 0.99 | 1.00 | 0.95 | 0.92 | 0.98 | 0.81 | 0.99 | ||||
| 4 | 1.00 | 0.90 | 0.82 | 0.96 | 0.96 | 1.00 | 0.90 | 0.95 | 0.80 | 0.95 | ||||||
| 5 | 1.00 | 0.86 | 0.90 | 0.91 | 1.00 | 0.90 | 0.85 | 0.91 | ||||||||
| 6 | 1.00 | 0.82 | 0.81 | 1.00 | 0.80 | 0.97 | ||||||||||
| 7 | 1.00 | 0.98 | 1.00 | 0.81 | ||||||||||||
| 8 | 1.00 | 1.00 | ||||||||||||||
| 1 | 1.00 | 0.92 | 0.92 | 0.96 | 0.85 | 0.80 | 0.93 | 0.92 | 1.00 | 0.89 | 0.89 | 0.95 | 0.83 | 0.89 | 0.77 | 0.90 |
| 2 | 1.00 | 1.00 | 0.96 | 0.87 | 0.78 | 0.94 | 0.99 | 1.00 | 1.00 | 0.92 | 0.87 | 0.91 | 0.79 | 0.98 | ||
| 3 | 1.00 | 0.96 | 0.87 | 0.78 | 0.94 | 0.99 | 1.00 | 0.92 | 0.87 | 0.91 | 0.79 | 0.98 | ||||
| 4 | 1.00 | 0.86 | 0.79 | 0.95 | 0.95 | 1.00 | 0.84 | 0.94 | 0.77 | 0.93 | ||||||
| 5 | 1.00 | 0.83 | 0.84 | 0.87 | 1.00 | 0.82 | 0.84 | 0.86 | ||||||||
| 6 | 1.00 | 0.81 | 0.78 | 1.00 | 0.74 | 0.91 | ||||||||||
| 7 | 1.00 | 0.94 | 1.00 | 0.79 | ||||||||||||
| 8 | 1.00 | 1.00 | ||||||||||||||
| 1 | 1.00 | 0.92 | 0.92 | 0.96 | 0.81 | 0.74 | 0.91 | 0.92 | 1.00 | 0.85 | 0.85 | 0.93 | 0.77 | 0.86 | 0.69 | 0.85 |
| 2 | 1.00 | 1.00 | 0.97 | 0.83 | 0.73 | 0.96 | 1.00 | 1.00 | 1.00 | 0.89 | 0.83 | 0.91 | 0.72 | 0.97 | ||
| 3 | 1.00 | 0.97 | 0.83 | 0.73 | 0.96 | 1.00 | 1.00 | 0.89 | 0.84 | 0.91 | 0.72 | 0.97 | ||||
| 4 | 1.00 | 0.83 | 0.73 | 0.94 | 0.96 | 1.00 | 0.78 | 0.93 | 0.68 | 0.90 | ||||||
| 5 | 1.00 | 0.79 | 0.81 | 0.83 | 1.00 | 0.78 | 0.78 | 0.82 | ||||||||
| 6 | 1.00 | 0.75 | 0.73 | 1.00 | 0.67 | 0.92 | ||||||||||
| 7 | 1.00 | 0.73 | 1.00 | 0.71 | ||||||||||||
| 8 | 1.00 | 1.00 | ||||||||||||||
Summary of BRI of genes passed supervised screenings: median and inter-quartile range.
| Statistic | Colon | Leukemia | Estrogen-ER | Estrogen-LN |
| 1 | 0.82 [0.62, 0.96] | 0.84 [0.63, 0.98] | 0.77 [0.55, 0.96] | 0.59 [0.46, 0.75] |
| 2 | 0.77 [0.57, 0.95] | 0.75 [0.54, 0.94] | 0.73 [0.51, 0.94] | 0.55 [0.43, 0.72] |
| 3 | 0.77 [0.58, 0.95] | 0.75 [0.55, 0.94] | 0.73 [0.51, 0.94] | 0.55 [0.43, 0.72] |
| 4 | 0.78 [0.60, 0.96] | 0.79 [0.59, 0.96] | 0.76 [0.53, 0.95] | 0.57 [0.45, 0.74] |
| 5 | 0.76 [0.56, 0.94] | 0.75 [0.54, 0.93] | 0.72 [0.53, 0.94] | 0.55 [0.43, 0.73] |
| 6 | 0.68 [0.50, 0.88] | 0.72 [0.54, 0.91] | 0.72 [0.53, 0.92] | 0.57 [0.46, 0.73] |
| 7 | 0.77 [0.57, 0.94] | 0.75 [0.54, 0.94] | 0.72 [0.51, 0.93] | 0.59 [0.43, 0.77] |
| 8 | 0.76 [0.58, 0.94] | 0.76 [0.56, 0.94] | 0.74 [0.52, 0.95] | 0.56 [0.44, 0.73] |
| 1 | 0.84 [0.64, 0.97] | 0.84 [0.61, 0.98] | 0.79 [0.59, 0.96] | 0.66 [0.52, 0.83] |
| 2 | 0.83 [0.62, 0.97] | 0.81 [0.60, 0.97] | 0.77 [0.56, 0.97] | 0.64 [0.50, 0.82] |
| 3 | 0.83 [0.62, 0.97] | 0.81 [0.59, 0.97] | 0.77 [0.57, 0.97] | 0.64 [0.50, 0.82] |
| 4 | 0.83 [0.64, 0.97] | 0.83 [0.61, 0.98] | 0.78 [0.57, 0.96] | 0.65 [0.51, 0.82] |
| 5 | 0.81 [0.60, 0.96] | 0.81 [0.60, 0.97] | 0.79 [0.60, 0.96] | 0.65 [0.51, 0.81] |
| 6 | 0.83 [0.62, 0.96] | 0.82 [0.60, 0.97] | 0.78 [0.58, 0.96] | 0.67 [0.51, 0.85] |
| 7 | 0.76 [0.56, 0.93] | 0.78 [0.59, 0.95] | 0.78 [0.60, 0.95] | 0.70 [0.58, 0.84] |
| 8 | 0.83 [0.62, 0.96] | 0.81 [0.59, 0.97] | 0.78 [0.57, 0.96] | 0.64 [0.51, 0.82] |
| 1 | 0.86 [0.65, 0.97] | 0.86 [0.66, 0.99] | 0.82 [0.64, 0.97] | 0.73 [0.61, 0.89] |
| 2 | 0.85 [0.65, 0.97] | 0.85 [0.64, 0.98] | 0.83 [0.63, 0.98] | 0.73 [0.59, 0.89] |
| 3 | 0.85 [0.64, 0.97] | 0.85 [0.64, 0.99] | 0.83 [0.63, 0.98] | 0.73 [0.59, 0.89] |
| 4 | 0.85 [0.65, 0.97] | 0.85 [0.65, 0.99] | 0.82 [0.64, 0.97] | 0.73 [0.61, 0.88] |
| 5 | 0.82 [0.62, 0.97] | 0.85 [0.64, 0.98] | 0.84 [0.64, 0.98] | 0.73 [0.59, 0.88] |
| 6 | 0.85 [0.65, 0.97] | 0.85 [0.65, 0.99] | 0.84 [0.65, 0.97] | 0.75 [0.60, 0.90] |
| 7 | 0.80 [0.63, 0.96] | 0.85 [0.68, 0.98] | 0.86 [0.71, 0.98] | 0.83 [0.71, 0.93] |
| 8 | 0.85 [0.65, 0.97] | 0.85 [0.64, 0.98] | 0.82 [0.63, 0.97] | 0.73 [0.60, 0.89] |