| Literature DB >> 25559769 |
Abstract
MOTIVATION: It is common to get an optimal combination of markers for disease classification and prediction when multiple markers are available. Many approaches based on the area under the receiver operating characteristic curve (AUC) have been proposed. Existing works based on AUC in a high-dimensional context depend mainly on a non-parametric, smooth approximation of AUC, with no work using a parametric AUC-based approach, for high-dimensional data.Entities:
Mesh:
Year: 2014 PMID: 25559769 PMCID: PMC4304290 DOI: 10.1186/1471-2164-15-S10-S1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Box plots in .
Figure 4Box plots in DLBCL data for AUC (left) and sensitivity when specificity equals 0.95 (right).
The median number of genes being selected in four microarray studies.
|
|
|
|
|
| |
|---|---|---|---|---|---|
| 42 | 30 | 37 | 3 | 16 | |
| 30 | 22 | 3 | 2 | 22 | |
| 51 | 36 | 7 | 5 | 10 | |
| 51 | 35 | 26 | 9 | 17 |
Figure 5ROC curves for four real data sets.
Top 10 frequently selected genes by AucEN.
| data | gene id | gene symbol | description | Coverage |
|---|---|---|---|---|
| Hsa.2097 | R14852 | Human vasoactive intestinal peptide (vip) mrna, complete cds | AucL, LogL, LogEN, [ | |
| Hsa.3331 | T86473 | Nucleoside diphosphate kinase a (Human) | AucL [ | |
| Hsa.37937 | R87126 | Myosin heavy chain, nonmuscle (Gallus gallus) | AucL, LogL, LogEN, Msauc, [ | |
| Hsa.601 | J05032 | Human aspartyl-tRNA synthetase alpha-2 subunit mRNA, complete cds | AucL, LogEN, [ | |
| Hsa.36952 | H43887 | Complement factor d precursor (Homo sapiens) | AucL,[ | |
| Hsa.8125 | T71025 | Human (HUMAN) | AucL, [ | |
| Hsa.8147 | M63391 | Human desmin gene, complete cds | AucL, LogL, LogEN, Msauc, [ | |
| Hsa.3306 | X12671 | Human gene for heterogeneous nuclear ribonucleoprotein (hnRNP) core protein A1 | LogL, LogEN, [ | |
| Hsa.26673 | R76825 | RNA-specific gtpase-activating protein (Homo sapiens) | AucL, [ | |
| Hsa.14069 | T67077 | Sodium/potassium-transporting atpase gamma chain (Ovis aries) | [ | |
| X59711 at | NFYA | NFYA Nuclear transcription factor Y, alpha | [ | |
| M30938 at | XRCC5 | ATP-DEPENDENT DNA HELICASE II, 86 KD SUBUNIT | [ | |
| U57721 at | Kynu | L-kynurenine hydrolase mRNA | [ | |
| X07834 at | Sod2 | SOD2 Superoxide dismutase 2, mitochondrial | [ | |
| U37408 at | Ctbp1 | CtBP mRNA | [ | |
| M98539 at | ptgds | Prostaglandin D2 synthase gene | [ | |
| U35113 at | Mta1 | Metastasis-associated mta1 mRNA | [ | |
| X13973 at | rnh1 | RNH Ribonuclease/angiogenin inhibitor | [ | |
| D49817 at | pfkfb3 | Fructose 6-phosphate,2-kinase/fructose 2,6-bisphosphatase | [ | |
| M83233 at | TCF12 | TCF12 Transcription factor 12 (HTF4, helix-loop-helix transcription factors 4) | LogL, LogEN, [ | |
| U96113 at | WWP1 | Nedd-4-like ubiquitin-protein ligase WWP1 mRNA, partial cds | AucL, [ | |
| U46006 s at | CSRP2 | Smooth muscle LIM protein (h-SmLIM) mRNA | AucL, LogL, LogEN, [ | |
| M35878 at | igfbp3 | INSULIN-LIKE GROWTH FACTOR BINDING PROTEIN 3 PRECURSOR | AucL, [ | |
| U77949 at | cdc6 | Cdc6-related protein (HsCDC6) mRNA | AucL, [ | |
| L41067 at | Nfatc3 | Transcription factor NFATx mRNA | AucL, [ | |
| U95006 at | STRA13 | D9 splice variant A mRNA | [ | |
| U64863 at | Pdcd1 | HPD-1 (hPD-1) mRNA | AucL, [ | |
| AB002409 at | ccl21 | SLC | AucL, MSauc, [ | |
| HG2279-HT2375 at | TPI1 | Triosephosphate Isomerase | AucL | |
| U17969 at | eif5a | EIF5A Eukaryotic translation initiation factor 5A | [ | |
The "Coverage" column shows the genes frequently selected by other methods or reported in the literature.
Summary of simulation results for different sizes of each block, when ρ = 0.6 and block = 1 under a normal scenario. nIMS and nTMS stand for the number of the true informative markers selected and the total number of markers selected, respectively.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 5 | 0.84 | 0.50 | 3 | 4 | |
| 0.82 | 0.45 | 3 | 4 | ||
| 0.82 | 0.45 | 2 | 2 | ||
| 0.80 | 0.40 | 1 | 1 | ||
| 0.81 | 0.40 | 1 | 4 | ||
| 40 | 0.86 | 0.55 | 20 | 24 | |
| 0.86 | 0.55 | 13 | 18 | ||
| 0.85 | 0.50 | 4 | 4 | ||
| 0.82 | 0.45 | 2 | 2 | ||
| 0.81 | 0.45 | 1 | 3 | ||
Summary of simulation results for different ρ, when size = 5 and block = 1, under a normal scenario.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 0.3 | 0.81 | 0.45 | 3 | 12 | |
| 0.81 | 0.45 | 3 | 6 | ||
| 0.85 | 0.50 | 3 | 3 | ||
| 0.85 | 0.50 | 2 | 2 | ||
| 0.82 | 0.45 | 2 | 6 | ||
| 0.9 | 0.81 | 0.45 | 3 | 4 | |
| 0.81 | 0.45 | 2 | 2 | ||
| 0.80 | 0.40 | 2 | 2 | ||
| 0.80 | 0.40 | 1 | 1 | ||
| 0.79 | 0.40 | 1 | 3 | ||
Summary of simulation results for different block, when size = 20 and ρ = 0.6, under a normal scenario.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 1 | 0.86 | 0.55 | 12 | 14 | |
| 0.85 | 0.55 | 10 | 12 | ||
| 0.83 | 0.47 | 4 | 4 | ||
| 0.81 | 0.40 | 2 | 2 | ||
| 0.81 | 0.40 | 1 | 4 | ||
| 3 | 0.96 | 0.85 | 25 | 40 | |
| 0.95 | 0.85 | 20 | 32 | ||
| 0.95 | 0.85 | 16 | 16 | ||
| 0.94 | 0.75 | 8 | 8 | ||
| 0.92 | 0.70 | 10 | 31 | ||
Figure 6The number of noisy genes selected by AucL vs. the sample size of the simulation study, with ρ = 0.6, block = 2, and size = 20.