| Literature DB >> 27034613 |
Burook Misganaw1, Eren Ahsen2, Nitin Singh1, Keith A Baggerly3, Anna Unruh4, Michael A White5, M Vidyasagar6.
Abstract
Ovarian cancer is the fifth leading cause of death among female cancers. Front-line therapy for ovarian cancer is platinum-based chemotherapy. However, the response of patients is highly nonuniform. The TCGA database of serous ovarian carcinomas shows that ~10% of patients respond poorly to platinum-based chemotherapy, with tumors relapsing in seven months or less. Another 10% or so enjoy disease-free survival of three years or more. The objective of the present research is to identify a small number of highly predictive biomarkers that can distinguish between the two extreme responders and then extrapolate to all patients. This is achieved using the lone star algorithm that is specifically developed for biological applications. Using this algorithm, we are able to identify biomarker panels of 25 genes (of 12,000 genes) that can be used to classify patients into one of the three groups: super responders, medium responders, and nonresponders. We are also able to determine a discriminant function that can divide the entire patient population into two classes, such that one group has a clear survival advantage over the other. These biomarkers are developed using the TCGA Agilent platform data and cross-validated on the TCGA Affymetrix platform data, as well as entirely independent data from Tothill et al. The P-values on the training data are extremely small, sometimes below machine zero, while the P-values on cross-validation are well below the widely accepted threshold of 0.05.Entities:
Keywords: ovarian cancer; platinum chemotherapy; prediction of patient response
Year: 2016 PMID: 27034613 PMCID: PMC4806766 DOI: 10.4137/CIN.S30803
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Figure 1Volcano plot of the negative logarithm of the t-test scores on the vertical axis and the fold changes on the horizontal axis.
The four classifiers studied in this paper.
| CLASSIFIER NO. | CLINICAL PARAMETER | PRE-FILTERIN |
|---|---|---|
| Classifier No. 1 | Overall survival | Tight |
| Classifier No. 2 | Overall survival | Loose |
| Classifier No. 3 | Progression-free survival | Tight |
| Classifier No. 4 | Progression-free survival | Loose |
Classifier nos. 1 and 2 – classifiers for OS.
| ENTREZ GENE ID | GENE SYMBOL | WEIGHT | ENTREZ GENE ID | GENE SYMBOL | WEIGHT |
|---|---|---|---|---|---|
| 241 | ALOX5AP | −0.5630 | 953 | ENTPD1 | −0.7066 |
| 3764 | kcnj8 | −0.4870 | 54704 | Pdp1 | −0.6358 |
| 26290 | GALNT8 | −0.4393 | 1410 | CRYAB | −0.5821 |
| 25790 | Ccdc19 | −0.3118 | 3764 | kcnj8 | −0.4942 |
| 2857 | GPR34 | −0.3116 | 5266 | PI3 | −0.4846 |
| 8483 | CILP | −0.3039 | 25790 | Ccdc19 | −0.4362 |
| 794 | CALB2 | −0.2889 | 6236 | RRAD | −0.4067 |
| 55016 | MARCH1 | −0.2662 | 10218 | angptl7 | −0.3554 |
| 6356 | Ccl11 | −0.2458 | 26290 | GALNT8 | −0.3522 |
| 64231 | MS4A6A | 0.1312 | 9033 | pkd2l1 | −0.3252 |
| 25924 | MYRIP | 0.2086 | 10753 | Capn9 | 0.2204 |
| 962 | Cd48 | 0.2207 | 728621 | CCDC30 | 0.2306 |
| 10753 | Capn9 | 0.2422 | 57612 | KIAA1466 | 0.2680 |
| 51284 | TLR7 | 0.2827 | 3918 | LAMC2 | 0.3141 |
| 1634 | DCN | 0.2845 | 404093 | CUEDC1 | 0.3931 |
| 123872 | LRRC50 | 0.2881 | 4147 | Matn2 | 0.3981 |
| 79623 | Galnt14 | 0.2937 | 1674 | DES | 0.4020 |
| 26585 | GREM1 | 0.3075 | 203102 | ADAM32 | 0.4241 |
| 5016 | Ovgp1 | 0.3079 | 5521 | PPP2R2B | 0.4337 |
| 5276 | serpini2 | 0.3439 | 9450 | LY86 | 0.4763 |
| 6387 | CXCL12 | 0.3519 | 8470 | SORBS2 | 0.4815 |
| 56143 | PCDHA5 | 0.3777 | 135138 | Pacrg | 0.5750 |
| 135138 | Pacrg | 0.3992 | 7130 | TNFAIP6 | 0.5876 |
| 4147 | Matn2 | 0.4722 | 1118 | CHIT1 | 0.6102 |
| 1118 | CHIT1 | 0.4867 | 1360 | Cpb1 | 0.6115 |
| 23144 | ZC3H3 | 0.7026 | |||
| Bias | 0.0150 | Bias | 0.0924 |
Classifier nos. 3 and 4 – classifiers for PFS.
| ENTREZ GENE ID | GENE SYMBOL | WEIGHT | ENTREZ GENE ID | GENE SYMBOL | WEIGHT |
|---|---|---|---|---|---|
| 26290 | GALNT8 | −0.3878 | 3696 | ITGB8 | −0.5790 |
| 79908 | BTNL8 | −0.3711 | 1301 | COL11A1 | −0.5332 |
| 6356 | Ccl11 | −0.3313 | 1421 | CRYGD | −0.5161 |
| 8483 | CILP | −0.3161 | 219699 | Unc5b | −0.4445 |
| 2043 | EPHA4 | −0.3061 | 27010 | TPK1 | −0.4264 |
| 1421 | CRYGD | −0.2971 | 79908 | BTNL8 | −0.3970 |
| 23148 | NACAD | −0.2522 | 79933 | SYNPO2L | −0.3925 |
| 27010 | TPK1 | −0.2273 | 8483 | CILP | −0.3620 |
| 54532 | usp53 | −0.2214 | 55083 | KIF26B | −0.2996 |
| 1281 | COL3A1 | −0.1851 | 27335 | EIF3K | −0.2712 |
| 1301 | COL11A1 | −0.0626 | 65263 | PYCRL | −0.2676 |
| 29989 | OBP2B | 0.0028 | 898 | CCNE1 | 0.2501 |
| 26576 | SRPK3 | 0.1299 | 79815 | NIPAL2 | 0.2547 |
| 26585 | GREM1 | 0.1378 | 64220 | STRA6 | 0.2579 |
| 8842 | PROM1 | 0.1870 | 10017 | Bcl2l10 | 0.3016 |
| 203102 | ADAM32 | 0.1898 | 203102 | ADAM32 | 0.3387 |
| 4222 | meox1 | 0.2153 | 64067 | Npas3 | 0.3950 |
| 79623 | Galnt14 | 0.2268 | 6361 | Ccl17 | 0.4034 |
| 10017 | Bcl2l10 | 0.2286 | 79696 | Fam164c | 0.4242 |
| 1896 | eda | 0.2519 | 50626 | CYHR1 | 0.4313 |
| 29991 | OBP2A | 0.2629 | 3752 | Kcnd3 | 0.4829 |
| 64067 | Npas3 | 0.2664 | 6778 | STAT6 | 0.4922 |
| 122616 | C14orf79 | 0.2947 | 6387 | CXCL12 | 0.5239 |
| 4147 | Matn2 | 0.3110 | 56143 | PCDHA5 | 0.5241 |
| 1634 | DCN | 0.3338 | 8842 | PROM1 | 0.5826 |
| 9723 | Sema3e | 0.3420 | 1290 | Col5a2 | 0.7540 |
| 56143 | PCDHA5 | 0.4219 | |||
| 6361 | Ccl17 | 0.4895 | |||
| Bias | 0.0084 | Bias | −0.0189 |
Figure 2ROC curves with tight prefiltering. Both classifiers started with 59 initial features, of which each classifier chose 25 features (which are different from one case to the other). (A) OS as the clinical parameter and (B) PFS as the clinical parameter.
Figure 3ROC curves with loose prefiltering. (A) The results using OS as the clinical parameter. The initial number of features was 208, of which 26 features were chosen finally. AUCs of the three curves are 0.8922, 0.6833, and 0.5781. (B) The results using PFS as the clinical parameter. The initial number of features was 181, of which 26 features were finally chosen. AUCs of the three curves are 0.8721, 0.6683, and 0.6886.
Three-way classification based on OS and tight prefiltering.
| LABEL | TCGA AGILENT | TCGA AFFYMETRIX | TOTHILL | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 107 | 57 | 24 | 188 | 89 | 59 | 39 | 188 | 13 | 29 | 6 | 48 | |
| 66 | 67 | 56 | 189 | 64 | 64 | 60 | 188 | 27 | 57 | 14 | 98 | |
| 15 | 64 | 109 | 188 | 34 | 65 | 88 | 189 | 8 | 12 | 8 | 28 | |
| 188 | 188 | 188 | 565 | 187 | 188 | 187 | 565 | 48 | 98 | 28 | 174 | |
Three-way classification based on OS and loose prefiltering.
| LABEL | TCGA AGILENT | TCGA AFFYMETRIX | TOTHILL | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 117 | 58 | 13 | 188 | 79 | 59 | 49 | 187 | 11 | 25 | 4 | 40 | |
| 60 | 63 | 66 | 189 | 76 | 62 | 50 | 188 | 24 | 48 | 18 | 90 | |
| 11 | 67 | 110 | 188 | 32 | 67 | 88 | 187 | 13 | 25 | 6 | 44 | |
| 188 | 188 | 189 | 565 | 187 | 188 | 187 | 562 | 48 | 98 | 28 | 174 | |
Figure 4Kaplan–Meier curves for classifier using OS to define classes and tight prefiltering. P-values are computed using log-rank test.
Figure 7Kaplan–Meier curves for classifier using PFS to define classes and loose prefiltering. P-values are computed using log-rank test.
Three-way classification based on PFS and tight prefiltering.
| LABEL | TCGA AGILENT | TCGA AFFYMETRIX | TOTHILL | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 110 | 54 | 24 | 188 | 85 | 63 | 39 | 188 | 15 | 18 | 7 | 40 | |
| 59 | 66 | 62 | 189 | 61 | 62 | 63 | 189 | 20 | 48 | 20 | 88 | |
| 18 | 67 | 103 | 188 | 40 | 60 | 87 | 188 | 4 | 24 | 15 | 43 | |
| 187 | 187 | 185 | 565 | 186 | 185 | 189 | 565 | 39 | 90 | 42 | 171 | |
Three-way classification based on PFS and loose prefiltering.
| LABEL | TCGA AGILENT | TCGA AFFYMETRIX | TOTHILL | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 114 | 58 | 16 | 188 | 84 | 60 | 43 | 187 | 12 | 22 | 6 | 40 | |
| 62 | 66 | 59 | 187 | 64 | 64 | 58 | 186 | 23 | 46 | 19 | 88 | |
| 11 | 63 | 114 | 188 | 38 | 61 | 88 | 187 | 4 | 22 | 17 | 43 | |
| 187 | 187 | 189 | 563 | 186 | 185 | 189 | 560 | 39 | 90 | 44 | 171 | |