| Literature DB >> 32434554 |
Einar Holsbø1, Vittorio Perduca2, Lars Ailo Bongo3, Eiliv Lund4,5, Etienne Birmelé2.
Abstract
OBJECTIVE: In this exploratory work we investigate whether blood gene expression measurements predict breast cancer metastasis. Early detection of increased metastatic risk could potentially be life-saving. Our data comes from the Norwegian Women and Cancer epidemiological cohort study. The women who contributed to these data provided a blood sample up to a year before receiving a breast cancer diagnosis. We estimate a penalized maximum likelihood logistic regression. We evaluate this in terms of calibration, concordance probability, and stability, all of which we estimate by the bootstrap.Entities:
Keywords: Breast cancer; Epidemiology; Metastasis; Predictive models; Transcriptomics
Mesh:
Year: 2020 PMID: 32434554 PMCID: PMC7238609 DOI: 10.1186/s13104-020-05088-0
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Fig. 1Bootstrap distribution of optimism-corrected estimates for Brier score, concordance/AUC, and stability for the Elasticnet model. The solid vertical lines show point estimates, and the dotted vertical lines show the middle .8 of each distribution
Fig. 2Expected calibration of predicted probabilities shown in solid black. The dotted line shows middle .8 of the bootstrap distribution. Ideally, .8 of the observations for which .8 metastasis probability was predicted should turn out to show metastasis. In other words the ideal calibration is a diagonal line (shown in grey). Our model tends to overestimate lower probabilities and underestimate higher ones
Resampling selection probability for the 108 elasticnet-selected genes
| GRK5a | 0.853 | C1orf115 | 0.290 | ANO8 | 0.221 | FBLN5 | 0.157 |
| GPATCH4 | 0.682 | LOC654055 | 0.287 | PTTG1IP | 0.219 | BLMH | 0.156 |
| GNGT2 | 0.474 | RNF214 | 0.280 | 3NDg8gVCdb | 0.218 | FCRL3 | 0.149 |
| PDGFDc | 0.467 | SULT1A1 | 0.278 | USF1 | 0.216 | TDRD9 | 0.143 |
| FAM24B | 0.457 | ZNF365 | 0.271 | BCCIP | 0.210 | ACY1 | 0.142 |
| PTPRN2 | 0.442 | USE1 | 0.267 | MGC29506 | 0.209 | ZFP57 | 0.142 |
| CBLB | 0.440 | DNMT3A | 0.267 | GRK5a | 0.207 | SLIC1 | 0.138 |
| PDCL | 0.410 | LOC649210 | 0.266 | WTIP | 0.205 | PICK1 | 0.135 |
| RASA2 | 0.380 | CNTNAP2 | 0.265 | BCL10 | 0.204 | RTN4IP1 | 0.134 |
| C11orf48 | 0.376 | IL2RA | 0.265 | DLGAP2 | 0.200 | CDCA7L | 0.132 |
| TCEB1 | 0.374 | CCT5 | 0.264 | HRAS | 0.199 | BEX4 | 0.131 |
| CAPN3 | 0.354 | R3HDM1 | 0.263 | RAD1 | 0.189 | FCAR | 0.130 |
| STK19 | 0.351 | MRPL43 | 0.260 | PRKCE | 0.187 | ANKRD35 | 0.111 |
| GUCY1A3 | 0.348 | SLC38A1 | 0.256 | UBAP2L | 0.186 | USP39 | 0.109 |
| ZDHHC11 | 0.345 | GNG8 | 0.255 | BPI | 0.186 | KIAA0495 | 0.106 |
| SULT1A3 | 0.336 | PLA2G4C | 0.251 | DTX1 | 0.184 | BRI3BP | 0.106 |
| Z6FIQGkeod | 0.335 | TCF4 | 0.248 | LASS5 | 0.182 | TUBA4A | 0.105 |
| FAM89A | 0.328 | uX15cu4f_e | 0.247 | GSTT1 | 0.182 | IDH1 | 0.102 |
| rh13dQX04f | 0.324 | C20orf107 | 0.245 | SPATA20 | 0.182 | DDX52 | 0.100 |
| LANCL2 | 0.323 | VCL | 0.242 | IGLL1 | 0.172 | ANKRD57 | 0.094 |
| SERPINE2 | 0.318 | EZH2 | 0.242 | SPG3A | 0.172 | TFG | 0.087 |
| ADIPOR2 | 0.314 | PRPSAP2 | 0.237 | PPAP2A | 0.172 | LILRA6 | 0.080 |
| GPR177 | 0.312 | ISY1 | 0.235 | NOTCH2NL | 0.172 | C6orf47 | 0.078 |
| PDGFDc | 0.299 | UGDH | 0.234 | TAF6 | 0.168 | WDR60 | 0.075 |
| LOC647460 | 0.294 | ABCF2 | 0.230 | CCDC90B | 0.166 | AHCYL2 | 0.068 |
| WEE1 | 0.293 | C16orf5 | 0.229 | LOC731486 | 0.158 | HAUS4 | 0.068 |
| ITM2C | 0.291 | VAV3 | 0.225 | CDH2 | 0.157 | MAD2L2 | 0.053 |
a Two probes map to the same gene GRK5. Combined selection probability is 1.06, implying that both get selected together at least some of the time
b Illumina probe id 3NDg8gVCdQkNdcg.Ko, missing annotation
c Two probes map to the same gene PDGFD. Combined selection probability is 0.766
d Ilummina probe id Z6FIQGkeoCSiVAoKeg, missing annotation
e Illumina probe id uX15cu4f_VUIuXoST0, missing annotation
f Illumina probe id rh13dQX04hUS7uOpRQ, missing annotation