Literature DB >> 23467471

Breast cancer survivability prediction using labeled, unlabeled, and pseudo-labeled patient data.

Juhyeon Kim1, Hyunjung Shin.   

Abstract

BACKGROUND: Prognostic studies of breast cancer survivability have been aided by machine learning algorithms, which can predict the survival of a particular patient based on historical patient data. However, it is not easy to collect labeled patient records. It takes at least 5 years to label a patient record as 'survived' or 'not survived'. Unguided trials of numerous types of oncology therapies are also very expensive. Confidentiality agreements with doctors and patients are also required to obtain labeled patient records. PROPOSED
METHOD: These difficulties in the collection of labeled patient data have led researchers to consider semi-supervised learning (SSL), a recent machine learning algorithm, because it is also capable of utilizing unlabeled patient data, which is relatively easier to collect. Therefore, it is regarded as an algorithm that could circumvent the known difficulties. However, the fact is yet valid even on SSL that more labeled data lead to better prediction. To compensate for the lack of labeled patient data, we may consider the concept of tagging virtual labels to unlabeled patient data, that is, 'pseudo-labels,' and treating them as if they were labeled.
RESULTS: Our proposed algorithm, 'SSL Co-training', implements this concept based on SSL. SSL Co-training was tested using the surveillance, epidemiology, and end results database for breast cancer and it delivered a mean accuracy of 76% and a mean area under the curve of 0.81.

Entities:  

Keywords:  Breast Cancer Survivability; Co Training; Machine Learning; Semi Supervised Learning

Mesh:

Year:  2013        PMID: 23467471      PMCID: PMC3721173          DOI: 10.1136/amiajnl-2012-001570

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  8 in total

1.  Predicting breast cancer survivability: a comparison of three data mining methods.

Authors:  Dursun Delen; Glenn Walker; Amit Kadam
Journal:  Artif Intell Med       Date:  2005-06       Impact factor: 5.326

2.  Improved breast cancer prognosis through the combination of clinical and genetic markers.

Authors:  Yijun Sun; Steve Goodison; Jian Li; Li Liu; William Farmerie
Journal:  Bioinformatics       Date:  2006-11-26       Impact factor: 6.937

3.  Neighborhood property-based pattern selection for support vector machines.

Authors:  Hyunjung Shin; Sungzoon Cho
Journal:  Neural Comput       Date:  2007-03       Impact factor: 2.026

4.  Graph sharpening plus graph integration: a synergy that improves protein functional classification.

Authors:  Hyunjung Shin; Andreas Martin Lisewski; Olivier Lichtarge
Journal:  Bioinformatics       Date:  2007-10-31       Impact factor: 6.937

5.  A computer program for period analysis of cancer patient survival.

Authors:  H Brenner; O Gefeller; T Hakulinen
Journal:  Eur J Cancer       Date:  2002-03       Impact factor: 9.162

6.  On Efficient Large Margin Semisupervised Learning: Method and Theory.

Authors:  Junhui Wang; Xiaotong Shen; Wei Pan
Journal:  J Mach Learn Res       Date:  2009-03-01       Impact factor: 3.654

7.  Semi-supervised methods to predict patient survival from gene expression data.

Authors:  Eric Bair; Robert Tibshirani
Journal:  PLoS Biol       Date:  2004-04-13       Impact factor: 8.029

8.  Applications of machine learning in cancer prediction and prognosis.

Authors:  Joseph A Cruz; David S Wishart
Journal:  Cancer Inform       Date:  2007-02-11
  8 in total
  16 in total

1.  Stage-Specific Survivability Prediction Models across Different Cancer Types.

Authors:  Elham Sagheb Hossein Pour; Rohit J Kate
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

2.  Applied Informatics Decision Support Tool for Mortality Predictions in Patients With Cancer.

Authors:  Dimitris Bertsimas; Jack Dunn; Colin Pawlowski; John Silberholz; Alexander Weinstein; Ying Daisy Zhuo; Eddy Chen; Aymen A Elfiky
Journal:  JCO Clin Cancer Inform       Date:  2018-12

3.  Applying under-sampling techniques and cost-sensitive learning methods on risk assessment of breast cancer.

Authors:  Jia-Lien Hsu; Ping-Cheng Hung; Hung-Yen Lin; Chung-Ho Hsieh
Journal:  J Med Syst       Date:  2015-02-25       Impact factor: 4.460

4.  Application and Clinical Value of Machine Learning-Based Cervical Cancer Diagnosis and Prediction Model in Adjuvant Chemotherapy for Cervical Cancer: A Single-Center, Controlled, Non-Arbitrary Size Case-Control Study.

Authors:  Yang Wang; Lidan Shen; Jun Jin; Guohua Wang
Journal:  Contrast Media Mol Imaging       Date:  2022-06-15       Impact factor: 3.009

5.  Online cancer communities as informatics intervention for social support: conceptualization, characterization, and impact.

Authors:  Shaodian Zhang; Erin O'Carroll Bantum; Jason Owen; Suzanne Bakken; Noémie Elhadad
Journal:  J Am Med Inform Assoc       Date:  2017-03-01       Impact factor: 4.497

6.  Network mirroring for drug repositioning.

Authors:  Sunghong Park; Dong-Gi Lee; Hyunjung Shin
Journal:  BMC Med Inform Decis Mak       Date:  2017-05-18       Impact factor: 2.796

7.  Multiple Machine Learnings Revealed Similar Predictive Accuracy for Prognosis of PNETs from the Surveillance, Epidemiology, and End Result Database.

Authors:  Yiyan Song; Shaowei Gao; Wulin Tan; Zeting Qiu; Huaqiang Zhou; Yue Zhao
Journal:  J Cancer       Date:  2018-10-10       Impact factor: 4.207

8.  A coupling approach of a predictor and a descriptor for breast cancer prognosis.

Authors:  Hyunjung Shin; Yonghyun Nam
Journal:  BMC Med Genomics       Date:  2014-05-08       Impact factor: 3.063

Review 9.  Machine learning applications in cancer prognosis and prediction.

Authors:  Konstantina Kourou; Themis P Exarchos; Konstantinos P Exarchos; Michalis V Karamouzis; Dimitrios I Fotiadis
Journal:  Comput Struct Biotechnol J       Date:  2014-11-15       Impact factor: 7.271

10.  CLASH: Complementary Linkage with Anchoring and Scoring for Heterogeneous biomolecular and clinical data.

Authors:  Yonghyun Nam; Myungjun Kim; Kyungwon Lee; Hyunjung Shin
Journal:  BMC Med Inform Decis Mak       Date:  2016-07-25       Impact factor: 2.796

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.