Literature DB >> 18003163

Multi-class classification of cancer stages from free-text histology reports using support vector machines.

Anthony Nguyen1, Darren Moore, Iain McCowan, Mary-Jane Courage.   

Abstract

Multi-class machine learning techniques using support vector machines (SVM) are proposed to classify the TNM stage of lung cancer patients from analysis of their free-text histology reports. Stages obtained automatically can be used for retrospective population-level studies of lung cancer outcomes. While the system could in principle be applied to stage different cancer types, the paper focuses on staging lung cancer due to data availability. Experiments have quantified system performance on a corpus of reports from 710 lung cancer patients using four different SVM architectures for multi-class classification. Results show that a system based on standard binary SVM classifiers organised in a hierarchical architecture show the most promise with overall accuracy results of 0.64 and 0.82 across T and N stages, respectively.

Entities:  

Mesh:

Year:  2007        PMID: 18003163     DOI: 10.1109/IEMBS.2007.4353497

Source DB:  PubMed          Journal:  Conf Proc IEEE Eng Med Biol Soc        ISSN: 1557-170X


  6 in total

1.  Classification of hepatocellular carcinoma stages from free-text clinical and radiology reports.

Authors:  Wen-Wai Yim; Sharon W Kwan; Guy Johnson; Meliha Yetisgen
Journal:  AMIA Annu Symp Proc       Date:  2018-04-16

2.  Machine Learning Approaches for Extracting Stage from Pathology Reports in Prostate Cancer.

Authors:  Raphael Lenain; Martin G Seneviratne; Selen Bozkurt; Douglas W Blayney; James D Brooks; Tina Hernandez-Boussard
Journal:  Stud Health Technol Inform       Date:  2019-08-21

3.  Collection of cancer stage data by classifying free-text medical reports.

Authors:  Iain A McCowan; Darren C Moore; Anthony N Nguyen; Rayleen V Bowman; Belinda E Clarke; Edwina E Duhig; Mary-Jane Fry
Journal:  J Am Med Inform Assoc       Date:  2007-08-21       Impact factor: 4.497

4.  A study of the effectiveness of machine learning methods for classification of clinical interview fragments into a large number of categories.

Authors:  Mehedi Hasan; Alexander Kotov; April Carcone; Ming Dong; Sylvie Naar; Kathryn Brogan Hartlieb
Journal:  J Biomed Inform       Date:  2016-05-13       Impact factor: 6.317

5.  MLW-gcForest: a multi-weighted gcForest model towards the staging of lung adenocarcinoma based on multi-modal genetic data.

Authors:  Yunyun Dong; Wenkai Yang; Jiawen Wang; Juanjuan Zhao; Yan Qiang; Zijuan Zhao; Ntikurako Guy Fernand Kazihise; Yanfen Cui; Xiaotong Yang; Siyuan Liu
Journal:  BMC Bioinformatics       Date:  2019-11-14       Impact factor: 3.169

6.  Synergistic Effects of Different Levels of Genomic Data for the Staging of Lung Adenocarcinoma: An Illustrative Study.

Authors:  Yingxia Li; Ulrich Mansmann; Shangming Du; Roman Hornung
Journal:  Genes (Basel)       Date:  2021-11-24       Impact factor: 4.096

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.