Literature DB >> 28552128

Application of data mining techniques and data analysis methods to measure cancer morbidity and mortality data in a regional cancer registry: The case of the island of Crete, Greece.

Iraklis Varlamis1, Ioannis Apostolakis2, Dimitra Sifaki-Pistolla3, Nilanjan Dey4, Vassilios Georgoulias5, Christos Lionis3.   

Abstract

BACKGROUND AND
OBJECTIVE: Micro or macro-level mapping of cancer statistics is a challenging task that requires long-term planning, prospective studies and continuous monitoring of all cancer cases. The objective of the current study is to present how cancer registry data could be processed using data mining techniques in order to improve the statistical analysis outcomes.
METHODS: Data were collected from the Cancer Registry of Crete in Greece (counties of Rethymno and Lasithi) for the period 1998-2004. Data collection was performed on paper forms and manually transcribed to a single data file, thus introducing errors and noise (e.g. missing and erroneous values, duplicate entries etc.). Data were pre-processed and prepared for analysis using data mining tools and algorithms. Feature selection was applied to evaluate the contribution of each collected feature in predicting patients' survival. Several classifiers were trained and evaluated for their ability to predict survival of patients. Finally, statistical analysis of cancer morbidity and mortality rates in the two regions was performed in order to validate the initial findings.
RESULTS: Several critical points in the process of data collection, preprocessing and analysis of cancer data were derived from the results, while a road-map for future population data studies was developed. In addition, increased morbidity rates were observed in the counties of Crete (Age Standardized Morbidity/Incidence Rates ASIR= 396.45 ± 2.89 and 274.77 ±2.48 for men and women, respectively) compared to European and world averages (ASIR= 281.6 and 207.3 for men and women in Europe and 203.8 and 165.1 in world level). Significant variation in cancer types between sexes and age groups (the ratio between deaths and reported cases for young patients, less than 34 years old, is at 0.055 when the respective ratio for patients over 75 years old is 0.366) was also observed.
CONCLUSIONS: This study introduced a methodology for preprocessing and analyzing cancer data, using a combination of data mining techniques that could be a useful tool for other researchers and further enhancement of the cancer registries.
Copyright © 2017 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Cancer data; Crete; Data mining; Feature selection; Greece

Mesh:

Year:  2017        PMID: 28552128     DOI: 10.1016/j.cmpb.2017.04.011

Source DB:  PubMed          Journal:  Comput Methods Programs Biomed        ISSN: 0169-2607            Impact factor:   5.428


  2 in total

1.  Forecasting the Amount of Blood Ordered in the Obstetrics and Gynaecology Ward with the Data Mining Approach.

Authors:  Tahmineh Aldaghi; Ghasemi H Morteza; Mehrdad Kargari
Journal:  Indian J Hematol Blood Transfus       Date:  2019-11-14       Impact factor: 0.900

2.  Effective Image Processing and Segmentation-Based Machine Learning Techniques for Diagnosis of Breast Cancer.

Authors:  Sushovan Chaudhury; Alla Naveen Krishna; Suneet Gupta; K Sakthidasan Sankaran; Samiullah Khan; Kartik Sau; Abhishek Raghuvanshi; F Sammy
Journal:  Comput Math Methods Med       Date:  2022-04-08       Impact factor: 2.809

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.