Roya Nikbakht1, Abbas Bahrampour1. 1. Department of Biostatistics and Epidemiology, Modeling in Health Research Center, Faculty of Health, Institute for Futures Studies in Health, Kerman University of Medical Sciences, Kerman, Iran.
Abstract
BACKGROUND: Fuzzy logistic regression model can be used for determining influential factors of disease. This study explores the important factors of actual predictive survival factors of breast cancer's patients. MATERIALS AND METHODS: We used breast cancer data which collected by cancer registry of Kerman University of Medical Sciences during the period of 2000-2007. The variables such as morphology, grade, age, and treatments (surgery, radiotherapy, and chemotherapy) were applied in the fuzzy logistic regression model. Performance of model was determined in terms of mean degree of membership (MDM). RESULTS: The study results showed that almost 41% of patients were in neoplasm and malignant group and more than two-third of them were still alive after 5-year follow-up. Based on the fuzzy logistic model, the most important factors influencing survival were chemotherapy, morphology, and radiotherapy, respectively. Furthermore, the MDM criteria show that the fuzzy logistic regression have a good fit on the data (MDM = 0.86). CONCLUSION: Fuzzy logistic regression model showed that chemotherapy is more important than radiotherapy in survival of patients with breast cancer. In addition, another ability of this model is calculating possibilistic odds of survival in cancer patients. The results of this study can be applied in clinical research. Furthermore, there are few studies which applied the fuzzy logistic models. Furthermore, we recommend using this model in various research areas.
BACKGROUND: Fuzzy logistic regression model can be used for determining influential factors of disease. This study explores the important factors of actual predictive survival factors of breast cancer's patients. MATERIALS AND METHODS: We used breast cancer data which collected by cancer registry of Kerman University of Medical Sciences during the period of 2000-2007. The variables such as morphology, grade, age, and treatments (surgery, radiotherapy, and chemotherapy) were applied in the fuzzy logistic regression model. Performance of model was determined in terms of mean degree of membership (MDM). RESULTS: The study results showed that almost 41% of patients were in neoplasm and malignant group and more than two-third of them were still alive after 5-year follow-up. Based on the fuzzy logistic model, the most important factors influencing survival were chemotherapy, morphology, and radiotherapy, respectively. Furthermore, the MDM criteria show that the fuzzy logistic regression have a good fit on the data (MDM = 0.86). CONCLUSION: Fuzzy logistic regression model showed that chemotherapy is more important than radiotherapy in survival of patients with breast cancer. In addition, another ability of this model is calculating possibilistic odds of survival in cancer patients. The results of this study can be applied in clinical research. Furthermore, there are few studies which applied the fuzzy logistic models. Furthermore, we recommend using this model in various research areas.
Entities:
Keywords:
Breast cancer; fuzzy logistic regression; mean degree of membership; survival
Cancer is a leading cause of death in the world.[1] The burden of cancer is rising gradually throughout the world. Among all types of cancer, lung cancer is the most prevalent cancer (worldwide) in men. When compared to men, in women, breast cancer is the first leading cause of cancer death in developed and developing countries.[23] Regarding the global cancer statistics report in 2011, breast cancer included around 23% of total cancer. Besides, the incidence rate of breast cancer had the maximum rate (89.9%) in western European countries based on standardized age (2011).[4]Nearly 7% of woman diagnosed with breast cancer are at an age younger than 40 years.[5] Expected number of deaths for breast cancer, US, 2015, is estimated near 40,290 women.[6] Southern Africa had the highest rate of breast cancer incidence mortality (19.3%). In addition, western Asia countries had the highest incidence rate and mortality rate of breast cancer 31.8% and 18.9%, respectively.[4] When compared to other countries, breast cancer is the second common cancer in Iranian females (21.4%).[78]
Etiology and predictive factors of breast cancer
The main predictive risk factors of breast cancer are “age, geographic area of residence, age at first birth, certain indicators of ovarian activity, history of benign breast disease, and familiar history of breast cancer.”[9] Breast cancer treatment usually involves a combination of surgery, radiation therapy, chemotherapy, and hormone therapy. In addition, prognosis and selection of therapy can determine by age, menopausal status, stage of disease, histologic, and nuclear grade of the primary tumor.[1011] Some studies indicated that survival of breast cancer's patients depends on tumor size and biological factors.[12] In some patients who received the radiotherapy before chemotherapy, the actual 5-year breast failure rate was 4% in 99 of them. In other patients (54 people) who received chemotherapy sequentially then radiotherapy without concurrent chemotherapy, this rate was 8%. Furthermore, the failure rate was 6% in 116 patients with receiving concurrent chemotherapy and radiotherapy.[13]For determining actual predictive survival factors in breast cancer, the fuzzy logistic regression model can be used. In addition, one of the abilities of this model is predicting the status of the new patients by possibilistic odds (fuzzy logistic regression was based on the possibilistic odds approach). A few studies applied this model for survival of cancer patients. In this study, we conducted the fuzzy logistic regression model (new statistical regression model) for determining important factors influencing survival of breast cancer patients and predicting the status of the new patient. The main purpose of this study is to introduce a new statistical model and applying the results in the clinical research.
MATERIALS AND METHODS
Patients
This study used data of breast cancer registered by cancer registry of Kerman (the largest province of Iran) during the period of 2000–2007. The aim of the study was to determine important factors in survival of breast cancer patients. There were 1311 patients with breast cancer (both gender). Males and some patients with incomplete information excluded from the study. Finally, 924 patients remained but we studied just 71 patients of them (we used Lingo software for solving inequalities of fuzzy logistic regression model, but the Lingo software can solve inequalities in a sample of 71). Fuzzy logistic regression was performed with a binary dependent variable (survival status which had two states: alive or dead) and some independent variables such as age, morphology, grade, and treatments (radiotherapy, chemotherapy, and surgery).The data of death results from breast cancer were available. Therefore, we determined 5-year survival from incidence (diagnosed with a disease during a given period of time) to death.Types of morphology were neoplasm malignant, carcinoma, and infiltrating duct carcinoma. Radiotherapy, chemotherapy, and surgery were treatments approaches. Some cancer patients received treatments while some others did not. In other words, in some cases, radiotherapy, chemotherapy, and surgery used to cure or control the disease. Tumor grade had three levels (I = well-differentiated, II = moderate-differentiated, III = poor-differentiated). Furthermore, the status of patients who died from 2000 to 2007 was zero.
Statistical analysis
The fuzzy logistic regression is an important tools for evaluating the relationship between independent variable (crisp or fuzzy) and fuzzy binary outcome (The status of some patients was one, and some were zero with a probability of μ and 1-μ, respectively).At first step, we fitted a logistic regression model to the data. After that, we used predicted probabilities as μ for modeling fuzzy logistic regression. Note that, possibilistic was used instead of probability in fuzzy logistic regression. In the mentioned model μ = Poss(Y = 1) is the possibility of having the related property and shows possibilistic odds. Then, we applied the Fuzzy logistic regression on data. To estimate fuzzy coefficients, R and Lingo software were used but Lingo software could not support data with large sample sizes.Suppose that X =(x1, x1,....,x) denoted a vector of explanatory variables, then the formulation of Fuzzy logistic regression with fuzzy coefficients is defined as:Eq(1)That, and is possibilistic odds of survival of breast cancer for each patient. Also are the slops of explanatory variables and intercept respectively [more details exist in Appendix 1].Moreover ultimately, mean degree of membership (MDM) used for checking goodness of fit in fuzzy logistic regression which is a value between 0 and 1. As MDM is closer to one, the model has a good fit to the data.Eq(2)[1415]
RESULTS
Based on the results of Table 1, Mean (SD) of age was 47.9 (12.5) in patients with breast cancer. Nearly 41% of patients were in neoplasm and malignant group. Furthermore, 2.8% and 56.3% of patients were in carcinoma and infiltrating duct carcinoma, respectively. More than two-third of patients were still alive after 5-year survival. Most of the patients had a moderate grade (90.1%). Patients based on their conditions received surgery (54.9%), radiotherapy (46.9%), and chemotherapy (19.7%).
Table 1
The basic and clinical characteristics of study participants
The basic and clinical characteristics of study participantsWe obtained the fuzzy logistic regression model as follows:The coefficient of some variables such as age and radiotherapy were fuzzy and were not interpretable. Therefore, we used defuzzification approach for fuzzy coefficients (more information about defuzzification exists in Appendix 1).After defuzzification, coefficients of variables were compared for detecting the importance of variables. According to obtained model, the most important factors influencing survival were chemotherapy, morphology, and radiotherapy, in order (Eq [3]) [Table 2].
Table 2
Coefficients defuzzification
Coefficients defuzzificationIn addition, the value of possibilistic odds of all cases determined. The possibilistic odds of five patients are reported in Table 3.
Table 3
The observed and the estimated fuzzy outputs of breast cancer
The observed and the estimated fuzzy outputs of breast cancerFor example, a possibilistic odds of survival of breast cancer for the first patient is 0.84. We can also interpret possibilistic odds for other patients in similar way. Furthermore, possibilistic odds for a new case can be obtained by Eq.[3]For instance, suppose we have a new case with following variables:Age = 45 years old, Grade II, morphology = Infiltrating duct carcinoma, gave all treatments (surgery, radiotherapy, and chemotherapy). The possibilistic odds of this patient can be calculated as follows:Therefore, possibilistic odds of survival of breast cancer for this case were 0.73 after difuzzification.Finally, model performance was assessed by MDM. The value of MDM was 0.86 which it showed fuzzy logistic regression had a good fit on the data.
DISCUSSION
In this study, we determined the importance of factors may affect survival of breast cancer, namely, fuzzy logistic regression model. The study results demonstrated the important factors were chemotherapy, morphology, and radiotherapy. Furthermore, the fuzzy logistic regression had a good fit on the data (MDM = 0.86). Such findings were consistent with similar study that pointed out in pre- and post-menopausal patients with high risk of breast cancer. The short-term surgical with combination of chemotherapy was effective.[16] In another study, the factor which influenced the results of 5-year survival was number of axillary lymph nodes involved (not by menopausal status). Furthermore, the combination chemotherapy at full dose is vital for achieving clinical benefit.[17] In similar study, the combination of radiotherapy with chemotherapy declined rates of loco regional after modified radical mastectomy.[18]Regarding the morphological factors, we found that through fuzzy logistic regression model-these factors had an important role at survival status in cancer patients. Similarly, morphological assessments studies showed that specific morphological characteristics strongly associated with “basal-like breast carcinoma” and can provide helpful information of prognosis of breast cancer.[19] Morphological assessment of the difference has been shown in numerous studies to provide useful information in breast cancer [20] that shows the power of fuzzy logistic regression for exploring the influential factors.This method is applied in other areas such as diabetic's patients, lupus, and tuberculosis. In one of them, the fuzzy logistic regression model introduced as a new possibilistic model for determining the diabetic status.[14] Another study measured the association between tuberculosis and smoking through this model.[21] Furthermore, Pourahmad applied fuzzy logistic regression in the suspected cases to systematic lupus erythematosus disease.[22]In this research, we introduced fuzzy logistic regression, a new statistical method, for determining breast cancer's survival factors. By this model which extracts possibilistic odds of survival and predicts status it is possible to calculate the odds for survival of the new patient which can be introduced. Furthermore, the results of this study can be applied in clinical research.
CONCLUSION
According to our findings, we recommended fuzzy logistic regression model: first, this model is useful for determining the survival of the breast cancer's patients regarding to real data. Another advantage of Fuzzy logistic regression model is that possibilistic odds of survival status can be calculated for a new case.
Authors: Carol E DeSantis; Stacey A Fedewa; Ann Goding Sauer; Joan L Kramer; Robert A Smith; Ahmedin Jemal Journal: CA Cancer J Clin Date: 2015-10-29 Impact factor: 508.702
Authors: Giridhara R Babu; Goleen Samari; Sharon Phoebe Cohen; Tanmay Mahapatra; Randa May Wahbe; Sherin Mermash; Osman M Galal Journal: Asian Pac J Cancer Prev Date: 2011
Authors: J F Simpson; R Gray; L G Dressler; C D Cobau; C I Falkson; K W Gilchrist; K J Pandya; D L Page; N J Robert Journal: J Clin Oncol Date: 2000-05 Impact factor: 44.544
Authors: Alireza Sadjadi; Mehdi Nouraie; Mohammad Ali Mohagheghi; Alireza Mousavi-Jarrahi; Reza Malekezadeh; Donald Maxwell Parkin Journal: Asian Pac J Cancer Prev Date: 2005 Jul-Sep