| Literature DB >> 28971122 |
Pelumi E Oguntunde1, Adebowale O Adejumo2, Hilary I Okagbue1.
Abstract
Breast cancer is the type of cancer that develops from breast tissue; it is mostly common in women and it is one of the most studied diseases, largely because of its high mortality (second to lung cancer). However, it occurs in males also. This article presents a statistical study of the distribution of age, gender, length of stay, mode of diagnosis, status (dead or alive) after treatment and the location of breast cancer among 300 patients admitted in the University of Ilorin teaching hospital, Ilorin, Nigeria. The study covers a period of five (5) years; from 2011 to 2016 and logistic regression was used to perform the basic analysis in this study. It was discovered that the age of patients and the location of the breast cancer (right or left) contributes significantly to the survival of the patients. However, early detection and treatment of the disease is highly encouraged. This study also recommends that awareness should be taken to the grassroots and males should not be excluded from this discussion.Entities:
Keywords: Breast cancer; Logistic regression; Mortality; Oncology
Year: 2017 PMID: 28971122 PMCID: PMC5612794 DOI: 10.1016/j.dib.2017.08.038
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Analysis of age.
| Age | ||
| N | Valid | 300 |
| Missing | 0 | |
| Mean | 49.71 | |
| Median | 50.00 | |
| Mode | 60 | |
| Std. Deviation | 13.884 | |
| Variance | 192.768 | |
| Skewness | .572 | |
| Std. Error of Skewness | .141 | |
| Kurtosis | .479 | |
| Std. Error of Kurtosis | .281 | |
| Minimum | 20 | |
| Maximum | 96 | |
| Percentiles | 25 | 40.00 |
| 50 | 50.00 | |
| 75 | 60.00 | |
Fig. 1The distribution of age using histogram.
Classification of age of the patients.
| Frequency | Percent | Valid Percent | Cumulative Percent | ||
|---|---|---|---|---|---|
| Valid | <41years | 88 | 29.3 | 29.3 | 29.3 |
| 41–55years | 115 | 38.3 | 38.3 | 67.7 | |
| > 55years | 97 | 32.3 | 32.3 | 100.0 | |
| Total | 300 | 100.0 | 100.0 | ||
Fig. 2Bar chart showing the classification of age.
Classification of length of stay.
| Frequency | Percent | Valid Percent | Cumulative Percent | ||
|---|---|---|---|---|---|
| Valid | < 11days | 106 | 35.3 | 35.3 | 35.3 |
| 11–21days | 101 | 33.7 | 33.7 | 69.0 | |
| > 21days | 93 | 31.0 | 31.0 | 100.0 | |
| Total | 300 | 100.0 | 100.0 | ||
Fig. 3Bar chart showing the classification of length of stay.
Distribution of gender of the patients.
| Gender/sex | Frequency | Percent | Cumulative Percent | |
|---|---|---|---|---|
| Female | 275 | 91.7 | 91.7 | |
| Male | 25 | 8.3 | 100.0 | |
| Total | 300 | 100.0 | ||
Fig. 4Bar chart showing the distribution of gender.
Crosstabulation for gender and outcome of patients.
| Count | ||||
|---|---|---|---|---|
| Outcome | Total | |||
| Alive | Dead | |||
| Sex | female | 188 | 87 | 275 |
| male | 15 | 10 | 25 | |
| Total | 203 | 97 | 300 | |
Categorical variable coding.
| Frequency | Parameter coding | |||
|---|---|---|---|---|
| (1) | (2) | |||
| Loscode | < 11days | 106 | 1.00 | 0.00 |
| 11–21days | 101 | 0.00 | 1.00 | |
| > 21days | 93 | 0.00 | 0.00 | |
| Agecode | <41years | 88 | 1.00 | 0.00 |
| 41–55years | 115 | 0.00 | 1.00 | |
| > 55years | 97 | 0.00 | 0.00 | |
| Location of Cancer | Both breasts | 25 | 1.00 | 0.00 |
| Left breast | 140 | 0.00 | 1.00 | |
| Right breast | 135 | 0.00 | 0.00 | |
| Mode of Diagnosis | Cytological | 166 | 1.00 | |
| Histological | 134 | 0.00 | ||
| sex | Female | 275 | 1.00 | |
| Male | 25 | 0.00 | ||
Classification Table.
| Observed | Predicted | ||||
| Outcome | Percentage Correct | ||||
| Alive | Dead | ||||
| Step 0 | Outcome | Alive | 203 | 0 | 100.0 |
| Dead | 97 | 0 | .0 | ||
| Overall Percentage | 67.7 | ||||
Variables in the equation.
| B | S.E. | Wald | df | Sig. | Exp(B) | ||
|---|---|---|---|---|---|---|---|
| Step 0 | Constant | −.738 | .123 | 35.797 | 1 | .000 | .478 |
Tests of model coefficients.
| Chi-square | df | Sig. | ||
|---|---|---|---|---|
| Step 1 | Step | 20.742 | 8 | .008 |
| Block | 20.742 | 8 | .008 | |
| Model | 20.742 | 8 | .008 | |
| Step 2 | Step | −.892 | 2 | .640 |
| Block | 19.850 | 6 | .003 | |
| Model | 19.850 | 6 | .003 | |
| Step 3 | Step | −.235 | 1 | .628 |
| Block | 19.616 | 5 | .001 | |
| Model | 19.616 | 5 | .001 | |
| Step 4 | Step | −.461 | 1 | .497 |
| Block | 19.155 | 4 | .001 | |
| Model | 19.155 | 4 | .001 | |
A negative Chi-squares value indicates that the Chi-squares value has decreased from the previous step.
Model summary.
| Step | -2 Log likelihood | Cox & Snell R Square | Nagelkerke R Square |
|---|---|---|---|
| 1 | 356.872 | .067 | .093 |
| 2 | 357.764 | .064 | .089 |
| 3 | 357.998 | .063 | .088 |
| 4 | 358.459 | .062 | .086 |
Estimation terminated at iteration number 4 because parameter estimates changed by less than .001.
Variables in the equation.
| B | S.E. | Wald | df | Sig. | Exp(B) | 95% C.I.for EXP(B) | |||
|---|---|---|---|---|---|---|---|---|---|
| Lower | Upper | ||||||||
| Step 1 | sex(1) | −.232 | .454 | .261 | 1 | .609 | .793 | .325 | 1.932 |
| agecode | 9.641 | 2 | .008 | ||||||
| agecode(1) | −.827 | .332 | 6.194 | 1 | .013 | .437 | .228 | .839 | |
| agecode(2) | −.875 | .309 | 7.996 | 1 | .005 | .417 | .227 | .765 | |
| Location of Cancer | 9.209 | 2 | .010 | ||||||
| Location of Cancer(1) | 1.092 | .470 | 5.407 | 1 | .020 | 2.981 | 1.187 | 7.485 | |
| Location of Cancer(2) | .721 | .276 | 6.847 | 1 | .009 | 2.057 | 1.198 | 3.531 | |
| Mode of Diagnosis(1) | −.156 | .263 | .353 | 1 | .552 | .855 | .511 | 1.432 | |
| loscode | .883 | 2 | .643 | ||||||
| loscode(1) | −.238 | .319 | .559 | 1 | .455 | .788 | .422 | 1.471 | |
| loscode(2) | .031 | .316 | .010 | 1 | .921 | 1.032 | .555 | 1.918 | |
| Constant | −.271 | .503 | .289 | 1 | .591 | .763 | |||
| Step 2 | sex(1) | −.220 | .453 | .237 | 1 | .626 | .802 | .330 | 1.948 |
| agecode | 9.669 | 2 | .008 | ||||||
| agecode(1) | −.827 | .331 | 6.253 | 1 | .012 | .437 | .229 | .836 | |
| agecode(2) | −.871 | .309 | 7.964 | 1 | .005 | .419 | .229 | .766 | |
| Location of Cancer | 9.573 | 2 | .008 | ||||||
| Location of Cancer(1) | 1.093 | .468 | 5.460 | 1 | .019 | 2.983 | 1.193 | 7.462 | |
| Location of Cancer(2) | .742 | .274 | 7.323 | 1 | .007 | 2.100 | 1.227 | 3.593 | |
| Mode of Diagnosis(1) | −.166 | .263 | .397 | 1 | .529 | .847 | .506 | 1.418 | |
| Constant | −.359 | .459 | .613 | 1 | .434 | .698 | |||
| Step 3 | agecode | 10.684 | 2 | .005 | |||||
| agecode(1) | −.852 | .326 | 6.814 | 1 | .009 | .427 | .225 | .809 | |
| agecode(2) | −.898 | .304 | 8.743 | 1 | .003 | .407 | .225 | .739 | |
| Location of Cancer | 9.389 | 2 | .009 | ||||||
| Location of Cancer(1) | 1.076 | .466 | 5.325 | 1 | .021 | 2.933 | 1.176 | 7.318 | |
| Location of Cancer(2) | .728 | .272 | 7.154 | 1 | .007 | 2.072 | 1.215 | 3.533 | |
| Mode of Diagnosis(1) | −.178 | .261 | .461 | 1 | .497 | .837 | .502 | 1.398 | |
| Constant | −.528 | .303 | 3.033 | 1 | .082 | .590 | |||
| Step 4 | agecode | 10.359 | 2 | .006 | |||||
| agecode(1) | −.832 | .324 | 6.581 | 1 | .010 | .435 | .230 | .822 | |
| agecode(2) | −.877 | .302 | 8.446 | 1 | .004 | .416 | .230 | .752 | |
| Location of Cancer | 9.581 | 2 | .008 | ||||||
| Location of Cancer(1) | 1.114 | .463 | 5.784 | 1 | .016 | 3.047 | 1.229 | 7.554 | |
| Location of Cancer(2) | .722 | .272 | 7.055 | 1 | .008 | 2.059 | 1.208 | 3.509 | |
| Constant | −.640 | .256 | 6.256 | 1 | .012 | .528 | |||
Variable(s) entered on step 1: sex, agecode, LocationofCancer, ModeofDiagnosis, loscode.
Hosmer and Lemeshow Test.
| Step | Chi-square | df | Sig. |
|---|---|---|---|
| 1 | 8.566 | 8 | .380 |
| 2 | 1.502 | 8 | .993 |
| 3 | 1.380 | 8 | .995 |
| 4 | 1.193 | 5 | .946 |
Classification Table.
| Observed | Predicted | ||||
|---|---|---|---|---|---|
| Outcome | Percentage Correct | ||||
| Alive | Dead | ||||
| Step 1 | Outcome | Alive | 187 | 16 | 92.1 |
| Dead | 74 | 23 | 23.7 | ||
| Overall Percentage | 70.0 | ||||
| Step 2 | Outcome | Alive | 193 | 10 | 95.1 |
| Dead | 81 | 16 | 16.5 | ||
| Overall Percentage | 69.7 | ||||
| Step 3 | Outcome | Alive | 180 | 23 | 88.7 |
| Dead | 68 | 29 | 29.9 | ||
| Overall Percentage | 69.7 | ||||
| Step 4 | Outcome | Alive | 180 | 23 | 88.7 |
| Dead | 68 | 29 | 29.9 | ||
| Overall Percentage | 69.7 | ||||
a. The cut value is .500
Fig. 5Diagram of predictive probabilities.
| Subject area | Medicine |
| More specific subject area | Biostatistics, Oncology |
| Type of data | Table and text file |
| How data was acquired | Unprocessed secondary data |
| Data format | Raw, analyzed |
| Experimental factors | Records of Breast cancer patients obtained from University of Ilorin Teaching Hospital (UITH), Nigeria. |
| Experimental features | Computational Analysis: Histogram, Bar-chart, Contingency tables, Logistic regression analysis. |
| Data source location | University of Ilorin Teaching Hospital (UITH), Nigeria |
| Data accessibility | All the data are available in this data article as supplementary materials |