Literature DB >> 28971122

Breast cancer patients in Nigeria: Data exploration approach.

Pelumi E Oguntunde1, Adebowale O Adejumo2, Hilary I Okagbue1.   

Abstract

Breast cancer is the type of cancer that develops from breast tissue; it is mostly common in women and it is one of the most studied diseases, largely because of its high mortality (second to lung cancer). However, it occurs in males also. This article presents a statistical study of the distribution of age, gender, length of stay, mode of diagnosis, status (dead or alive) after treatment and the location of breast cancer among 300 patients admitted in the University of Ilorin teaching hospital, Ilorin, Nigeria. The study covers a period of five (5) years; from 2011 to 2016 and logistic regression was used to perform the basic analysis in this study. It was discovered that the age of patients and the location of the breast cancer (right or left) contributes significantly to the survival of the patients. However, early detection and treatment of the disease is highly encouraged. This study also recommends that awareness should be taken to the grassroots and males should not be excluded from this discussion.

Entities:  

Keywords:  Breast cancer; Logistic regression; Mortality; Oncology

Year:  2017        PMID: 28971122      PMCID: PMC5612794          DOI: 10.1016/j.dib.2017.08.038

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the data The data on breast cancer could be useful for government and health workers to make decisions that would reduce the risk of breast cancer among the populace. The data provides the analysis of the age, gender, location of the breast cancer, mode of diagnosis, length of stay (LOS), outcome of treatment of breast cancer patients for the population studied. The data can further be analyzed using other statistical tools like chi square test, multiple linear regression and Poisson regression analysis. The result from the analysis can be compared with other oncologic studies. The interpretation of the data could be helpful in educational studies, epidemiologic oncology, molecular pathologic epidemiology, and breast cancer awareness, screening and so on. The study can be replicated or extended to longitudinal studies. The article provides insight on the impact and consequence of age and location of breast cancer on the survivability of breast cancer patients.

Data

The data set used in this article was collected as a secondary data and it contains information on 300 breast cancer patients. The data set was obtained from the Cancer Registry Department under the Department of Admission and Discharge Unit, University of Ilorin Teaching Hospital (UITH) Ilorin, Nigeria. It involves information on 275 females and 25 males and it covers a period of five (5) years; from 2011 to 2016. The patients were all treated as in-patients and were later discharged, of these, 97 patients were discharged dead while 203 patients were discharged alive. The raw data is available and can be assessed as Supplementary data. Descriptive analyses were performed and logistic regression analysis was also used to describe and analyze the data set. The data is summarized under different classifications: gender (sex), location of the breast cancer, mode of diagnosis, survival after treatment, age and length of stay in the hospital during treatment.

Analysis of age of the patients

The frequency table showing the analysis of the age of all the 300 patients is shown in Table 1.
Table 1

Analysis of age.

Statistics
Age







NValid300
Missing0
Mean49.71
Median50.00
Mode60
Std. Deviation13.884
Variance192.768
Skewness.572
Std. Error of Skewness.141
Kurtosis.479
Std. Error of Kurtosis.281
Minimum20
Maximum96







Percentiles2540.00
5050.00
7560.00
Analysis of age. In Table 1, it can be seen that the mean age of the patients is 49.71 years, the minimum and maximum ages are 20 years and 96 years respectively. The data set is slightly positively skewed with a coefficient of skewness of 0.572. A diagrammatic representation of the age of the patients is as shown in Fig. 1.
Fig. 1

The distribution of age using histogram.

The distribution of age using histogram. The age of the patients were classified into three different groups (or classes) and the respective frequencies are as shown in Table 2.
Table 2

Classification of age of the patients.

Agecode
FrequencyPercentValid PercentCumulative Percent
Valid<41years8829.329.329.3
41–55years11538.338.367.7
> 55years9732.332.3100.0
Total300100.0100.0
Classification of age of the patients. It can be seen from Table 2 that majority (115) of the patients are in the age group 41–55 years which accounts for 38.3% of the total population under study. The diagrammatic representation of the information in Table 2 is as shown in Fig. 2.
Fig. 2

Bar chart showing the classification of age.

Bar chart showing the classification of age.

Analysis on length of stay of the patients at the hospital

Information on the length of stay of the patients in the hospital before discharge is as shown in Table 3 and the respective frequencies are also displayed.
Table 3

Classification of length of stay.

Loscode
FrequencyPercentValid PercentCumulative Percent
Valid< 11days10635.335.335.3
11–21days10133.733.769.0
> 21days9331.031.0100.0
Total300100.0100.0
Classification of length of stay. From Table 3, it can be seen that most (106) of the patients were discharged early and particularly in less than 11 days. The diagrammatic representation is as shown in Fig. 3.
Fig. 3

Bar chart showing the classification of length of stay.

Bar chart showing the classification of length of stay.

Analysis on the gender of the patients

The information on the gender of the patients is as shown in Table 4.
Table 4

Distribution of gender of the patients.

Gender/sexFrequencyPercentCumulative Percent
Female27591.791.7
Male258.3100.0
Total300100.0
Distribution of gender of the patients. It can be seen in Table 4 that majority (275) of the patients are females. Also, the table revealed the incidence of breast cancer among male patients. The information in Table 4 is represented diagrammatically in Fig. 4.
Fig. 4

Bar chart showing the distribution of gender.

Bar chart showing the distribution of gender.

Experimental design, materials and methods

Research on breast cancer and other form of cancer are intense because of the high fatality rate of the disease if not properly managed. Several aspects of breast cancer has been studied, some of which have generated data sets. The analysis on those data sets is based on the various experimental designs, research materials and referred scientific methods. Some of such areas are: CT images, growth factor levels in incident breast cancer, hormone receptor status, cytokine circulation, secretagogue users in breast cancer treatments, chemokine levels, breast cancer and diabetes mellitus co-infection and treatment, breast cancer and HIV treatment, breast cancer and pregnancy. Others are: proteome analysis, risk factors analysis, breast examination, screening, management and breast cancer awareness, epidemiology, risk assessment tools, treatment options: radiotherapy treatment versus chemotherapy, survival analysis, breast cancer subtypes, biomarkers, socio-cultural barriers to treatment, socio-demographic factors and alternative medicine approach, genetic risk, dietary patterns, early diagnostics and treatment and others [1], [2], [3], [4], [5], [6], [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], [17], [18], [19], [20], [21], [22], [23], [24], [25], [26]. Chi-square test of independence can be used to analyze the data collected, for instance, a cross-tabulation of gender and outcome of the patients at the point of discharge can be classified into a r x c contigency table as shown in Table 5. In this research however, logistic regression analysis was used to analyze the data set. See similar analysis in [27], [28], [29], [30]
Table 5

Crosstabulation for gender and outcome of patients.

sex * Outcome Crosstabulation
Count
Outcome
Total
AliveDead
Sexfemale18887275
male151025
Total20397300
Crosstabulation for gender and outcome of patients. Table 6 represents the coding for variables length of stay, age, location of cancer, mode of diagnosis and gender of the patients.
Table 6

Categorical variable coding.

FrequencyParameter coding
(1)(2)
Loscode< 11days1061.000.00
11–21days1010.001.00
> 21days930.000.00
Agecode<41years881.000.00
41–55years1150.001.00
> 55years970.000.00
Location of CancerBoth breasts251.000.00
Left breast1400.001.00
Right breast1350.000.00
Mode of DiagnosisCytological1661.00
Histological1340.00
sexFemale2751.00
Male250.00
Categorical variable coding. Table 7 shows the classification table at step 0.
Table 7

Classification Table.

Classification Tablea,b
ObservedPredicted
Outcome
Percentage Correct
AliveDead

Step 0OutcomeAlive2030100.0
Dead970.0
Overall Percentage67.7
Classification Table. Table 8 shows the variables in the equation at Step 0.
Table 8

Variables in the equation.

BS.E.WalddfSig.Exp(B)
Step 0Constant−.738.12335.7971.000.478
Variables in the equation. Block 1: Method = Backward Stepwise (Conditional). Table 9 shows the omnibus tests of model coefficients.
Table 9

Tests of model coefficients.

Omnibus Tests of Model Coefficients
Chi-squaredfSig.
Step 1Step20.7428.008
Block20.7428.008
Model20.7428.008
Step 2aStep−.8922.640
Block19.8506.003
Model19.8506.003
Step 3aStep−.2351.628
Block19.6165.001
Model19.6165.001
Step 4aStep−.4611.497
Block19.1554.001
Model19.1554.001

A negative Chi-squares value indicates that the Chi-squares value has decreased from the previous step.

Tests of model coefficients. A negative Chi-squares value indicates that the Chi-squares value has decreased from the previous step. Table 10 shows the model summary using the log-likelihood, Cox & Snell R square and Negelkerke R square.
Table 10

Model summary.

Model Summary
Step-2 Log likelihoodCox & Snell R SquareNagelkerke R Square
1356.872a.067.093
2357.764a.064.089
3357.998a.063.088
4358.459a.062.086

Estimation terminated at iteration number 4 because parameter estimates changed by less than .001.

Model summary. Estimation terminated at iteration number 4 because parameter estimates changed by less than .001. Table 11 shows the variables in the equation from Step 1 to Step 4:
Table 11

Variables in the equation.

BS.E.WalddfSig.Exp(B)95% C.I.for EXP(B)
LowerUpper
Step 1asex(1)−.232.454.2611.609.793.3251.932
agecode9.6412.008
agecode(1)−.827.3326.1941.013.437.228.839
agecode(2)−.875.3097.9961.005.417.227.765
Location of Cancer9.2092.010
Location of Cancer(1)1.092.4705.4071.0202.9811.1877.485
Location of Cancer(2).721.2766.8471.0092.0571.1983.531
Mode of Diagnosis(1)−.156.263.3531.552.855.5111.432
loscode.8832.643
loscode(1)−.238.319.5591.455.788.4221.471
loscode(2).031.316.0101.9211.032.5551.918
Constant−.271.503.2891.591.763





















Step 2asex(1)−.220.453.2371.626.802.3301.948
agecode9.6692.008
agecode(1)−.827.3316.2531.012.437.229.836
agecode(2)−.871.3097.9641.005.419.229.766
Location of Cancer9.5732.008
Location of Cancer(1)1.093.4685.4601.0192.9831.1937.462
Location of Cancer(2).742.2747.3231.0072.1001.2273.593
Mode of Diagnosis(1)−.166.263.3971.529.847.5061.418
Constant−.359.459.6131.434.698





















Step 3aagecode10.6842.005
agecode(1)−.852.3266.8141.009.427.225.809
agecode(2)−.898.3048.7431.003.407.225.739
Location of Cancer9.3892.009
Location of Cancer(1)1.076.4665.3251.0212.9331.1767.318
Location of Cancer(2).728.2727.1541.0072.0721.2153.533
Mode of Diagnosis(1)−.178.261.4611.497.837.5021.398
Constant−.528.3033.0331.082.590





















Step 4aagecode10.3592.006
agecode(1)−.832.3246.5811.010.435.230.822
agecode(2)−.877.3028.4461.004.416.230.752
Location of Cancer9.5812.008
Location of Cancer(1)1.114.4635.7841.0163.0471.2297.554
Location of Cancer(2).722.2727.0551.0082.0591.2083.509
Constant−.640.2566.2561.012.528

Variable(s) entered on step 1: sex, agecode, LocationofCancer, ModeofDiagnosis, loscode.

Variables in the equation. Variable(s) entered on step 1: sex, agecode, LocationofCancer, ModeofDiagnosis, loscode. Table 12 shows the Hosmer and Lemeshow Test.
Table 12

Hosmer and Lemeshow Test.

Hosmer and Lemeshow Test
StepChi-squaredfSig.
18.5668.380
21.5028.993
31.3808.995
41.1935.946
Hosmer and Lemeshow Test. Table 13 shows the classification table for all the steps; steps 1–4.
Table 13

Classification Table.

ObservedPredicted
Outcome
Percentage Correct
AliveDead
Step 1OutcomeAlive1871692.1
Dead742323.7
Overall Percentage70.0
Step 2OutcomeAlive1931095.1
Dead811616.5
Overall Percentage69.7
Step 3OutcomeAlive1802388.7
Dead682929.9
Overall Percentage69.7
Step 4OutcomeAlive1802388.7
Dead682929.9
Overall Percentage69.7

a. The cut value is .500

Classification Table. a. The cut value is .500 The predictive probability is as shown in Fig. 5.
Fig. 5

Diagram of predictive probabilities.

Diagram of predictive probabilities. Breast cancer is one of the dangerous diseases. It occurs in both males and females but the incidence is more in females. Based on this present study, the age of the patient and the location of the breast cancer (right breast or left breast) both contribute significantly to whether a patient would survive the breast cancer disease or not.
Subject areaMedicine
More specific subject areaBiostatistics, Oncology
Type of dataTable and text file
How data was acquiredUnprocessed secondary data
Data formatRaw, analyzed
Experimental factorsRecords of Breast cancer patients obtained from University of Ilorin Teaching Hospital (UITH), Nigeria.
Experimental featuresComputational Analysis: Histogram, Bar-chart, Contingency tables, Logistic regression analysis.
Data source locationUniversity of Ilorin Teaching Hospital (UITH), Nigeria
Data accessibilityAll the data are available in this data article as supplementary materials
  25 in total

1.  Determinants of stage at diagnosis of breast cancer in Nigerian women: sociodemographic, breast cancer awareness, health care access and clinical factors.

Authors:  Elima Jedy-Agba; Valerie McCormack; Oluwole Olaomi; Wunmi Badejo; Monday Yilkudi; Terna Yawe; Emmanuel Ezeome; Iliya Salu; Elijah Miner; Ikechukwu Anosike; Sally N Adebamowo; Benjamin Achusi; Isabel Dos-Santos-Silva; Clement Adebamowo
Journal:  Cancer Causes Control       Date:  2017-04-26       Impact factor: 2.506

2.  The severity, outcome and challenges of breast cancer in Nigeria.

Authors:  A R K Adesunkanmi; O O Lawal; K A Adelusola; M A Durosimi
Journal:  Breast       Date:  2005-08-08       Impact factor: 4.380

3.  Five year cancer incidence in Calabar, Nigeria (2009-2013).

Authors:  Ima-Obong A Ekanem; Donald M Parkin
Journal:  Cancer Epidemiol       Date:  2016-05-07       Impact factor: 2.984

4.  An epidemiologic investigation of physical activity and breast cancer risk in Africa.

Authors:  Ningqi Hou; Paul Ndom; Johashaphat Jombwe; Temidayo Ogundiran; Adeyinka Ademola; Imran Morhason-Bello; Oladosu Ojengbede; Antony Gakwaya; Dezheng Huo
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2014-09-21       Impact factor: 4.254

5.  TH1 and TH2 cytokines dataset in insulin users with diabetes mellitus and newly diagnosed breast cancer.

Authors:  Zachary A P Wintrob; Jeffrey P Hammel; George K Nimako; Dan P Gaile; Alan Forrest; Alice C Ceacareanu
Journal:  Data Brief       Date:  2017-02-20

6.  CT images and radiotherapy treatment planning of patients with breast cancer: A dataset.

Authors:  Mohammad Rezaei; Ahmad Mohammadbeigi; Karim Khoshgard; Abbas Haghparast
Journal:  Data Brief       Date:  2017-06-10

7.  Insulin use, hormone receptor status and hematopoietic cytokines׳ circulation in women with diabetes mellitus and breast cancer.

Authors:  Zachary A P Wintrob; Jeffrey P Hammel; George K Nimako; Dan P Gaile; Alan Forrest; Alice C Ceacareanu
Journal:  Data Brief       Date:  2017-02-21

8.  Insulin secretagogue use and circulating inflammatory C-C chemokine levels in breast cancer patients.

Authors:  Zachary A P Wintrob; Jeffrey P Hammel; George K Nimako; Zahra S Fayazi; Dan P Gaile; Alan Forrest; Alice C Ceacareanu
Journal:  Data Brief       Date:  2017-02-16

9.  Management of pregnancy associated breast cancer with chemotherapy in a developing country.

Authors:  Emmanuel A Sule; Festus Ewemade
Journal:  Int J Surg Case Rep       Date:  2015-11-04

10.  Data report on inflammatory C-C chemokines among insulin-using women with diabetes mellitus and breast cancer.

Authors:  Zachary A P Wintrob; Jeffrey P Hammel; George K Nimako; Zahra S Fayazi; Dan P Gaile; Erin E Davis; Alan Forrest; Alice C Ceacareanu
Journal:  Data Brief       Date:  2017-02-22
View more
  8 in total

1.  Quantitative evaluation of pregnant women delivery status' records in Akure, Nigeria.

Authors:  Adebowale O Adejumo; Esivue A Suleiman; Hilary I Okagbue; Pelumi E Oguntunde; Oluwole A Odetunmibi
Journal:  Data Brief       Date:  2017-11-14

Review 2.  Global Inequities in Precision Medicine and Molecular Cancer Research.

Authors:  Thomas M Drake; Stephen R Knight; Ewen M Harrison; Kjetil Søreide
Journal:  Front Oncol       Date:  2018-09-04       Impact factor: 6.244

3.  Learning analytics: Data sets on the academic record of accounting students in a Nigerian University.

Authors:  Folashade O Owolabi; Pelumi E Oguntunde; Dorcas T Adetula; Samuel A Fakile
Journal:  Data Brief       Date:  2018-06-26

Review 4.  Systematic Review of Prevalence of Antepartum Depression during the Trimesters of Pregnancy.

Authors:  Hilary I Okagbue; Patience I Adamu; Sheila A Bishop; Pelumi E Oguntunde; Abiodun A Opanuga; Elvir M Akhmetshin
Journal:  Open Access Maced J Med Sci       Date:  2019-05-14

5.  Data in support of high rate of pregnancy related deaths in Maiduguri, Borno State, Northeast Nigeria.

Authors:  Patience I Adamu; Muminu O Adamu; Hilary I Okagbue
Journal:  Data Brief       Date:  2018-03-16

6.  Statistical data analysis of cancer incidences in insurgency affected states in Nigeria.

Authors:  Patience I Adamu; Pelumi E Oguntunde; Hilary I Okagbue; Olasunmbo O Agboola
Journal:  Data Brief       Date:  2018-05-05

7.  Analysis of selected crime data in Nigeria.

Authors:  Pelumi E Oguntunde; Oluwadare O Ojo; Hilary I Okagbue; Omoleye A Oguntunde
Journal:  Data Brief       Date:  2018-06-02

8.  On the Epidemiology and Statistical Analysis of HIV/AIDS Patients in the Insurgency Affected States of Nigeria.

Authors:  Patience I Adamu; Pelumi E Oguntunde; Hilary I Okagbue; Olasunmbo O Agboola
Journal:  Open Access Maced J Med Sci       Date:  2018-07-19
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.