Literature DB >> 32226801

Malaria patients in Nigeria: Data exploration approach.

Nureni Olawale Adeboye1, Olawale Victor Abimbola1, Sakinat Oluwabukola Folorunso2.   

Abstract

Malaria is a life threatening disease which is usually transmitted to people through the bite of infected female anopheles mosquitoes. However, this article deals with the data exploration of malaria symptoms reported by 337 patients attended to at Federal Polytechnic Ilaro Medical centre, Ogun State Nigeria. The study covers a period of four (4) weeks monitoring of patients attendance, their consultation with physician and malaria test results as compared to their claims of malaria infection. Logistic regression was used for the basic analysis of the dataset and it was discovered that people in the age range 38-47 years are mostly affected with malaria and that females are the most infected gender species with headache being the most significant symptom based on its Wald statistic value. This study strongly recommends the introduction of a long lasting malaria prevention scheme that cut across all categories of ages and genders within the Nigerian community, and that self-medication should be seriously warned against as most claims of malaria were not actually found to be true upon verification.
© 2019 The Authors.

Entities:  

Keywords:  Headache; Logistic regression; Malaria; Mosquitoes

Year:  2019        PMID: 32226801      PMCID: PMC7093799          DOI: 10.1016/j.dib.2019.104997

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table The data on malaria infection could be useful for government and health workers to make decisions that would reduce the risk of malaria infection among the populace. This work provides a deeper understanding of the prevalence and prognosis of malaria infection. The data can be useful in malaria infection awareness, management and treatment. The data could be used as a baseline for comparison in future studies. The data reveals high significant impacts of prevalent factors such as headache, pain, fever, cold etc. on malaria morbidity

Data

The data set used in this article was collected as a secondary data from Federal Polytechnic Ilaro Medical centre, Ilaro Ogun state, Nigeria and it contains information on 337 patients who presented themselves for consultation on malaria related infections. The symptoms reported by the patients were recorded and information about the same patients were collected after been tested for malaria. These patients are between the ages of 3 and 77 years of whom 180 are females and 157 are males, and their data was collected for a period of 4 weeks. The recorded symptoms as reported by the patients were all compared with the results of the malaria test, and the results of the malaria test was used for the target variables. This dataset consist of 15 malaria symptoms which are “Fever, Cold, Rigor, Fatigue, Headache, Bitter-tongue, Vomiting, Diarrhea, Convulsion, Anemia, Jaundice, Cocacola-Urine, Hypoglycemia, Prostration, and Hyperpyrexia” as collected. From the dataset, Ages of the patients are recorded in years while gender were encoded in ordinal form as “0” for Male and “1” for Female. Other features are encoded in integers (“0” for non-presence and “1” for the symptoms presence). This raw dataset which has been approved by the medical director, representing the institutional bioethics committee is available and can be assessed as Supplementary data. Descriptive analyses were performed and logistic regression analysis was also used to describe and analyze the data set. The data is summarized under different classifications which are: classification based on gender (sex), malaria infection classification for age, classification of malaria infection by sex and classification based on some common malaria symptoms.

Analysis of age of the patients

The frequency table showing the analysis of the age of all the 337 patients is shown in Table 1. In Table 1, it can be seen that the mean age of the patients is 30.35 years, the minimum and maximum ages are 3 year and 77 years respectively. The data set is slightly positively skewed and leptokurtic with a coefficient of Skewness and kurtosis of 0.755 and 0.536 respectively.
Table 1

Analysis of age in years.

Statistics
N
 Valid337
 Missing0
Mean30.35
Median29.00
Mode31
Std. Deviation14.721
Variance216.704
Skewness.755
Std. Error of Skewness.133
Kurtosis.536
Std. Error of Kurtosis.265
Range74
Minimum3
Maximum77
Sum10,227
Analysis of age in years. A diagrammatic representation of the age distribution and age range of the patients is as shown in Fig. 1, Fig. 2 respectively. The age of the patients were classified into eight different groups (or classes) and the respective frequencies are as shown in Table 2. It can be seen from Table 2 that majority (50) of the patients are in the age group 38–47 years which is approximately 15% of the total population. The diagrammatic representation of the information in Table 2 is as shown in Fig. 2.
Fig. 1

Age distribution (Years).

Fig. 2

Percentage distribution of Ages (Years).

Table 2

Classification of age of patients (Years).

Age RangeFrequenciesPercentage
8–174814.2
18–274011.8
28–374413.0
38–475014.8
48–574413.0
58–674613.6
68–774914.5
78–87175.0
Total338100
Age distribution (Years). Percentage distribution of Ages (Years). Classification of age of patients (Years). Information on the gender is as shown in Table 3 and the respective frequencies are also displayed. From Table 3, it can be seen that most of the patients were female. The diagrammatic representation is as shown in Fig. 3.
Table 3

Distribution of gender of the patients.

SexFrequencyPercentValid PercentCumulative Percent
Male15746.646.646.6
Female18053.453.4100.0
Total337100.0100.0
Fig. 3

Bar Chart showing the distribution of gender.

Distribution of gender of the patients. Bar Chart showing the distribution of gender.

Analysis on malaria diagnosis using logistic regression

Information on the diagnosis of patients who presented themselves for malaria treatment was shown in Table 4 and it was observed that only 116 of the 337 reported cases were actually found to be infected with malaria, of which most of them are female. The diagrammatic representation of Table 4 is as shown in Fig. 4. It was observed that in Fig. 5, the chart of the predicted probabilities gave a Cut Value/threshold of 0.5 and the goodness of fit test was carried out using Hosmer and Lemeshow Test.
Table 4

Cross tabulation for gender and Malaria of patients.

Sex * Severe Malaria Cross tabulation
Count
Severe Malaria
Total
No MalariaMalaria
Sex
 Male10354157
 Female11862180
Total221116337
Fig. 4

Multiple Bar Chart showing the distribution of gender and Malaria.

Fig. 5

Diagram of predictive probabilities.

Cross tabulation for gender and Malaria of patients. Multiple Bar Chart showing the distribution of gender and Malaria. Diagram of predictive probabilities.

Experimental design, materials and methods

This article shows the strength of the significant level of the perceived as well as diagnosed malaria symptoms using logistic regression analysis. It equally examined the linear relationship between the malaria predicted binary classes. Research on malaria has been a great concerns to government and world health organizations. According to Ref. [1], there were estimated deaths of 435,000 from malaria globally in 2017, compared with 451,000 estimated deaths in 2016, and 607 000 in 2010. According to researches, several aspect of malaria prediction method has been studied. And different forms of dataset have been used such as malaria cell image dataset and different forms of numerical dataset. Artificial neural networks, Machine learning/Data mining and deep learning methods has been helpful to previous researchers in predicting malaria outbreak/infections in different regions and community all over the world. Some have gone as far as using geospatial based and weather based dataset in predicting malaria which has been a very huge success in previous years and different recommendation have been made [[1], [2], [3], [4], [5], [6], [7], [8], [9]]. Malaria is transmitted exclusively through the bites of Anopheles mosquitoes. The intensity of transmission depends on factors related to the parasite, the vector, the human host, and the environment. Symptoms of malaria include fever, headache, and vomiting, and other listed symptoms in the dataset which usually appear between 10 and 15 days after the mosquito bite. If not treated, malaria, more so falciparum malaria, can quickly become life-threatening by disrupting the blood supply to vital organs [[10], [11], [12], [13], [14]]. Chi-square test of independence can equally be used to analyze the data collected. For instance, a cross-tabulation of gender and Malaria outcome of the patients after been tested can be classified into contingency table as shown in Table 4. In this research however, logistic regression analysis was used to analyze the data set. Table 5 shows the classification table at step 1.
Table 5

Classification Table.

ObservedPredicted
Severe Malaria
Percentage Correct
No MalariaMalaria
Step 1Severe Malaria
 No Malaria2041792.3
 Malaria912521.6
Overall Percentage68.0
Classification Table. Table 6 shows the variables in the equation at Step 1.
Table 6

Variables in the equation.

BS.E.WaldDfSig.Exp(B)
Step 0Constant−0.6450.11531.60610.0000.525
Variables in the equation. Table 7 shows the omnibus tests of model coefficients.
Table 7

Test of model coefficients.

Omnibus Tests of Model Coefficients
Chi-squaredfSig.
Step 1Step29.30117.032
Block29.30117.032
Model29.30117.032
Test of model coefficients. Table 8 shows the model summary using the log-likelihood, Cox & Snell R square and Negelkerke R square.
Table 8

Model summary.

Step−2 Log likelihoodCox & Snell R SquareNagelkerke R Square
1404.614a0.0830.115

Estimation terminated at iteration number 4 because parameter estimates changed by less than .001.

Model summary. Estimation terminated at iteration number 4 because parameter estimates changed by less than .001. Table 9 shows the Hosmer and Lemeshow Test.
Table 9

Hosmer and Lemeshow test.

StepChi-squareDfSig.
15.2668.729
Hosmer and Lemeshow test. Table 10 shows Contingency Table for Hosmer and Lemeshow Test.
Table 10

Contingency Table for Hosmer and Lemeshow test.

Severe Maleria = No Malaria
Severe Maleria = Malaria
Total
ObservedExpectedObservedExpected
Step 113129.92834.07234
22527.46896.53234
32525.87498.12634
42424.318109.68234
52323.0771110.92334
62621.622812.37834
72120.1481313.85234
81718.6591715.34134
91517.2101916.79034
101412.6961718.30431
Contingency Table for Hosmer and Lemeshow test. Table 11 shows the classification table for all the step 1.
Table 11

Variables in the equation.

BS.E.WalddfSig.Exp(B)95% C.I.for EXP(B)
LowerUpper
Step 1aAge.013.0082.3801.1231.013.9971.030
sex (1)−.076.250.0921.761.927.5671.514
fever (1).023.287.0061.9371.023.5831.795
cold (1)−.345.2531.8561.173.708.4311.163
rigor (1)−.182.257.5021.478.833.5031.380
fatigue (1)−.267.2521.1171.290.766.4671.256
headace (1)−.795.2867.7031.006.452.258.792
bitter_tongue (1).187.250.5581.4551.205.7381.967
vomitting (1)−.034.480.0051.944.967.3772.479
diarrhea (1)−.478.2543.5351.060.620.3771.020
Convulsion (1).423.2622.6141.1061.527.9142.549
Anemia (1).033.257.0161.8981.033.6251.710
jundice (1)−.139.261.2851.593.870.5221.450
cocacola_urine (1)−.377.2482.3041.129.686.4221.116
hypoglycemia (1)−.772.3963.8061.051.462.2131.004
prostraction (1).603.3153.6711.0551.828.9863.388
hyperpyrexia (1)−.017.362.0021.962.983.4831.999
Constant−.619.767.6501.420.539

Variable(s) entered on step 1: age, sex, fever, cold, rigor, fatigue, headace, bitter_tongue, vomitting, diarrhea, Convulsion, Anemia, jundice, cocacola_urine, hypoglycemia, prostraction, hyperpyrexia.

Variables in the equation. Variable(s) entered on step 1: age, sex, fever, cold, rigor, fatigue, headace, bitter_tongue, vomitting, diarrhea, Convulsion, Anemia, jundice, cocacola_urine, hypoglycemia, prostraction, hyperpyrexia. Fig. 5 shows the diagram of predictive probabilities.

Specifications Table

SubjectMedicine
Specific subject areaEpidemiological, Public health, Biostatistics
Type of dataTable, Text
How data were acquiredUnprocessed Secondary data collected from Federal polytechnic Ilaro Medical Centre
Data formatRaw and partially analysed
Experimental factorsObservation of different Malaria Symptoms and the result of each patients after been tested for malaria
Experimental featuresComputational Analysis: Histogram, Bar-chart, Logistic regression analysis
Data source locationFederal Polytechnic Ilaro Medical Centre, Ilaro, Ogun State, Nigeria
Data accessibilityAll the data are available in this data article as supplementary materials
Value of the Data

The data on malaria infection could be useful for government and health workers to make decisions that would reduce the risk of malaria infection among the populace.

This work provides a deeper understanding of the prevalence and prognosis of malaria infection.

The data can be useful in malaria infection awareness, management and treatment.

The data could be used as a baseline for comparison in future studies.

The data reveals high significant impacts of prevalent factors such as headache, pain, fever, cold etc. on malaria morbidity

  7 in total

Review 1.  Towards a malaria vaccine for Papua New Guinea.

Authors:  J C Reeder
Journal:  P N G Med J       Date:  2001 Mar-Jun

2.  Impact of Plasmodium falciparum infection on haematological parameters in children living in Western Kenya.

Authors:  Robert N Maina; Douglas Walsh; Charla Gaddy; Gordon Hongo; John Waitumbi; Lucas Otieno; David Jones; Bernhards R Ogutu
Journal:  Malar J       Date:  2010-12-13       Impact factor: 2.979

Review 3.  Malaria in school-age children in Africa: an increasingly important challenge.

Authors:  Joaniter Nankabirwa; Simon J Brooker; Sian E Clarke; Deepika Fernando; Caroline W Gitonga; David Schellenberg; Brian Greenwood
Journal:  Trop Med Int Health       Date:  2014-08-22       Impact factor: 2.622

4.  A Weather-Based Prediction Model of Malaria Prevalence in Amenfi West District, Ghana.

Authors:  Esther Love Darkoh; John Aseidu Larbi; Eric Adjei Lawer
Journal:  Malar Res Treat       Date:  2017-01-31

5.  Using mixed effects logistic regression models for complex survey data on malaria rapid diagnostic test results.

Authors:  Chigozie Louisa J Ugwu; Temesgen T Zewotir
Journal:  Malar J       Date:  2018-12-05       Impact factor: 2.979

6.  Analysis of Haematological Parameters as Predictors of Malaria Infection Using a Logistic Regression Model: A Case Study of a Hospital in the Ashanti Region of Ghana.

Authors:  Ellis Kobina Paintsil; Akoto Yaw Omari-Sasu; Matthew Glover Addo; Maxwell Akwasi Boateng
Journal:  Malar Res Treat       Date:  2019-05-21

7.  A comparison of malaria prevalence, control and management strategies in irrigated and non-irrigated areas in eastern Kenya.

Authors:  John Muthii Muriuki; Philip Kitala; Gerald Muchemi; Ian Njeru; Joan Karanja; Bernard Bett
Journal:  Malar J       Date:  2016-08-11       Impact factor: 2.979

  7 in total
  1 in total

1.  Malaria research supported with open access data.

Authors:  Nicholas A Pullen; Emma Bertran
Journal:  Data Brief       Date:  2022-03-02
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.