Literature DB >> 29900251

Statistical analysis of bank deposits dataset.

Pelumi E Oguntunde1, Hilary I Okagbue1, Patience I Adamu1, Omoleye A Oguntunde2, Sola J Oluwatunde3, Abiodun A Opanuga1.   

Abstract

This article presents the statistical analysis of the deposit activities in each of the account types of a leading bank in Nigeria. The mean effect of these account types on the bank was determined using analysis of variance (ANOVA). Further test which include the Tukey's simultaneous test for differences of means was also conducted.

Entities:  

Keywords:  Account; Bank; Deposit; Security; Statistics; Turkey pairwise comparison

Year:  2018        PMID: 29900251      PMCID: PMC5996490          DOI: 10.1016/j.dib.2018.03.096

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the data The data is useful in calculating loan to deposit ratio. The data could be used as one of vital tools in assessing bank competitiveness [1]. The data analysis could be helpful in detecting non-performing loans (NPL) in credit management [2]. The data could be helpful in monitoring off balance sheet engagements [3]. The data could be used to monitor compliance to banking decision making and strategy implementation; for example, innovative savings products [4], [5], [6]. The data analysis can be applied to monitor statutory policies and regulation; for example, the effect of monetary policies [7]. The data can be extended to include behavioral attitudes and customer preferences for some types of accounts.

Data

The data in this article involves the amount of money (in Naira) deposited into six different account types available in a leading bank in Nigeria on a particular day in year 2017. It also gives information on the number of people that make deposits into the various account types. The bank used has six different account types which we denote as Account Type 1 (Savings), Account Type 2 (Current), Account Type 3 (Corporate), Account Type 4, Account Type 5 and Account Type 6. Since the data is sensitive and a real life data, we would like to protect the privacy policy of the bank. Descriptive statistics was used to summarize the data and to provide plots for proper visualization and understanding. SPSS version 20 and Minitab version 17 were used for the analyses in this paper. The data set is summarized in Table 1.
Table 1

Summary statistics of the dataset.

Account Types123456
No. of Depositors30156874
Minimum (#)3000130,000180,00012,000800015,000
Maximum (#)130,000850,000700,00070,00080,00080,000
Sum (#)649,0006,663,0002,192,000249,000256,000132,000
Mean (#)21,633.33444,200.00365,333.3331,125.0036,671.4333,000
Summary statistics of the dataset. The information contained in Table 1 shows that more people patronize account type 1 which is savings account than any other account types but the total money deposited in the account is not necessarily the largest. The account type that attracts the highest deposits is account type 2 (current account), though, the number of depositors for this account type is not the highest but on the average, customers deposited the highest amount of money there. This is reasonable because in the real sense, current account holders could either be for personal, businesses, and corporate organizations. A chart that summarizes the whole dataset is presented in Fig. 1.
Fig. 1

The chart representing the amount of deposits and account types.

The chart representing the amount of deposits and account types. The deposit patterns for account types 1–6 are provided in form of histogram in Fig. 2, Fig. 3, Fig. 4, Fig. 5, Fig. 6, Fig. 7 respectively.
Fig. 2

The histogram for Savings Account (Account Type 1).

Fig. 3

The histogram for Current Account (Account Type 2).

Fig. 4

The histogram for Corporate Account (Account Type 3).

Fig. 5

The histogram for Account Type 4.

Fig. 6

The histogram for Account Type 5.

Fig. 7

The histogram for Account Type 6.

The histogram for Savings Account (Account Type 1). The histogram for Current Account (Account Type 2). The histogram for Corporate Account (Account Type 3). The histogram for Account Type 4. The histogram for Account Type 5. The histogram for Account Type 6. Also, the boxplot representing the mean amount deposited in the various account types is displayed in Fig. 8.
Fig. 8

A Boxplot representing the data set.

A Boxplot representing the data set. The impact of the current account is also being identified in the plot provided in Fig. 8. The mean deposit in each account type with their respective 95% Confidence Interval (C.I) is displayed in Table 2.
Table 2

95% confidence interval for the mean.

Account TypeNMeanStandard deviation95% C. I
Account Type 1 (Savings)302163326795(−23413, 66679)
Account Type 2 (Current)15444200230460(380495, 507905)
Account Type 3 (Corporate)6365333200107(264607, 466059)
Account Type 483112520401(−56106, 118356)
Account Type 573657130918(−56683, 129826)
Account Type 643300031401(−90364, 156364)
95% confidence interval for the mean. The 95% confidence interval plot for the mean of the amount deposited in the various account types is displayed in Fig. 9.
Fig. 9

A plot for the 95% C.I for the mean amount of deposits.

A plot for the 95% C.I for the mean amount of deposits.

Experimental design, materials and methods

Analysis of variance has traditionally been used to investigate mean effects of groups of subjects. In this research, a one-way ANOVA is applied. ANOVA and other statistical tools have been applied to the analysis of economic data such as in econometric models, credit management, accounting and audit and many others which are too numerous to enumerate. Furthermore, statistical tools are often combined with other tools for better analysis. Some examples include: macroeconomic volatility generation [8], economic impact of transportation [9], economic impact of professional negotiation [10], Gross Domestic product and exchange rate [11], economic impact of tourism [12], income inequality [13], the effects of expenditure [14], human capital in energy growth [15], quality of life [16], economic impact of portfolio selection [17], economics of refugees and asylum seekers [18], economic recovery [19] and energy needs for economic development [20]. Since we are dealing with a one-way ANOVA, the underlying model is:where is the jth observation in the ith treatment, is the overall mean, is the effect of treatment i, is the error term The specific hypothesis used is: H0: The mean deposits in all the account types are equal Versus H1: The mean deposits are not equal for at least one of the account types However, Minitab version 17 was used for the analysis of variance (ANOVA) and further tests. Also, the level of significance used for all the analyses is 0.05. The result is displayed in Table 3.
Table 3

Analysis of variance (ANOVA) table.

Source of variation (SV)Degree of freedom (df)Sum of square (SS)Mean square (MS)FP-value
Account types52.32688E+124.65377E+1130.510.000
Error649.76204E+1115,253,184,208
Total693.30309E+12
Analysis of variance (ANOVA) table. Decision Rule: Reject H0 if p-value is less or equal to the level of significance. Decision: We reject H0 since p-value (0.000) is less than the level of significance (0.05). Inference: The mean deposits are not equal for at least one of the account types. The ANOVA model is summarized in Table 4.
Table 4

Model summary.

StatisticValue
Pooled standard deviation123,504
R-square70.45%
R-square (Adjusted)68.14%
R-square (predicted)64.24%
Model summary.

Turkey pairwise comparisons

Since H0 was rejected, we are interested in knowing which pair of the means is actually significantly different from each other using Turkey pairwise comparisons. The means are paired, the differences between the means are calculated and the Tukey's simultaneous test for differences of means of the deposits is obtained. The result is displayed in Table 5.
Table 5

Tukey simultaneous tests for differences of means.

Difference between meansDifferenceStandard error95% C.IT-valuep-value
Acct Type 2 - Acct Type 1422,56739,055(307,959, 537,174)10.820.000
Acct Type 3 - Acct Type 1343,70055,233(181,620, 505,780)6.220.000
Acct Type 4 - Acct Type 1949249,144(−134,720, 153,703)0.191.000
Acct Type 5 - Acct Type 114,93851,841(−137,188, 167,064)0.291.000
Acct Type 6 - Acct Type 111,36765,740(−181,547, 204,280)0.171.000
Acct Type 3 - Acct Type 2−78,86759,658(−253,933, 96,199)−1.320.772
Acct Type 4 - Acct Type 2−413,07554,070(−571,742, − 254,408)−7.640.000
Acct Type 5 - Acct Type 2−407,62956,532(−573,522, − 241,735)−7.210.000
Acct Type 6 - Acct Type 2−411,20069,499(−615,146, − 207,254)−5.920.000
Acct Type 4 - Acct Type 3−334,20866,700(−529,938, − 138,479)−5.010.000
Acct Type 5 - Acct Type 3−328,76268,711(−530,394, − 127,129)−4.780.000
Acct Type 6 - Acct Type 3−332,33379,721(−566,275, − 98,392)−4.170.001
Acct Type 5 - Acct Type 4544663,919(−182,124, 193,017)0.091.000
Acct Type 6 - Acct Type 4187575,630(−220,062, 223,812)0.021.000
Acct Type 6 - Acct Type 5−357177,410(−230,731, 223,588)− 0.051.000
Tukey simultaneous tests for differences of means. The pairs with p-value that is less than 0.05 are significantly different from each other. For us to have a clearer picture, the result is summarized in Table 6.
Table 6

Grouping by Turkey's method.

Account TypeNMeanGrouping
Account Type 2 (Current)15444,200A
Account Type 3 (Corporate)6365,333A
Account Type 5736,571B
Account Type 6433,000B
Account Type 4831,125B
Account Type 1 (Savings)30216,33B
Grouping by Turkey's method. Remark: The means that do not share the same letter are significantly different from each other The residuals are represented in form of histogram and are displayed in Fig. 10.
Fig. 10

Plot for the residuals.

Plot for the residuals. The normal probability plot for the residuals is displayed in Fig. 11.
Fig. 11

The normal probability plot for the residuals.

The normal probability plot for the residuals.

Key information from the results

The mean effect of current account and corporate account on the bank are the same. The mean effect of Savings account, account types 4, 5 and 6 on the bank are the same. Current account and corporate account attract more deposits than the other account types. ANOVA has been applied to different research works which yielded some interesting results similar to this research [21], [22], [23], [24], [25].
Subject areaEconomics
More specific subject areaBanking and Finance, Social Statistics
Type of dataTable and text file
How data was acquiredSecondary data
Data formatRaw and partially analyzed (Descriptive and Inferential)
Experimental factorsData sets on the amount of money deposited in a bank in different account types
Experimental featuresObservations on the number of customers that made deposit into the six various accounts of the bank and the amount they deposited.
Data source locationThe data was obtained from one of the leading banks in Nigeria
Data accessibilityAll the data are available this data article
  16 in total

1.  Dataset for an analysis of tourism and economic growth: A study of Sri Lanka.

Authors:  Ronald Ravinesh Kumar; Peter Josef Stauvermann
Journal:  Data Brief       Date:  2016-07-06

2.  The data of GDP and exchange rate used in the Balassa-Samuelson hypothesis.

Authors:  Weiguo Wang; Jing Xue; Chonghua Du
Journal:  Data Brief       Date:  2016-10-03

3.  Data on some socio-economic parameters explaining the movement of extra-EU asylum seekers in Europe.

Authors:  Silvia Angeloni
Journal:  Data Brief       Date:  2016-11-09

4.  Random number datasets generated from statistical analysis of randomly sampled GSM recharge cards.

Authors:  Hilary I Okagbue; Abiodun A Opanuga; Pelumi E Oguntunde; Paulinus O Ugwoke
Journal:  Data Brief       Date:  2016-12-09

5.  Measuring resilience to financial instability: A new dataset.

Authors:  Domenico Lombardi; Pierre Siklos
Journal:  Data Brief       Date:  2016-11-09

6.  Macroeconomic dataset for generating macroeconomic volatility among selected countries in the Asia Pacific region.

Authors:  Yee Peng Chow; Junaina Muhammad; Bany Ariffin Amin Noordin; Fan Fah Cheng
Journal:  Data Brief       Date:  2017-11-08

7.  Learning analytics for smart campus: Data on academic performances of engineering undergraduates in Nigerian private university.

Authors:  Segun I Popoola; Aderemi A Atayero; Joke A Badejo; Temitope M John; Jonathan A Odukoya; David O Omole
Journal:  Data Brief       Date:  2018-01-03

8.  Dataset for analysing the relationships among economic growth, fossil fuel and non-fossil fuel consumption.

Authors:  John Asafu-Adjaye; Dominic Byrne; Maximiliano Alvarez
Journal:  Data Brief       Date:  2016-11-26

9.  Survey data on factors affecting negotiation of professional fees between Estate Valuers and their clients when the mortgage is financed by bank loan: A case study of mortgage valuations in Ikeja, Lagos State, Nigeria.

Authors:  Chukwuemeka O Iroham; Hilary I Okagbue; Olalekan A Ogunkoya; James D Owolabi
Journal:  Data Brief       Date:  2017-05-01

10.  Data on the key performance indicators for quality of service of GSM networks in Nigeria.

Authors:  Segun I Popoola; Aderemi A Atayero; Nasir Faruk; Joke A Badejo
Journal:  Data Brief       Date:  2017-12-14
View more
  3 in total

1.  Survey dataset on analysis of queues in some selected banks in Ogun State, Nigeria.

Authors:  Sheila A Bishop; Hilary I Okagbue; Pelumi E Oguntunde; Abiodun A Opanuga; Oluwole A Odetunmibi
Journal:  Data Brief       Date:  2018-05-24

2.  Datasets on factors influencing trading on pedestrian bridges along Ikorodu road, Lagos, Nigeria.

Authors:  Olabisi O Ajakaiye; Hammed A Afolabi; Adedotun O Akinola; Hilary I Okagbue; Omoniyi O Olagunju; Olufumilayo O Adetoro
Journal:  Data Brief       Date:  2018-06-22

3.  Datasets on factors influencing the urban environmental quality of intra-urban motor parks across density areas of Lagos metropolis.

Authors:  Olayinka O Agunloye; Olabisi O Ajakaiye; Adedotun O Akinola; Hilary I Okagbue; Adedeji O Afolabi
Journal:  Data Brief       Date:  2018-07-03
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.