Literature DB >> 30101162

Exploration of daily Internet data traffic generated in a smart university campus.

Oluwaseun J Adeyemi1, Segun I Popoola2, Aderemi A Atayero2, David G Afolayan1, Mobolaji Ariyo1, Emmanuel Adetiba1,2,3.   

Abstract

In this data article, a robust data exploration is performed on daily Internet data traffic generated in a smart university campus for a period of twelve consecutive (12) months (January-December, 2017). For each day of the one-year study period, Internet data download traffic and Internet data upload traffic at Covenant University, Nigeria were monitored and properly logged using required application software namely: FreeRADIUS; Radius Manager Web application; and Mikrotik Hotspot Manager. A comprehensive dataset with detailed information is provided as supplementary material to this data article for easy research utility and validation. For each month, descriptive statistics of daily Internet data download traffic and daily Internet data upload traffic are presented in tables. Boxplot representations and time series plots are provided to show the trends of data download and upload traffic volume within the smart campus throughout the 12-month period. Frequency distributions of the dataset are illustrated using histograms. In addition, correlation and regression analyses are performed and the results are presented using a scatter plot. Probability Density Functions (PDFs) and Cumulative Distribution Functions (CDFs) of the dataset are also computed. Furthermore, Analysis of Variance (ANOVA) and multiple post-hoc tests are conducted to understand the statistical difference(s) in the Internet traffic volume, if any, across the 12-month period. The robust data exploration provided in this data article will help Internet Service Providers (ISPs) and network administrators in smart campuses to develop empirical model for optimal Quality of Service (QoS), Internet traffic forecasting, and budgeting.

Entities:  

Keywords:  Internet Protocol; Internet data traffic; Nigerian university; Smart campus; Smart education

Year:  2018        PMID: 30101162      PMCID: PMC6083017          DOI: 10.1016/j.dib.2018.07.039

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the data The data provided in this data article can be used to accurately predict Internet data traffic in a smart campus environment. Predictions of Internet data traffic will help network engineers to improve the Quality of Service (QoS) of computer networks and also ensure efficient utilization of the networks in a smart university campus [1], [2]. Availability of dataset on Internet data traffic obtained from real scenarios will facilitate more empirical research in the areas of computer networking and Internet traffic engineering [3], [4]. This dataset is made available to give correct facts and figures on Internet data traffic in a Nigerian university campus that is driven by Information and Communication Technologies (ICTs) [5], [6]. Free access to daily Internet data traffic of a period of one year will facilitate the development of empirical prediction models that can be used by Internet Service Providers (ISPs) and Internet subscribers in a smart university campus for effective network planning and traffic forecasting [7], [8], [9], [10], [11], [12]. Robust data exploration that is performed in this data article will help the university network administrators to gain useful insights about the traffic peak and off-peak periods. Also, the descriptive statistics, frequency and probability distribution plots, correlation analysis, ANOVA test and multiple post-hoc test results will give better understanding of the relationships between the Internet data download traffic and the Internet data upload traffic in a smart campus [13], [14], [15].

Data

Ubiquitous access to reliable Internet services is pivotal to achieving sustainable smart education in university campuses [16], [17], [18]. Accurate Internet data traffic prediction models are required for computer network planning and forecasting to guarantee efficient Quality of Service (QoS) in enterprise computer networks and applications. However, computer network planning are usually carried out based on theoretical formulations and simulations due to paucity of empirical data from real life scenarios. In this data article, a robust data exploration is performed on daily Internet data traffic in a smart university campus for a period of twelve consecutive (12) months (January–December, 2017). For each month, descriptive statistics of daily Internet data download traffic and daily Internet data upload traffic are presented in tables. The mean, median, mode, standard deviation, variance, kurtosis, Skewness, range, minimum, maximum, and sum of the daily Internet data traffic download and upload for January–December, 2017 are presented in Tables 1 and 2 respectively.
Table 1

Descriptive statistics of daily IP data download traffic in Terabytes (TB).

JanFebMarAprMayJunJulAugSepOctNovDec
Mean2.282.302.882.722.412.231.891.203.153.203.172.33
Median2.602.402.902.602.202.201.900.923.253.203.002.40
Mode3.402.002.902.502.002.002.100.823.503.003.002.40
Standard Deviation1.210.660.790.520.820.490.710.800.400.490.690.97
Variance1.460.440.630.270.660.240.510.640.160.240.480.94
Kurtosis1.614.385.123.803.183.675.586.032.631.785.563.96
Skewness−0.47−1.160.48−0.160.950.810.872.00−0.690.110.760.75
Range3.512.744.202.603.302.003.703.301.501.603.804.09
Minimum0.190.361.201.301.201.500.600.502.202.401.400.81
Maximum3.703.105.403.904.503.504.303.803.704.005.204.90
Sum70.8164.3689.4081.6074.8067.0058.4637.2094.4099.1095.0044.21
Table 2

Descriptive statistics of daily IP data upload traffic in Terabytes (TB).

JanFebMarAprMayJunJulAugSepOctNovDec
Mean0.290.430.490.480.360.310.280.190.580.650.640.33
Median0.320.470.470.500.290.310.290.150.590.640.650.28
Mode0.140.070.150.140.140.200.060.060.400.690.230.14
Standard Deviation0.160.140.140.090.180.060.110.140.060.060.120.17
Variance0.020.020.020.010.030.000.010.020.000.000.020.03
Kurtosis1.713.555.438.613.492.452.555.074.472.897.236.52
Skewness−0.24−1.040.19−2.071.190.18−0.071.80−1.160.16−0.391.74
Range0.510.560.730.470.660.250.480.520.270.280.770.72
Minimum0.030.070.150.140.140.200.060.060.400.530.230.14
Maximum0.540.630.880.600.800.440.540.580.670.801.000.86
Sum8.9612.1715.1214.2711.269.238.665.9417.2920.1019.066.24
Descriptive statistics of daily IP data download traffic in Terabytes (TB). Descriptive statistics of daily IP data upload traffic in Terabytes (TB).

Experimental design, materials and methods

A robust data exploration was performed on daily Internet data traffic in a smart university campus for a period of twelve consecutive (12) months (January–December, 2017). For each day of the one-year study period, Internet data download traffic and Internet data upload traffic at Covenant University, Nigeria were monitored and properly logged using an open source software, FreeRADIUS, Radius Manager web application, and Mikrotik Hotspot Manager. FreeRADIUS software was installed in Linux Operating System (OS) for authentication, authorization, and accounting services. Radius Manager Web application was used to add users, to edit and create cards, and to harvest data in a more user-friendly format. Mikrotik Hotspot Manager was used to integrate the smart campus network to the enterprise edge. Statistical computations were done using the Machine Learning and Statistics toolbox in MATLAB 2016a software. Boxplot representations of the daily download traffic and the daily upload traffic for the 12-month period are shown in Figs. 1 and 2 respectively.
Fig. 1

Boxplot representation of daily data download traffic in Terabytes (TB).

Fig. 2

Boxplot representation of daily data upload traffic in Terabytes (TB).

Boxplot representation of daily data download traffic in Terabytes (TB). Boxplot representation of daily data upload traffic in Terabytes (TB).

Data exploration

Time series plots are provided to show the trends of data download and upload volume within the smart campus throughout the 12-month period. Fig. 3, Fig. 4, Fig. 5, Fig. 6 show the trends in data download traffic for the first, second, third, and fourth quarters of year 2017 respectively. Similarly, the patterns of data upload traffic for the first, second, third, and fourth quarters of year 2017 are shown in Fig. 7, Fig. 8, Fig. 9, Fig. 10 respectively. Frequency distributions of the dataset are illustrated using histograms. Fig. 11, Fig. 12, Fig. 13, Fig. 14 show the histogram distributions of the data traffic volume for first, second, third, and fourth quarters of 2017.
Fig. 3

(a)–(c). Download traffic volume in first quarter, 2017.

Fig. 4

(a)–(c). Download traffic volume in second quarter, 2017.

Fig. 5

(a)–(c). Download traffic volume in third quarter, 2017.

Fig. 6

(a)–(c). Download traffic volume in fourth quarter, 2017.

Fig. 7

(a)–(c). Upload traffic volume in first quarter, 2017.

Fig. 8

(a)–(c). Upload traffic volume in second quarter, 2017.

Fig. 9

(a)–(c). Upload traffic volume in third quarter, 2017.

Fig. 10

(a)–(c). Upload traffic volume in fourth quarter, 2017.

Fig. 11

(a)–(f). Frequency distributions of data download and upload traffic in first quarter, 2017.

Fig. 12

(a)–(f). Frequency distributions of data download and upload traffic in second quarter, 2017.

Fig. 13

(a)–(f). Frequency distributions of data download and upload traffic in third quarter, 2017.

Fig. 14

(a)–(f). Frequency distributions of data download and upload traffic in fourth quarter, 2017.

(a)–(c). Download traffic volume in first quarter, 2017. (a)–(c). Download traffic volume in second quarter, 2017. (a)–(c). Download traffic volume in third quarter, 2017. (a)–(c). Download traffic volume in fourth quarter, 2017. (a)–(c). Upload traffic volume in first quarter, 2017. (a)–(c). Upload traffic volume in second quarter, 2017. (a)–(c). Upload traffic volume in third quarter, 2017. (a)–(c). Upload traffic volume in fourth quarter, 2017. (a)–(f). Frequency distributions of data download and upload traffic in first quarter, 2017. (a)–(f). Frequency distributions of data download and upload traffic in second quarter, 2017. (a)–(f). Frequency distributions of data download and upload traffic in third quarter, 2017. (a)–(f). Frequency distributions of data download and upload traffic in fourth quarter, 2017. Correlation and regression analyses are performed to establish a linear relationship the data download traffic and data upload traffic. The relationship yielded a correlation coefficient (R) of 0.8791. A linear regression equation that represent the relationship is provided in the scatter plot shown in Fig. 15. Probability Density Functions (PDFs) and Cumulative Distribution Functions (CDFs) of the dataset are also computed. PDF and CDF models of Normal, Logistic, Non-parametric, Rician, Weibull and Nakagami distributions were used to fit the empirical data as shown in Figs. 16 and 17. The CDF model distribution fittings of the dataset are shown in Figs. 18 and 19. Distribution fitting parameters for download data traffic (January–December, 2017) based on the six distribution models are presented in Table 3. Estimates and standard errors of download data traffic distribution parameters for the six models are presented in Table 4. Similarly, the distribution fitting parameters for upload data traffic (January–December, 2017) based on the six distribution models are presented in Table 5. Estimates and standard errors of download data traffic distribution parameters for the six models are presented in Table 6.
Fig. 15

Scatter plot of data download traffic and data upload traffic.

Fig. 16

Download data traffic distribution fittings using PDF models.

Fig. 17

Upload data traffic distribution fittings using PDF models.

Fig. 18

Download data traffic distribution fittings using CDF models.

Fig. 19

Upload data traffic distribution fittings using CDF models.

Table 3

Distribution fitting parameters for download data traffic (January–December, 2017).

NormalLogisticRicianWeibullNakagami
Log Likelihood−473.562−477.028−472.879−475.457−485.289
Domain−∞<y<∞−∞<y<∞0<y<∞0<y<∞0<y<∞
Mean2.4832.5242.4822.4772.462
Variance0.8590.9190.8580.83130.958
Table 4

Estimates and standard errors of download data traffic distribution parameters.

Normal
Logistic
Rician
Weibull
Nakagami
ParameterApproxStd ErrApproxStd ErrApproxStd ErrApproxStd ErrApproxStd Err
µ2.4830.0492.5240.0492.2490.0612.7760.0521.6800.116
σ0.9270.0350.5280.0230.9910.0432.9580.1277.0200.288
Table 5

Distribution fitting parameters for upload data traffic (January–December, 2017).

NormalLogisticRicianWeibullNakagami
Log Likelihood86.96975.9293.03290.01186.668
Domain−∞<y<∞−∞<y<∞0<y<∞0<y<∞0<y<∞
Mean0.4200.4230.4210.4200.417
Variance0.03590.0410.0350.0350.038
Table 6

Estimates and standard errors of upload data traffic distribution parameters.

Normal
Logistic
Rician
Weibull
Nakagami
ParameterApproxStd ErrApproxStd ErrApproxStd ErrApproxStd ErrApproxStd Err
µ0.4200.0100.4220.0100.3450.0170.4730.0111.2110.082
σ0.1890.0070.1120.0040.2160.0122.3750.1040.2120.010
Scatter plot of data download traffic and data upload traffic. Download data traffic distribution fittings using PDF models. Upload data traffic distribution fittings using PDF models. Download data traffic distribution fittings using CDF models. Upload data traffic distribution fittings using CDF models. Distribution fitting parameters for download data traffic (January–December, 2017). Estimates and standard errors of download data traffic distribution parameters. Distribution fitting parameters for upload data traffic (January–December, 2017). Estimates and standard errors of upload data traffic distribution parameters. Furthermore, Analysis of Variance (ANOVA) and multiple post-hoc tests are conducted to understand the statistical difference(s) in the Internet traffic volume, if any, across the 12-month period. The results of the ANOVA test and the multiple post-hoc test conducted on download data traffic are presented in Tables 7 and 8 respectively. Likewise, the results of the ANOVA test and the multiple post-hoc test conducted on upload data traffic are presented in Tables 9 and 10 respectively. The multiple post-hoc comparison results for download data traffic and upload data traffic are depicted graphically in Figs. 20 and 21.
Table 7

ANOVA test results for download data traffic.

Source of VariationSum of SquaresDegree of FreedomMean SquaresF StatisticProb>F
Columns116.441110.5919.413.16×10−30
Error185.933410.55
Total302.37352
Table 8

Multiple post-hoc test results for download data traffic.

Groups ComparedLower limits for 95% confidence intervalsMean DifferenceUpper limits for 95% confidence intervalsp-value
JanFeb−0.6435−0.01440.61481.0000
JanMar−1.2126−0.59970.01330.0619
JanApr−1.0538−0.43580.18220.4732
JanMay−0.7416−0.12870.48420.9999
JanJun−0.56720.05090.66891.0000
JanJul−0.21450.39841.01130.6053
JanAug0.47131.08421.69710.0000
JanSep−1.4805−0.8625−0.24450.0003
JanOct−1.5255−0.9126−0.29960.0001
JanNov−1.5005−0.8825−0.26450.0002
JanDec−0.7457−0.04260.66041.0000
FebMar−1.2144−0.58530.04380.0971
FebApr−1.0555−0.42140.21270.5702
FebMay−0.7435−0.11430.51481.0000
FebJun−0.56890.06520.69931.0000
FebJul−0.21640.41281.04190.5907
FebAug0.46941.09861.72770.0000
FebSep−1.4822−0.8481−0.21400.0008
FebOct−1.5273−0.8982−0.26910.0002
FebNov−1.5022−0.8681−0.23400.0005
FebDec−0.7455−0.02830.68901.0000
MarApr−0.45410.16390.78190.9994
MarMay−0.14200.47101.08390.3327
MarJun0.03250.65051.26860.0288
MarJul0.38510.99811.61100.0000
MarAug1.07091.68392.29680.0000
MarSep−0.8808−0.26280.35520.9658
MarOct−0.9258−0.31290.30000.8827
MarNov−0.9008−0.28280.33520.9423
MarDec−0.14610.55701.26010.2857
AprMay−0.31090.30710.92510.9007
AprJun−0.13640.48671.10970.3072
AprJul0.21620.83421.45220.0006
AprAug0.90201.52002.13800.0000
AprSep−1.0497−0.42670.19640.5217
AprOct−1.0948−0.47680.14120.3264
AprNov−1.0697−0.44670.17640.4458
AprDec−0.31440.39321.10070.8096
MayJun−0.43840.17960.79760.9986
MayJul−0.08580.52711.14000.1754
MayAug0.60001.21291.82580.0000
MaySep−1.3518−0.7338−0.11570.0059
MayOct−1.3968−0.7839−0.17090.0017
MayNov−1.3718−0.7538−0.13570.0039
MayDec−0.61700.08610.78911.0000
JunJul−0.27050.34750.96550.7972
JunAug0.41531.03331.65140.0000
JunSep−1.5364−0.9133−0.29030.0001
JunOct−1.5815−0.9634−0.34540.0000
JunNov−1.5564−0.9333−0.31030.0001
JunDec−0.8010−0.09350.61401.0000
JulAug0.07290.68581.29870.0136
JulSep−1.8789−1.2609−0.64280.0000
JulOct−1.9239−1.3110−0.69800.0000
JulNov−1.8989−1.2809−0.66280.0000
JulDec−1.1441−0.44100.26200.6587
AugSep−2.5647−1.9467−1.32860.0000
AugOct−2.6097−1.9968−1.38380.0000
AugNov−2.5847−1.9667−1.34860.0000
AugDec−1.8299−1.1268−0.42380.0000
SepOct−0.6681−0.05010.56791.0000
SepNov−0.6431−0.02000.60311.0000
SepDec0.11230.81981.52730.0084
OctNov−0.58790.03010.64811.0000
OctDec0.16690.86991.57300.0031
NovDec0.13230.83981.54730.0059
Table 9

ANOVA test results for upload data traffic.

Source of VariationSum of SquaresDegree of FreedomMean SquaresF StatisticProb > F
Columns7.38110.6743.582.03 × 10−58
Error5.253410.02
Total12.63352
Table 10

Multiple post-hoc test results for upload data traffic.

Groups ComparedLower limits for 95% confidence intervalsMean DifferenceUpper limits for 95% confidence intervalsp-value
JanFeb−0.2513−0.1456−0.03990.0004
JanMar−0.3018−0.1988−0.09580.0000
JanApr−0.2904−0.1866−0.08270.0000
JanMay−0.1771−0.07410.02890.4391
JanJun−0.1224−0.01860.08521.0000
JanJul−0.09330.00970.11261.0000
JanAug−0.00560.09740.20040.0843
JanSep−0.3911−0.2873−0.18350.0000
JanOct−0.4622−0.3593−0.25630.0000
JanNov−0.4499−0.3461−0.24220.0000
JanDec−0.1573−0.03920.07890.9953
FebMar−0.1589−0.05320.05250.8930
FebApr−0.1474−0.04090.06560.9843
FebMay−0.03420.07150.17720.5413
FebJun0.02050.12700.23360.0055
FebJul0.04960.15530.26100.0001
FebAug0.13740.24310.34880.0000
FebSep−0.2482−0.1416−0.03510.0009
FebOct−0.3193−0.2136−0.10790.0000
FebNov−0.3070−0.2004−0.09390.0000
FebDec−0.01410.10640.22690.1456
MarApr−0.09160.01220.11611.0000
MarMay0.02170.12470.22770.0044
MarJun0.07640.18020.28400.0000
MarJul0.10550.20850.31140.0000
MarAug0.19320.29620.39920.0000
MarSep−0.1923−0.08850.01530.1862
MarOct−0.2634−0.1605−0.05750.0000
MarNov−0.2511−0.1473−0.04340.0002
MarDec0.04150.15960.27770.0006
AprMay0.00860.11240.21630.0206
AprJun0.06330.16800.27260.0000
AprJul0.09240.19620.30000.0000
AprAug0.18010.28400.38780.0000
AprSep−0.2054−0.10070.00390.0723
AprOct−0.2765−0.1727−0.06890.0000
AprNov−0.2642−0.1595−0.05480.0000
AprDec0.02850.14730.26620.0030
MayJun−0.04830.05550.15940.8460
MayJul−0.01920.08380.18680.2473
MayAug0.06860.17150.27450.0000
MaySep−0.3170−0.2132−0.10930.0000
MayOct−0.3881−0.2851−0.18220.0000
MayNov−0.3758−0.2719−0.16810.0000
MayDec−0.08320.03490.15300.9983
JunJul−0.07560.02830.13210.9992
JunAug0.01220.11600.21980.0139
JunSep−0.3734−0.2687−0.16400.0000
JunOct−0.4445−0.3407−0.23680.0000
JunNov−0.4321−0.3275−0.22280.0000
JunDec−0.1395−0.02060.09831.0000
JulAug−0.01520.08780.19070.1862
JulSep−0.4008−0.2969−0.19310.0000
JulOct−0.4719−0.3689−0.26590.0000
JulNov−0.4596−0.3557−0.25190.0000
JulDec−0.1670−0.04890.06930.9721
AugSep−0.4885−0.3847−0.28090.0000
AugOct−0.5597−0.4567−0.35370.0000
AugNov−0.5473−0.4435−0.33960.0000
AugDec−0.2548−0.1366−0.01850.0086
SepOct−0.1758−0.07200.03190.5017
SepNov−0.1635−0.05880.04590.7988
SepDec0.12920.24810.36690.0000
OctNov−0.09060.01320.11701.0000
OctDec0.20190.32000.43820.0000
NovDec0.18800.30690.42570.0000
Fig. 20

Graphical representation of multiple post-hoc test result for download data traffic.

Fig. 21

Graphical representation of multiple post-hoc test result for download data traffic.

ANOVA test results for download data traffic. Multiple post-hoc test results for download data traffic. ANOVA test results for upload data traffic. Multiple post-hoc test results for upload data traffic. Graphical representation of multiple post-hoc test result for download data traffic. Graphical representation of multiple post-hoc test result for download data traffic.
Subject areaEngineering
More specific subject areaInformation and Communication Engineering
Type of dataTables, graphs, figures, and spreadsheet file
How data was acquiredFor each day of the one-year study period, Internet data download traffic and Internet data upload traffic at Covenant University, Nigeria were monitored and properly logged using an open source software, FreeRADIUS, Radius Manager web application, and Mikrotik Hotspot Manager.
Data formatRaw, analyzed
Experimental factorsInternet data download traffic and Internet data upload traffic were monitored and logged for only nineteen (19) days in December, 2017 because the university proceeded to end-of-year break afterward.
Experimental featuresDescriptive statistics, boxplot representations, time series plots, frequency distributions, correlation and regression analyses, Probability Density Functions (PDFs), Cumulative Distribution Functions (CDFs), Analysis of Variance (ANOVA) test, and multiple post-hoc test are performed to explore the dataset provided in this data article. All statistical computations were done using the Machine Learning and Statistics toolbox in MATLAB 2016a software.
Data source locationThe dataset on Internet data traffic provided in this article were collected at Covenant University, Canaanland, Ota, Nigeria (Latitude 6.6718° N, Longitude 3.1581° E)
Data accessibilityA comprehensive dataset is provided in Microsoft Excel spreadsheet file and attached assupplementary materialto this data article for easy research utility and validation
  4 in total

1.  Received signal strength and local terrain profile data for radio network planning and optimization at GSM frequency bands.

Authors:  Segun I Popoola; Aderemi A Atayero; Nasir Faruk
Journal:  Data Brief       Date:  2017-12-19

2.  Smart campus: Data on energy consumption in an ICT-driven university.

Authors:  Segun I Popoola; Aderemi A Atayero; Theresa T Okanlawon; Benson I Omopariola; Olusegun A Takpor
Journal:  Data Brief       Date:  2017-12-07

3.  Learning analytics for smart campus: Data on academic performances of engineering undergraduates in Nigerian private university.

Authors:  Segun I Popoola; Aderemi A Atayero; Joke A Badejo; Temitope M John; Jonathan A Odukoya; David O Omole
Journal:  Data Brief       Date:  2018-01-03

4.  Data on the key performance indicators for quality of service of GSM networks in Nigeria.

Authors:  Segun I Popoola; Aderemi A Atayero; Nasir Faruk; Joke A Badejo
Journal:  Data Brief       Date:  2017-12-14
  4 in total
  1 in total

1.  A Research on the Realization Algorithm of Internet of Things Function for Smart Education.

Authors:  Kuiliang Fu
Journal:  Comput Intell Neurosci       Date:  2022-04-29
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.