Further Applied Mathematics, 1 (2) (2021): 1-10 www.furthersci.com
ON FORECASTING MALARIA INFECTION USING AUTOREGRESSIVE INTEGRATED MOVING AVERAGE (ARIMA) MODELS.
AKINTUNDE MUTAIRU OYEWALE
Department of Statistics, Federal Polytechnic, Ede, Osun State
Received: 11 April 2021; Accepted: 6 may 2021; Published: 12 May 2021
Abstract
The study examines the forecast performance of ARIMA model with reference to monthly malaria data obtained from the record of federal Polytechnic, Ede Osun state from 2015 to 2019. The stationarity conditions of the series were examined using Augmented Dickey-Fuller test. At first difference the data become stationary, indicating that the malaria infection data is integrated of order 1. Twelve models were fitted which make up of six different specifications from each class of models. Akaike and Bayesian information criteria were used as benchmark to select optimal models. The optimal models comprise of from the group models and from the group models. ARIMA (1,1,3) and ARIMA (3,1,1) were compared using performance measure indices (RMSE, MSE, MAPE, MAE, and Theil U-inequality), with ARIMA (3,1,1) having better in-sample and out-of-sample forecast measures.
Keywords: models, in and out of sample, forecast accuracy, correlogram, unit root, ,
1. Introduction
The need to project into the future is critical in all human endeavors, because a combination of a very excellent record of yesterday and today will reveal what the future will look like. Forecasting is the science of seeing into the future. Any model(s) that can't project into the future won't be effective for policymaking, and thus won't survive the test of time. When forecasting future situations, time series models remain an invaluable asset that could project into the future. Autoregressive integrated moving average model was applied by [7] for the determination of the seasonal variation of demand by making use of historical data and verified these models by checking its forecast performance. In enhancing forecast accuracy [10] combined seasonal factors in their model and calculated the seasonal factors using multiplicative model. [11] elongated [10] research utilized different connections between trend and seasonality within the context of seasonal ARIMA hypothesis. [3] explained Forecasting as an operation of making guess(es) into the future values of variables under study. ARIMA model is a classical or traditional model with so many applications in stock market data [12]. The approaches of statistical and artificial intelligence techniques are used to generate this model [12]; [2]. ARIMA model are applied in so many fields of human endeavour for example ARIMA models was applied by [14] to demonstrate time correlation and probability distribution to decide wind-pace time collection information problem. ARIMA model architecture was applied by [13] to forecast electricity price. [8] devised ARIMA model in the software program reliability. Several researchers have improved forecasting models prominent among models so improved are exponential method, regression model, GARCH model and many others. However, in the present study only related works that made use of ARIMA model in forecasting stock market data are cited, for more details one may consult [9]; [4]; [5]; [6]; [12].
In this paper the author used ARIMA model for forecasting malaria infection data, from which some results of forecasting were obtained. The results obtained from analysed data pointed to the potential asset of ARIMA model in offering potential academia, policy analyst and other would-be users the forecasting model that could assist in decision making.
ARIMA model is divided into three components: The autoregressive component ( ), the differencing component ( ), and the moving average component ( ). The objective of this study is to forecast malaria infection at federal Polytechnic, Ede. Monthly data were obtained from the health record of the institution for the period 2015-2019.
The remaining part of the paper is organized as follow section 2 covers mathematical specification of ARIMA model and stationarity test. Section 3 Empirical analysis with malaria data, illustration of stationarity test, results and discussion. Parameter estimation of ARIMA model, within and out of sample forecast performance of the model fitted. Section 4 and 5 are Conclusion and References.
MATHEMATICAL SPECIFICATION OF THE MODEL USED
ARIMA model in time series analysis is an extension of an autoregressive moving average model by introducing integration order. ARIMA models are fitted to time series data either to enhance the data performance or to forecast the future point. They are applied in cases when series are chaotic, volatile and not stable. The model is commonly referred to as the , where parameter p d, and q are positive integers that refer to the order of the autoregressive, integrated and moving average parts of the model respectively. ARIMA Model’s form is a very important part of the Box-Jenkins approach to time series modeling. The Autoregressive Integrated Moving Average Process is represented using the model
Where and
If and
then the general model
may be written as
2.2 STATIONARITY TEST
Augmented Dickey Fuller test was employed to examine the stationary condition of the series used. This is defined as
Where is a constant, is the coefficient on a time trend and is the lag order of the autoregressive process.
The following is the hypothesis of the Augmented Dickey-Fuller test:
: The data is stationary and does not need to be differentiated
: The data is non-stationary and must be differentiated to achieve stationarity.
3,0 EMPIRICAL ANALYSIS WITH MALARIA DATA
The study relied on monthly malaria data obtained from the records of the Federal Polytechnic in Ede, Osun state, from 2015 to 2019.
3.1 STATIONARITY TEST
Tables 1 and 2 show the results of the series' stationarity tests (Malaria infection). We reject the null hypothesis that the data is stationary and does not require differentiation at any level because both AIC and BIC are less than the critical value. However, at the first difference, an alternative hypothesis was accepted, implying that the series is non-stationary and should be differentiated as such.
TABLE 1: STATIONARITY TEST AT LEVEL
UNIT ROOT
|
Lag selection criteria
AIC BIC
Lag =2 Lag = 3
|
ADF test Statistic
|
1.434445 1. 42317
|
Critical value
|
-1.367845 -2.332747
|
Decision
|
Reject Reject
|
*MacKinnon
|
0.00000 0.00000
|
TABLE 2: STATIONARITY TEST AT FIRST DIFFERENCE
UNIT ROOT
|
Lag selection citeria
AIC BIC
Lag =3 Lag = 4
|
ADF test Statistic
|
-2.01375 -2.00231
|
Critical value
|
--2.867859 -2.62318
|
Decision
|
Accept
|
*MacKinnon
|
0.00000 0.00000
|
3.2 RESULTS AND DISCUSSION
After determining the series' stability at the first difference, we proceeded to analyze the series. Six different and six different models were fitted. The optimal models are from the class of models and from the class of models, as shown below.:
TABLE 3: AIC AND BIC VALUES FOR FITTED
MODEL
|
AIC
|
BIC
|
|
51.7234
|
51.5432
|
|
45.3588
|
44.9872
|
|
42.2314
|
42.0912
|
|
54.9087
|
54.7865
|
|
42.1198
|
41.9876
|
|
47.8731
|
47.6597
|
TABLE 4: AIC AND BIC VALUES FOR FITTED
MODEL
|
AIC
|
BIC
|
|
51.7234
|
51.5432
|
|
67.9621
|
67.5631
|
|
41.7432
|
41.6743
|
|
50.9651
|
50.8712
|
|
46.4732
|
46.3214
|
|
47.8702
|
47.6532
|
3.3 PARAMETER ESTIMATION FOR THE ARIMA MODEL FITTED
and ARIMA are fitted and their parameters are estimated in tables 5 and 6 below.
TABLE 5: MODEL PARAMETER ESTIMATES
MODEL
|
COEFFICIENT
|
STANDARD ERROR
|
|
0.2012
|
0.2265
|
|
-0.4523
|
0.2134
|
|
-0.6532
|
0.2457
|
|
0.7213
|
0.4321
|
|
-0.7654
|
0.3214
|
|
-0.8321
|
0.5213
|
|
0.6314
|
0.7432
|
The mathematical specification of the model fitted is stated below
TABLE 6: MODEL PARAMETER ESTIMATES
MODEL
|
COEFFICIENT
|
STANDARD ERROR
|
|
-0.7231
|
0.3421
|
|
0.7914
|
0.4102
|
|
-0.6751
|
0.1672
|
|
0.9231
|
0.3217
|
|
-0.6534
|
0.5100
|
|
0.8762
|
0.4123
|
|
0.9431
|
0.6241
|
In the same way as given above the mathematical specification of the model fitted is stated below
3.4 WITHIN-SAMPLE FORECAST PERFORMANCE OF AND MODELS
To evaluate the within-sample forecast performance of the models fitted, indices such as RMSE, MSE, MAPE, MAE, and Theil U-inequality were used. The performances of the two models are competitive based on the eight performance indices used, but outperformed ) as shown below:
TABLE 7: IN-SAMPLE FORECAST PERFORMANCE OF AND MODELS
FORECAST MEASURES
|
|
|
Root mean square error
|
32.4321
|
38.7632
|
Mean absolute error
|
19.4321
|
31.3251
|
Mean absolute percent error
|
23.2654
|
25.6531
|
Mean square error
|
1.02345
|
1.02316
|
Theil inequality coefficient
|
0.5321
|
0.5621
|
Bias proportion
|
0.1043
|
0.1057
|
Variance proportion
|
0.0021
|
0.0132
|
Covariance proportion
|
0.7842
|
0.7663
|
3.5 IN-SAMPLE FORECAST PERFORMANCE OF AND MODELS
The same performance measure indices that were used previously were used to evaluate the out-of-sample forecast performance of the models and In this case, eight performance indices were used; the performances of the two models are comparable, but ) outperformed as illustrated in table 8 below:
TABLE 8: OUT- OF SAMPLE FORECAST PERFORMANCE OF AND MODELS
FORECAST MEASURES
|
|
|
Root mean square error
|
30.6512
|
32.4532
|
Mean absolute error
|
23.1254
|
24.7632
|
Mean absolute percent error
|
33.4571
|
35.5533
|
Mean square error
|
1.1254
|
1.2132
|
Theil inequality coefficient
|
0.6432
|
0.7236
|
Bias proportion
|
0.2315
|
0.3567
|
Variance proportion
|
0.0001
|
0.0034
|
Covariance proportion
|
0.9231
|
0.9123
|
CONCLUSION
This paper focuses on modeling and forecasting Malaria infection using ARIMA models. The series' stationarity was examined and found to be stable at first difference using the Augmented Dickey- Fuller test. Following that, we analyzed the data using ) and . These models were chosen based on how well they fit the series under study. The model fitted for in and out of sample forecast performance results show that ) performed better than for the two forecast scenarios (the in and out of samples forecast). It should be noted that the two models are competitive; in the absence of ), we can make do with
Share with your friends: |