An Econometric Time Series GDP Model Analysis : Statistical Evidences and Investigations

This article aims to provide an analysis for a time series data of gross domestic product (GDP) of the Sudan. An econometric time series model with macroeconomic variables is conducted. Since a non-stationary time series must be made stationary, some statistical tests are followed so that the time series become stationary series. After applying these tests, the time series became stationary and integrated of order I. Box-Jenkins procedure is used to determine ARMA. OLS is used to estimate the models parameters. Performances chosen ARIMA model are verified on the basis of classical statistical tests and forecasting. The model features are interpreted on the basis of standard measures of forecasting performance.


Introduction
As a measure of performance for an economy, the (GDP), gross domestic product, is the value of all final goods and services produced within a country in a year.GDP data is widely used economic data in the field of time series modeling and analysis.GDP data are used to meet a wide variety of requirements, such as in industry, finance, research institutions, and other fields.Forecasting economic model is an essential component of a country's economy decision-making process.The GDP forecast is necessary for policy makers to forecast economic model.For these reasons, this paper investigates the performance of GDP model for the Sudan.It aims at analyzing time series econometric model of macroeconomic variable GDP in the country.Time series models and analysis has been discussed in [1] [2] [3] [4] [5] and many others.Time series analysis aims at identifying data patterns and trends as well as explanation of data modeling and forecasting.Two principal approaches are adopted to maintain time series analysis which depends on the time or the frequency domain.Several procedures are used to analyze data within these domains.A useful common technique is the Box-Jenkins ARIMA method [1], which can be used for univariate or multivariate data set analyses.The ARIMA technique uses moving averages (MA), smoothing, and regression methods to detect and remove data autocorrelation.Tools of time series analysis have been intensively discussed by [6].
Many statistical tests are used in time series models in order to make it a stationary series and integrated; thus, Box-Jenkins procedure is used for the determination of ARMA, and OLS method is used to estimate the model parameters.
In the following sections, among the techniques those are useful for analyzing will be identified.This paper is organized as follows: Chapter 2 is devoted to the proposed model of the study.A background about data collection and methodology is presented in Chapter 3 while Chapter 4 is devoted to data analysis and results which has been discussed in Chapter 5 and then a brief conclusion has been introduced in Chapter 6.

The Proposed Model
The methodology of time series analysis composed of two steps: constructing a data model for that time series, and forecasting the future values.
For a regular time series pattern, the value of the series, Y t , should be a function of previous values.If Y is the target value that we are trying to model and predict, and Y t is the value of Y at time t, then the goal is to build a model of the type: ( ) where Y t−1 is the previous observation value of Y, Y t−2 is the value two observations ago, etc., and e t (a random shock), represents noise that does not follow a predictable pattern.Variables Values occurring prior to the current observation are called lag values.In a repeating pattern time series, the value of Y t is usually highly correlated with Y t−cycle .Thus, the goal of constructing a time series model is to build a model such that the error between the predicted value of the target variable and the actual value is as small as possible.
Consider a time series of data X t , the ARMA model consists of two parts, an autoregressive (AR) part and a moving average (MA) part.Following [7], the AR(p) model is written in the form: where ,..., i p ϕ ϕ are the model parameters, c is a constant (which may be omit- ted for simplicity) and t ε is an error term.The MA (q) notation stands for the moving average model of order q: where the θ 1 , ..., θ q are the parameters of the model and the ε t , ε t−1 , ... are, the error terms.
The notation ARMA (p, q) refers to the model with p autoregressive terms and q moving average terms.This model contains the AR (p) and MA (q) models, where the error terms ε t are assumed to be independent identically-distributed random variables with mean zero and ε t -N (0, σ 2 ) where σ 2 is the variance.
The process ( ) t Y is said to be ARIMA (p, d, q) if: where ( ) The process ( ) t Y is stationary if and only if d = 0 in which case it reduces to ARMA (p, q) process: The Box-Jenkins methodology [1] is a five-step technique for identifying, selecting, and assessing models for a type of time series data.These steps are:

Data and Methodology of Collection
The GDP is equals to the total expenditures for all final goods and services produced within the country in a stipulated period of time.It is the sum of gross value added by all resident producers in the economy plus any product taxes and minus any subsidies not included in the value of the products [8].
The except for the year 2000 [9].
The resulting high external and internal deficits, coupled with the sustained American sanctions as well as the security concerns in the country, affected the economic situation which led to devaluation to supplement the budget, including the devaluation of the currency by 29% and removal of fuel subsidies worth SDG 3.6 billion (Sudanese pounds) about 1.2% of GDP, resulting in riots.Eco-Journal of Applied Mathematics and Physics nomic linkages and value addition were weakened during the period of oil-driven growth (1999-2011), mainly in agriculture (which provided 47.6% of total jobs in 2011).The major field of government expenditure might be on the security services, though no official figures were displayed.
Also, the high taxes along the supply chains and the recent increase in tariffs on imported inputs in addition to the high costs of energy and infrastructure services raised domestic resource costs and reduced domestic value addition.

Data Analysis
Time series are analyzed in order to understand the nature of underlying structure and mechanism of the function that produce the observations.In this section, the data of GDP statistics of Sudan, which include the current and constant prices in million US$ for the period (1960-2015) will be investigated.
Figures 1-3 show a line graph of GDP levels in the period under consideration.Overall, the line graph shows a clear dominance of a long-term upward trend, suggesting a non-stationary time series in levels.In this analysis of GDP data, a summary of the model descriptive statistics for GDP) is given in Table 1, where one way ANOVA Summary for the same model is shown in Table 2 and

GDP-Model_1
Observed vations eliminated.Autocorrelation and partial autocorrelation tables are also provided for the residuals (errors) between the actual and predicted values of the time series.Proportion of variance explained by model is the best single measure of how well the predicted values match the original values.If the predicted values exactly match the original values, then the model would explain 100% of the variance.In fact this is not always the case (here the model explains 61.6% of the variance due to the R square value), as seen in Table 3, Table 6 and Table 7.
Examining the autocorrelation table shown in Table 11, we see that the highest autocorrelation is −0.313 which occurs with a lag of 15.Hence we want to be sure to include lag values up to 15 when building the model.
The autocorrelation ACF (Table 11 and Figure 6) and partial autocorrelation PACF tables (Table 12 and Figure 7) provide valuable information about the significance of the lag variables.An autocorrelation is the correlation between the target variable (GDP) and lag values for the same variable.Correlation values range from −1 to +1.A value of +1 indicates that the two variables move together perfectly; a value of −1 indicates that they move in opposite directions (see the results of Table 12.The second column of the autocorrelation table shows the standard error of the autocorrelation, this is followed by the t-statistic in the third column.The right side of the autocorrelation table is a bar chart with asterisks used to indicate positive or negative correlations right or left of the centerline.The dots shown in the chart mark the points two standard deviations from zero.If the autocorrelation bar is longer than the dot marker (that is, it covers it), then the autocorrelation should be considered significant.In this model, significant autocorrelations occurred for all lags except for lag 15.Based on the assumption that the series are not cross correlated and that one of the series is white noise, the cross correlations and range of lags (from −7 to +7 are displayed in Table 13 and Figure 8).The figure shows confidence limit to be all above zero for the GDP.Thus, if we rely on this information, we may conclude that we have a good fit.

GDP in billion US$ with year
From Table 5, we could put the model as the following form: , or GDP 9.314 0.785 , with standard error 0.120 for (for the year 2020).The Annual growth rates are estimated to be about 5.3%, 5.36%, 5.52% and 5.67% for the above years respectively.

Discussion
We evaluate Autoregressive Integrated Moving Average (ARIMA) model of the GDP series using Box-Jenkins methodology by using four different equations which are, linear, logarithmic, quadratic and exponential equations.I also successively eliminated the AR or the MA term while leaving the other term in, but still got higher values for all test parameters.Based on the parameter values, I found that the ARIMA (0, 0, 0) is the best model for the data.Comparing with other models, ARIMA model has been selected as the final model.We provide method for prediction and forecasting based on data, which may be applicable and useful to government and business institutions.Sudan GDP Annual Growth Rate Forecasts are projected using an autoregressive integrated moving average (ARIMA) to be 4.9 for 2017 and 4.9 for 2020, using analysis expectations.We model the past behavior of Sudan GDP Annual Growth Rate using historical data and adjustments of the coefficients of the econometric model by taking into account analysis assessments and future expectations.It can be seen that time series are very complex because each observation is somewhat dependent upon the previous observation, and often is influenced by more than one previous observation.Random error is also influential from one observation to another.These influences are called autocorrelation-dependent relationships between successive observations of the same va-riable.The challenge of time series analysis is to extract the autocorrelation elements of the data, either to understand the trend itself or to model the underlying mechanisms.
A word of caution about using multiple regression techniques with time series data: because of the autocorrelation nature of time series, time series violate the assumption of independence of errors.Type I error rates will increase substantially when autocorrelation is present.Also, inherent patterns in the data may dampen or enhance the effect of an intervention; in time series analysis, patterns are accounted for within the analysis.

Conclusion
This article has discussed the analysis for GDP statistics of the Sudan.The ARIMA method used here might be appropriate only for a time series that is stationery (i.e., its mean, variance, and autocorrelation should be approximately constant through time) and it is recommended that there are at least 50 observations in the input data (the underlying model has 55 observations).It is also assumed that the values of the estimated parameters are constant throughout the series.The article has discussed changes in the GDP for the period .
The results for the analysis, indicated that model, provides useful information for identifying GDP trend.An important policy consideration rising from the study is that there is increasing trend for the model of the data.More advanced future work can be done on the basis of these investigations, particularly in residual analysis of the model.
Based on forecasting model results, the forecasted values for Sudan GDP (in in billion US$), are 99.51 (for the year 2017), 101 (2018), 106.58 (2019) and 112.62 best-fit time series data, then auto-regressive component (AR) in ARIMA is designated as p, the integrated component (I) as d, and moving average (MA) as q.The AR component represents the effects of previous data obser- vations.The I component represents trends, including seasonality.And the MA component represents effects of previous random shocks (or error).To fit an ARIMA model to a time series, the order of each model component must be selected.Usually a small integer value (usually 0, 1, or 2) is determined for each component.

Table 3
for the classical regression model summarized in Table4, and the coefficients are presented in Table5with summary and parameter estimates using linear equation method are shown in Table6.Significant tests are also summarized for the model.Using Time Series Modeler, the model type was shown in the estimates are based on 5% level of significance.The R-square value is over 61% for linear, logarithmic, quadratic and exponential methods, and the F value of regression is highly significant in Table7and Figure4.Then the model Figure 1.GDP 1960-2015 in billion US$.Source: world Bank.

Table 2 .
One-way ANOVA for GDP.

Table 4 .
Classical regression model summary.

Table 6 .
Model summary and parameter estimates.
Dependent Variable: GDP in billion US$.The independent variable is year.

Table 7 .
Model summary and parameter estimates.

Table 9 .
The model statistics has been shown in Table10accompanied by descriptive graph (Figure5), which suggested upward trend for the series.

Table 11 .
Autocorrelations for GDP in billion US$.

Table 13 .
Cross correlations, range of lags from −7 to 7. GDP in billion US$ with year.(a) Based on the assumption that the series are not cross correlated and that one of the series is white noise.