The Forecast Research of Linear Regression Forecast Model in National Economy ()

1. Introduction
The analysis method of the national economy has always been linear regression or other various methods for regression analysis, but when we use a single method for forecasting or data analysis, there will often be large errors or large limitations. In this case, the combined model can often be used to calculate more accurate results. For example, when using a multiple linear regression model, it can predict data with a linear relationship, but it cannot predict data with an exponential relationship. When using a gray prediction model, can only make fuzzy predictions on the data, and cannot reflect the linear relationship of the data. The model in this article solves the problem that the linear prediction cannot determine the highest power and function, and at the same time solves the problem that the gray prediction model cannot perform linear prediction. Combining the two can be used in a variety of sequence forecasts with both linear and exponential growth trends in the future. It has broad significance and can make better predictions on the data.
2. Multiple Linear Regression and Grey Forecast Theory Basis
2.1. Introduction to Multiple Linear Regression
Suppose there is a random variable y and a general variable
. The linear regression model between is
(1.1)
In
is
, unknown parameter
can be called regression constant,
is regression coefficients. Y is dependent variable,
is p general variables that can be accurately measured and controlled at the same time are called independent variables. When P = 1, the formula (1) is called a univariate linear regression model. At that time, we can call Equation (1) a multiple linear regression model.
is random error. Regarding the random error term, we usually set
,
is the theoretical regression equation.
For a practical problem, if we have obtained n sets of observation data
then the above linear regression model Equation (1) can be expressed as
(1.2)
It is written in the form of a matrix:
,
,
,
,
(1.3)
The second term X is a
matrix of order, which is called regression design matrix or data matrix. In the design of the experiment, because the elements in X can be set in advance and can be controlled, because there are some subjective factors, so X is called the design matrix [1].
For example, to analyze the relationship between automobile characteristics and automobile sales volume, a multivariate linear regression model is established first. The data used by the model is shown in Figure 1 [2] [3] [4].
Firstly, it is stepwise regressed by SPSS, then selected, analyzed, regressed and linearized, and obtained (Tables 1-3).
![]()
Figure 1. Relationship between automobile characteristics and automobile sales volume.
APredicted value: (constant), resale, length, horsepow, width, wheelbas, engineers, price; BPredicted value: (constant), resale, length, horsepow, width, wheelbas, engineers; CPredicted value: (constant), resale, length, horsepow, wheelbas, engineers; DPredicted value: (constant), resale, length, wheelbas, engineers; EPredicted Value: (constant), resale, length, wheelbase.
AVariable: Sales; BPredicted values: (constant), resale, length, horsepow, width, wheelbas, engineers, price; CPredicted values: (constant), resale, length, horsepow, width, wheelbas, engineers; DPredicted values: (constant), resale, length, horsepow, wheelbas, engineers; EPredicted values: (constant), resale, length, wheelbas, engineers; FPredicted values: (constant), resale, length, wheelbas.
AVariable: Sales; BPredicted values in the model: (constant), resale, length, horsepow, width, wheelbas, engineers; CPredicted values in the model: (constant), resale, length, horsepow, wheelbas, engineers; DPredicted values in model: (constant), resale, length, wheelbas, engineer; EPredicted values in the model: (constant), resale, length, wheelbas.
2.2. Grey Pre-Test and Test
Grey prediction method is a kind of prediction method for some systems with uncertain factors. Grey theory holds that although some systematic behavior phenomena are hazy and the data are complex, they are still orderly and have certain overall functions. Therefore, the generation of grey number is to find the law from the clutter, and the grey theory establishes a generated data model instead of the original data model, so the data used in grey prediction is the result of inverse processing of some predicted values obtained from the GM model of generated data. The method of grey prediction is a branch of theory in the field of fuzzy prediction, which is more perfect in theory and method [5].
Model of grey prediction
First of all, we should do some preprocessing to the data.
For example, let’s say that the original data sequence is
Accumulate this set of data
You can get a new sequence of data
The above formula can be
summed up as, then we can call the sequence represented in this data an accumulation generation of the original data sequence, and we can also call it an accumulation
generation, obviously. Therefore, it can be assumed that an exponential curve or even a straight line is used to approximate this accumulated generated
sequence. In order to return this accumulated sequence to the original sequence, it is necessary to carry out post-subtraction operation, which can also be called subtraction generation, that is, the difference between the two data before and after, as in the above example
Summarizing the above formulas, we can get the following results:
(1.4)
Among them
,
Modeling principle of grey prediction model.
Give a column of observation data
Add it up once and get it.
Let
the first order differential equation be
satisfied, and this equation also satisfies the initial conditions when
,
The solution of is
The method of grey modeling is to estimate the constants and by using the least square method through a a cumulative u sequence. Because
they are used as initial values, they are
all brought into the equation separately,
and the difference is used instead of the differential, and because they are sampled at equal
intervals, they can be obtained
(1.5)
So there are similar
ones,
The
term
is then
moved to the right and written as the quantity product of the vector because it
also
involves accumulating the values
of the two times in the column,
so it is more appropriate to take the average of the two times before and after, that is,
it will
,
be replaced by, so the formula in the form of the quantity product of the vector before will be rewritten into a matrix expression
, (1.6)
Order,
where T means transposition, order
Then
the original matrix expression can also be written as,
it is estimated by the least square method
as, and the
estimated
values
are substituted into such a time response equation
, when
What is calculated from the above
formula is a fitting value;
At
that time, it was a predicted value, which was the fitted value
relative to a cumulative sequence, and then restored by post-subtraction operation. At
that time, the fitted
value could
be
obtained; At that time, the forecast
value of the original series can be obtained.
This chapter introduces the theoretical basis of this paper, introduces the concept of multiple linear regression model and its application examples, and introduces the concept of grey theory and the modeling basis of grey theory, as well as the simple modeling method and calculation method of grey theory [6] [7].
3. Prediction Model Based on Multiple Linear Regression and Grey Prediction
3.1. Grey Forecast and Its Advantages
3.1.1. GM (1, 1) Model
GM series models are the basic models in grey prediction theory, especially GM (1, 1) model is widely used. The model used in this paper is GM (1, 1) model. There are four basic forms of GM (1, 1) model, including mean GM (1, 1) model, original difference GM (1, 1) model, mean difference GM (1, 1) model and discrete GM (1, 1) model. The definitions of these four models are explained as follows:
There is
a
,
sequence,
wherein;
1-AGO sequence of:
,
where; Weighing
(2.1)
It is the original form of GM (1, 1) model, and the original form of GM (1, 1) model is essentially a difference equation. The parameter
vector in Equation (2.1) can also be estimated by least square method
(2.2)
where Y and B are respectively
,
(2.3)
Therefore, based on the original form of GM (1, 1) model and Formula (2.2), the parameters of the model are estimated, and the solution of the original difference Equation (2.1) is directly used as the time response formula, and the obtained model is called the original difference form of GM (1, 1) model [8].
Suppose,
as
shown in Equation (2.1)
, where
,
call
(2.4)
is in the form of the mean value of GM (1, 1) model. The mean value of GM (1, 1) model is essentially a difference equation. The parameter
vectors in Equation (2.4) can also be estimated using Equation (2.2), where it is important to note that some elements in Matrix B differ from Equation (2.3)
(2.5)
(2.6)
is called an albino differential equation in mean form of GM (1, 1) model. The matrix B in Equation (2.2) is replaced by Equation (2.5). The parameter vectors in Equation (2.6) are estimated according to the least square estimation method.
With the help of the solution of the above-mentioned whitening differential equation Equation (2.6), the mixed model of difference and differential of the time response equation of GM (1, 1) is constructed, which is called the mean mixed form of GM (1, 1) model, which can be called the mean GM (1, 1) model. The parameter of the mean GM (1, 1)
model is the development coefficient, and B is the grey action. The development
coefficient reflects
the
development trend of as well. The estimated model parameters in the form of mean value of GM (1, 1) model can be directly taken the solution of mean difference Equation (2.4) as the model obtained in the time response formula, which is called the mean difference form of GM (1, 1) model, that is, GM (1, 1) model called mean difference. The
discrete form called GM (1, 1) model is called discrete GM (1, 1) model [5].
3.1.2. Advantages and Disadvantages of GM (1, 1) Model
GM (1, 1) model can only use the behavior data sequence of the system to establish the prediction model, which belongs to a relatively simple and practical single sequence modeling method. In the case of time data, it only involves some regular time variables, and in the case of horizontal sequence data, it only involves some regular object serial number variables, but does not involve other explanatory variables. It is simple and can discover some valuable development and change information at the same time, so it has been widely used. However, the practice of forecasting shows that GM (1, 1) model is sometimes very effective, sometimes it is very unsatisfactory, and sometimes it will lose its utility. In order to make up for this small deviation, sometimes some combination models can be used to make up for the deficiency that GM (1, 1) model cannot describe some linear relationships among variables [9].
3.2. Combining Multiple Linear Regression with Grey Prediction
3.2.1. The Inadequacy of Multiple Linear Regression
Although multivariate linear regression is widely used, it also has its own shortcomings. When we can’t determine what kind of function conditions the data meet, multivariate linear can’t determine its specific index, and can’t describe the exponential growth trend. This grey prediction can just complement it [10] [11].
3.2.2. Shortcomings of Grey Prediction
Grey prediction sometimes depends too much on historical data, and does not consider the relationship between various factors, which is too much error for medium and long-term prediction results.
3.2.3. Combining Multiple Linear Regression with Grey Prediction
The grey linear regression combination model can make up for the deficiency of the original linear regression model that can’t describe the exponential growth trend and the GM (1) model that can’t describe the linear relationship among variables, and can predict both the linear growth trend and the exponential growth trend, so the data analysis has higher accuracy.
3.3. Grey Linear Combination Model
Using the method of combining multiple linear regression model with grey forecasting GM (1, 1) model, this paper puts forward that grey linear regression combination model can be used for forecasting, that is, the sum between linear regression
equation and exponential
equation can be used
’s1-AGO
for fitting.
This chapter plays an important role in this paper, This chapter introduces the advantages and disadvantages of the traditional multiple linear regression model, and introduces the grey GM (1, 1) model, and some advantages and disadvantages of the grey GM (1, 1) model. After analysis, it puts forward a model combining the two models, which makes comprehensive use of their advantages and makes them complement each other at the same time, so as to achieve better prediction data. At the same time, this chapter also introduces the research significance of the new model proposed in this paper [12].
4. Application of Forecasting Model in National Economic Data
4.1. National Economic Index Data and Preprocessing
4.1.1. National Economic Indicators
The national economic indicators used in this paper come from the data of China Statistical Yearbook 2017 in recent ten years, that is, the data of gross national product from 2007 to 2016.
The data in Figure 2 is the data that needs to be referenced in this article.
4.1.2. Sample Data Preprocessing
If we want to establish a linear relationship between the gross national product and other values, and hope to achieve the purpose of prediction, then we must establish a multiple linear relationship between the attributes of the current year and the gross national product of the next year.
For example:
By analogy, the linear relationship established in this way is the multiple linear relationship between the current year’s gross national income, the primary industry, the secondary industry, the tertiary industry and the per capita GDP and the next year’s GDP, that is to say, the current year’s value is used to predict the next year’s GDP. Import the data in the above table into SPSS as shown in Figure 3.
![]()
Figure 2. National economic data table 2007-2016.
![]()
Figure 3. SPSS national economic data chart.
4.2. Multiple Linear Regression Forecasting Model Predicts National Economy
Import the data used into SPSS and then do the following (Figure 4).
Introduce “next year’s gross national product” into dependent variables, and “GDP, primary industry, secondary industry, tertiary industry and per capita GDP” into independent variables. By using the entry method, the multiple linear relationship between them is obtained. The specific operation method is shown in the following figure (Figure 5).
The results are obtained as in Tables 4-7.
4.3. Grey Forecasting Model Predicts National Economy
Establish a grey prediction model to set a
sequence, which;
,
1-AGO
sequence
of: where; then the
,
parameter
vector is
estimated by the least square
method, where Y and B are
,
Import the model data into MATLAB, and get the following graphics (Figure 6).
![]()
Figure 4. SPSS linear regression operation.
![]()
Table 4. Variables entered into table.
AVariable: Gross national product for the next year (100 million yuan); BHas reached the tolerance = 0.000 limit.
AValue: (constant), per capita GDP (yuan), added value of secondary industry (100 million yuan), added value of primary industry (100 million yuan), gross national income (100 million yuan).
ANumber of changes: Gross National Product of the next year (100 million yuan).
AVariable: Gross national product for the next year (100 million yuan); BForecast value in the model: (constant), per capita GDP (yuan), added value of secondary industry (100 million yuan), added value of primary industry (100 million yuan), gross national income (100 million yuan).
![]()
Figure 5. Spss linear regression operation.
4.4. Grey Linear Combination Model Predicts National Economy
Because there are many sequences of this model, the model of 2.3 is further improved here, and it is
assumed that it is accumulated to
obtain, in which
It
is called grey linear regression combination prediction model, where v and
are
undetermined
parameters.
Orde
,
Then,
from this, because
all
the data in are predicted
values, in order to facilitate v fitting
parameters, all of them are replaced by
sequential accumulation sequences, so that the fitting v values of
parameters to be m identified can be determined, Considering that different values are taken,
the values of the obtained fitting parameters will be
different, so all
the obtained selected mean values
can be selected and calculated separately, and the finally obtained mean values can be taken v as the fitting
values of v the parameters
to be identified: When the fitting values of the parameters are determined,
the grey linear combination prediction model is as follows:
,
,
Then, the matrix
estimation formula of the parameters to be identified C1, C2, C3 can be obtained, and the prediction of
this model can be carried out by the above formula.
As shown in Figure 7, so
Carry out an accumulation generation
Therefore, from the formula provided in this paper, we can see Figure 8 and Figure 9.
Therefore,
Therefore
so
,
![]()
Figure 7. Gross domestic product 2007-2016.
![]()
Figure 8.
and the relationship between m and k.
![]()
Figure 9.
and the relationship between m and k.
4.5. Comparison of Results
From several results, the prediction model used in this paper is more accurate and closer to the predicted value, which better shows the advantages of prediction, and combines the advantages of linear regression with the advantages of grey prediction.
This chapter is the practical part of the data in this paper, using the three models mentioned in this paper to fit the data respectively, and comparing the results obtained, it is found that the model used in this paper is more accurate.
5. Conclusion
In this article, based on the idea of combining linear regression with grey prediction, determines the model of combining the two, and compares it with the separate models of the two. The linear regression model solves the problem that the highest power and function cannot be determined by linear prediction. At the same time, it solves the problem that the grey prediction model can’t make linear prediction. By combining the two models, it can be used in the series prediction of various existing linear trends and exponential growth trends in the future, which not only has extensive significance, but also can make better prediction for data.