A Comprehensive Price Prediction System Based on Inverse Multiquadrics Radial Basis Function for Portfolio Selection
Mengmeng Zheng
Jinan University, Guangzhou, China.
DOI: 10.4236/am.2021.1212076   PDF    HTML   XML   150 Downloads   581 Views  

Abstract

Price prediction plays a crucial role in portfolio selection (PS). However, most price prediction strategies only make a single prediction and do not have efficient mechanisms to make a comprehensive price prediction. Here, we propose a comprehensive price prediction (CPP) system based on inverse multiquadrics (IMQ) radial basis function. First, the novel radial basis function (RBF) system based on IMQ function rather than traditional Gaussian (GA) function is proposed and centers on multiple price prediction strategies, aiming at improving the efficiency and robustness of price prediction. Under the novel RBF system, we then create a portfolio update strategy based on kernel and trace operator. To assess the system performance, extensive experiments are performed based on 4 data sets from different real-world financial markets. Interestingly, the experimental results reveal that the novel RBF system effectively realizes the integration of different strategies and CPP system outperforms other systems in investing performance and risk control, even considering a certain degree of transaction costs. Besides, CPP can calculate quickly, making it applicable for large-scale and time-limited financial market.

Share and Cite:

Zheng, M. (2021) A Comprehensive Price Prediction System Based on Inverse Multiquadrics Radial Basis Function for Portfolio Selection. Applied Mathematics, 12, 1189-1209. doi: 10.4236/am.2021.1212076.

1. Introduction

The target of PS is to achieve some long-term financial goals by constructing an effective investment strategy that can reasonably allocate wealth among a set of assets. There are two main theories about PS. One is the mean-variance theory [1], which aims to balancing the expected return (mean) and risk (variance) of a portfolio and is generally applicable to the single-period PS. The other is Kelly investment [2] [3] which focuses on maximizing the expected log return and is suitable for multiple-period PS. These two theories are also the cornerstone of modern PS research and are constantly exploited and innovated. Considering the real financial market environment, most portfolio researches are more in line with the Kelly investment model. In other words, the essence of PS can be understood as an optimization problem on expected log return. Therefore, in order to make portfolio selection more effective under known historical price information, price prediction is an important link to the construction of portfolio.

Adam’s theory mentioned that there was a high chance that the market would go towards a particular direction [4]. Therefore, price prediction plays an indispensable role in PS. At present, the most commonly used price prediction systems generally follow three principles, namely, trend-following principle, trend-reversing principle and pattern-matching principle [5]. Trend-following principle assumes that a well-performing asset’s price will keep rising over next period, and vice versa. For example, PPT [6] exploits the maximum value of asset prices within a time window to predict future prices. In contrast to the trend-following, the trend-reversing principle assumes that future asset prices will reverse to some kinds of historical mean. The investment behavior is to sell good performance and buy poor performance [7] [8] [9] [10]. For example, RMR and OLMAR exploit L1-median [8] and the exponential moving average (EMA) [10], respectively, to make future price prediction. Pattern-matching principle is to look for historical price pattern that fit the current financial environment and use it to predict the future prices [11] [12] [13].

Although some strategies mentioned above can be applied to some data sets with encouraging results, no attempt has been made to combine these strategies to construct a comprehensive price prediction system. In fact, the exponential moving average (EMA) [5], the PP [6] and L1-median [8] are all classic and widely used price prediction tools. L1-median and EMA are both moderate strategies, while PP is an aggressive strategy that can actively strive for high returns. Depending on the financial environment, sometimes aggressive strategies are needed to achieve high returns, while sometimes moderate strategies are needed to avoid risks. This inspires us to construct a system able to take full advantage of multiple strategies.

Radial basis functions (RBF) are widely exploited in solving partial differential equations and image denoising, [14] - [21]. The Gaussian (GA) function, the multiquadrics (MQ) function and inverse multiquadrics (IMQ) function are three classic expressions of RBF [20] [22]. Despite its good smoothing effect, GA is a function only with good local characteristics, which means it’s significant merely in a neighborhood near the center point. MQ was originally introduced by Hardy [23] and generally accepted by researchers. Although Franke [24] proved that MQ performed best in dealing with the interpolation problem of scatter data, MQ was conditionally positive definite. Therefore, a more application-oriented IMQ was proposed. The outstanding advantages of IMQ are good global feature, strict positive definite and stable eigenvalue [14] [18]. For example, Abbasbandy [14] pointed out that IMQ could be used to approximate the unknown analytic function to get a more stable and accurate solution in solving the global optimization problem. And Tanbay [19] pointed out that compared with GA in the solution of the neutron diffusion equation, IMQ had a more stable performance and could obtain highly numerical solution. Therefore, we incorporate IMQ rather than traditional GA in our novel RBF system to increase efficiency and robustness.

In this paper, a comprehensive price prediction (CPP) system based on IMQ radial basis function is constructed. The system firstly uses IMQ basis function to construct a novel RBF system. Then, combining with the novel RBF system, a portfolio update strategy based on kernel and trace operator is constructed. Now let’s consider H different price prediction strategies. This paper mainly focuses on three strategies, namely EMA, L1-median and PP. Firstly, CPP selects the best-performing strategy according to investing performance of all strategies within the recent window and given it the largest influence in future price prediction. Secondly, CPP exploits the similarity between the best-performing strategy and other price prediction strategies to calculate the influence of other strategies. This system effectively integrates the advantages of all price prediction strategies and innovative measuring the influence by investing performance. In genneral, this paper’s main contributions are as follows:

1) Propose a novel RBF system based on IMQ radial basis function and centered on multiple price predictions, which form a comprehensive price prediction.

2) Propose a comprehensive combination of aggressive strategies and moderate strategies to achieve a better balance between returns and risks.

3) Propose a portfolio update strategy based on kernel and trace operator.

The rest parts of this paper are presented as follows. Section 2 describes the relevant problem setting and related work about PS. The CPP system is introduced and described in detail in Section 3. Experiments on 4 benchmark data sets are carried out to assess CPP in Section 4. Finally, conclusions are presented in Section 5.

2. The Relevant Problem Setting and Related Work

2.1. The Relevant Problem Setting

In this paper, d assets with a time span of n periods in financial market are considered. For the sake of understanding, let’s think of a period as a day. The asset prices of the tth period is presented by the close prices vector q t + d , t = 0 , 1 , 2 , , n , where q t i represents the price of the ith asset and + d is a d-dimensional nonnegative real space. The change of asset prices is presented by the price relative vector [25]

s t = q t q t 1 , s t + d , t = 1 , 2 , 3 , , n , (1)

where division between two vectors represents the division of corresponding components, i.e. s t i = q t i q t 1 i . Note that the price relative vector is an crucial metric in

PS, because it’s always used to evaluate the performance of an asset. When s t i > 1 , the ith asset price goes up, and vice versa. Of course when s t i = 1 , it means the price doesn’t change.

At the beginning of each trading period, wealth needs to be allocated across a range of assets. In this paper, when investing, the proportion of each asset in total weath is recorded as the portfolio vector. Suppose there are d assets and their portfolio vector in the tth period is

v t Δ d : = { v + d : i = 1 d v ( i ) = 1 } , (2)

where Δ d is a d-dimensional simplex. A non-negative constraint indicates that non-short-selling and the equality constraint indicates that self-financing, which means it is not allowed to borrow money and all of the wealth is reinvested. Since all the wealth in the previous periods is invested over next period, the cumulative wealth (CW) increases at a multiple rate, i.e. W t = W t 1 v t Τ s t , which v t Τ s t is the increasing factor. So after n periods, the final cumulative wealth is

W n = W 0 t = 1 n ( v t Τ s t ) , (3)

where W 0 is the initial wealth. In this paper, for the convenience of calculation, it is assumed that W 0 = 1 . Then the final cumulative wealth W n is

W n = t = 1 n ( v t Τ s t ) . (4)

The ultimate purpose of PS system is to maximize the final cumulative wealth W n by constructing a set of the portfolio vectors { v t } t = 1 n , that is

W ˜ n = max { v t Δ d } t = 1 n t = 1 n ( v t Τ s t ) . (5)

From the above equation, this is equivalent to maximizing the increasing factor v t Τ s t . Note that this optimization problem does not require statistical assumption about the changes in asset price.

2.2. Related Work

In this subsection, some classical prediction strategies are introduced to help us understand how to build a PS system.

The UBAH strategy [5], which is generally used as a market strategy to generate a market index, is to start with an equal distribution of wealth among d assets and keep it constant, thus the final cumulative wealth

W n M a r k e t = 1 d i = 1 d t = 1 n s t ( i ) . (6)

Both OLMAR [10] and RMR [8], which keep a moderate attitude, use the mean reversion phenomenon to predict the future asset prices. OLMAR points out that the future asset prices would recover to historical moving average and proposed the exponential moving averages (EMA). The EMA exploits all historical price information to achieve price prediction. The specific representation of EMA is as follow:

s ˜ E , t + 1 ( α ) = E M A t ( α ) q t = α q t + ( 1 α ) E M A t 1 ( α ) q t = α 1 + ( 1 α ) E M A t 1 q t 1 q t 1 q t = α 1 + ( 1 α ) s ˜ E , t s t , (7)

where E M A t represents the previous EMA and 1 is a d-dimensional vector with components of 1. 0 < α < 1 is a decaying factor and s t is the real price relative on the tth period.

When expanding E M A t , then

E M A t = α q t + ( 1 α ) E M A t 1 = α q t + ( 1 α ) α q t 1 + ( 1 α ) 2 E M A t 2 = k = 0 t 1 ( 1 α ) k α q t k + ( 1 α ) t q 0 , (8)

as a result, EMA really makes full use of all historical prices and gives lager weight to more recent price information.

Unlike OLMAR, RMR no longer uses the simple mean, but instead exploits the robustness of L1-median [26] [27] to predict the future asset prices. Statistically speaking, the L1-median has a more attractive property than the simple mean because it’s breakdown point is 0.5, meaning that when 50% of the points in the data set are pollution values, the L1-median can take values that exceed all boundaries. A higher the breakdown point means a more stable estimator, and the breakdown point of the simple mean is 0. The corresponding future price relative of RMR is

s ˜ L , t + 1 = L 1 m e d t + 1 ( ω ) q t , (9)

where

L 1 m e d t + 1 ( ω ) = arg min μ i = 0 k 1 q t i μ , (10)

where represents the Euclidean norm.

OLMAR and RMR exploit the same optimization approach to update strategies as follows:

v ^ t + 1 = arg min v Δ d 1 2 v v ^ t 2 , s .t v Τ s t + 1 ε > 0. (11)

EMA and L1-median essentially exploits the principle of mean reversion. They are cautious in their price prediction. However, there are plenty of evidence in real financial markets that irrational investment can keep prices trends. Therefore, the importance of trend-following strategies should not be ignored. In real financial markets, most investors profit from rising prices. So they are more concerned about recent maximum prices. PPT system suggests using the PPs from different asset prices within a time window [6].

q ˜ t + 1 i = max 0 k ω 1 q t k i , i = 1 , 2 , , d

s ˜ P , t + 1 = q ˜ t + 1 q t , (12)

and s ˜ P , t + 1 can also be understood as the growth potential of the assets.

EMA and L1-median belong to the trend-reversing, both of which are conservative and moderate investment strategies. In contrast, PP is an active and aggressive strategy as it belongs to the trend-following. Depending on the financial environment, sometimes aggressive strategies are needed to achieve high returns, while sometimes moderate strategies are needed to avoid risks. All of these motivate us to construct a comprehensive price prediction system that can effectively integrate the advantages of different strategies.

3. A Comprehensive Price Prediction Based on Inverse Multiquadrics Radial Basis Function

3.1. Novel RBF System Based on IMQ Function

The classical expression of RBF system is as follow:

y = Q ψ ( x ) , ψ = [ ψ 1 , ψ 2 , , ψ H ] Τ ψ h ( x ) = exp ( x ρ h 2 2 σ h 2 ) (13)

where x and y are input and output respectively, Q d × H represents the weight matrix of the RBF system, d is the dimension of output, ψ is a vector composed of GA radial basis function, both ρ h and σ h 2 are the center points and the scale parameters of the GA function, respectively. Note that σ h 2 reflects the width of the function image.

Besides GA, IMQ is another radial basis function that cannot be ignored. Here, a novel RBF system based on IMQ radial basis function is constructed, that is

y = Q ψ ( x ) , ψ = [ ψ 1 , ψ 2 , , ψ H ] Τ ψ h ( x ) = 1 x ρ h 2 + σ h 2 (14)

There is a theoretical basis for this improvement. GA and IMQ are essentially the same and both are positive definite functions [28]. However, in practical application, IMQ performance is more stable and better than GA [14] [19].

As for the formula in this paper, H price prediction strategies are denoted as { s ˜ h , t + 1 } h = 1 H . In this paper, three typical price prediction strategies are integrated, namely s ˜ E , t + 1 , s ˜ L , t + 1 and s ˜ P , t + 1 in formula Equation (7), Equation (9) and Equation (12), corresponding to H = 3 . The next step is to determine the input x and the center ρ h . First, all price prediction strategies { s ˜ h , t + 1 } h = 1 3 serve as the centers of novel RBF system. Secondly, projection operation is exploited to get qualified portfolio v ˜ h , t + 1 , and then choose the strategy with the best performance as the input. The specific methods are as follows:

v ˜ h , t + 1 = arg min s Δ d s s ˜ h , t + 1 2 , h = 1,2,3 (15)

s ˜ , t + 1 = arg min 1 h 3 min 0 k ω 1 v ˜ h , t k Τ s t k , (16)

where Δ d is defined as Equation (2) in Equation (15), v ˜ h , t k Τ s t k represents the increasing factor of the hth price prediction strategy of the ( t k ) th period and s t k is the actual price relative generated by Equation (1). The method is firstly transformed { s ˜ 1, t + 1 , s ˜ 2, t + 1 , s ˜ 3, t + 1 } onto a d-dimensional simplex [31]. Then it exploits { v ˜ h , t k } k = 0 w 1 generate the increasing factors to evaluate the investing performance and select the input s ˜ , t + 1 of the novel system. The essence of s ˜ , t + 1 is select the best-performing strategy, where the best-performing strategy means that it can get the highest return even in the worst financial environment. The general process is that the smallest increasing factors of each strategy is selected within a time window, and then the largest increasing factor is selected from a set composed of the smallest increasing factors. This approach ensures that we get the best price prediction strategy in the worst trading environment, which is the key to improving the overall robustness of the system.

In the novel RBF system mentioned in Equation (14), all qualified portfolios calculated by Equation (15) serve as centers of the novel RBF system, and the best strategy s ˜ , t + 1 serves as fixed inputs. The specific form of novel RBFs in Equation (14) is transformed into

Δ v t + 1 = S ˜ t + 1 ψ ( s ˜ , t + 1 ) , ψ = [ ψ 1 , ψ 2 , ψ 3 ] Τ ψ h ( s ˜ , t + 1 ) = 1 s ˜ , t + 1 v ˜ h , t + 1 2 + σ h 2 , h = 1,2,3 (17)

where Δ v t + 1 represents the updated increment of the portfolio on the ( t + 1 ) th period and S ˜ t + 1 = [ s ˜ 1, t + 1 , s ˜ 2, t + 1 , s ˜ 3, t + 1 ] d × 3 is equivalent to the weight matrix. The reason for this representation of Δ v t + 1 will be explained in the later Section 3.3. From the above formula, it can be seen that the system quantifies the similarity degree between s ˜ , t + 1 and v ˜ h , t + 1 . If v ˜ h , t + 1 is close to s ˜ , t + 1 , then the function ψ h will increase, and s ˜ h , t + 1 will amplify its influence on the increment Δ v t + 1 . With the change of the time t, the input s ˜ , t + 1 also changes between different strategies in order to adapt to the latest changes in the financial environment.

The proposed novel RBF system in Equation (17) is different from in Equation (13) in many different ways which are mainly reflected in the problem setting, data characteristics and the selection of radial basis functions.

1) The centers { ρ h } in Equation (13) are obtained by minimizing the error of fitting y . While the centers { v ˜ h , t + 1 } in Equation (17) represent the corresponding price prediction strategies.

2) Although the inputs of those two RBF systems are fixed, s ˜ , t + 1 is determined by the recent investing performance of all prediction strategies. This means that each price prediction strategy is likely to be an input.

3) The basis functions of those two RBF systems are different. The system in Equation (13) uses Gaussian radial basis function, while Equation (17) uses the IMQ radial basis function.

4) The objective of Equation (13) is to fit y . So the back-propagation methods can be used to solve the problem [29] [30]. However, the objective of Equation (17) aims to maximize the generalized increasing factor and the solution method is different from Equation (13).

3.2. Comprehensive Price Prediction System

In order to apply the theory to practice, the next step is to construct a PS model using the proposed novel RBF system. As described in Section 2.1, in order to obtain better investing performance, the increasing factor v t + 1 Τ s t + 1 should be maximized when price information of t periods is known. Although the price relative s t + 1 is unknown, we can use the price prediction strategy { s ˜ h , t + 1 } to generate the future price relative. In addition, the novel RBF system can be exploited to construct a generalized increasing factor as follow:

v t + 1 = arg max v t r ( V Ψ S ˜ t + 1 Τ ) , V = v 1 ( 3 ) Τ Ψ = d i a g ( ψ ) , S ˜ t + 1 = [ s ˜ 1, t + 1 , s ˜ 2, t + 1 , s ˜ 3, t + 1 ] s .t . v Δ d , v v ^ t ε , ε > 0, (18)

where tr is the trace operator, V is an 3-dimensional vector with component v , Ψ is a diagonal matrix with ψ as diagonal elements, S ˜ t + 1 is a matrix composed of 3 predicted future price relatives, and 1 3 is an 3-dimensional vector with elements of 1. Notice the difference between 1 3 here and d-dimensional 1 , and the v ˜ and v ^ are different vectors. The constraints of v make it act as a qualified portfolio and within a distance ε from v ^ t .

Compared with the classical form of the increasing factor v Τ s ˜ t + 1 in PPT [6], t r ( V Ψ S ˜ t + 1 ) can be considered as a generalized increasing factor. Because t r ( V Ψ S ˜ t + 1 ) includes Ψ as the kernel to adjust the influence of different price prediction strategies. Strategy with the best performance s ˜ , t + 1 given it the largest influence, while other strategies { s ˜ h , t + 1 } have the less influence and the magnitude of influence measured by their similarity to s ˜ , t + 1 .

Inspired by the gradient projection principle [6] [8] [10] [31], when solving v t + 1 , the simplex constraint of v is firstly relaxed to only consider 1 Τ v = 1 , and then the result is projected onto the simplex. And make u = v v ^ t here, then Equation (18) can be further converted into

u t + 1 = arg max u t r ( U Ψ S ˜ t + 1 Τ ) , s .t 1 Τ u = 0, u ε U = u 1 ( 3 ) Τ , v t + 1 = v ^ t + u t + 1 , (19)

the reason for this simplification is that v ^ t is fixed and 1 Τ u = 1 Τ v 1 Τ v ^ t = 0 . Therefore, the optimization goal is now switching from v t + 1 to the update increment u t + 1 .

3.3. Solution Algorithm

In this subsection, the solution algorithm of CPP is introduced in detail, which has briefly concluded in Proposition 1. It is worth noting that our solution is suboptimal, not only because there is a certain bias in estimating the future with historical data but also to avoid over-fitting. So it’s not necessary to get the optimal solution.

Proposition 1. If ( I ( 1 d ) 1 1 Τ ) S ˜ t + 1 Ψ 1 ( 3 ) 0 , the unique solution of Equation (19) is

u t + 1 = ε ( I 1 d 1 1 Τ ) S ˜ t + 1 Ψ 1 ( 3 ) ( I 1 d 1 1 Τ ) S ˜ t + 1 Ψ 1 ( 3 ) , (20)

note that if ( I 1 d 1 1 Τ ) S ˜ t + 1 Ψ 1 ( 3 ) = 0 , we make u t + 1 = 0 .

Proof. The first step is to prove that u t + 1 satisfies all constraints in Equation (19), that is

u t + 1 = 1 (21)

1 Τ u t + 1 = ε 1 Τ ( I 1 d 1 1 Τ ) S ˜ t + 1 Ψ 1 ( 3 ) ( I 1 d 1 1 Τ ) S ˜ t + 1 Ψ 1 ( 3 ) = ε ( 1 Τ 1 d ( 1 Τ 1 ) 1 Τ ) S ˜ t + 1 Ψ 1 ( 3 ) ( I 1 d 1 1 Τ ) S ˜ t + 1 Ψ 1 ( 3 ) = 0 (22)

So u t + 1 satisfies the two constraints. The second step is to prove that the optimization problem in Equation (19) maximizes at u t + 1 . By contradiction we suppose that there exists u ^ that satisfies all constraints and

t r ( U ^ Ψ S ˜ t + 1 Τ ) > t r ( U t + 1 Ψ S ˜ t + 1 Τ ) , (23)

where U ^ = u ^ 1 3 , U t + 1 = u t + 1 1 3 .

On the one hand, the right side of the Equation (23) can be converted to

t r ( U t + 1 Ψ S ˜ t + 1 Τ ) = t r ( u t + 1 1 3 Τ Ψ S ˜ t + 1 Τ ) = t r ( 1 3 Τ Ψ S ˜ t + 1 Τ u t + 1 ) = 1 3 Τ Ψ S ˜ t + 1 Τ u t + 1 (24)

Note that 1 3 Τ Ψ S ˜ t + 1 Τ u t + 1 is a scalar.

Then we substitute u t + 1 into the scalar

1 3 Τ Ψ S ˜ t + 1 Τ u t + 1 = ε 1 3 Τ Ψ S ˜ t + 1 Τ ( I 1 d 1 1 Τ ) S ˜ t + 1 Ψ 1 ( 3 ) ( I 1 d 1 1 Τ ) S ˜ t + 1 Ψ 1 ( 3 ) = ε 1 3 Τ Ψ S ˜ t + 1 Τ ( I 1 d 1 1 Τ ) 2 S ˜ t + 1 Ψ 1 ( 3 ) ( I 1 d 1 1 Τ ) S ˜ t + 1 Ψ 1 ( 3 ) = ε ( I 1 d 1 1 Τ ) S ˜ t + 1 Ψ 1 ( 3 ) (25)

The second equation is derived from the idempotent of ( I 1 d 1 1 Τ ) .

On the other hand, the left side of the Equation (23) can be converted to

t r ( U ^ Ψ S ˜ t + 1 Τ ) = 1 3 Τ Ψ S ˜ t + 1 Τ u ^ = 1 3 Τ Ψ S ˜ t + 1 Τ ( I 1 d 1 1 Τ ) u ^ 1 3 Τ Ψ S ˜ t + 1 Τ ( I 1 d 1 1 Τ ) u ^ (26)

The inequality is derived from Cauchy-Schwarz inequality.

Take Equation (23), Equation (25) and Equation (26) into consideration, we have

ε ( I 1 d 1 1 Τ ) S ˜ t + 1 Ψ 1 ( 3 ) < 1 3 Τ Ψ S ˜ t + 1 Τ ( I 1 d 1 1 Τ ) u ^ (27)

Hence, we can deduce u ^ < ε . This contradicts u ^ ε . It is proved that the optimization problem in Equation (19) obtain the maximum at u t + 1

Finally, we prove that the solution u t + 1 is unique. If u t + 1 is not the only solution, then there must be another optimal solution u u t + 1 , and

1 3 Τ Ψ S ˜ t + 1 Τ u t + 1 = t r ( U t + 1 Ψ S ˜ t + 1 Τ ) = t r ( U Ψ S ˜ t + 1 Τ ) = 1 3 Τ Ψ S ˜ t + 1 Τ u (28)

This shows that u u t + 1 . So the Cauchy-Schwarz inequality is strict

1 3 Τ Ψ S ˜ t + 1 Τ u = 1 3 Τ Ψ S ˜ t + 1 Τ ( I 1 d 1 1 Τ ) u < 1 3 Τ Ψ S ˜ t + 1 Τ ( I 1 d 1 1 Τ ) u (29)

According to Equation (25) and Equation (29), we obtain u > ε . This contradicts the constraint u < ε .

To sum up, we can get that u t + 1 is the optimal and unique solution of optimization problem in Equation (19). □

Regarding Proposition 1, the update increment

u t + 1 = A ε ( ( I 1 d 1 1 Τ ) S ˜ t + 1 Ψ 1 ( 3 ) ) = A ε ( ( I 1 d 1 1 Τ ) S ˜ t + 1 ψ ) , (30)

where A ε is a mapping projected onto the ε -Euclidean ball. Compared with Equation (17), in order to satisfy the constraint conditions in Equation (19), u t + 1 adds two operators on the basis of Δ v t + 1 . Thus, this is also explains why the update increment of the portfolio is represented as Equation (17).

Then, the portfolio v ^ t + 1 on the next period is shown as follows:

v t + 1 = v ^ t + u t + 1 v ^ t + 1 = arg min v Δ d v v t + 1 2 . (31)

The complete CPP system is outlined in Algorithm 1. CPP is a fast algorithm because it only uses ordinary matrix calculation without any iterative calculation, which significantly reduces the operation time.

4. Experiments

4.1. Data Sets and Comparison Approaches

In this subsection, in order to comprehensively assess the performance of systems, a large number of experiments were carried out on four data sets, namely NYSE(N), DJIA, SP500 and HS300. In fact, they contain the daily price relatives of assets, originating from the New York Stock Exchange, Dow Jones Industrial Average, Standard & Pool 500 and China Stock Index 300, respectively. These data sets have larger assets scales and long time spans. So they can be used to assess the investing performance of systems. The details of these data sets are introduced in Table 1.

Table 1. Summary of 4 benchmark data sets from real-world financial market.

Algorithm 1. The whole CPP system.

Six commonly used PS systems are introduced, namely RMR, OLMAR, PPT, TRLR [32], SSPO [33] and AICTR [34], to compete with CPP that we proposed. A detailed descriptions of these systems are as follow.

1) RMR: RMR uses L1-median to make price prediction as describled in Section 2.2.

2) OLMAR: OLMAR uses the moving averages to make price prediction, which are described in Section 2.2.

3) PPT: PPT is an aggressive strategy that uses the maximum values of different assets to make price prediction as mentioned in Section 2.2.

4) TRLR: TRLR [32] is a novel trend representation strategy. It exploits weighted ridge regression to represent the price trend pattern with timet as a variable, which improves the efficiency of the price prediction.

5) SSPO: SSPO [33] is an aggressive strategy and concentrates wealth on a small number of assets by taking advantage of the inherent sparsity of assets to maximize the final CW.

6) AICTR: AICTR [34] is a composite trend tracking system using Gaussian basis function.

In order to ensure the performance of these systems, the parameter settings of these systems are consistent with the default settings in the original paper [6] [8] [10] [32] [33] [34]. RMR: ω = 5 , ε = 5 ; OLMAR: ν = 0.5 , ε = 10 ; PPT: ω = 5 , ε = 100 ; TRLR: ω = 5 , q 1 = 0.1 , λ = 0.7 , σ = 0.3 , η = 40000 ; SSPO: ω = 5 , γ = 0.01 , λ = 0.5 , η = 0.005 , ζ = 500 ; AICTR: ω = 5 , σ h 2 = 0.0025 , ε = 1000 . Note that for consistency, all time window sizes are set to ω = 5 . In the experiments, the portfolio vector is initialized to v 1 = ( 1 / d ) 1 .

In general, the parameters of CPP are determined based on the results of final cumulative wealth (CW), operating in the same way as previous studies. The calculation of final CW is described in detail in Section 2.1. First, we set the window size ω = 5 , which is widely used and consistent with other systems. Secondly, we change one parameter to fix the other parameters for the experiments. Since ε is the updating strength, it is roughly estimated to be larger value, while σ h 2 is a parameter used to evaluate the difference between two portfolios, it should be a small value. On the one hand, we firstly set ω = 5 , ε = 1400 , and then set σ h 2 change between 0.0007 and 1.0012. According to the results in Figure 1, the investing performance of CPP around σ h 2 = 0.0008 is stable and good. On the other hand, we firstly set ω = 5 , σ h 2 = 0.0008 and then make ε change between 1200 and 1700. The results are shown in Figure 2 and we know that CPP is stable and good around ε = 1400 . Therefore, the parameters of CPP are set as: σ h 2 = 0.0008 , ε = 1400 .

4.2. Experimental Results

In this paper, a scheme containing seven evaluation indicators are designed to assess the performance of different systems and achieve the most excellent results. These seven indicators can be roughly divided into three categories, namely investing performance, risk metrics and application issues. Investing performance

Figure 1. Final CWs of CPP in regard to σ h 2 on four data sets (fixed ω = 5 , ε = 1400 ). (a) NYSE(N), (b) DJIA, (c) SP500, (d) HS300.

Figure 2. Final CWs of CPP in regard to ε on four data sets (fixed ω = 5 , σ h 2 = 0.0008 ). (a) NYSE(N), (b) DJIA, (c) SP500, (d) HS300.

includes CW, mean excess return (MER) and α Factors. Risk metrics consist of sharpe ratio (SR) and information ratio (IR). As for application issues, we chose transaction cost and running times to assess them. Those indications will be discussed in the following subsection.

4.2.1. Investing Performance

1) CW: The final CWs are the primary consideration in evaluating a system. Without taking transaction into account, the results of CWs for various systems mentioned in Section 4.1 are shown in Table 2. CPP outperforms seven commonly used systems on three data sets and ranks third on NYSE(N). For example, CPP(4.74, 18.05) is significantly higher than PPT(3.08, 10.78), TRLR(2.71, 13.17) and OLMAR(1.16, 9.59), respectively. In addition, DJIA is a challenging data set because many systems do not perform well in this data set, such as PPT, and OLMAR. But CPP can reach a value of 4.74, which is 53.90% higher than PPT. These results shows that CPP is an effective PS system and can accumulate more wealth in the real financial market.

In order to show the superiority of the system CPP, the CWs of each system on DJIA is plotted in Figure 3. By observing the Figure 3, the excellent investing performance of CPP can be shown more intuitively.

2) MER: Return is a financial term that describes the proportion of wealth by a PS system gained or lost over one investing period. In this paper, the daily return

Table 2. Final CWs and MERs of different PS systems on 4 data sets.

Figure 3. Daily CWs of different PS systems on DJIA.

on the tth period is r t = v ^ t Τ s t 1 . MER [35] is an indicator that computes the long-term average daily excess return of attended PS systems

MER = r ¯ s r ¯ m = 1 n t = 1 n ( r s , t r m , t ) , (32)

where r s , t and r m , t present the daily returns of attended PS system and the market baseline on the tth period, respectively. Note that, the UBAH system is defined as the market baseline.

The MER results from various PS systems are described in Table 2. CPP obtains the largest MER on three data sets and ranks third on NYSE(N). For instance, CPP(0.0041, 0.0004) is significantly higher than PPT(0.0031, −0.0004), OLMAR(0.0012, 0.0001) and TRLR(0.0030, 0.0001) on DJIA and HS300 respectively. In addition, a small MER gap is likely to produce a lager CW gap in the long-term. Therefore, the above results demonstrate that CPP can achieve an outstanding investing performance.

3) α Factor: MER measures the investing performance of a PS system without considering market risks. However, in the real financial market, the volatility of the market will undoubtedly affect the performance of assets. Capital asset pricing model(CAPM) [36] points out that the expected return sources of PS systems can be divided into two parts: the first part comes from the market return, and the second comes from the inherent excess return, also called α Factor [37] [38]. Therefore, α Factor able to evaluate the investing performance of different PS systems:

E ( r s ) = α + β E ( r m ) , β ^ = c ^ ( r s , r m ) s ^ 2 ( r m ) , α ^ = r ¯ s β ^ r ¯ m (33)

where E ( ) is the mathematical expectation of the sample data, and c ^ ( ) and s ^ ( ) are covariance matrix and standard deviation calculated from the daily return of n trading days, respectively.

The α Factor results from all PS systems mentioned are presented in Table 3. CPP achieved the highest α on three data sets and ranked second on the NYSE(N). For example, CPP (0.0042, 0.0002) achieves a higher α , compared

Table 3. α Factors (with p-Values of t-tests) of different PS systems on four data sets.

with RMR(0.0030, −4.5E−5), PPT(0.0034, −0.0005), OLMAR(0.0013, −0.0003) and TRLR(0.0031, 0.0001) on DJIA and HS300, respectively. The above results show that CPP is still able to achieve higher inherent excess return in the face of market volatility. In addition, the statistical t-test is used to determine whether α is significantly lager than 0, proving that the inherent excess return is not achieved by luck. The results of p-value presented in Table 3 and the α of CPP is significantly lager than 0 at a high confidence level of 99% (with all p-values < 0.01). It shows that CPP achieves good inherent excess returns and the obtained results are not the result of luck.

4.2.2. Risk Metrics

1) Sharpe Ratio: In the real financial market, risks and returns coexist, and high excess returns often mean high risks. Wise investors will balance returns and risks before investing. Sharpe proposed SR [36] [39] to measure risk-adjusted return

SR = r ¯ s r f s ^ ( r s ) (34)

where r f is the return of risk-free assets, and r ¯ s and s ^ ( r s ) are the average return and the standard deviation calculated from the daily return of n periods respectively. Note that, this article don’t consider risk-free assets. So let r f = 0 .

The results of SR for various PS systems are presented in Table 4. CPP stands out among commonly used systems and achieves the highest SR on all data sets. For example, CPP(0.1076, 0.0599) achieves the highest SR compared to AICTR (0.0995, 0.0555), OLMAR(0.0252, 0.0353), and SSPO(0.0919, 0.0192) on DJIA and HS300 respectively. The above results prove that CPP has an excellent ability to balance return and risk compared with commonly used systems.

2) Information Ratio: Unlike SR, IR [40] does not directly measure the risk-adjusted excess return of a system, but measures it compared to the Market

IR = r ¯ s r ¯ m s ^ ( r s r m ) (35)

Table 4. SRs and IRs of different PS systems on four data sets.

The results of IR are presented in Table 4. CPP obtains the highest IR among commonly used systems on all data sets. For example, CPP(0.1012, 0.1417) outperforms AICTR(0.0979, 0.1364), PPT(0.1008,0.1176), OLMAR(0.0970, 0.0457), RMR(0.0953, 0.1092) and SSPO(0.0979, 0.1304) on NYSE(N) and DJIA respectively. Therefore, CPP is a robust system, which can obtain good excess returns and able to control risks effectively.

4.2.3. Application Issues

1) Transaction Cost: In the real-world financial environment, the transaction cost is a problem that cannot be ignored. Suppose that the transaction cost rate required to update the portfolio denoted as γ ( 0,1 ) . The proportional transaction cost model [8] [10] [41] assumes that the final CW at the beginning of the tth period can be expressed as

W n γ = W 0 t = 1 n [ ( v ^ t Τ s t ) × ( 1 γ 2 i = 1 d | v ^ t i v ˜ t 1 ( i ) | ) ]

v ˜ t 1 ( i ) = v ^ t 1 ( i ) s t 1 ( i ) v ^ t 1 Τ s t 1

where v ˜ t 1 ( i ) represents the adjusted portfolio of the tth asset at the end of the ( t 1 ) th period and let v ˜ 0 = 0 . ( γ 2 ) i = 1 d | v ^ t i v ˜ t 1 ( i ) | stands the proportional

transaction cost used to update portfolios from the adjusted portfolio to the next portfolio v ˜ t .

Figure 4. Final CWs of different PS systems in regard to transaction cost rate γ on four data sets. (a) NYSE(N), (b) DJIA, (c) SP300, (d) HS300.

In the case of considering transaction costs, in order to better assess the investment performance of CPP, we let γ change between 0% and 0.5% and carry out experiments to calculate the final CWs of different commonly used systems. The results are presented in Figure 4. CPP achieves the highest CW on all data sets when the transaction cost rate γ fluctuates between 0% and 0.15%. In addition, even when γ reaches a high value ( 0.15 % γ 0.5 % ), CPP still outperforms other PS systems on three data sets. Therefore, it shows that CPP can bear moderate transaction costs and can be applied in real world financial markets.

2) Running Times: Running Time is an important indicator to judge whether a system can be applied in a large-scale and time-limiting environment, such as High-Frequency Trading (HFT) [42]. We use a regular computer equipped an Intel Core i5-8250U CPU and 8GB DDR4 2400 MHZ memory card to perform CPP in the experiments. The average running times of CPP for one trading period are 1.751e−4, 1.876e−4, 1.936e−4 and 1.876e−4 on NYSE(N), DJIA, SP500 and HS300, respectively. Therefore, CPP has good computational efficiency and can be applied in large-scale financial markets.

5. Conclusion

In this paper, we proposed a new CPP system based on IMQ radial basis function with an integration of three different aggressive and moderate strategies for effective and robust PS. Instead of using a traditional GA function, here we chose a more stable and accurate function that is IMQ for the novel RBF system, which centers on multiple strategies. With regard to portfolio update, different from the traditional increasing factor, we propose a generalized growth factor based on a kernel and trace operation. And CPP is fast and can be applied in larger scale and limited time financial environment. Extensive experiments are performed on 4 worldwide benchmark data sets to indicate that CPP can effectively integrate the advantages of different strategies and it was proved to be effective in PS. On the one hand, in most cases CPP outperforms other commonly used systems in performance indicators CW, MER and α Factor. On the other hand, CPP achieves the highest SR and IR compared with other systems. The results show that CPP has not only excellent investing performance but also good risk control ability. In addition, CPP can withstand reasonable transaction costs and fast operation, which have to be considered in the real financial market. In conclusion, CPP is an efficient and robust PS system and deserves further investigation. Of course, the CPP system has its own shortcomings. On one hand, the problem of reducing transaction cost was not considered at the initial stage of modeling. On other hand, this paper considers only the three strategies. In the future, we can improve the performance of the system from these two aspects.

6. Acknowledgements

Thanks to Jinan University for the resources provided, and to tutors for their valuable advice on this manuscript. This research is supported by the National Natural Science Foundation of China under Grants 61703182, the Science and Technology Planning Project of Guangzhou, China under Grants 202102021173 and the Fundamental Research Funds for the Central Universities under Grants 21617347.

Conflicts of Interest

The author declares no conflicts of interest regarding the publication of this paper.

References

[1] Markowitz, H.M. (1952) Portfolio Selection. Journal of Finance, 7, 77-91.
https://doi.org/10.1111/j.1540-6261.1952.tb01525.x
[2] Kelly Jr., J. (1956) A New Interpretation of Information Rate. Bell System Technical Journal, 35, 917-926.
https://doi.org/10.1002/j.1538-7305.1956.tb03809.x
[3] Finkelstein, M. and Whitley, R. (1981) Optimal Strategies for Repeated Games. Advance in Applied Probability, 13, 415-428.
https://doi.org/10.2307/1426692
[4] Wilder, W. (2012) The ADAM Theory of Markets. Shanxi Peoples’s Publishing House.
[5] Li, B. and Hoi, S.C.H. (2014) Online Portfolio Selection: A Survey. ACM Computing Surveys, 46, Article No. 35.
https://doi.org/10.1145/2512962
[6] Lai, Z.R., Dai, D.Q., Ren, C.X. and Huang, K.-K. (2018) A Peak Price Tarcking-Based Learning System for Portfolio Selection. IEEE Transactions Neural Newtworks and Learning Systems, 29, 2823-2832.
[7] Borodin, A., EL-Yaniv, R. and Gogan, V. (2004) Can We Learn to Beat the Best Stock. Journal of Artificial Intelligence Research, 21, 579-694.
https://doi.org/10.1613/jair.1336
[8] Huang, D.-J., Zhou, J.-L. and Li, B. (2016) Robust Median Reversion Strategy for Online PS. IEEE Transactions Neural Newtworks and Learning Systems, 28, 2480-2493.
https://doi.org/10.1109/TKDE.2016.2563433
[9] Li, B., Hoi, S.C.H. and Zhao, P.-L. (2013) Confidence Weighted Mean Reversion Strategy for Online PS. ACM Transactions on Knowledge Discovery from Data, 7, Article No. 4.
https://doi.org/10.1145/2435209.2435213
[10] Li, B., Hoi, S.C.H., Sahoo, D. and Liu, Z.-Y. (2015) Moving Average Reversion Strategy for Online Portfolio Selection. Artificial Intelligence, 222, 104-123.
https://doi.org/10.1016/j.artint.2015.01.006
[11] Gyorfi, L., Urban, A. and Vajda, I. (2007) Kernel-Based Semi-Log-Optimal Empirical Empirical Portfolio Selection Strategies. International Journal Theoretical and Applied Finance, 10, 505-516.
https://doi.org/10.1142/S0219024907004251
[12] Gyorfi, L., Lugosi, G. and Udina, F. (2006) Nonparametric Kernel-Based Sequential Investment Strategies. Mathematical Finance, 16, 337-357.
https://doi.org/10.1111/j.1467-9965.2006.00274.x
[13] Gyorfi, L., Udina, F. and Walk, H. (2008) Nonparametric Nearest Neighbor Based Empirical Portfolio Selection Strategies. Statistics & Decisions, 26, 145-157.
https://doi.org/10.1524/stnd.2008.0917
[14] Abbasbandy, S., Roohani Ghehsateh, H., Hashim, I. and Alsaedi, A. (2014) A Comparision Study of Meshfree Techniques for Solving the Two-Dimensional Linear Hyperbolic Telegraph Equation. Engineering Analysis with Boundary Elements, 47, 10-20.
https://doi.org/10.1016/j.enganabound.2014.04.006
[15] Ji, Y. and Kim, S. (2013) An Adaptive Radial Basis Function Method Using Weighted Improvement. Winter Simulation Conference Proceeding, Washington DC, 8-11 December 2013, 957-968.
https://doi.org/10.1109/WSC.2013.6721486
[16] Khan, M., Khan, Z. and Khan, H. (2020) Collocation Method for Multiplicative Noise Removal Model. Mehran University Research Journal of Engineering and Technology, 39, 734-743.
https://doi.org/10.22581/muet1982.2004.05
[17] Ku, C.Y., Liu, C.Y. and Xiao, J.E. (2020) Multiquadrics without the Shape Parameter for Solving Partial Differential Equations. Symmetry, 12, Article No. 1813.
https://doi.org/10.3390/sym12111813
[18] Wang, S.Y. and Wang, M.Y. (2006) Structural Shape and Topology Optimization Using an Implicit Free Boundary Parametrization Method. Computer Modeling in Engineering & Sciences, 13, 119-147.
[19] Tanbay, T. and Ozgener, B. (2014) A Comparison of the Meshless RBF Collocation Method with Finite Element and Boundary Element Methods in Neutron Diffusion Calculation. Engineering Analysis with Boundary Elements, 46, 30-40.
https://doi.org/10.1016/j.enganabound.2014.05.005
[20] Soley, F., Barf, M. and Hagh, F. (2018) Inverse Multi-Quadric RBF for Computing the Weights of FD Method: Application to American Options. Communications in Nonlinear Science and Numerical Simulation, 64, 74-88.
https://doi.org/10.1016/j.cnsns.2018.04.011
[21] Tan, R., James, R. and Nina, F. (2020) Nonstationary Discrete Convolutionnkenel for Multimodeal Process Monitoring. IEEE Transaction on Neural Networks and Learning Systems, 31, 3670-3681.
https://doi.org/10.1109/TNNLS.2019.2945847
[22] Buhmann, D., Marchi, S. and Perra, E. (2020) Analysis of a New Class of Rational RBF Expansions. IMA Journal of Numerical Analysis, 40, 1972-1993.
https://doi.org/10.1093/imanum/drz015
[23] Hardy, R.L. (1990) Theory and Application of the Multiquadric-Biharmonic Method 20 Years of Discovery. Computers and Mathematics with Applications, 19, 163-208.
https://doi.org/10.1016/0898-1221(90)90272-L
[24] Franke, R. (1982) Scattered Data Interpolation: Tests of Some Methods. Mathematics of Computation, 38, 181-200.
https://doi.org/10.1090/S0025-5718-1982-0637296-4
[25] Cover, T.M. (1991) Universal Portfolios. Mathmatical Finance, 1, 1-29.
https://doi.org/10.1111/j.1467-9965.1991.tb00002.x
[26] Weiszfeld, E. (1937) Sur le point pour lequel la somme des distance den points donnes est minimum. Tohoku Mathematical Journal, 43, 355-386.
[27] Vard, Y. and Zhang, C.H. (2000) The Multivariate L1-Median and Associated data Depth. Proceedings of the National Academy of Science of the United States of America, 97, 1423-1426.
https://doi.org/10.1073/pnas.97.4.1423
[28] Boyd, J.P. (2011) The Near-Equivalence of Five Species of Spectrally-Accurate Radial Basis Functions (RBFs): Asymptotic Approximations to the RBF Cardinal Functions on a Uniform Unbounded Grid. Journal of Computational Physics, 230, 1304-1318.
https://doi.org/10.1016/j.jcp.2010.10.038
[29] Brahma, P., Wu, D.-P. and She, Y.-Y. (2016) Why Deep Learning Works: A Manifold Disentanglement Perspective. IEEE Transaction on Neural Networks and Learning Systems, 27, 1997-2008.
https://doi.org/10.1109/TNNLS.2015.2496947
[30] Jenni, R., Serkan, K. and Mon, G. (2016) Training Radial Basis Function Neural Networks for Classification via Class-Specific Clustering. IEEE Transaction on Neural Networks and Learning Systems, 27, 2458-2471.
https://doi.org/10.1109/TNNLS.2015.2497286
[31] John, D., Shai, S.-S. and Yor, S. (2008) Efficient Projections onto the l1-Ball for Learning in High Dimensions. International Conference on Machine Learning, Helsinki, 5-9 July 2008, 272-279.
[32] Lai, Z.-R., Yang, P.-Y., Wu, X.-T. and Fang, L. (2018) Trend Representation Based Log-Density Regularization System for Portfolio Optimization. Pattern Recognition, 76, 14-24.
https://doi.org/10.1016/j.patcog.2017.10.024
[33] Lai, Z.-R., Yang, P.-Y., Fang, L.-D. and Wu, X.-T. (2018) Short-Term Sparse Portfolio Optimization Based on Alternating Direction Method of Multipliers. Journal of Machine Learning Research, 19, 63:1-63:28.
[34] Lai, Z.-R., Dai, D.-Q., Ren, C.-X. and Huang, K.-K. (2018) Radial Basis Functions with Adaptive Input and Composite Trend Representation for PS. IEEE Transaction on Neural Networks and Learning Systems, 29, 6214-6226.
https://doi.org/10.1109/TNNLS.2018.2827952
[35] Jegadeesh, N. (1990) Evidence of Predictable Behavior of Security Returns. The Journal of Finance, 45, 881-898.
https://doi.org/10.1111/j.1540-6261.1990.tb05110.x
[36] Sharpe, W.F. (1964) Capital Asset Prices: A Theory of Market Equilibrium under Conditions of Risk. The Journal of Finance, 19, 425-442.
https://doi.org/10.1111/j.1540-6261.1964.tb02865.x
[37] Lintner, J. (1965) The Valuation of Risk Assets and the Selection of Risky Investments in Stock Portfolios and Capital Budgets. The Review of Economics and Statistics, 47, 13-37.
https://doi.org/10.2307/1924119
[38] Mossin, J. (1966) Equilibrium in a Capital Asset Market. The Econometric Society, 34, 768-783.
https://doi.org/10.2307/1910098
[39] Sharpe, W.F. (1966) Mutual Fund Performance. The Journal of Business, 39, 119-138.
https://doi.org/10.1086/294846
[40] Treynor, J.L. and Black, F. (1973) How to Use Security Analysis to Improve Portfolio Selection. The Journal of Business, 46, 66-86.
https://doi.org/10.1086/295508
[41] Blum, A. and Kalai, A. (1999) Univeral Portfolios with and without Transaction Costs. Machine Learning, 35, 193-205.
https://doi.org/10.1023/A:1007530728748
[42] Aldrige, I. (2013) High-Frequency Trading: A Practical Guide to Algorithmic Strategies and Trading Systems. 2nd Edition, Wiley, Hoboken.
https://doi.org/10.1002/9781119203803

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.