Efficient Solutions of Coupled Matrix and Matrix Differential Equations

Abstract

In Kronecker products works, matrices are sometimes regarded as vectors and vectors are sometimes made in to matrices. To be precise about these reshaping we use the vector and diagonal extraction operators. In the present paper, the results are organized in the following ways. First, we formulate the coupled matrix linear least-squares problem and present the efficient solutions of this problem that arises in multistatic antenna array processing problem. Second, we extend the use of connection between the Hadamard (Kronecker) product and diagonal extraction (vector) operator in order to construct a computationally-efficient solution of non-homogeneous coupled matrix differential equations that useful in various applications. Finally, the analysis indicates that the Kronecker (Khatri-Rao) structure method can achieve good efficient while the Hadamard structure method achieve more efficient when the unknown matrices are diagonal.

Share and Cite:

Al-Zhour, Z. (2012) Efficient Solutions of Coupled Matrix and Matrix Differential Equations. Intelligent Control and Automation, 3, 176-187. doi: 10.4236/ica.2012.32020.

1. Introduction

Linear matrix and matrix differential equations show up in various fields including engineering, mathematics, physics, statistics, control, optimization, economic, linear system and linear differential system problems. For instance, the Lyapunov equations and (where A* is the conjugate transpose of A) are used to analyze of the stability of continuous-time and discrete-time systems, respectively [1]. The generalized Lyapunov equation:

. (1)

(where is the transpose of B) has been used to characterize structured covariance matrices [2]. Most of the existing results, however, are connected with particular systems of such matrix and matrix differential equations.

Coupled matrix and matrix differential equations have also been widely used in stability theory of differential equations, control theory, communication systems, perturbation analysis of linear and non-linear matrix equations and other fields of pure and applied mathematics and also recently in the context of the analysis and numerical simulation of descriptor systems. For instance, the canonical system

(2)

With the boundary conditions and has been used to the solution of optimal control problem with the performance index [3]. In addition, many interesting problems lead to coupled Riccati matrix differential equations [4]:

(3)

and the general class of non-homogeneous coupled matrix differential equations:

(4)

where are given scalar matrices, is a given matrix function, are the unknown diagonal matrix functions to be solved and; and where denotes the derivative of matrix function.. (where is the set of all matrices over the complex number field and when, we write instead of).

Examples of such situation are singular [5] and hybrid system control [6] and nonzero sum differential games [7]. Depending on the problem considered, different coupling terms may appear. However, in all the above mentioned cases the systems are difficult to solve.

Let us recall some concepts that will be used below.

Given two matrices and

, then the Kronecker product of A and B is defined by (e.g. [8-12])

. (5)

While if, , and let and be the columns of A and B, respectively, namely

,.

The columns of the Kronecker product are for all i, j combinations in lexicographic order namely,

(6)

Thus, the Khatri-Rao product of A and B is defined by [13,14]:

(7)

consists of a subset of the columns of. Notice that is of order and is of order. This observation can be expressed in the following form [15]:

, (8)

where the selection matrix is of order and

(9)

and is an column vector with a unity element in the k-th position and zeros elsewhere.

Additionally, if both matrices and have the same size, then the Hadamard product of A and B is defined by [8-11,16]:

. (10)

This product is much simpler than Kronecker and Khatri-Rao products and it can be connected with isomorphic diagonal matrix representations that can have a certain interest in many fields of pure and applied mathematics, for example, Tauber [16] applied the Hadamard product to solving a partial differential equation coming from an air pollution problem. The Hadamard product is clearly commutative, associative, and distributive with respect to addition. It has been known that is a (principal) submatrix of if A and B are (square) of the same size. This can be found in Visick [12] and even in Zhang’s book [17]. Liv-Ari [13, Theorem 3.1, p. 128] gave the following new relations related to Kronecker, Khatri-Rao and Hadamard products:

; (11)

. (12)

The Kronecker product and vector operator affirming their capability of solving some matrix and matrix differential equations. Such equations can be readily converted into the standard linear equation form by using the well-known identity (e.g. [17,18]):

, (13)

Where denotes a vectorization by columns of a matrix. The need to compute the, and are due its appearance in the solutions of coupled matrix differential equations. Here

(14)

For any matrix, the spectral representation of and assures that [9,18]:

(15)

where and are the eigenvalues and the corresponding eigenvectors of A, and is the eigenvectors of matrix.

Finally, for any matrices A, B, C, , we shall make a frequent use the following properties of the Kronecker product (e.g. [9,18-20]) which are used to establish our results.

1)

(16)

2);;

(17)

3);

(18)

4)

. (19)

In this paper, we present the efficient solution of coupled matrix linear least-squares problem and extend the use of diagonal extraction (vector) operator in order to construct a computationally-efficient solution of nonhomogeneous coupled matrix linear differential equations.

2. Coupled Matrix Linear Least-Squares Problem

The multistatic antenna array processing problem can be written in matrix notation as [13]

. (20)

where, and are given (complex valued) matrices; and where the unknown matrix is diagonal. We also assume that n < mp, so that we suggest using a least-squares approach, viz.,

, (21)

where is called Frobenius norm of A. Using the identity in Equation (13) we can transform (21) into the vector LSP form:

. (22)

which has the well-known solution:

, (23)

provided is invertible.

Applying the direct vector transformation in Equation (13) to results in a highly inefficient leastsquare problem, because VecX is very sparse. Liv-Ari [13] described an alternative approach based on:

, X is diagonal   (24)

which involves the so-called Khatri-Rao product, as well as the diagonal extraction operator:

(25)

which forms a column vector consisting of the diagonal elements of the square matrix X, instead of the much longer column vector VecX. In addition, if Y is any matrix of order, then            

. (26)

As we have observed earlier, when the unknown matrix X is diagonal, solving for VecX is highly inefficient, since most of the elements of X vanish. Instead Liv-Ari [13] used the more compact vectorization identity to rewrite matrix LSP (21) in the vector form:

. (27)

Notice that consists of only the nontrivial (i.e., diagonal) elements of the matrix X. The explicit solution of (27) is

. (28)

provided is invertible.

It turns out that this expression can also be implemented using Hadamard product, resulting in a significant reduction in computational cost, as implied the following result [13]:

, (29)

where and.

When, we observe that the left-hand side expression in Equation (29) requires multiplications, while forming the equivalent right-hand side expression requires only multiplications. Thus the latter offers significant computational savings, especially when.

Now, using (26) we can rewrite (28) in the more compact form:

. (30)

This expression which requires (multiply and add) operations is much more efficient than (28), which requires operations. It means that the computational advantage of using the Hadamard product expression is particularly evident when, which implies that . In order to be able to use (30) we must ascertain that the matrix is invertible. This will hold, for instance, when both A and B have full column rank.

As for the diagonal extraction operator, we observe that for any square matrix,

. (31)

If Y is diagonal, then we also have

, Y is diagonal.      (32)

Moreover, the columns of the selection matrix are mutually orthonormal, viz.,  

. (33)

Using (32) and (11), we get the fundamental relation between the Hadamard product and diagonal extraction operator which is given by

, X is diagonal  (34)

where A, B and X is diagonal matrix.

Now we will discuss the efficient and more efficient least-squares solutions of coupled matrix linear equations:

, (35)

where A, , E, are given scalar matrices and X, are unknown matrices to be solved. We also assume that, so that the coupled matrix linear Equations (35) is over-determined, which suggests using a least squares approach. We consider the coupled matrix linear least-squares problem (CLSP):

. (36)

The solution procedure presented here may be considered as a continuation of the method proposed to solve least-squares problem in (21).

Using the identity (13) we can transform (36) into the vector CLSP form [10]:

(37)

which has the following solution

(38)

One can easily show that

(39)

where is a unitary matrix. So

(40)

Suppose that and, we then have

(41)

Now the least—squares solutions (38) can be rewrite into the form:

. (42)

This gives

(43)

where and.

In order to be able to use (38) and (43) we must ascertain that the matrix:

is invertible if and only one  

and

are invertible matrices.

As we observed, when the unknown matrices X and are diagonal, solving for VecX and VecY are highly inefficient, since most of the elements of X and Y vanish. Instead we can use the more compact vectorization identity (24) to rewrite the coupled matrix linear least-squares problem (37) in the reduced-order vector form:

. (44)

Notice that and consists of only the nontrivial (i.e., diagonal) elements of matrices X and Y. The explicit efficient solution of (44) is

(45)

where and.

In order to be able to use (45), we must ascertain that the matrix

and

are invertible matrices.

It turns out that the expression (45) can also be implemented using Hadamard product by the same technique in the expression (30). Note that the least squares solutions in term of Hadamard product is more efficient than (45) and (43).

3. Non-Homogeneous Matrix Differential Equations

The solution procedure presented here may be considered as a continuation of the method proposed to solve the homogenous coupled matrix differential equations in [18]. We will use our knowledge of the solution of the of simplest homogeneous matrix differential equation:

, (46)

where, are given scalar matrices, and is the unknown matrix function to be solved. In fact the unique solution of (46) is given by:

. (47)

Theorem 3.1 Let, are given scalar matrices, is a given matrix function and is the unknown matrix. Then the general solution of the non-homogeneous matrix differential equation:

, (48)

is given by

. (49)

Where is well-definedwhich involves the convolution product of two matrices and.

Proof: Suppose that is the particular solution of (48). The product rule of differentiation gives

.

Substituting these in (48) we obtain 

Thus

. (50)

Multiplying both sides of (50) by gives

(51)

Integrating both sides of (51) between 0 and t gives

(52)

Hence, by assumption, we conclude that the particular solution of equation (48) is

. (53)

Now from (47) and (53) we get (49).

Theorem 3.2 Let, , are given scalar matrices, is a given matrix function and is unknown diagonal matrix function. Then the general solution of non-homogeneous matrix differential equation

(54)

is given by

. (55)

Proof: Using the identity (34) we can transform (54) into the vector form:

(56)

Now, applying (49), then the unique solution of (56) is

If we put in Theorem 3.2 we obtain the following result.

Corollary 3.3 Let A, B, are given scalar matrices. Then general solution of the homogeneous matrix differential equation:

(57)

is given by

(58)

Now we will discuss the general class of non-homogeneous coupled matrix differential equations which defined in (4): By using the -notation of (4), we have

. (59)

Let

(60)

Now (59) can be written as 

and the general solution is given by:

. (61)

Note that there is many special cases can be considered from the above general class coupled matrix differential equations; now we will discuss some important special cases in the next results.  

Theorem 3.4 Let A, B, C, D, E, are given scalar matrices such that

;

are given matrix functions and, are the unknown diagonal matrices. Then the general solution of non-homogeneous coupled matrix differential equations:

(62)

is given by

(63)

Proof: Using the identity (34) we can transform (62) into the vector form:

(64)

From (61), this system has the following solution:

. (65)

Now we will deal with 

. (66)

Since, then we have

Then

But

;

.

So

(67)

Due to (67) we have

; (68)

. (69)

Now substitute (68) and (69) in (65), we get (63).

If we put in Theorem 3.4 we obtain the following result.

Corollary 3.5 Let A, B, C, D, E, are given scalar matrices such that

and, are the unknown diagonal matrices. Then the general solution of homogeneous coupled matrix differential equations:

(70)

is given by 

(71)

Corollary 3.6 Let, , E, are given scalar matrices and, are the unknown diagonal matrices. Then the general solution of homogeneous coupled matrix differential equations:

(72)

is given by

(73)

Proof: For any matrix, it is easily to show that

; (74)

. (75)

Now put in Corollary 3.5 we have

Similarly,

.

While if we applying the fundamental relation between and Kronecker product defined in (13) and using the same technique in the proof of Theorem 3.4 we obtain (for any matrix) the following result.

Theorem 3.7 Let A, B, C, D, E, are given scalar matrices such that, , , are given matrix functions and, are the unknown matrices. Then the general solution of non-homogeneous coupled matrix differential equations:

(76)

is given by

(77)

If we put and in Theorem 3.7 and using properties (16)-(19) we obtain the following results.

Corollary 3.8 Let B, D, E, are given scalar matrices such that and, are the unknown matrices. Then the general solution of homogeneous coupled matrix differential equations:

(78)

is given by

(79)

Corollary 3.9 Let A, C, E, are given scalar matrices such that and, are the unknown matrices .Then the general solution of homogeneous coupled matrix differential equations:

(80)

is given by

(81)

4. Concluding Remarks

We have studied an explicit characterization of the mappings

in terms of the selection matrix as in (11) and (12). We have also observed that the same matrix relates the two operators and as in (31) and (32). We used the fundamental relation between the Hadamard (Kronecker) product and diagonal extraction (vector) operator in (34) and (13) to derive our main results in Section 2 and 3 and, subsequently, to construct a computationally-efficient solution of coupled matrix least-squares problem and non-homogeneous coupled matrix differential equations. In fact, the Kronecker (Hadamard) product and operator () affirming their capability of solving matrix and matrix differential equations fast (more fast when the unknown matrices are diagonal). To demonstrate the usefulness of applying some properties of the Kronecker products, suppose we have to solve, for example, the following system:

, (82)

where A, are given scalar matrices and is unknown matrix to be solved. Then it is not hard by using the -notation to establish the following equivalence:

, (83)

and thus also by using the -notation product to establish the following equivalence:

, X is diagonal.   (84)

If we ignore the Kronecker (Hadamard) product structure, then we need to solve the following both matrix equations:

                                 (85)

Here, Y can be obtained in arithmetic operations (flops) by using LU factorization of matrix B (Forward Substitution).

                                (86)

Here X can be obtained also in operations (flops) by using LU factorization of matrix A (Back Substitution).

Now without exploiting the Kronecker product structure, an system defined in (82) would normally (by Gaussian elimination) require operations to solve. But when we use Kronecker product structure:, the calculations shows that can be obtained only in operations by using LU factorization of matrices A and B [20, pp. 87]. We can say that the system of the form: can be solved fast and the Kronecker structure also a voids the formation of matrices, only the smaller lower and upper triangular matrices LA, LB, UA, UB are needed. While if X is diagonal matrix and use the Hadamard product structure:, the calculations shows that can be obtained only in operations by using LU factorization of.

We can say that the system of the form: can be solved more fast than Kronecker structure, only the very smaller lower and upper triangular matrices and are needed. For example, consider A, B are 3 × 3 matrices and C is 9 × 1 vector. To demonstrate the usefulness of applying Kronecker product and -notation, we return to the system problem. If is non-singular and regarding with LU factorizetions of and, then a solution of system exists and can be written as:

. (87)

First, the lower triangular system can be solved by forward substitution as the following:

i.e.,

which can be solved in operations. The first three equations are:

.                  (88)

.     (89)


           (90)

Now the next three equations are:

.                     (91)

.     (92)


          . (93)

The first boldface expression in (91) can be computed as. The second boldface expression in (92) can be also computed as.

While the third boldface expression in (93) can be also computed as.

We use the previous expressions for obtaining, and in the first set of equations to simplify the second set of three equations. The simplified second set of equations becomes

. (94)

. (95)

. (96)

Solving the second set of equations takes operations and the forward solve step takes operations, so obtaining z4, z5 and z6 takes time. This simplification and using the work from the previous solution step continuous so that solving each of n-sets of n-equations takes time, resulting in an overall solution time of. Exploiting the Kronecker structure reduce the usual, expected time to solve to.

One final note regarding the exploitation of the Kronecker structure of the system remains. Suppose the matrices A and B are different sizes. Then, the time required to solve the system is, where is the size of A and is the size of B. In our work, the modeler has some choice for the size of the A and B matrices. Thus, a wise choice would make small, reducing the effect of the term in the computation time.

While when X is diagonal matrix and applying -notation, we return to the system problem:. If is non-singular matrix and regarding with LU factorizations of , then a solution of system exists and can be written as:

. (97)

First, the lower triangular system can be solved by forward substitution as the following:

which can be solved in operations as follows:

.    (98)

.               (99)

. (100)

5. Conclusion

The solution of coupled matrix linear least-squares problems and coupled matrix differential equations is studied and some important special cases are discussed. The analysis indicates that solving for is efficient and solving for is more efficient when the unknown matrices are diagonal. Although the algorithms are presented for non-homogeneous coupled matrix and matrix linear differential equations, the idea adopted can be easily extended to study coupled matrix nonlinear differential equations, e.g., the coupled matrix Riccati differential equations.

6. Acknowledgements

The author expresses his sincere thanks to referee (s) for careful reading of the manuscript and several helpful suggestions. The author also gratefully acknowledges that this research was partially supported by Deanship of Scientific Research/University of Dammam/Kingdom of Saudi Arabia/under the Grant No. 2012152.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1] T. Kailath, “Linear Systems,” Prentice-Hall, Englewood Cliffs, 1980.
[2] T. Kailath and A. H. Sayed, “Displacement Structure: Theory and Applications,” SIAM Review, Vol. 37, No. 3, 1995, pp. 297-386. doi:10.1137/1037082
[3] G. Mouroutsos and P. D Sparis, “Taylor Series Approach to System Identification, Analysis and Optimal Control,” Journal of the Franklin Institute, Vol. 319, No. 3, pp. 359-371. doi:10.1016/0016-0032(85)90056-0
[4] L. Jódar and H. Abou-Kandil, “Kronecker Products and Coupled Matrix Riccati Differential Systems,” Linear Algebra and its Applications, Vol. 121, 1989, pp. 39-51. doi:10.1016/0024-3795(89)90690-3
[5] S. Campbell, “Singular Systems of Differential Equations II,” Pitman, London, 1982.
[6] M. Mariton, “Les Systèmes Linéaires á Sauts Markoviens,” Thèse d’Etat, Université Paris-Sud, 1986.
[7] J. B. Cruz and C. I. Chen, “Series Nash Solution of TwoPerson Nonzero Sum Linear Differential Games,” Journal of Optimization Theory and Applications, Vol. 7, No. 4, 1971, pp. 240-257. doi:10.1007/BF00928706
[8] Z. Al-Zhour and A. Kilicman, “Matrix Equalities and Inequalities Involving Khatri-Rao and Tracy-Singh Sums,” Journal of Inequalities in Pure & Applied Mathematics, Vol. 7, No. 1, 2006, pp. 496-513.
[9] A. Graham, “Kronecker Products and Matrix Calculus with Applications,” Ellis Horwood Ltd., New York, 1981.
[10] A. Kilicman and Z. Al-Zhour, “Vector Least-Squares Solutions of Coupled Singular Matrix Equations,” Journal of Computational and Applied Mathematics, Vol. 206, No. 2, 2007, pp. 1051-1069. doi:10.1016/j.cam.2006.09.009
[11] W.-H. Steeb, “Matrix Calculus and Kronecker Product with Applications and C++ Programs,” World Scientific Publishing Co. Pte. Ltd., Singapore, 1997.
[12] G. Visick, “A Quantitative Version of the Observation that the Hadamard Product Is A Principle Submatrix of the Kronecker Product,” Linear Algebra and its Applications, Vol. 304, No. 1-3, 2000, pp. 45-68. doi:10.1016/S0024-3795(99)00187-1
[13] H. Lev-Ari, “Efficient Solution of Linear Matrix Equations with Application to Multistatic Antenna Array Processing,” Communications in Information & Systems, Vol. 5, No. 1, 2005, pp. 123-130.
[14] C. R. Rao and M. B. Rao, “Matrix Algebra and its Applications to Statistics and Econometrics,” World Scientific Publishing Co. Pte. Ltd., Singapore, 1998.
[15] F. Ding and T. Chen, “Iterative Least-Squares Solutions of Coupled Sylvester Matrix Equations,” Systems & Control Letters, Vol. 54, No. 2, 2005, pp. 95-107. doi:10.1016/j.sysconle.2004.06.008
[16] S. Tauber, “An Applications of the Hadamard Product to Air Pollution,” Applied Mathematics and Computation, Vol. 4, No. 2, 1978, pp. 167-176. doi:10.1016/0096-3003(78)90020-6
[17] F. Zhang, Matrix Theory: Basic Results and Techniques, Springer-Verlag, New York, 1999.
[18] A. Kilicman and Z. Al-Zhour, “The General Common Exact Solutions of Coupled Linear Matrix and Matrix Differential Equations,” Journal of Computational Analysis and Applications, Vol. 1, No. 1, 2005, pp. 15-29.
[19] J. R. Magnus and H. Neudecker, “Matrix Differential Calculus with Applications in Statistics and Econometrics,” John Wiley and Sons Ltd., New York, 1999.
[20] G. F. Van Loan, “The Ubiquitous Kronecker Product,” Journal of Computational and Applied Mathematics, Vol. 123, No. 1-2, 2000, pp. 85-100. doi:10.1016/S0377-0427(00)00393-9

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.