On Nonlinear Optimal Control Problems under State Constraint

Jinghao Zhu

doi:10.4236/oalib.1114454

Open Access Library Journal > Vol.12 No.11, November 2025

On Nonlinear Optimal Control Problems under State Constraint

Jinghao Zhu
School of Mathematical Sciences, Tongji University, Shanghai, China.
DOI: 10.4236/oalib.1114454 PDF HTML XML 12 Downloads 53 Views

Abstract

We study some nonlinear optimal control problems under state constraint. We construct extremal flows by differential-algebraic equations to solve an optimal control problem subject to mixed control-state constraint. Then we present an approximation approach to the state constraint optimal control problem.

Keywords

State Constraint Optimal Control, Global Optimization, Extremal Flow, Differential-Algebraic Equation

Share and Cite:

Zhu, J.H. (2025) On Nonlinear Optimal Control Problems under State Constraint. Open Access Library Journal, 12, 1-11. doi: 10.4236/oalib.1114454.

1. Introduction

In general, it is hard to obtain an analytic solution for nonlinear optimal control problem under state constraint. To solve a constraint optimal control problem, it is rather common to use a direct discretization approach to exact solution for the problem [1] [2]. Mathematically, the direct discretization methods should be supported by the investigation of convergence properties for the solutions of discretized problems approximating to the solution of the continuous problem. One usually expects a desired error between a numerical value and the optimal objective value of the original problem. However, for many engineering applications, the direct discretization methods may be useful to deal with some constrained optimal control problems efficiently, but so far for these methods one still expects more researches on theoretical foundation of convergence results [2]. The purpose of this paper is to provide a convergence result for a nonlinear optimal control problem under state constraint.

In this paper, according to the traditional optimal control theory, an admissible control is measurable and bounded on the interval $[0, T]$ such that the ordinary differential equation in the control problem has a unique solution.

We consider three nonlinear optimal control problems under state constraint as follows.

(1.1)

where the cost function $P (x)$ is continuously differentiable on $R^{n}$ and $Q (x)$ is a convex function on $R^{n}$ . For this problem the matrix functions $f (x), g (x)$ on $R^{n}$ are smooth and the vector $a \in R^{n}$ satisfying $Q (a) < 0$ is given in the control system in (1.1).

(1.2)

(1.3)

where the parameter $β > 0$ is given and the cost function $P (x)$ is continuously differentiable on $R^{n}$ and $Q (x)$ is a convex function on $R^{n}$ . For this problem the matrix functions $f (x), g (x)$ on $R^{n}$ are smooth and the vector $a \in R^{n}$ satisfying $Q (a) < 0$ is given in the control system in (1.3).

Main assumption: In this paper, we assume that the sets of admissible control of all problems concerned in this paper are not empty.

Remark 1.1. Through out the paper by optimal value of the problem we mean the infimum of the cost functional, i.e. $\inf J (u)$ . On the other hand, noting that $P (x)$ is continuously differentiable and that the matrix functions $f (x), g (x)$ on $R^{n}$ are smooth, by functional analysis we see that the cost functional

$J (u) = P (x (T)) + \int_{0}^{T} \frac{1}{2} u^{T} (t) u (t) d t$ is continuous on admissible control space.

The rest of the paper is organized as follows. In Section 2, to deal with the problem , we present a partial differential equation by rewriting the Hamilton-Jacobi-Bellman equation. Then we create an extremal flow by a differential-algebraic equation to compute the optimal value of the problem . We prove a conver-gence theorem for an approximation approach to the optimal value of the problem by a series of optimal values of the problem with different parameters in Section 3. In Section 4, we provide a convergence result for the optimal value of the problem by proving that the problem and the problem have the same optimal value. We give a proof of Theorem 2.1 in Section 5 and a conclusion in Section 6.

2. A Study on Optimal Control Problem Subject to Mixed Control-State Constraint

In this section, we deal with the optimal control problem by a partial differential equation. In the following, the positive number $β$ is fixed. For given $x \in R^{n}$ , define a set

$S (x) = {u : u^{T} u < \frac{- Q (x)}{β}} .$ (2.1)

Note that if $Q (x) \geq 0$ then $S (x) = \emptyset$ . In the following we assume that

$S (x) \neq \emptyset .$ (2.2)

We consider the Hamilton-Jacobi-Bellman equation as follows:

$v_{t} (t, x) + v_{x}^{T} (t, x) f (x) + min_{u \in S (x)} {v_{x}^{T} (t, x) g (x) u + \frac{1}{2} u^{T} u} = 0, (T, x) = P (x) .$ (2.3)

For given $(λ, x) \in R^{m} \times R^{n}$ with $Q (x) < 0$ , define a function

$H (λ, x) : = min_{u \in S (x)} {λ^{T} u + \frac{1}{2} u^{T} u},$ (2.4)

then for $t \in [0, T]$ and $λ = g^{T} (x) v_{x} (t, x)$ , we have

$H (g^{T} (x) v_{x} (t, x), x) = min_{u \in S (x)} {v_{x}^{T} (t, x) g (x) u + \frac{1}{2} u^{T} u} .$ (2.5)

By (2.5), we can rewrite the Hamilton-Jacobi-Bellman equation in (2.3) with global optimization to obtain the following partial differential equation [3]:

$v_{t} (t, x) + v_{x}^{T} (t, x) f (x) + H (g^{T} (x) v_{x} (t, x), x) = 0, (T, x) = P (x) .$ (2.6)

We will solve the optimal control problem in (1.3) by the partial dif-ferential equation in (2.6).

Given a pair $(x, u) \in R^{n} \times R^{m}$ , satisfying $u \in S (x)$ , i.e. $u^{T} u < \frac{- Q (x)}{β}$ . For $\frac{- Q (x)}{β} > 0$ , in the following let $\hat{u}$ denote the global minimizer of $\min_{u \in S (x)} {λ^{T} u + \frac{1}{2} u^{T} u}$ , i.e. $H (λ, x) = λ^{T} \hat{u} + \frac{1}{2} {\hat{u}}^{T} \hat{u} = {min}_{u \in S (x)} {λ^{T} u + \frac{1}{2} u^{T} u}$ . We need the following lemma to study the expression of $H (g^{T} (x) v_{x} (t, x), x)$ . For given $r > 0$ and $λ \in R^{m}$ , we then define auxiliary function $h (λ, r) = {min}_{{‖ u ‖}^{2} < r} {λ^{T} u + \frac{1}{2} u^{T} u}$ . Again let $\hat{u}$ denote the global minimizer of ${min}_{{‖ u ‖}^{2} < r} {λ^{T} u + \frac{1}{2} u^{T} u}$ , i.e. $h (λ, x) = λ^{T} \hat{u} + \frac{1}{2} {\hat{u}}^{T} \hat{u} = {min}_{{‖ u ‖}^{2} < r} {λ^{T} u + \frac{1}{2} u^{T} u}$ . We see that, given $x \in R^{n}$ satisfying $\frac{- Q (x)}{β} > 0$ , for $r = \frac{- Q (x)}{β}$ , $λ = g^{T} (x) v_{x} (t, x)$ , we have $H (g^{T} (x) v_{x} (t, x), x) = h (λ, r)$ . By primary optimi-zation theory we have the following lemma.

Lemma 2.1. Given $r > 0$ and $λ \in R^{m}$ . If ${‖ λ ‖}^{2} < r$ , then $\hat{u} = - λ, h (λ, r) = \frac{- {‖ λ ‖}^{2}}{2}$ . On the other hand, if ${‖ λ ‖}^{2} \geq r$ , then $\hat{u} = \sqrt{r} (\frac{- λ}{‖ λ ‖})$ , $(λ, r) = \frac{r}{2} - \sqrt{r} ‖ λ ‖$ .

For given $(t, x) \in [0, T] \times R^{n}$ such that $\frac{- Q (x)}{β} > 0$ , denoting $\frac{- Q (x)}{β}$ by $r$ and denoting $g^{T} (x) v_{x} (t, x)$ by $λ$ , by Lemma 2.1, we see that, if ${‖ g^{T} (x) v_{x} (t, x) ‖}^{2} < \frac{- Q (x)}{β}$ , then

$\hat{u} = φ (t, x) : = - g^{T} (x) v_{x} (t, x), H (g^{T} (x) v_{x} (t, x), x) = \frac{- 1}{2} {‖ g^{T} (x) v_{x} (t, x) ‖}^{2},$ (2.7)

and if ${‖ g^{T} (x) v_{x} (t, x) ‖}^{2} \geq \frac{- Q (x)}{β}$ ,then

$\begin{array}{l} \hat{u} = φ (t, x) : = \sqrt{\frac{- Q (x)}{β}} (\frac{- g^{T} (x) v_{x} (t, x)}{‖ g^{T} (x) v_{x} (t, x) ‖}), \\ H (g^{T} (x) v_{x} (t, x), x) = \frac{- Q (x)}{2 β} - \sqrt{\frac{- Q (x)}{β}} ‖ g^{T} (x) v_{x} (t, x) ‖ . \end{array}$ (2.8)

Remark 2.1. By Lemma 2.1 we see that $H (λ, x)$ is continuous with respect to $(λ, x)$ . We can get a viscosity solution of the partial differential equation in (2.6) [4]-[6]. Then the Hamilton-Jacobi-Bellman equation in (2.3) can be solved for a numerical solution [7].

Definition 2.1. For a solution $v (t, x)$ of the partial differential equation in (2.6), we call $(\hat{x} (\cdot), \hat{u} (\cdot))$ an extremal flow related to $v (t, x)$ if it is a solution of the following differential-algebraic equation:

$\dot{\hat{x}} (t) = f (\hat{x} (t)) + g (\hat{x} (t)) \hat{u} (t), \hat{x} (0) = a \in R^{n},$ (2.9)

$\begin{array}{l} v_{x}^{T} (t, \hat{x} (t)) g (\hat{x} (t)) \hat{u} (t) + \frac{1}{2} {\hat{u}}^{T} (t) \hat{u} (t) \\ = H (g^{T} (\hat{x} (t)) v_{x} (t, \hat{x} (t)), \hat{x} (t)), t \in [0, T] . \end{array}$ (2.10)

By the same way in [7], we can prove the following theorem.

Theorem 2.1. Let $v (t, x)$ be a solution of the partial differential equation in (2.6) and $(\hat{x} (\cdot), \hat{u} (\cdot))$ be an extremal flow defined by (2.9), (2.10). Then, $\hat{u} (\cdot)$ is an optimal control of the problem , and

$v (0, a) = P (\hat{x} (T)) + \int_{0}^{T} \frac{1}{2} {\hat{u}}^{T} (t) \hat{u} (t) d t$ is the optimal value of the problem .

Theorem 2.2. If the continuously differentiable function $v (t, x)$ is a solution of the partial differential equation in (2.6) on $[0, \infty) \times {x : - Q (x) > 0}$ and $φ (t, x)$ is the function defined in (2.7), (2.8), then $u = φ (t, x)$ is an optimal feedback control of the problem .

Proof: Since $v_{x} (t, x)$ is continuous, by (2.7), (2.8), we see that $φ (t, x)$ is continuous on $[0, T] \times {x : - Q (x) > 0}$ . By classical theory of ordinary differential equation we see that the equation

$\dot{x} (t) = f (x (t)) + g (x (t)) φ (t, x (t)), \hat{x} (0) = a \in R^{n}$ (2.11)

has a solution on $[0, T] \times {x : - Q (x) > 0}$ . Let the solution of the ODE in (2.11) be denoted by $\hat{x} (t)$ and let $φ (t, \hat{x} (t))$ be denoted by $\hat{u} (t)$ . By lemma 2.1 and (2.7),(2.8) we see that

Noting (2.11), (2.12), by Definition 2.1, the pair $(\hat{x} (\cdot), \hat{u} (\cdot))$ is an extremal flow related to $v (t, x)$ . It follows from Theorem 2.1 that $u = φ (t, x)$ is an optimal feedback control of the problem .

3. An Approximation Approach to the Optimal Value of Problem

In this section we show a convergent result for an approximation approach to the optimal value of problem which is restated as follows.

(3.1)

where the cost function $P (x)$ is continuously differentiable on $R^{n}$ and $Q (x)$ is a convex function on $R^{n}$ . In this problem the matrix functions $f (x), g (x)$ on $R^{n}$ are continuously differentiable and the vector $a \in R^{n}$ such that $Q (a) < 0$ are given in the control system in (3.1).

In the following, for a given positive number $β$ , the optimal value of problem is denoted by $V_{β}$ and the optimal value of problem is denoted by $\hat{V}$ .

Lemma 3.1. (i). For each given number $β > 0$ , $V_{β} \geq \hat{V}$ . (ii). If $β_{1} \geq β_{2} > 0$ , then $V_{β_{1}} \geq V_{β_{2}}$ .

Proof: Firstly, let $(x (\cdot), u (\cdot))$ be an admissible pair of the problem . Note that the functions $f (x), g (x)$ and the vector $a$ appearing in and are the same. It follows from the fact $Q (x (t)) + β u^{T} (t) u (t) < 0$ , $t \in [0, T]$ that $Q (x (t)) < 0$ , $t \in [0, T]$ . Thus $(x (\cdot), u (\cdot))$ is also an admissible pair of the problem . Consequently, $V_{β} \geq \hat{V}$ .

Secondly, let $(x (\cdot), u (\cdot))$ be an admissible pair of the problem with the parameter $β_{1}$ . Note that functions $f (x), g (x)$ and the vector $c, a$ appearing in do not depend on different parameter $β$ . Noting $β_{1} \geq β_{2} > 0$ , it follows from the fact $Q (x (t)) + β_{1} u^{T} (t) u (t) < 0$ , $t \in [0, T]$ that $Q (x (t)) + β_{2} u^{T} (t) u (t) < 0$ , $t \in [0, T]$ . Thus $(x (\cdot), u (\cdot))$ is also an admissible pair of the problem with the parameter $β_{2}$ . Consequently, $V_{β_{1}} \geq V_{β_{2}}$ . The lemma is proved.

Theorem 3.1. For given $ϵ > 0$ there exists a positive number $β$ such that

$| V_{β} - \hat{V} | < ϵ .$ (3.2)

Proof: Given $ϵ > 0$ . Let $(\bar{x}, \bar{u})$ be an admissible pair of the problem such that

$\hat{V} \leq J (\bar{u}) < \hat{V} + ϵ,$ (3.3)

noting that $\hat{V}$ is the infimum of $J (u)$ for the problem .

Noting that $\bar{x} (t)$ is continuous and $c^{T} \bar{x} (t) < 0, t \in [0, T]$ , we see that there is a $δ > 0$ such that

$Q (\bar{x} (t)) < - δ, t \in [0, T] .$ (3.4)

Noting that the admissible control is bounded on $[0, T]$ , there is a number $M > 0$ such that

$\bar{u} {(t)}^{T} \bar{u} (t) \leq M, t \in [0, T] .$ (3.5)

By (3.4), (3.5), there is a $β > 0$ , such that

$Q (\bar{x} (t)) + β \bar{u} {(t)}^{T} \bar{u} (t) \leq - δ + β M < 0, t \in [0, T] .$ (3.6)

Thus $(\bar{x}, \bar{u})$ is an admissible pair of both problem and problem with the parameter $β$ . Then we have

$J (\bar{u}) \geq V_{β}$ (3.7)

By Lemma 3.1 and (3.3),(3.7) we have

$0 \leq V_{β} - \hat{V} < V_{β} - (J (\bar{u}) - ϵ) \leq J (\bar{u}) - (J (\bar{u}) - ϵ) = ϵ .$ (3.8)

Therefore (3.2) is true and the theorem has been proved.

Corollary 3.1. Let $β_{n}, n = 1, 2, \dots$ be a decrease sequence of positive numbers satisfying $β_{n} \to 0$ when $n \to + \infty$ . Then

$\lim_{n \to + \infty} V_{β_{n}} = \hat{V} .$ (3.9)

Proof: By Lemma 3.1, noting (3.3), (3.6) in the proof of Theorem 3.1, for each $n (= 1, 2, \dots)$ , there is an admissible pair $(x_{n}, u_{n})$ of the problem satisfying

$\hat{V} \leq J (u_{n}) < \hat{V} + \frac{1}{n},$ (3.10)

noting that $\hat{V}$ is the infimum of $J (u)$ for the problem .

Noting that $x_{n} (.)$ is continuous and $c^{T} x_{n} (t) < 0, t \in [0, T]$ , we see that there is a $δ_{n} > 0$ such that

$Q (x_{n} (t)) < - δ_{n} < 0, t \in [0, T],$ (3.11)

and noting that the admissible control is bounded on $[0, T]$ , there is a number $M_{n} > 0$ such that

$u_{n} {(t)}^{T} u_{n} (t) \leq M_{n}, t \in [0, T] .$ (3.12)

By (3.11), (3.12), there is a $β_{n} > 0$ , such that

$Q (x_{n} (t)) + β_{β} u_{n} {(t)}^{T} u_{n} (t) \leq - δ_{n} + β_{n} M_{n} < 0, t \in [0, T] .$ (3.13)

Thus $(x_{n}, u_{n})$ is an admissible pair of both problem and problem with the parameter $β_{n}$ . Then we have

$J (u_{n}) \geq V_{β_{n}} .$ (3.14)

This process (3.10)-(3.14) begins from $n = 1$ . But by Lemma 3.1 we see that if a positive number $β$ is got to satisfy (3.13) then for every positive number $β^{'}$ less than $β$ the process (3.10)-(3.14) still works. For $n = 1$ , we choose $0 < β_{1} < 1$ as in (3.10)-(3.14). After the step $n (\geq 1)$ has been done, in the next

step we choose $0 < β_{n + 1} \leq \frac{β_{n}}{n + 1} (< β_{n})$ such that

$Q (x_{n + 1} (t)) + β_{n + 1} u_{n + 1} {(t)}^{T} u_{n + 1} (t) \leq - δ_{n + 1} + β_{n + 1} M_{n + 1} < 0, t \in [0, T] .$ (3.15)

Then we see that in this way the positive sequence ${β_{n}}$ is strictly decreasing and tends to zero when $n \to \infty$ . By the deductive process the same as (3.8) in the proof of Theorem 3.1, or by Lemma 3.1 and (3.10), (3.14), we have for each $n = 1, 2, \dots$ ,

$0 \leq V_{β_{n}} - \hat{V} < J (u_{n}) - (J (u_{n}) - \frac{1}{n}) = \frac{1}{n} .$ (3.15)

Therefore we have

$lim_{n \to + \infty} V_{β_{n}} = \hat{V},$ (3.16)

with the positive sequence ${β_{n}}$ being strictly decreasing and tending to zero when $n \to \infty$ . The Corollary 3.1 has been proved.

4. On the Optimal Value of Problem .

In this section we deal with the problem :

(4.1)

In the following, the optimal value of the problem is denoted by $V^{*}$ . Recall that in Section 4 the optimal value of the problem is denoted by $\hat{V}$ . In the following lemma, noting that $c^{T} a < 0$ , we define two sets of admissible control as follows:

$D_{1} : = {u (.) : Q (x_{u} (t)) \leq 0, t \in [0, T]},$

$D_{2} : = {u (.) : Q (x_{u} (t)) < 0, t \in [0, T]} .$

Lemma 4.1. Under the notations above, we have

$\hat{V} \geq V^{*} .$ (4.2)

Proof: Let $x_{u} (.)$ be the solution of the linear equation

$\dot{x} (t) = f (x (t)) + g (x (t)) u (t), x (0) = a (\in R^{n})$

corresponding to an admissible control $u (.)$ . It is clear that $D_{2} \subset D_{1}$ . Con-sequently,

$\hat{V} \geq V^{*} .$ (4.4)

The lemma is proved.

In the following lemma we should recall that, in the first section of this paper, we have assumed that the admissible control set $D_{2} = {u (.) : c^{T} x_{u} (t) < 0, t \in [0, T]}$ is not empty.

Lemma 4.2. Let $\bar{u} (.) \in D_{2} = {u (.) : Q (x_{u} (t)) < 0, t \in [0, T]}$ . Then for any admis-sible control $\tilde{u} (.)$ such that $Q (x_{u} (t)) \leq 0$ , $t \in [0, T]$ , we have, for $\forall α \in (0, 1]$ ,

$α \bar{u} (.) + (1 - α) \tilde{u} (.) \in D_{2} = {u (.) : Q (x_{u} (t)) < 0, t \in [0, T]} .$ (4.5)

Proof: Let $\bar{x} (.)$ and $\tilde{x} (.)$ be the trajectories of the linear system $\dot{x} (t) = A x (t) + B u (t)$ , $x (0) = a \in R^{n}$ corresponding to $\bar{u} (.)$ and $\tilde{u} (.)$ respec-tively. Noting that $Q (x)$ is convex, we have, for $t \in [0, T]$ ,

$Q (α \bar{x} (t) + (1 - α) \tilde{x} (t)) = α Q (\bar{x} (t)) + (1 - α) Q (\tilde{x} (t)) < 0,$ (4.6)

also noting that $Q (\bar{x} (t)) < 0$ and $Q (\tilde{x} (t)) \leq 0$ , $α \in (0, 1]$ . The lemma is proved.

Theorem 4.1. Let the notations of $\hat{V}, V^{*}$ be as in Lemma 4.1. Then

$V^{*} = \hat{V} .$ (4.7)

Proof: Let $\bar{u} (.) \in D_{2} = {u (.) : c^{T} x_{u} (t) < 0, t \in [0, T]}$ . We show (4.7) by induc-tion. In the initial step, for given $δ > 0$ , we have an admissible control $\hat{u} (.)$ satisfying $c^{T} x_{\hat{u}} (t) \leq 0$ , $t \in [0, T]$ and

$V^{*} \leq J (\hat{u}) < V^{*} + δ,$ (4.8)

noting that $V^{*}$ is the infimum of $J (u)$ for the problem .

Recalling Remark 1.1, noting that each admissible control is bounded, the func-tion $P (x)$ in the cost functional for the concerned problems is continuously differentiable and the cost functional $J (.)$ is continuous on the control space. By Lemma 4.2, there exists a number $α > 0$ , such that the control

$u_{α} (.) = α \bar{u} (.) + (1 - α) \hat{u} (.) = \hat{u} (.) + α (\bar{u} (.) - \hat{u} (.))$ (4.9)

satisfying

$V^{*} \leq J (u_{α}) < V^{*} + δ$ (4.10)

and

$Q (x_{α} (t)) < 0, t \in [0, T] .$ (4.11)

Thus $u_{α}$ is also an admissible control for the problem . Then by Lemma 4.1 and (4.10) we have

$V^{*} \leq \hat{V} \leq J (u_{α}) < V^{*} + δ,$ (4.12)

consequently,

$V^{*} \leq \hat{V} < V^{*} + δ .$ (4.13)

Next the same as in the previous step we have an admissible control which is denoted by $\tilde{u} (.)$ satisfying $c^{T} x_{\tilde{u}} (t) \leq 0$ , $t \in [0, T]$ such that

$V^{*} \leq J (\tilde{u}) < V^{*} + \frac{δ}{2},$

noting that $V^{*}$ is the infimum of $J (u)$ for the problem . As in the previous step above, we have a number $\tilde{α} > 0$ , such that the control

$u_{\tilde{α}} (.) = \tilde{α} \bar{u} (.) + (1 - \tilde{α}) \tilde{u} (.) = \tilde{u} (.) + \tilde{α} (\bar{u} (.) - \tilde{u} (.))$ (4.14)

satisfying

$V^{*} \leq J (u_{\tilde{α}}) < V^{*} + \frac{δ}{2}$ (4.15)

and

$Q (x_{\tilde{α}} (t)) < 0, t \in [0, T] .$ (4.16)

Thus $u_{\tilde{α}}$ is also an admissible control for the problem . Then by Lemma 4.1 and (4.15) we have

$V^{*} \leq \hat{V} \leq J (u_{\tilde{α}}) < V^{*} + \frac{δ}{2},$ (4.17)

consequently,

$V^{*} \leq \hat{V} < V^{*} + \frac{δ}{2} .$ (4.18)

Similar to the process from (4.13) to (4.18), by induction, in this way, for $n = 0, 1, 2, \dots$ , when $n = 0$ , it is in the initial step, we have $V^{*} \leq \hat{V} < V^{*} + δ$ , and when staying in the $n + 1$ -th step, we have,

$V^{*} \leq \hat{V} < V^{*} + 2^{- (n + 1)} δ .$ (4.19)

Thus for each positive integer $N$ , we have

$0 \leq \hat{V} - V^{*} < 2^{- N} δ .$

Let $N \to + \infty$ , we have

$V^{*} = \hat{V} .$ (4.20)

Therefore (4.7) is true and the theorem has been proved. By Theorem 4.1 and Corollary 3.1, we have the following convergence result.

Corollary 4.1. Let $β_{n}, n = 1, 2, \dots$ be a decrease sequence of positive numbers satisfying $β_{n} \to 0$ when $n \to + \infty$ . Then

$\lim_{n \to + \infty} V_{β_{n}} = V^{*} .$

5. A proof of Theorem 2.1

By (2.9), (2.10), we have, for $t \in [0, T]$ ,

$\begin{matrix} \frac{d}{d t} v (t, \hat{x} (t)) = v_{t} (t, \hat{x} (t)) + v_{x}^{T} (t, \hat{x} (t)) f (\hat{x} (t)) + v_{x}^{T} (t, \hat{x} (t)) g (\hat{x} (t)) \hat{u} (t) \\ = v_{t} (t, \hat{x} (t)) + v_{x}^{T} (t, \hat{x} (t)) f (\hat{x} (t)) \\ + H (g^{T} (\hat{x} (t)) v_{x} (t, \hat{x} (t)), \hat{x} (t)) v_{x} (t, \hat{x} (t)) - \frac{1}{2} {\hat{u}}^{T} (t) \hat{u} (t) \\ = - \frac{1}{2} {\hat{u}}^{T} (t) \hat{u} (t) . \end{matrix}$ (5.1)

Integrating the above equality with respect to $t$ from 0 to $T$ , noting that $v (T, \hat{x} (T)) = P (\hat{x} (T))$ , $\hat{x} (0) = a$ , we have

$\int_{0}^{T} - \frac{1}{2} {\hat{u}}^{T} (t) \hat{u} (t) d t = \int_{0}^{T} \frac{d}{d t} v (t, \hat{x} (t)) d t = P (\hat{x} (T)) - v (0, a)$ (5.2)

and

$v (0, a) = P (\hat{x} (T)) + \int_{0}^{T} \frac{1}{2} {\hat{u}}^{T} (t) \hat{u} (t) d t .$ (5.3)

Now let $(x (\cdot), u (\cdot))$ be an arbitrary admissible pair of the control system in the problem . We have, for $t \in [0, T]$ ,

$c^{T} x (t) + β u^{T} (t) u (t) < 0,$ (5.4)

which implies $u (t) \in S (x (t))$ . Thus, by (2.5) with $λ = g^{T} (x (t)) v_{x} (t, x (t))$ for a $t \in [0, T]$ , we have

$H (g^{T} (x (t)) v_{x} (t, x (t)), x (t)) \leq v_{x}^{T} (t, x (t)) g (x (t)) u (t) + \frac{1}{2} u^{T} (t) u (t) .$ (5.5)

Then for each $t \in [0, T]$ , by the partial deferential equation in (2.6), also noting that $(x (\cdot), u (\cdot))$ is an arbitrary admissible pair of the control system in the problem , we have, by (5.5),

$\begin{matrix} 0 = v_{t} (t, x (t)) + v_{x}^{T} (t, x (t)) f (x (t)) + H (g^{T} (x (t)) v_{x} (t, x (t)), x (t)) \\ \leq v_{t} (t, x (t)) + v_{x}^{T} (t, x (t)) f (x (t)) + v_{x}^{T} (t, x (t)) g (x (t)) u (t) + \frac{1}{2} u^{T} (t) u (t) \\ = v_{t} (t, x (t)) + v_{x}^{T} (t, x (t)) \frac{d x (t)}{d t} + \frac{1}{2} u^{T} (t) u (t) \\ = \frac{d}{d t} v (t, x (t)) + \frac{1}{2} u^{T} (t) u (t) . \end{matrix}$ (5.6)

Integrating the above inequality over $[0, T]$ , noting $v (T, x (T)) = P (x (T))$ , $x (0) = a$ , by (5.6), we have

$\begin{matrix} 0 \leq \int_{0}^{T} [\frac{d}{d t} v (t, x (t)) + \frac{1}{2} u^{T} (t) u (t)] d t \\ = P (x (T)) - v (0, a) + \int_{0}^{T} \frac{1}{2} u^{T} (t) u (t) d t . \end{matrix}$ (5.7)

By (5.3), (5.7), we have

$P (\hat{x} (T)) + \int_{0}^{T} \frac{1}{2} {\hat{u}}^{T} (t) \hat{u} (t) d t = v (0, a) \leq P (x (T)) + \int_{0}^{T} \frac{1}{2} u^{T} (t) u (t) d t .$ (5.8)

By (5.8), we see that $\hat{u} (\cdot)$ is an optimal control for the problem and

$v (0, a) = P (\hat{x} (T)) + \int_{0}^{T} \frac{1}{2} {\hat{u}}^{T} (t) \hat{u} (t) d t$ is the optimal value of the problem .

The theorem has been proved.

6. Conclusion

In this paper, we study nonlinear optimal control problem under state constraint. Firstly we deal with a nonlinear optimal control problem subject to mixed control-state constraint. We try to create a partial differential equation by Hamilton-Jacobi-Bellman equation with global optimization. Then we provide a convergence result for an approximation approach to the optimal value of a nonlinear optimal control problem under state constraint.

Conflicts of Interest

The author declares no conflicts of interest.

Conflicts of Interest

The author declares no conflicts of interest.

References

[1]	Sontag, E.D. (1998) Mathematical Control Theory: Deterministic Finite Dimensional Systems. 2nd Edition, Springer.
[2]	Martens, B. and Gerdts, M. (2020) Convergence Analysis for Approximations of Optimal Control Problems Subject to Higher Index Differential-Algebraic Equations and Mixed Control-State Constraints. SIAM Journal on Control and Optimization, 58, 1-33. [Google Scholar] [CrossRef]
[3]	Zhu, J. (2018) Singular Optimal Control by Minimizer Flows. European Journal of Control, 42, 32-37. [Google Scholar] [CrossRef]
[4]	Crandall, M.G. and Lions, P. (1983) Viscosity Solutions of Hamilton-Jacobi Equations. Transactions of the American Mathematical Society, 277, 1-42. [Google Scholar] [CrossRef]
[5]	Bardi, M. and Capuzzo-Dolcetta, I. (1997) Optimal Control and Viscosity Solutions of Hamilton-Jacobi-Bellman Equations. Birkhauser.
[6]	Fleming, W.H. (1969) The Cauchy Problem for a Nonlinear First Order Partial Differential Equation. Journal of Differential Equations, 5, 515-530. [Google Scholar] [CrossRef]
[7]	Zhu, J. (2023) A Computational Approach to Optimal Control Problems Subject to Mixed Control-State Constraints. International Journal of Control, 96, 41-47. [Google Scholar] [CrossRef]

Journals Menu

Follow SCIRP

	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies