Optimal control of diffusion processes with terminal constraint in law (Q2082225): Difference between revisions

The author considers the stochastic optimal control problem written as: \[ inf_{\alpha _{t}\in \mathcal{A}}\mathbb{E}[\int_{0}^{T}(f_{1}(t,X_{t},\alpha _{t})+f_{2}(t,\mathcal{L}(X_{t})))dt+g(\mathcal{L}(X_{T}))], \] under the constraint \(\Psi (\mathcal{L}(X_{T}))\leq 0\) for the diffusion solution satisfying: \(dX_{t}=b(t,X_{t},\alpha _{t})dt+\sqrt{2}\sigma (t,X_{t},\alpha _{t})dB_{t}\), with the initial condition \(\mathcal{L}(X_{0})=m_{0}\in \mathcal{P}_{2}(\mathbb{R}^{d})\), the space of probability measures over \( \mathbb{R}^{d}\) with finite second-order moment. Here \(f_{1}:[0,T]\times \mathbb{R}^{d}\times A\rightarrow \mathbb{R}\) and \(f_{2}:[0,T]\times \mathcal{P}_{2}(\mathbb{R}^{d})\rightarrow \mathbb{R}\) are the instantaneous costs, \(g:\mathcal{P}_{2}(\mathbb{R}^{d})\rightarrow \mathbb{R}\) is the terminal cost, \(\Psi :\mathcal{P}_{2}(\mathbb{R}^{d})\rightarrow \mathbb{R}\) is the final constraint, \(b:[0,T]\times \mathbb{R}^{d}\times A\rightarrow \mathbb{R}^{d}\) and \(\sigma :[0,T]\times \mathbb{R}^{d}\times A\rightarrow \mathbb{S}_{d}(\mathbb{R})\), the space of symmetric matrices of size \( d\times d\), are, respectively, the drift and the volatility of the controlled process \(X\), and \(\alpha \) is the control process with values in the control space \(A\), a closed subset of an Euclidean space. The main purpose of the paper is to prove that optimal Markov policies exist and are related to the solutions \((\lambda ,u,m)\in \mathbb{R}^{+}\times C_{b}^{1,2}([0,T]\times \mathbb{R}^{d})\times C^{0}([0,T],\mathcal{P}_{2}( \mathbb{R}^{d}))\) to the following system of partial differential equations (optimality conditions): \(-\partial _{t}u(t,x)+H(t,x,Du(t,x),D^{2}u(t,x))= \frac{\delta f_{2}}{\delta m}(t,m(t),x)\), \(\partial _{t}m-div(\partial _{p}H(t,x,Du(t,x),D^{2}u(t,x))m)+\sum_{i,j}\partial _{ij}^{2}((\partial _{M}H(t,x,Du(t,x),D^{2}u(t,x)))_{ij}m)=0\), in \([0,T]\times \mathbb{R}^{d}\), with the conditions \(u(T,x)=\lambda \frac{\delta \Psi }{\delta m}(m(T),x)+ \frac{\delta g}{\delta m}(m(T),x)\) in \(\mathbb{R}^{d}\), \(m(0)=m0\), \(\lambda (m(T))=0\), \(\Psi (m(T))\leq 0\), \(\lambda \geq 0\). Here \(H(t,x,p,M):=sup_{a \in A}\{-b(t,x,a)\cdot p-\sigma ^{t}\sigma (t,x,a)\cdot M-f_{1}(t,x,a)\}\) is the Hamiltonian of the system. The (backward) first equation is a Hamilton-Jacobi-Bellman equation satisfied by the adjoint state \(u\). The (forward) second equation is a Fokker-Planck equation which describes the evolution of the probability distribution \(m\) of the optimally controlled process. The nonnegative parameter \(\lambda \) is the Lagrange multiplier associated with the terminal constraint. The author introduces the Lagrangian \(L(t,x,q,N)=sup_{(p,M)\in \mathbb{R}^{d}\times \mathbb{S}_{d}( \mathbb{R})}\{-p\cdot q-M\cdot N-H(t,x,p,M)\}=H^{\ast }(t,x,-q,-N)\). The first main result proves, under appropriate hypotheses on the data of the problem, the existence of optimal Markov policies and if \((\alpha _{t})\in \mathcal{A}\) is an optimal Markov policy, there exists \((\lambda ,\varphi ,m)\in \mathbb{R}^{+}\times C_{b}^{1,2}([0,T]\times \mathbb{R}^{d})\times C^{0}([0,T],\mathcal{P}_{2}(\mathbb{R}^{d}))\) such that for \(m(t)\otimes dt\) almost all \((t,x)\in \lbrack 0,T]\times \mathbb{R}^{d}\) \(H(t,x,D\phi (t,x),D^{2}\phi (t,x))=-b(t,x,\alpha (t,x))\cdot D\varphi (t,x)-\sigma ^{t} \sigma (t,x,\alpha (t,x))\cdot D^{2}\phi (t,x)-f_{1}(t,x,\alpha (t,x))\}\), and \((\lambda ,\phi ,m)\) satisfies the above system of optimality conditions. If \(\Psi \), \(f_{2}\) and \(g\) are convex with respect to the measure argument and under different assumptions, the conditions of the previous result are also proved to be sufficient conditions, that is if \( \alpha \in L^{0}([0,T]\times \mathbb{R}^{d},A)\) satisfies the previous Hamiltonian equality for some \((\lambda ,\phi ,m)\) satisfying the system of optimality conditions, then the problem \(dX_{t}=b(t,X_{t},\alpha (t,X_{t}))dt+\sqrt{2}\sigma (t,X_{t},\alpha (t,X_{t}))dB_{t}\), starting from \(X_{0}\), has unique strong solution \(X_{t}\), and \(m(t)=\mathcal{L}(X_{t})\) and \(\alpha _{t}:=\alpha (t,X_{t})\) is a Markovian solution to the minimization problem \(inf_{\alpha \in \mathcal{U}_{ad}}J_{SP}(\alpha )\), where \(J_{SP}\) is the above-defined cost functional and \(\mathcal{U}_{ad}\) is the set of admissible controls \(\mathcal{U}_{ad}=\{\alpha \in \mathcal{A} :\Psi (\mathcal{L}(X_{T}^{\alpha }))\leq 0\) and \(J_{SP}(\alpha )<+\infty \}\) . For the proof, the author first proves that the Hamilton-Jacobi-Bellman equation admits a unique strong solution \(\phi \in C_{b}^{\frac{3+\alpha }{2} ,3+\alpha }([0,T]\times \mathbb{R}^{d})\). He introduces a relaxed problem for which he proves an existence result. He proves Lipschitz continuity properties for the solution to the Hamilton-Jacobi-Bellman equation.

0 references

reviewed by

Alain Brillard

0 references

zbMATH Keywords

stochastic optimal control

0 references

constraints in law

0 references

Hamilton-Jacobi-Bellman equation

0 references

Fokker-Planck equation

0 references

mean field games

0 references

minmax

0 references

convex duality

0 references

existence result

0 references

relaxed problem

0 references

optimal Markov policies

0 references

MaRDI profile type

MaRDI publication profile

0 references

cites work

Mean Field Games: Numerical Methods

0 references

Convex viscosity solutions and state constraints

0 references

Q4954179

0 references

Q4663947

0 references

Stochastic homogenization of quasilinear Hamilton-Jacobi equations and geometric motions

0 references

A weak Bernstein method for fully non-linear elliptic equations

0 references

A computational fluid mechanics solution to the Monge-Kantorovich mass transfer problem

0 references

Controllability of a Fokker-Planck equation, the Schrödinger system, and a related stochastic optimal control (revised version)

0 references

Optimal Control under Stochastic Target Constraints

0 references

Stochastic Target Problems with Controlled Loss

0 references

\(C^{1,\beta}\) regularity of viscosity solutions via a continuous-dependence result

0 references

Stable solutions in potential mean field game systems

0 references

Second order mean field games with degenerate diffusion and local coupling

0 references

Probabilistic Theory of Mean Field Games with Applications I

0 references

On dynamic programming principle for stochastic control under expectation constraints

0 references

User’s guide to viscosity solutions of second order partial differential equations

0 references

Q4702909

0 references

Compactification methods in the control of degenerate diffusions: existence of an optimal control

0 references

Existence and uniqueness of martingale solutions for SDEs with rough or degenerate coefficients

0 references

Controlled Markov processes and viscosity solutions

0 references

Convex Duality Approach to the Optimal Control of Diffusions

0 references

Quantile hedging

0 references

Portfolio optimization with a prescribed terminal wealth distribution

0 references

Calibration of local‐stochastic volatility models by optimal transport

0 references

Convexity of solutions and \(C^{1,1}\) estimates for fully nonlinear elliptic equations

0 references

Viscosity solutions of fully nonlinear second-order elliptic partial differential equations

0 references

Q4002114

0 references

SOME NEW RESULTS IN THE THEORY OF CONTROLLED DIFFUSION PROCESSES

0 references

Mean field games via controlled martingale problems: existence of Markovian equilibria

0 references

Mean field games

0 references

Homogenization of degenerate second-order PDE in periodic and almost periodic environments and applications

0 references

Two End Points Marginal Problem by Stochastic Optimal Transportation

0 references

Duality theorem for the stochastic optimal control problem

0 references

Envelope Theorems for Arbitrary Choice Sets

0 references

A variational approach to the mean field planning problem

0 references

Optimality conditions in variational form for non-linear constrained stochastic control problems

0 references

Duality and Approximation of Stochastic Optimal Control Problems under Expectation Constraints

0 references

Mass transportation problems. Vol. 1: Theory. Vol. 2: Applications

0 references

Minimax monotonicity

0 references

Dynamic programming for stochastic target problems and geometric flows

0 references

Q3862204

0 references

Optimal transportation under controlled stochastic dynamics

0 references

Well-posedness of multidimensional diffusion processes with weakly differentiable coefficients

0 references

Q4805362

0 references

Optimal Transport

0 references

On the regularity theory of fully nonlinear parabolic equations: I

0 references

On the regularity theory of fully nonlinear parabolic equations: II

0 references

Q4255599

0 references

Identifiers

zbMATH Open document ID

1503.49023

0 references

DOI

10.1007/s10957-022-02053-8

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2082225

@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / OpenAlex ID @@
+W3115234400
@@ Property / OpenAlex ID: W3115234400 / rank @@
+Normal rank
@@ Property / arXiv ID @@
+.10707
@@ Property / arXiv ID: 2012.10707 / rank @@
+Normal rank
@@ Property / cites work @@
+Mean Field Games: Numerical Methods
@@ Property / cites work: Mean Field Games: Numerical Methods / rank @@
+Normal rank
@@ Property / cites work @@
+Convex viscosity solutions and state constraints
@@ Property / cites work: Convex viscosity solutions and state constraints / rank @@
+Normal rank
@@ Property / cites work @@
+Q4954179
@@ Property / cites work: Q4954179 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4663947
@@ Property / cites work: Q4663947 / rank @@
+Normal rank
@@ Property / cites work @@
+Stochastic homogenization of quasilinear Hamilton-Jacobi equations and geometric motions
+Normal rank
@@ Property / cites work @@
+A weak Bernstein method for fully non-linear elliptic equations
+Normal rank
@@ Property / cites work @@
+A computational fluid mechanics solution to the Monge-Kantorovich mass transfer problem
+Normal rank
@@ Property / cites work @@
+Controllability of a Fokker-Planck equation, the Schrödinger system, and a related stochastic optimal control (revised version)
+Normal rank
@@ Property / cites work @@
+Optimal Control under Stochastic Target Constraints
+Normal rank
@@ Property / cites work @@
+Stochastic Target Problems with Controlled Loss
@@ Property / cites work: Stochastic Target Problems with Controlled Loss / rank @@
+Normal rank
@@ Property / cites work @@
+\(C^{1,\beta}\) regularity of viscosity solutions via a continuous-dependence result
+Normal rank
@@ Property / cites work @@
+Stable solutions in potential mean field game systems
+Normal rank
@@ Property / cites work @@
+Second order mean field games with degenerate diffusion and local coupling
+Normal rank
@@ Property / cites work @@
+Probabilistic Theory of Mean Field Games with Applications I
+Normal rank
@@ Property / cites work @@
+On dynamic programming principle for stochastic control under expectation constraints
+Normal rank
@@ Property / cites work @@
+User’s guide to viscosity solutions of second order partial differential equations
+Normal rank
@@ Property / cites work @@
+Q4702909
@@ Property / cites work: Q4702909 / rank @@
+Normal rank
@@ Property / cites work @@
+Compactification methods in the control of degenerate diffusions: existence of an optimal control
+Normal rank
@@ Property / cites work @@
+Existence and uniqueness of martingale solutions for SDEs with rough or degenerate coefficients
+Normal rank
@@ Property / cites work @@
+Controlled Markov processes and viscosity solutions
+Normal rank
@@ Property / cites work @@
+Convex Duality Approach to the Optimal Control of Diffusions
+Normal rank
@@ Property / cites work @@
+Quantile hedging
@@ Property / cites work: Quantile hedging / rank @@
+Normal rank
@@ Property / cites work @@
+Portfolio optimization with a prescribed terminal wealth distribution
+Normal rank
@@ Property / cites work @@
+Calibration of local‐stochastic volatility models by optimal transport
+Normal rank
@@ Property / cites work @@
+Convexity of solutions and \(C^{1,1}\) estimates for fully nonlinear elliptic equations
+Normal rank
@@ Property / cites work @@
+Viscosity solutions of fully nonlinear second-order elliptic partial differential equations
+Normal rank
@@ Property / cites work @@
+Q4002114
@@ Property / cites work: Q4002114 / rank @@
+Normal rank
@@ Property / cites work @@
+SOME NEW RESULTS IN THE THEORY OF CONTROLLED DIFFUSION PROCESSES
+Normal rank
@@ Property / cites work @@
+Mean field games via controlled martingale problems: existence of Markovian equilibria
+Normal rank
@@ Property / cites work @@
+Mean field games
@@ Property / cites work: Mean field games / rank @@
+Normal rank
@@ Property / cites work @@
+Homogenization of degenerate second-order PDE in periodic and almost periodic environments and applications
+Normal rank
@@ Property / cites work @@
+Two End Points Marginal Problem by Stochastic Optimal Transportation
+Normal rank
@@ Property / cites work @@
+Duality theorem for the stochastic optimal control problem
+Normal rank
@@ Property / cites work @@
+Envelope Theorems for Arbitrary Choice Sets
@@ Property / cites work: Envelope Theorems for Arbitrary Choice Sets / rank @@
+Normal rank
@@ Property / cites work @@
+A variational approach to the mean field planning problem
+Normal rank
@@ Property / cites work @@
+Optimality conditions in variational form for non-linear constrained stochastic control problems
+Normal rank
@@ Property / cites work @@
+Duality and Approximation of Stochastic Optimal Control Problems under Expectation Constraints
+Normal rank
@@ Property / cites work @@
+Mass transportation problems. Vol. 1: Theory. Vol. 2: Applications
+Normal rank
@@ Property / cites work @@
+Minimax monotonicity
@@ Property / cites work: Minimax monotonicity / rank @@
+Normal rank
@@ Property / cites work @@
+Dynamic programming for stochastic target problems and geometric flows
+Normal rank
@@ Property / cites work @@
+Q3862204
@@ Property / cites work: Q3862204 / rank @@
+Normal rank
@@ Property / cites work @@
+Optimal transportation under controlled stochastic dynamics
+Normal rank
@@ Property / cites work @@
+Well-posedness of multidimensional diffusion processes with weakly differentiable coefficients
+Normal rank
@@ Property / cites work @@
+Q4805362
@@ Property / cites work: Q4805362 / rank @@
+Normal rank
@@ Property / cites work @@
+Optimal Transport
@@ Property / cites work: Optimal Transport / rank @@
+Normal rank
@@ Property / cites work @@
+On the regularity theory of fully nonlinear parabolic equations: I
+Normal rank
@@ Property / cites work @@
+On the regularity theory of fully nonlinear parabolic equations: II
+Normal rank
@@ Property / cites work @@
+Q4255599
@@ Property / cites work: Q4255599 / rank @@
+Normal rank