Skip to main navigation Skip to search Skip to main content

Bellman equation and viscosity solutions for mean-field stochastic control problem

  • Huyên Pham*
  • , Xiaoli Wei
  • *Corresponding author for this work
  • CNRS

Research output: Contribution to journalArticlepeer-review

Abstract

We consider the stochastic optimal control problem of McKean-Vlasov stochastic differential equation where the coefficients may depend upon the joint law of the state and control. By using feedback controls, we reformulate the problem into a deterministic control problem with only the marginal distribution of the process as controlled state variable, and prove that dynamic programming principle holds in its general form. Then, by relying on the notion of differentiability with respect to probability measures recently introduced by P.L. Lions in [32], and a special Itô formula for flows of probability measures, we derive the (dynamic programming) Bellman equation for mean-field stochastic control problem, and prove a verification theorem in our McKean- Vlasov framework. We give explicit solutions to the Bellman equation for the linear quadratic mean-field control problem, with applications to the mean-variance portfolio selection and a systemic risk model. We also consider a notion of lifted viscosity solutions for the Bellman equation, and show the viscosity property and uniqueness of the value function to the McKean-Vlasov control problem. Finally, we consider the case of McKean-Vlasov control problem with open-loop controls and discuss the associated dynamic programming equation that we compare with the case of closed-loop controls.

Original languageEnglish
JournalESAIM - Control, Optimisation and Calculus of Variations
Volume24
Issue number1
DOIs
StatePublished - 2018
Externally publishedYes

Keywords

  • Bellman equation
  • Dynamic programming
  • McKean-Vlasov SDEs
  • Viscosity solutions
  • Wasserstein space

Fingerprint

Dive into the research topics of 'Bellman equation and viscosity solutions for mean-field stochastic control problem'. Together they form a unique fingerprint.

Cite this