Apprenticeship Learning with Prior Beliefs: Toward Continuous-Time Inverse Optimal Control

seminario tenuto da

Mauricio Junca

Giugno

Giovedì

SEMINARIO INTERDISCIPLINARE

ore 11:00

presso - Aula Da Stabilire -

nell'ambito della serie: STOCHASTICS AND APPLICATIONS

Inverse reinforcement learning seeks to recover a cost function that explains an expert’s behavior, but the problem is generally ill-posed. This talk presents a regularized inverse-optimization approach in which prior beliefs are used to select meaningful costs, even when the observed expert is not exactly optimal. After discussing the main results in the discrete setting, I develop a parallel with continuous-time stochastic control, where occupation measures, HJB inequalities, and Sobolev regularization lead naturally to an inverse PDE and variational-inequality framework.

Torna alla pagina dei seminari del Dipartimento di Matematica di Bologna

Apprenticeship Learning with Prior Beliefs: Toward Continuous-Time Inverse Optimal Control

seminario tenuto da Mauricio Junca

seminario tenuto da

Mauricio Junca