Sigillo dell'Università di Bologna
Seminari del Dipartimento di Matematica
Università di Bologna

Apprenticeship Learning with Prior Beliefs: Toward Continuous-Time Inverse Optimal Control

seminario tenuto da
Mauricio Junca

Giugno
25
Giovedì
SEMINARIO INTERDISCIPLINARE
ore 11:00
presso - Aula Da Stabilire -
nell'ambito della serie: STOCHASTICS AND APPLICATIONS
Inverse reinforcement learning seeks to recover a cost function that explains an expert’s behavior, but the problem is generally ill-posed. This talk presents a regularized inverse-optimization approach in which prior beliefs are used to select meaningful costs, even when the observed expert is not exactly optimal. After discussing the main results in the discrete setting, I develop a parallel with continuous-time stochastic control, where occupation measures, HJB inequalities, and Sobolev regularization lead naturally to an inverse PDE and variational-inequality framework.

organizzato da: Stefano Pagliarani
Torna alla pagina dei seminari del Dipartimento di Matematica di Bologna
— Università di Bologna —
Contatti Privacy