Pasar al contenido principal

An Abelian theorem for a Markov decision process in a system of interacting objects with unknown random disturbance law

Autor/es Anáhuac
José D. López-Barrientos
Año de publicación
2024
Journal o Editorial
Pure and Applied Functional Analysis

Abstract

This paper studies a mean-field approach for Markov decision processes in a class of systems of a large number of objects that interact with each other according to an observable but unknown law for the central controller. The central controller acts under the ergodic cost criterion with Borel state and control spaces, bounded costs, and compact action space. We depart from the characterization of the discounted optimal strategies, and then, by means of an Abelian theorem, we study the existence of average cost optimal stationary policies in the original model. We also analyze the performance of the mean field limit optimal policies in the original model.