詳細書目資料

資料來源: Google Book
14
0
0
0
0

Zero-sum discrete-time Markov games with unknown disturbance distribution : discounted and average criteria

  • 作者: Minjarez-Sosa, J. Adolfo, author.
  • 其他作者:
  • 其他題名:
    • SpringerBriefs in probability and mathematical statistics.
  • 出版: Cham : Springer International Publishing :Imprint: Springer
  • 叢書名: SpringerBriefs in probability and mathematical statistics,
  • 主題: Markov processes. , Differential games. , Probability Theory and Stochastic Processes.
  • ISBN: 9783030357207 (electronic bk.) 、 9783030357191 (paper)
  • FIND@SFXID: CGU
  • 資料類型: 電子書
  • 內容註: Zero-sum Markov games -- Discounted optimality criterion -- Average payoff criterion -- Empirical approximation-estimation algorithms in Markov games -- Difference-equation games: examples -- Elements from analysis -- Probability measures and weak convergence -- Stochastic kernels -- Review on density estimation.
  • 摘要註: This SpringerBrief deals with a class of discrete-time zero-sum Markov games with Borel state and action spaces, and possibly unbounded payoffs, under discounted and average criteria, whose state process evolves according to a stochastic difference equation. The corresponding disturbance process is an observable sequence of independent and identically distributed random variables with unknown distribution for both players. Unlike the standard case, the game is played over an infinite horizon evolving as follows. At each stage, once the players have observed the state of the game, and before choosing the actions, players 1 and 2 implement a statistical estimation process to obtain estimates of the unknown distribution. Then, independently, the players adapt their decisions to such estimators to select their actions and construct their strategies. This book presents a systematic analysis on recent developments in this kind of games. Specifically, the theoretical foundations on the procedures combining statistical estimation and control techniques for the construction of strategies of the players are introduced, with illustrative examples. In this sense, the book is an essential reference for theoretical and applied researchers in the fields of stochastic control and game theory, and their applications.
  • 讀者標籤:
  • 引用連結:
  • Share:
  • 系統號: 005480913 | 機讀編目格式
  • 館藏資訊

    This SpringerBrief deals with a class of discrete-time zero-sum Markov games with Borel state and action spaces, and possibly unbounded payoffs, under discounted and average criteria, whose state process evolves according to a stochastic difference equation. The corresponding disturbance process is an observable sequence of independent and identically distributed random variables with unknown distribution for both players. Unlike the standard case, the game is played over an infinite horizon evolving as follows. At each stage, once the players have observed the state of the game, and before choosing the actions, players 1 and 2 implement a statistical estimation process to obtain estimates of the unknown distribution. Then, independently, the players adapt their decisions to such estimators to select their actions and construct their strategies. This book presents a systematic analysis on recent developments in this kind of games. Specifically, the theoretical foundations on the procedures combining statistical estimation and control techniques for the construction of strategies of the players are introduced, with illustrative examples. In this sense, the book is an essential reference for theoretical and applied researchers in the fields of stochastic control and game theory, and their applications.

    資料來源: Google Book
    延伸查詢 Google Books Amazon
    回到最上