scientific article; zbMATH DE number 7625165

Raihan Seraj, Jayakumar Subramanian, Aditya Mahajan, Amit K. Sinha

Publication date: 29 November 2022

Full work available at URL: https://arxiv.org/abs/2010.08843

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.

partially observable Markov decision processes approximate dynamic programming information state approximate information state partially observed reinforcement learning

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)

Related Items (3)

Robustness and sample complexity of model-based MARL for general-sum Markov games ⋮ Separation of learning and control for cyber-physical systems ⋮ Unnamed Item

Uses Software

OpenAI Gym
POMDPs.jl
MiniGrid

Cites Work

Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Planning and acting in partially observable stochastic domains
Equivalence of distance-based and RKHS-based statistics in hypothesis testing
Decentralized stochastic control
Finding optimal memoryless policies of POMDPs under the expected average reward criterion
On essential information in sequential decision processes
Equivalence notions and model minimization in Markov decision processes
Lipschitz continuity of value functions in Markovian decision processes
Toward a quantitative theory of self-generated complexity
Stochastic approximation with two time scales
Finite approximations in discrete-time stochastic control. Quantized models and asymptotic optimality
On the empirical estimation of integral probability metrics
Sufficient statistics in the optimum control of stochastic systems
Optimal control of Markov processes with incomplete state information
Information states for linear stochastic systems
Introduction to stochastic control theory
Reinforcement learning of non-Markov decision processes
The Complexity of Optimal Queuing Network Control
Optimally Solving Dec-POMDPs as Continuous-State MDPs
A Concise Introduction to Decentralized POMDPs
Temporal logic motion planning using POMDPs with parity objectives
Decentralized optimal control of Markov chains with a common past information set
Bounded Rationality in Multiagent Systems Using Decentralized Metareasoning
Markov decision processes with noise-corrupted and delayed state observations
Bisimulation Metrics for Continuous Markov Decision Processes
Linear Automaton Transformations
Probability Metrics
Optimal causal coding - decoding problems
Optimal Performance of Networked Control Systems with Nonclassical Information Structures
Recurrent policy gradients
Solution of some nonclassical LQG stochastic decision problems
Convergence of discretization procedures in dynamic programming
Dynamic programming approach to decentralized stochastic control problems
Survey of decentralized control methods for large scale systems
Approximations of Dynamic Programs, I
A separation theorem for periodic sharing information patterns in decentralized control
Integral Probability Metrics and Their Generating Classes of Functions
How Does the Value Function of a Markov Decision Process Depend on the Transition Probabilities?
The Optimal Control of Partially Observable Markov Processes over a Finite Horizon
OnActor-Critic Algorithms
Sufficient Conditions for the Value Function and Optimal Strategy to be Even and Quasi-Convex
Optimal Design of Sequential Real-Time Communication Systems
Sequential Problems in Decentralized Detection With Communication
Optimal Control Strategies in Delayed Sharing Information Structures
Networked Markov Decision Processes With Delays
Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach
Optimal Decentralized Control of Coupled Subsystems With Control Sharing
Learning representations by back-propagating errors
On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability
The Common-Information Approach to Decentralized Stochastic Control
A Counterexample in Stochastic Optimum Control
Team Decision Problems
An example of interaction between information and control: The Transparency of a game
The Complexity of Decentralized Control of Markov Decision Processes
Information Theory
Optimal Transport
Computational mechanics: pattern and prediction, structure and simplicity.

This page was built for publication: