scientific article; zbMATH DE number 7625165
From MaRDI portal
Publication:5054599
Raihan Seraj, Jayakumar Subramanian, Aditya Mahajan, Amit K. Sinha
Publication date: 29 November 2022
Full work available at URL: https://arxiv.org/abs/2010.08843
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
partially observable Markov decision processesapproximate dynamic programminginformation stateapproximate information statepartially observed reinforcement learning
Related Items (3)
Robustness and sample complexity of model-based MARL for general-sum Markov games ⋮ Separation of learning and control for cyber-physical systems ⋮ Unnamed Item
Uses Software
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Planning and acting in partially observable stochastic domains
- Equivalence of distance-based and RKHS-based statistics in hypothesis testing
- Decentralized stochastic control
- Finding optimal memoryless policies of POMDPs under the expected average reward criterion
- On essential information in sequential decision processes
- Equivalence notions and model minimization in Markov decision processes
- Lipschitz continuity of value functions in Markovian decision processes
- Toward a quantitative theory of self-generated complexity
- Stochastic approximation with two time scales
- Finite approximations in discrete-time stochastic control. Quantized models and asymptotic optimality
- On the empirical estimation of integral probability metrics
- Sufficient statistics in the optimum control of stochastic systems
- Optimal control of Markov processes with incomplete state information
- Information states for linear stochastic systems
- Introduction to stochastic control theory
- Reinforcement learning of non-Markov decision processes
- The Complexity of Optimal Queuing Network Control
- Optimally Solving Dec-POMDPs as Continuous-State MDPs
- A Concise Introduction to Decentralized POMDPs
- Temporal logic motion planning using POMDPs with parity objectives
- Decentralized optimal control of Markov chains with a common past information set
- Bounded Rationality in Multiagent Systems Using Decentralized Metareasoning
- Markov decision processes with noise-corrupted and delayed state observations
- Bisimulation Metrics for Continuous Markov Decision Processes
- Linear Automaton Transformations
- Probability Metrics
- Optimal causal coding - decoding problems
- Optimal Performance of Networked Control Systems with Nonclassical Information Structures
- Recurrent policy gradients
- Solution of some nonclassical LQG stochastic decision problems
- Convergence of discretization procedures in dynamic programming
- Dynamic programming approach to decentralized stochastic control problems
- Survey of decentralized control methods for large scale systems
- Approximations of Dynamic Programs, I
- A separation theorem for periodic sharing information patterns in decentralized control
- Integral Probability Metrics and Their Generating Classes of Functions
- How Does the Value Function of a Markov Decision Process Depend on the Transition Probabilities?
- The Optimal Control of Partially Observable Markov Processes over a Finite Horizon
- OnActor-Critic Algorithms
- Sufficient Conditions for the Value Function and Optimal Strategy to be Even and Quasi-Convex
- Optimal Design of Sequential Real-Time Communication Systems
- Sequential Problems in Decentralized Detection With Communication
- Optimal Control Strategies in Delayed Sharing Information Structures
- Networked Markov Decision Processes With Delays
- Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach
- Optimal Decentralized Control of Coupled Subsystems With Control Sharing
- Learning representations by back-propagating errors
- On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability
- The Common-Information Approach to Decentralized Stochastic Control
- A Counterexample in Stochastic Optimum Control
- Team Decision Problems
- An example of interaction between information and control: The Transparency of a game
- The Complexity of Decentralized Control of Markov Decision Processes
- Information Theory
- Optimal Transport
- Computational mechanics: pattern and prediction, structure and simplicity.
This page was built for publication: