The following pages link to \({\mathcal Q}\)-learning (Q1812931):
Displaying 50 items.
- Sequential Advantage Selection for Optimal Treatment Regimes (Q148418) (← links)
- \(Q\)- and \(A\)-learning methods for estimating optimal dynamic treatment regimes (Q252819) (← links)
- Multiscale Q-learning with linear function approximation (Q312650) (← links)
- How hierarchical models improve point estimates of model parameters at the individual level (Q313077) (← links)
- Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design (Q313259) (← links)
- Probabilistic inference for determining options in reinforcement learning (Q331688) (← links)
- Perspectives of approximate dynamic programming (Q333093) (← links)
- Active inference and agency: optimal control without cost functions (Q353847) (← links)
- Machine learning in agent-based stochastic simulation: inferential theory and evaluation in transportation logistics (Q356384) (← links)
- Designing time difference learning for interference management in heterogeneous networks (Q367452) (← links)
- Testing probabilistic equivalence through reinforcement learning (Q383369) (← links)
- Dynamic treatment regimes: technical challenges and applications (Q405345) (← links)
- Learning to compete, coordinate, and cooperate in repeated games using reinforcement learning (Q413845) (← links)
- The optimal unbiased value estimator and its relation to LSTD, TD and MC (Q415609) (← links)
- Q-learning with censored data (Q450048) (← links)
- Adaptive dynamic programming and optimal control of nonlinear nonaffine systems (Q472591) (← links)
- Decentralized reinforcement learning robust optimal tracking control for time varying constrained reconfigurable modular robot based on ACI and \(Q\)-function (Q473552) (← links)
- Reinforcement learning: exploration-exploitation dilemma in multi-agent foraging task (Q505118) (← links)
- Data-based analysis of discrete-time linear systems in noisy environment: controllability and observability (Q508746) (← links)
- Q-learning for continuous-time linear systems: A model-free infinite horizon optimal control approach (Q511735) (← links)
- Sampled fictitious play for approximate dynamic programming (Q547121) (← links)
- Designing decentralized controllers for distributed-air-jet MEMS-based micromanipulators by reinforcement learning (Q614880) (← links)
- A human-robot collaborative reinforcement learning algorithm (Q614949) (← links)
- Distributed reinforcement learning for coordinate multi-robot foraging (Q614967) (← links)
- Model-free event-triggered control algorithm for continuous-time linear systems with optimal performance (Q680566) (← links)
- Four encounters with system identification (Q693680) (← links)
- Integral \(Q\)-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems (Q694822) (← links)
- A behavioral learning process in games (Q700080) (← links)
- Free energy, value, and attractors (Q764242) (← links)
- Collective behavior of artificial intelligence population: transition from optimization to game (Q784096) (← links)
- Safe learning for near-optimal scheduling (Q832074) (← links)
- Model-free reinforcement learning for branching Markov decision processes (Q832301) (← links)
- Imitation guided learning in learning classifier systems (Q835987) (← links)
- A learning classifier system for mazes with aliasing clones (Q835990) (← links)
- Q-learning agents in a Cournot oligopoly model (Q844790) (← links)
- Tutorial series on brain-inspired computing. IV: Reinforcement learning: machine learning and natural learning (Q867508) (← links)
- Multi-objective optimization of water-using systems (Q877675) (← links)
- The relation between reinforcement learning parameters and the influence of reinforcement history on choice behavior (Q894100) (← links)
- Non-zero sum Nash Q-learning for unknown deterministic continuous-time linear systems (Q900691) (← links)
- Reinforcement learning algorithms with function approximation: recent advances and applications (Q903601) (← links)
- Mathematical properties of neuronal TD-rules and differential Hebbian learning: a comparison (Q937719) (← links)
- Risk-sensitive reinforcement learning algorithms with generalized average criterion (Q940247) (← links)
- Learning agents in an artificial power exchange: Tacit collusion, market power and efficiency of two double-auction mechanisms (Q943954) (← links)
- Q-learning algorithms with random truncation bounds and applications to effective parallel computing (Q946195) (← links)
- Learning competitive pricing strategies by multi-agent reinforcement learning (Q951423) (← links)
- Algebraic results and bottom-up algorithm for policies generalization in reinforcement learning using concept lattices (Q1003553) (← links)
- If multi-agent learning is the answer, what is the question? (Q1028919) (← links)
- Reinforcement distribution in fuzzy Q-learning (Q1037957) (← links)
- A theoretical analysis of temporal difference learning in the iterated prisoner's dilemma game (Q1048261) (← links)
- Learning to compose fuzzy behaviors for autonomous agents (Q1125758) (← links)