Pages that link to "Item:Q1812931"

From MaRDI portal

← \({\mathcal Q}\)-learning (Q1812931)

Jump to:navigation, search

What links here

⧼whatlinkshere-whatlinkshere-target⧽

Page:

⧼whatlinkshere-whatlinkshere-ns⧽

Namespace:

Invert selection

⧼whatlinkshere-whatlinkshere-filter⧽

Hide transclusions

Hide links

Hide redirects

The following pages link to \({\mathcal Q}\)-learning (Q1812931):

Displaying 50 items.

Sequential Advantage Selection for Optimal Treatment Regimes (Q148418) (← links)
\(Q\)- and \(A\)-learning methods for estimating optimal dynamic treatment regimes (Q252819) (← links)
Multiscale Q-learning with linear function approximation (Q312650) (← links)
How hierarchical models improve point estimates of model parameters at the individual level (Q313077) (← links)
Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design (Q313259) (← links)
Probabilistic inference for determining options in reinforcement learning (Q331688) (← links)
Perspectives of approximate dynamic programming (Q333093) (← links)
Active inference and agency: optimal control without cost functions (Q353847) (← links)
Machine learning in agent-based stochastic simulation: inferential theory and evaluation in transportation logistics (Q356384) (← links)
Designing time difference learning for interference management in heterogeneous networks (Q367452) (← links)
Testing probabilistic equivalence through reinforcement learning (Q383369) (← links)
Dynamic treatment regimes: technical challenges and applications (Q405345) (← links)
Learning to compete, coordinate, and cooperate in repeated games using reinforcement learning (Q413845) (← links)
The optimal unbiased value estimator and its relation to LSTD, TD and MC (Q415609) (← links)
Q-learning with censored data (Q450048) (← links)
Adaptive dynamic programming and optimal control of nonlinear nonaffine systems (Q472591) (← links)
Decentralized reinforcement learning robust optimal tracking control for time varying constrained reconfigurable modular robot based on ACI and \(Q\)-function (Q473552) (← links)
Reinforcement learning: exploration-exploitation dilemma in multi-agent foraging task (Q505118) (← links)
Data-based analysis of discrete-time linear systems in noisy environment: controllability and observability (Q508746) (← links)
Q-learning for continuous-time linear systems: A model-free infinite horizon optimal control approach (Q511735) (← links)
Sampled fictitious play for approximate dynamic programming (Q547121) (← links)
Designing decentralized controllers for distributed-air-jet MEMS-based micromanipulators by reinforcement learning (Q614880) (← links)
A human-robot collaborative reinforcement learning algorithm (Q614949) (← links)
Distributed reinforcement learning for coordinate multi-robot foraging (Q614967) (← links)
Model-free event-triggered control algorithm for continuous-time linear systems with optimal performance (Q680566) (← links)
Four encounters with system identification (Q693680) (← links)
Integral \(Q\)-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems (Q694822) (← links)
A behavioral learning process in games (Q700080) (← links)
Free energy, value, and attractors (Q764242) (← links)
Collective behavior of artificial intelligence population: transition from optimization to game (Q784096) (← links)
Safe learning for near-optimal scheduling (Q832074) (← links)
Model-free reinforcement learning for branching Markov decision processes (Q832301) (← links)
Imitation guided learning in learning classifier systems (Q835987) (← links)
A learning classifier system for mazes with aliasing clones (Q835990) (← links)
Q-learning agents in a Cournot oligopoly model (Q844790) (← links)
Tutorial series on brain-inspired computing. IV: Reinforcement learning: machine learning and natural learning (Q867508) (← links)
Multi-objective optimization of water-using systems (Q877675) (← links)
The relation between reinforcement learning parameters and the influence of reinforcement history on choice behavior (Q894100) (← links)
Non-zero sum Nash Q-learning for unknown deterministic continuous-time linear systems (Q900691) (← links)
Reinforcement learning algorithms with function approximation: recent advances and applications (Q903601) (← links)
Mathematical properties of neuronal TD-rules and differential Hebbian learning: a comparison (Q937719) (← links)
Risk-sensitive reinforcement learning algorithms with generalized average criterion (Q940247) (← links)
Learning agents in an artificial power exchange: Tacit collusion, market power and efficiency of two double-auction mechanisms (Q943954) (← links)
Q-learning algorithms with random truncation bounds and applications to effective parallel computing (Q946195) (← links)
Learning competitive pricing strategies by multi-agent reinforcement learning (Q951423) (← links)
Algebraic results and bottom-up algorithm for policies generalization in reinforcement learning using concept lattices (Q1003553) (← links)
If multi-agent learning is the answer, what is the question? (Q1028919) (← links)
Reinforcement distribution in fuzzy Q-learning (Q1037957) (← links)
A theoretical analysis of temporal difference learning in the iterated prisoner's dilemma game (Q1048261) (← links)
Learning to compose fuzzy behaviors for autonomous agents (Q1125758) (← links)

Retrieved from "https://portal.mardi4nfdi.de/wiki/Special:WhatLinksHere/Item:Q1812931"