scientific article

From MaRDI portal
Publication:3093180

zbMath1222.68196MaRDI QIDQ3093180

Yishay Mansour, Eyal Even-Dar

Publication date: 12 October 2011

Full work available at URL: http://www.jmlr.org/papers/v5/evendar03a.html

Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.



Related Items (22)

Adaptive stepsizes for recursive estimation with applications in approximate dynamic programmingUnified reinforcement Q-learning for mean field game and control problemsReinforcement learning-based design of side-channel countermeasuresReinforcement learning with algorithms from probabilistic structure estimationError bounds for constant step-size \(Q\)-learningA concentration bound for \(\operatorname{LSPE}( \lambda )\)A Discrete-Time Switching System Analysis of Q-LearningRecent advances in reinforcement learning in financeA stochastic contraction mapping theoremSettling the sample complexity of model-based offline reinforcement learningIntegrated condition-based maintenance and multi-item lot-sizing with stochastic demandCooperative and geometric learning algorithm (CGLA) for path planning of UAVs with limited informationUnnamed ItemEmpirical Dynamic ProgrammingRisk-Averse Approximate Dynamic Programming with Quantile-Based Risk MeasuresFundamental design principles for reinforcement learning algorithmsConvergence Rates and Decoupling in Linear Stochastic Approximation AlgorithmsFinite-sample analysis of nonlinear stochastic approximation with applications in reinforcement learningMean-Field Controls with Q-Learning for Cooperative MARL: Convergence and Complexity AnalysisSpeedy Categorical Distributional Reinforcement Learning and Complexity AnalysisOne-dimensional system arising in stochastic gradient descentConcentration of Contractive Stochastic Approximation and Reinforcement Learning




This page was built for publication: