Reference points and learning
From MaRDI portal
Publication:2138367
DOI10.1016/J.JMATECO.2021.102621zbMATH Open1490.91092OpenAlexW4206026350MaRDI QIDQ2138367FDOQ2138367
Publication date: 11 May 2022
Published in: Journal of Mathematical Economics (Search for Journal in Brave)
Full work available at URL: https://ora.ox.ac.uk/objects/uuid:48cd5b18-37d1-4108-9ba7-959c31e36de4
Cites Work
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Asymmetric Least Squares Estimation and Testing
- The Dual Theory of Choice under Risk
- Stability theory by Liapunov's direct method
- Temporal Resolution of Uncertainty and Dynamic Choice Theory
- Prospect Theory: An Analysis of Decision under Risk
- Substitution, Risk Aversion, and the Temporal Behavior of Consumption and Asset Returns: A Theoretical Framework
- A Model of Reference-Dependent Preferences*
- A Theory of Disappointment Aversion
- Risk Aversion in the Small and in the Large
- Discounted linear exponential quadratic Gaussian control
- An axiomatic characterization of preferences under uncertainty: Weakening the independence axiom
- Axiomatic utility theories with the betweenness property
- Lectures on stochastic programming. Modeling and theory.
- On Cash Equivalents and Information Evaluation in Decisions Under Uncertainty: Part I: Basic Theory
- Risk-averse dynamic programming for Markov decision processes
- Robustness
- On the convergence of reinforcement learning
- Attainability of boundary points under reinforcement learning
- The Structure of Intertemporal Preferences under Uncertainty and Time Consistent Plans
- Unique solutions for stochastic recursive utilities
- Convergence results for single-step on-policy reinforcement-learning algorithms
- Individual Q-Learning in Normal Form Games
- Stochastic approximations for finite-state Markov chains
- Risk-sensitive reinforcement learning
- Asynchronous stochastic approximation with differential inclusions
- Boundedness of iterates in \(Q\)-learning
- Risk-Sensitive Reinforcement Learning
Cited In (3)
This page was built for publication: Reference points and learning
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2138367)