Optimal timing of decisions: a general theory based on continuation values
From MaRDI portal
Abstract: Building on insights of Jovanovic (1982) and subsequent authors, we develop a comprehensive theory of optimal timing of decisions based around continuation value functions and operators that act on them. Optimality results are provided under general settings, with bounded or unbounded reward functions. This approach has several intrinsic advantages that we exploit in developing the theory. One is that continuation value functions are smoother than value functions, allowing for sharper analysis of optimal policies and more efficient computation. Another is that, for a range of problems, the continuation value function exists in a lower dimensional space than the value function, mitigating the curse of dimensionality. In one typical experiment, this reduces the computation time from over a week to less than three minutes.
Recommendations
Cites work
- scientific article; zbMATH DE number 5016447 (Why is no real title available?)
- scientific article; zbMATH DE number 52448 (Why is no real title available?)
- scientific article; zbMATH DE number 1325008 (Why is no real title available?)
- A general equilibrium model of sovereign default and business cycles
- A new type of approximation leading to reduction of dimensionality in control processes
- Declining Reservation Wages and Learning
- Dynamic programming with homogeneous functions
- Entry, Exit, and firm Dynamics in Long Run Equilibrium
- Equilibrium price dispersion with sequential search
- Existence and Uniqueness of Solutions to the Bellman Equation in the Unbounded Case
- Existence and uniqueness of a fixed point for local contractions
- Fatou's lemma for weakly converging probabilities
- Firm Turnover in Imperfectly Competitive Markets1
- Learning by Doing vs. Learning About Match Quality: Can We Tell Them Apart?
- Markov chains and stochastic stability
- Markov-Perfect Industry Dynamics: A Framework for Empirical Work
- On the dynamics of unemployment and wage distributions
- On-the-Job Search and Precautionary Savings
- Optimal Lending Contracts and Firm Dynamics
- Recursive utility and optimal growth with bounded or unbounded returns
- Recursive utility and the Ramsey problem
- Selection and the Evolution of Industry
- Spinoffs and the market for ideas
- Stochastic search equilibrium
- The Accumulation of Wealth and the Cyclical Generation of New Technologies: A Search Theoretic Approach
- The Growth and Diffusion of Knowledge
- Two Questions about European Unemployment
- Uncertainty and Learning in Pharmaceutical Demand
- Uncertainty and unemployment
- Uncertainty traps
- Using Randomization to Break the Curse of Dimensionality
Cited in
(7)- Unbounded dynamic programming via the Q-transform
- Dynamic learning and decision making via basis weight vectors
- scientific article; zbMATH DE number 1594532 (Why is no real title available?)
- Principles and Practice of Constraint Programming – CP 2004
- Reward-rate maximization in sequential identification under a stochastic deadline
- Efficient computation of optimal actions
- Existence and uniqueness of solutions to the Bellman equation in stochastic dynamic programming
This page was built for publication: Optimal timing of decisions: a general theory based on continuation values
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1734573)