Estimating scale-invariant future in continuous time

From MaRDI portal
Publication:5154136

DOI10.1162/NECO_A_01171zbMATH Open1471.92038DBLPjournals/neco/TiganjGSH19arXiv1802.06426OpenAlexW2964141756WikidataQ91588944 ScholiaQ91588944MaRDI QIDQ5154136FDOQ5154136

Zoran Tiganj, Per B. Sederberg, Marc W. Howard, Samuel J. Gershman

Publication date: 1 October 2021

Published in: Neural Computation (Search for Journal in Brave)

Abstract: Natural learners must compute an estimate of future outcomes that follow from a stimulus in continuous time. Widely used reinforcement learning algorithms discretize continuous time and estimate either transition functions from one step to the next (model-based algorithms) or a scalar value of exponentially-discounted future reward using the Bellman equation (model-free algorithms). An important drawback of model-based algorithms is that computational cost grows linearly with the amount of time to be simulated. On the other hand, an important drawback of model-free algorithms is the need to select a time-scale required for exponential discounting. We present a computational mechanism, developed based on work in psychology and neuroscience, for computing a scale-invariant timeline of future outcomes. This mechanism efficiently computes an estimate of inputs as a function of future time on a logarithmically-compressed scale, and can be used to generate a scale-invariant power-law-discounted estimate of expected future reward. The representation of future time retains information about what will happen when. The entire timeline can be constructed in a single parallel operation which generates concrete behavioral and neural predictions. This computational mechanism could be incorporated into future reinforcement learning algorithms.


Full work available at URL: https://arxiv.org/abs/1802.06426




Recommendations



Cites Work


Cited In (1)





This page was built for publication: Estimating scale-invariant future in continuous time

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5154136)