Mathematical Research Data Initiative
Main page
Recent changes
Random page
SPARQL
MaRDI@GitHub
New item
In other projects
MaRDI portal item
Discussion
View source
View history
English
Log in

Framing reinforcement learning from human reward: reward positivity, temporal discounting, episodicity, and performance

From MaRDI portal
Publication:891791
Jump to:navigation, search

DOI10.1016/J.ARTINT.2015.03.009zbMATH Open1343.68199OpenAlexW1453801241MaRDI QIDQ891791FDOQ891791

N. E. Zubov

Publication date: 17 November 2015

Published in: Artificial Intelligence (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/j.artint.2015.03.009



zbMATH Keywords

reinforcement learningend-user programminghuman teachershuman-agent interactioninteractive machine learningmodeling user behavior


Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)


Cites Work

  • 10.1162/153244303322753616
  • A survey of cross-validation procedures for model selection
  • Learning Representation and Control in Markov Decision Processes: New Frontiers


Cited In (1)

  • TAMER






This page was built for publication: Framing reinforcement learning from human reward: reward positivity, temporal discounting, episodicity, and performance

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q891791)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:891791&oldid=12845484"
Tools
What links here
Related changes
Printable version
Permanent link
Page information
This page was last edited on 30 January 2024, at 16:04. Warning: Page may not contain recent updates.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki