Jonathan Baxter

From MaRDI portal
Person:1369060



List of research outcomes

This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!

PublicationDate of PublicationType
Variance reduction techniques for gradient estimates in reinforcement learning2011-10-12Paper
Emmerald: a fast matrix–matrix multiply using Intel's SSE instructions
Concurrency and Computation: Practice and Experience
2003-02-04Paper
scientific article; zbMATH DE number 1753152 (Why is no real title available?)2002-10-13Paper
scientific article; zbMATH DE number 1753153 (Why is no real title available?)2002-10-10Paper
Estimation and approximation bounds for gradient-based reinforcement learning
Journal of Computer and System Sciences
2002-07-04Paper
Improved generalization through explicit optimization of margins
Machine Learning
2001-02-05Paper
Learning to play chess using temporal differences
Machine Learning
2000-11-05Paper
A Bayesian/information theoretic model of learning to learn via multiple task sampling
Machine Learning
1997-01-01Paper
scientific article; zbMATH DE number 811532 (Why is no real title available?)1995-11-29Paper


Research outcomes over time


This page was built for person: Jonathan Baxter