Jonathan Baxter
From MaRDI portal
Person:1369060
List of research outcomes
This list is not complete and representing at the moment only items from zbMATH Open and arXiv. We are working on additional sources - please check back here soon!
| Publication | Date of Publication | Type |
|---|---|---|
| Variance reduction techniques for gradient estimates in reinforcement learning | 2011-10-12 | Paper |
| Emmerald: a fast matrix–matrix multiply using Intel's SSE instructions Concurrency and Computation: Practice and Experience | 2003-02-04 | Paper |
| scientific article; zbMATH DE number 1753152 (Why is no real title available?) | 2002-10-13 | Paper |
| scientific article; zbMATH DE number 1753153 (Why is no real title available?) | 2002-10-10 | Paper |
| Estimation and approximation bounds for gradient-based reinforcement learning Journal of Computer and System Sciences | 2002-07-04 | Paper |
| Improved generalization through explicit optimization of margins Machine Learning | 2001-02-05 | Paper |
| Learning to play chess using temporal differences Machine Learning | 2000-11-05 | Paper |
| A Bayesian/information theoretic model of learning to learn via multiple task sampling Machine Learning | 1997-01-01 | Paper |
| scientific article; zbMATH DE number 811532 (Why is no real title available?) | 1995-11-29 | Paper |
Research outcomes over time
This page was built for person: Jonathan Baxter