Machine learning and nonparametric bandit theory

From MaRDI portal

Publication:4850249

Jump to:navigation, search

DOI10.1109/9.400491MaRDI QIDQ4850249zbMATH OpenOpenAlexFDO

Authors Tze Leung Lai, S. Yakowitz

Publication date 9 October 1995

Published in IEEE Transactions on Automatic Control (Search for Journal in Brave)

Full work available at URL https://semanticscholar.org/paper/fe611ab14edf4c19fa90e03da42589b0e9c5d5ec

zbMATH Keywords

controlled Markov processes risk growth

Mathematics Subject Classification ID

Nonparametric inference (62G99) Optimal stopping in statistics (62L15) Stochastic learning and adaptive control (93E35)

Recommendations

Cited in

(15)

This page was built for publication: Machine learning and nonparametric bandit theory

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4850249)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4850249&oldid=19194572"