scientific article; zbMATH DE number 3845417
semi-Markov processesadaptive control algorithmsextremal controllersMarkov chains controlYakubovich method
Learning and adaptive systems in artificial intelligence (68T05) Markov chains (discrete-time Markov processes on discrete state spaces) (60J10) Stochastic approximation (62L20) Stationary stochastic processes (60G10) Discrete-time Markov processes on general state spaces (60J05) Adaptive control/observation systems (93C40) Discrete-time control/observation systems (93C55) Model systems in control theory (93C99) Stochastic systems in control theory (general) (93E03) Optimal stochastic control (93E20) Research exposition (monographs, survey articles) pertaining to systems and control theory (93-02)
- Some recent advances of automatic control in China
- scientific article; zbMATH DE number 4187639 (Why is no real title available?)
- scientific article; zbMATH DE number 3168214 (Why is no real title available?)
- scientific article; zbMATH DE number 7596797 (Why is no real title available?)
- scientific article; zbMATH DE number 3918209 (Why is no real title available?)
- scientific article; zbMATH DE number 194362 (Why is no real title available?)
- Customization of J. Bather's UCB strategy for a Gaussian multiarmed bandit
- State-of-the-art and prospects of adaptive systems
- Gaussian two-armed bandit and optimization of batch data processing
- New results on the application of the passification method. A survey
- Frequency adaptive control. I
- Two-armed bandit problem and batch version of the mirror descent algorithm
- scientific article; zbMATH DE number 195177 (Why is no real title available?)
- Poissonian two-armed bandit: a new approach
- Finding minimax strategy and minimax risk in a random environment (the two-armed bandit problem)
- UCB strategies and optimization of batch processing in a one-armed bandit problem
- scientific article; zbMATH DE number 3304836 (Why is no real title available?)
- Two-armed bandit problem for parallel data processing systems
- Eine Überwachungs- und Koordinationsebene für den adaptiven Regelkreis / A “supervision and coordination level” for the adaptive control loop
- Locally optimal adaptive control
- One-armed bandit problem for parallel data processing systems
- One-armed bandit problem and the mirror descent algorithm
- Adaptive control systems
- Parallel design of robust control in the stochastic environment (the two-armed bandit problem)
- scientific article; zbMATH DE number 3978901 (Why is no real title available?)
This page was built for publication:
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3315342)