The following pages link to (Q4315289):
Displaying 50 items.
- Planning and acting in partially observable stochastic domains (Q72343) (← links)
- Optimal cost almost-sure reachability in POMDPs (Q253969) (← links)
- A leader-follower partially observed, multiobjective Markov game (Q256610) (← links)
- Value of information for a leader-follower partially observed Markov game (Q256611) (← links)
- Optimal pricing for a \(\mathrm{GI}/\mathrm{M}/k/N\) queue with several customer types and holding costs (Q257062) (← links)
- Efficient approximation of optimal control for continuous-time Markov games (Q259052) (← links)
- A unified approach to time-aggregated Markov decision processes (Q259403) (← links)
- Approximation metrics based on probabilistic bisimulations for general state-space Markov processes: a survey (Q271706) (← links)
- An evidential approach to SLAM, path planning, and active exploration (Q274441) (← links)
- Adaptive learning via selectionism and Bayesianism. II: The sequential case (Q280320) (← links)
- Optimal control of a multiclass queueing system when customers can change types (Q285963) (← links)
- Nonzero-sum constrained discrete-time Markov games: the case of unbounded costs (Q287649) (← links)
- Convergence of controlled models and finite-state approximation for discounted continuous-time Markov decision processes with constraints (Q296787) (← links)
- Approximate dynamic programming for stochastic linear control problems on compact state spaces (Q299794) (← links)
- Solving average cost Markov decision processes by means of a two-phase time aggregation algorithm (Q300040) (← links)
- Synthesizing efficient systems in probabilistic environments (Q300419) (← links)
- Multiscale Q-learning with linear function approximation (Q312650) (← links)
- Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design (Q313259) (← links)
- Discrete-time control for systems of interacting objects with unknown random disturbance distributions: a mean field approach (Q315779) (← links)
- Modeling and optimization control of a demand-driven, conveyor-serviced production station (Q319221) (← links)
- Optimal minimum bids and inventory scrapping in sequential, single-unit, Vickrey auctions with demand learning (Q319629) (← links)
- A multi-step rolled forward chance-constrained model and a proactive dynamic approach for the wheat crop quality control problem (Q319836) (← links)
- Circumventing the Slater conundrum in countably infinite linear programs (Q319852) (← links)
- Response-adaptive designs for clinical trials: simultaneous learning from multiple patients (Q320737) (← links)
- New approximate dynamic programming algorithms for large-scale undiscounted Markov decision processes and their application to optimize a production and distribution system (Q320866) (← links)
- Modelling adherence behaviour for the treatment of obstructive sleep apnoea (Q321086) (← links)
- Optimal policies of \(M(t)/M/c/c\) queues with two different levels of servers (Q321117) (← links)
- Optimal policies for the berth allocation problem under stochastic nature (Q323540) (← links)
- A multi-period ordering and clearance pricing model considering the competition between new and out-of-season products (Q324300) (← links)
- Design and evaluation of norm-aware agents based on normative Markov decision processes (Q324660) (← links)
- An application-oriented approach to dual control with excitation for closed-loop identification (Q328039) (← links)
- Extreme state aggregation beyond Markov decision processes (Q329613) (← links)
- A perturbation approach to a class of discounted approximate value iteration algorithms with Borel spaces (Q330284) (← links)
- Probabilistic inference for determining options in reinforcement learning (Q331688) (← links)
- Decentralized stochastic control (Q333078) (← links)
- Perspectives of approximate dynamic programming (Q333093) (← links)
- Optimal control of queueing systems with non-collaborating servers (Q333457) (← links)
- Online network design with outliers (Q334928) (← links)
- Selecting malaria interventions: a top-down approach (Q336487) (← links)
- Dynamic vehicle allocation control for automated material handling system in semiconductor manufacturing (Q336517) (← links)
- Exploring the economic consequences of letting a supplier hold reserve storage (Q337286) (← links)
- Multi-class, multi-resource advance scheduling with no-shows, cancellations and overbooking (Q342261) (← links)
- Job control in heterogeneous computing systems (Q357030) (← links)
- Constrained Markov decision processes with first passage criteria (Q363565) (← links)
- Optimal assignment of servers to tasks when collaboration is inefficient (Q364078) (← links)
- Q-learning and policy iteration algorithms for stochastic shortest path problems (Q378731) (← links)
- (Approximate) iterated successive approximations algorithm for sequential decision processes (Q378751) (← links)
- Adaptive aggregation for reinforcement learning in average reward Markov decision processes (Q378753) (← links)
- Asymptotically optimal Bayesian sequential change detection and identification rules (Q378756) (← links)
- Inventory replenishment control under supply uncertainty (Q378782) (← links)