The following pages link to Abhijit Gosavi (Q421533):
Displaying 14 items.
- On the distribution of the number stranded in bulk-arrival, bulk-service queues of the M/G/1 form (Q421534) (← links)
- Simulation optimization for revenue management of airlines with cancellations and overbooking (Q858602) (← links)
- Reinforcement learning for long-run average cost. (Q1427588) (← links)
- A reinforcement learning algorithm based on policy iteration for average reward: Empirical results with yield management and convergence analysis (Q1771225) (← links)
- Simulation-based optimization: Parametric optimization techniques and reinforcement learning (Q1869929) (← links)
- Simulation-based optimization. Parametric optimization techniques and reinforcement learning (Q2253773) (← links)
- Boundedness of iterates in \(Q\)-learning (Q2504669) (← links)
- A risk-sensitive approach to total productive maintenance (Q2507920) (← links)
- Reinforcement Learning: A Tutorial Survey and Recent Advances (Q2901057) (← links)
- Solving Semi-Markov Decision Problems Using Average Reward Reinforcement Learning (Q3116659) (← links)
- A machine learning approach to optimise the usage of recycled material in a remanufacturing environment (Q3163724) (← links)
- OPTIMAL DESIGN OF PLANS FOR ACCEPTANCE SAMPLING BY VARIABLES WITH INVERSE GAUSSIAN DISTRIBUTION (Q4416935) (← links)
- Variance-penalized Markov decision processes: dynamic programming and reinforcement learning techniques (Q5166474) (← links)
- Maintenance optimization in a digital twin for industry 4.0 (Q6601572) (← links)