Pages that link to "Item:Q1784527"
From MaRDI portal
The following pages link to Adaptive importance sampling for value function approximation in off-policy reinforcement learning (Q1784527):
Displaying 8 items.
- Direct density-ratio estimation with dimensionality reduction via least-squares hetero-distributional subspace search (Q553261) (← links)
- Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation (Q889297) (← links)
- Semi-supervised speaker identification under covariate shift (Q985576) (← links)
- Dimensionality reduction for density ratio estimation in high-dimensional spaces (Q1784536) (← links)
- Efficient exploration through active learning for value function approximation in reinforcement learning (Q1784573) (← links)
- Least-squares two-sample test (Q1952558) (← links)
- Multivariate error modeling and uncertainty quantification using importance (re-)weighting for Monte Carlo simulations in particle transport (Q2106929) (← links)
- Reward-Weighted Regression with Sample Reuse for Direct Policy Search in Reinforcement Learning (Q2887009) (← links)