The following pages link to Voot Tangkaratt (Q747243):
Displayed 9 items.
- Direct conditional probability density estimation with sparse feature selection (Q747244) (← links)
- Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation (Q889297) (← links)
- Active deep Q-learning with demonstration (Q2217431) (← links)
- Model-based reinforcement learning with dimension reduction (Q2281680) (← links)
- TD-regularized actor-critic methods (Q2320580) (← links)
- Sufficient Dimension Reduction via Direct Estimation of the Gradients of Logarithmic Conditional Densities (Q5157134) (← links)
- Efficient Sample Reuse in Policy Gradients with Parameter-Based Exploration (Q5378202) (← links)
- Conditional Density Estimation with Dimensionality Reduction via Squared-Loss Conditional Entropy Minimization (Q5380192) (← links)
- Direct Estimation of the Derivative of Quadratic Mutual Information with Application in Supervised Dimension Reduction (Q5380831) (← links)