Pages that link to "Item:Q5218653"
From MaRDI portal
The following pages link to A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play (Q5218653):
Displaying 50 items.
- Compact and efficient encodings for planning in factored state and action spaces with learned binarized neural network transition models (Q785238) (← links)
- Meta-modeling game for deriving theory-consistent, microstructure-based traction-separation laws via deep reinforcement learning (Q1986877) (← links)
- A non-cooperative meta-modeling game for automated third-party calibrating, validating and falsifying constitutive laws with parallelized adversarial attacks (Q2020834) (← links)
- Topological properties of the set of functions generated by neural networks of fixed size (Q2031060) (← links)
- Automatic discovery of interpretable planning strategies (Q2071410) (← links)
- Deep reinforcement learning for \textsf{FlipIt} security game (Q2086680) (← links)
- What may lie ahead in reinforcement learning (Q2094025) (← links)
- Multi-agent reinforcement learning: a selective overview of theories and algorithms (Q2094040) (← links)
- Reinforcement learning: an industrial perspective (Q2094053) (← links)
- World-class interpretable poker (Q2102358) (← links)
- Construction of symmetric orthogonal designs with deep Q-network and orthogonal complementary design (Q2129592) (← links)
- Reward shaping to improve the performance of deep reinforcement learning in perishable inventory management (Q2140199) (← links)
- Planning for potential: efficient safe reinforcement learning (Q2163254) (← links)
- Deep policy dynamic programming for vehicle routing problems (Q2170197) (← links)
- Inductive general game playing (Q2203324) (← links)
- On solving the problem of 7-piece chess endgames (Q2216918) (← links)
- Making sense of sensory input (Q2238610) (← links)
- Deliberative acting, planning and learning with hierarchical operational models (Q2238702) (← links)
- Reward is enough (Q2238710) (← links)
- Deep reinforcement learning for the optimal placement of cryptocurrency limit orders (Q2242354) (← links)
- Dynamic selective maintenance optimization for multi-state systems over a finite horizon: a deep reinforcement learning approach (Q2286928) (← links)
- The Hanabi challenge: a new frontier for AI research (Q2302288) (← links)
- A cooperative game for automated learning of elasto-plasticity knowledge graphs and models with AI-guided experimentation (Q2319404) (← links)
- Comparison of deep neural networks and deep hierarchical models for spatio-temporal data (Q2419837) (← links)
- A machine learning framework for LES closure terms (Q2672196) (← links)
- Artificial Intelligence, Chaos, Prediction and Understanding in Science (Q4958613) (← links)
- (Q4998982) (← links)
- Deep Statistical Model Checking (Q5041276) (← links)
- Archetypal landscapes for deep neural networks (Q5073145) (← links)
- The unreasonable effectiveness of deep learning in artificial intelligence (Q5073209) (← links)
- Scalable Online Planning for Multi-Agent MDPs (Q5076326) (← links)
- Efficient Multi-objective Reinforcement Learning via Multiple-gradient Descent with Iteratively Discovered Weight-Vector Sets (Q5145843) (← links)
- (Q5148991) (← links)
- Full Gradient DQN Reinforcement Learning: A Provably Convergent Scheme (Q5153609) (← links)
- (Q5214220) (← links)
- Benchmark and Survey of Automated Machine Learning Frameworks (Q5856462) (← links)
- Constrained Multiagent Markov Decision Processes: a Taxonomy of Problems and Algorithms (Q5856487) (← links)
- Induction and Exploitation of Subgoal Automata for Reinforcement Learning (Q5856492) (← links)
- Model-based Reinforcement Learning: A Survey (Q5870792) (← links)
- Comparative analysis of machine learning methods for active flow control (Q5882038) (← links)
- A Proof that Artificial Neural Networks Overcome the Curse of Dimensionality in the Numerical Approximation of Black–Scholes Partial Differential Equations (Q5889064) (← links)
- The explanation game: a formal framework for interpretable machine learning (Q6067308) (← links)
- Explore and Exploit with Heterotic Line Bundle Models (Q6068101) (← links)
- Optimal production ramp‐up in the smartphone manufacturing industry (Q6072174) (← links)
- Risk-aware shielding of partially observable Monte Carlo planning policies (Q6088298) (← links)
- Smoothing policies and safe policy gradients (Q6097096) (← links)
- Is there a role for statistics in artificial intelligence? (Q6103792) (← links)
- Spatial state-action features for general games (Q6108764) (← links)
- A reinforcement learning approach to the stochastic cutting stock problem (Q6114929) (← links)
- Forecasting Hamiltonian dynamics without canonical coordinates (Q6117176) (← links)