AlphaZero
From MaRDI portal
Software:43111
swMATH31400MaRDI QIDQ43111FDOQ43111
Author name not available (Why is that?)
Cited In (35)
- Comparison of deep neural networks and deep hierarchical models for spatio-temporal data
- Meta-modeling game for deriving theory-consistent, microstructure-based traction-separation laws via deep reinforcement learning
- Scalable Online Planning for Multi-Agent MDPs
- On solving the problem of 7-piece chess endgames
- Making sense of sensory input
- The Hanabi challenge: a new frontier for AI research
- Construction of symmetric orthogonal designs with deep Q-network and orthogonal complementary design
- Unsupervised basis function adaptation for reinforcement learning
- Metric entropy limits on recurrent neural network learning of linear dynamical systems
- A non-cooperative meta-modeling game for automated third-party calibrating, validating and falsifying constitutive laws with parallelized adversarial attacks
- A neural network multigrid solver for the Navier-Stokes equations
- Topological properties of the set of functions generated by neural networks of fixed size
- Dynamic selective maintenance optimization for multi-state systems over a finite horizon: a deep reinforcement learning approach
- Deep reinforcement learning for the optimal placement of cryptocurrency limit orders
- A cooperative game for automated learning of elasto-plasticity knowledge graphs and models with AI-guided experimentation
- Efficient multi-objective reinforcement learning via multiple-gradient descent with iteratively discovered weight-vector sets
- Benchmark and survey of automated machine learning frameworks
- Constrained multiagent Markov decision processes: a taxonomy of problems and algorithms
- A machine learning framework for LES closure terms
- Discovering faster matrix multiplication algorithms with reinforcement learning
- Artificial intelligence, chaos, prediction and understanding in science
- Induction and exploitation of subgoal automata for reinforcement learning
- Manifold learning for parameter reduction
- Reinforcement learning for combinatorial optimization: a survey
- Teaching People by Justifying Tree Search Decisions: An Empirical Study in Curling
- Planning for potential: efficient safe reinforcement learning
- Title not available (Why is that?)
- A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
- Inductive general game playing
- Deliberative acting, planning and learning with hierarchical operational models
- Reward is enough
- Compact and efficient encodings for planning in factored state and action spaces with learned binarized neural network transition models
- Automatic discovery of interpretable planning strategies
- Sophisticated inference
- Reward shaping to improve the performance of deep reinforcement learning in perishable inventory management
This page was built for software: AlphaZero