Hyperband: a novel bandit-based approach to hyperparameter optimization

DOI10.48550/ARXIV.1603.06560zbMATH1468.68204MaRDI QIDQ72746FDOQ72746

Authors Lisha Li, Kevin Jamieson, Giulia DeSalvo, Afshin Rostamizadeh, Ameet Talwalkar, Lisha Li, Kevin Jamieson, Giulia Desalvo, Afshin Rostamizadeh, Ameet Talwalkar

Publication date 21 March 2016

Full work available at URL https://arxiv.org/abs/1603.06560

zbMATH Keywords

model selection deep learning hyperparameter optimization infinite-armed bandits online optimization

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Artificial neural networks and deep learning (68T07) Problem solving in the context of artificial intelligence (heuristics, search strategies, etc.) (68T20) Online algorithms; streaming algorithms (68W27) Approximation methods and heuristics in mathematical programming (90C59)

Abstract: Performance of machine learning algorithms depends critically on identifying a good set of hyperparameters. While recent approaches use Bayesian optimization to adaptively select configurations, we focus on speeding up random search through adaptive resource allocation and early-stopping. We formulate hyperparameter optimization as a pure-exploration non-stochastic infinite-armed bandit problem where a predefined resource like iterations, data samples, or features is allocated to randomly sampled configurations. We introduce a novel algorithm, Hyperband, for this framework and analyze its theoretical properties, providing several desirable guarantees. Furthermore, we compare Hyperband with popular Bayesian optimization methods on a suite of hyperparameter optimization problems. We observe that Hyperband can provide over an order-of-magnitude speedup over our competitor set on a variety of deep-learning and kernel-based learning problems.

Recommendations

Cites work

Cited in

(53)

Describes a project that uses

Uses Software

This page was built for publication: Hyperband: a novel bandit-based approach to hyperparameter optimization

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q72746)