Video description
Neural architecture search (NAS) is a promising research direction that has the potential to replace expert-designed networks with learned, task-specific architectures. Ameet Talwalkar (Carnegie Mellon University | Determined AI) shares work that aims to help ground the empirical results in this field and proposes new NAS baselines that build off the following observations: NAS is a specialized hyperparameter optimization problem, and random search is a competitive baseline for hyperparameter optimization.
Leveraging these observations, Ameet evaluates both random search with early-stopping and a novel random search with a weight-sharing algorithm on two standard NAS benchmarks: PTB and CIFAR-10. Results show that random search with early-stopping is a competitive NAS baseline that performs at least as well as ENAS, a leading NAS method, on both benchmarks. Additionally, random search with weight-sharing outperforms random search with early-stopping, achieving a state-of-the-art NAS result on PTB and a highly competitive result on CIFAR-10.
Ameet concludes by exploring existing reproducibility issues for published NAS results, noting the lack of source material needed to exactly reproduce these results, and discussing the robustness of published results given the various sources of variability in NAS experimental setups.
All information (code, random seeds, documentation) needed to exactly reproduce our results will be shared, along with random search with weight-sharing results for each benchmark on two independent experimental runs.
This session was recorded at the 2019 O'Reilly Artificial Intelligence Conference in New York.
Product information
- Title: Random search and reproducibility for neural architecture search
- Author(s):
- Release date: October 2019
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 0636920339373
You might also like
video
ODSC Europe 2018 (Open Data Science Conference)
ODSC Europe 2018 Royalties for this video set help fund the ODSC Grant Award for open …
video
Meet the Expert: Dean Wampler on Scaling ML/AI Applications with Ray
Modern ML and AI applications require a lot of compute power, which usually means distribution over …
video
Enhance recommendations in Uber Eats with graph convolutional networks
Uber Eats has become synonymous with online food ordering. With an increasing selection of restaurants and …
audiobook
Transformed
Help transform your business and innovate like the world's top tech companies! Transformed: Moving to the …