Designing Neural Network Architectures using Reinforcement Learning

At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful experimentation or modified from a handful of existing networks. We introduce MetaQNN, a meta-modeling algorithm based on reinforcement learnin...

Full description

Saved in:
Bibliographic Details
Main Authors Baker, Bowen, Gupta, Otkrist, Naik, Nikhil, Raskar, Ramesh
Format Journal Article
LanguageEnglish
Published 07.11.2016
Subjects
Online AccessGet full text
DOI10.48550/arxiv.1611.02167

Cover

Abstract At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful experimentation or modified from a handful of existing networks. We introduce MetaQNN, a meta-modeling algorithm based on reinforcement learning to automatically generate high-performing CNN architectures for a given learning task. The learning agent is trained to sequentially choose CNN layers using$Q$ -learning with an$\epsilon$ -greedy exploration strategy and experience replay. The agent explores a large but finite space of possible architectures and iteratively discovers designs with improved performance on the learning task. On image classification benchmarks, the agent-designed networks (consisting of only standard convolution, pooling, and fully-connected layers) beat existing networks designed with the same layer types and are competitive against the state-of-the-art methods that use more complex layer types. We also outperform existing meta-modeling approaches for network design on image classification tasks.
AbstractList At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful experimentation or modified from a handful of existing networks. We introduce MetaQNN, a meta-modeling algorithm based on reinforcement learning to automatically generate high-performing CNN architectures for a given learning task. The learning agent is trained to sequentially choose CNN layers using$Q$ -learning with an$\epsilon$ -greedy exploration strategy and experience replay. The agent explores a large but finite space of possible architectures and iteratively discovers designs with improved performance on the learning task. On image classification benchmarks, the agent-designed networks (consisting of only standard convolution, pooling, and fully-connected layers) beat existing networks designed with the same layer types and are competitive against the state-of-the-art methods that use more complex layer types. We also outperform existing meta-modeling approaches for network design on image classification tasks.
Author Naik, Nikhil
Baker, Bowen
Gupta, Otkrist
Raskar, Ramesh
Author_xml – sequence: 1
  givenname: Bowen
  surname: Baker
  fullname: Baker, Bowen
– sequence: 2
  givenname: Otkrist
  surname: Gupta
  fullname: Gupta, Otkrist
– sequence: 3
  givenname: Nikhil
  surname: Naik
  fullname: Naik, Nikhil
– sequence: 4
  givenname: Ramesh
  surname: Raskar
  fullname: Raskar, Ramesh
BackLink https://doi.org/10.48550/arXiv.1611.02167$$DView paper in arXiv
BookMark eNrjYmDJy89LZWCQNDTQM7EwNTXQTyyqyCzTMzQzNNQzMDI0M-dkcHZJLc5Mz8vMS1fwSy0tSswBUiXl-UXZCo5FyRmZJanJJaVFqcUKpcUgJUGpmXlp-UXJqbmpeSUKPqmJRSCdPAysaYk5xam8UJqbQd7NNcTZQxdsXXxBUWZuYlFlPMjaeLC1xoRVAAA0hDom
ContentType Journal Article
Copyright http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID AKY
GOX
DOI 10.48550/arxiv.1611.02167
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 1611_02167
GroupedDBID AKY
GOX
ID FETCH-arxiv_primary_1611_021673
IEDL.DBID GOX
IngestDate Tue Sep 30 19:24:42 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-arxiv_primary_1611_021673
OpenAccessLink https://arxiv.org/abs/1611.02167
ParticipantIDs arxiv_primary_1611_02167
PublicationCentury 2000
PublicationDate 2016-11-07
PublicationDateYYYYMMDD 2016-11-07
PublicationDate_xml – month: 11
  year: 2016
  text: 2016-11-07
  day: 07
PublicationDecade 2010
PublicationYear 2016
Score 3.2188013
SecondaryResourceType preprint
Snippet At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Learning
Title Designing Neural Network Architectures using Reinforcement Learning
URI https://arxiv.org/abs/1611.02167
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQsTRLMzdPtEzSNbFMtdA1STJP1k1MSkrVtUw1SjNINTJKTQWftu_rZ-YRauIVYRrBxKAA2wuTWFSRWQY5HzipWB_YHAEdqWloZs7MwAxsKIA28_pHQCYnwUdxQdUj1AHbmGAhpErCTZCBH9q6U3CERIcQA1NqngiDswt4lQSwjlAAHYUBlPeDrL1WcEQaxS9WAC1BT1cISgWfZZoMHrZTgB5_mi7KIO_mGuLsoQu2Nr4AckZEPMhF8WAXGYsxsAB78qkSDApmoMPrUlMMUgwtkk3MQbdemKYlJxqapKQZJhklm1lIMkjgMkUKt5Q0AxewFjcDb5Azl2FgKSkqTZUF1pQlSXLg4AIAK2NuWw
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Designing+Neural+Network+Architectures+using+Reinforcement+Learning&rft.au=Baker%2C+Bowen&rft.au=Gupta%2C+Otkrist&rft.au=Naik%2C+Nikhil&rft.au=Raskar%2C+Ramesh&rft.date=2016-11-07&rft_id=info:doi/10.48550%2Farxiv.1611.02167&rft.externalDocID=1611_02167