Designing Neural Network Architectures using Reinforcement Learning

At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful experimentation or modified from a handful of existing networks. We introduce MetaQNN, a meta-modeling algorithm based on reinforcement learnin...

Full description

Saved in:

Bibliographic Details
Main Authors	Baker, Bowen, Gupta, Otkrist, Naik, Nikhil, Raskar, Ramesh
Format	Journal Article
Language	English
Published	07.11.2016
Subjects	Computer Science - Learning
Online Access	Get full text
DOI	10.48550/arxiv.1611.02167

Cover

Abstract	At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful experimentation or modified from a handful of existing networks. We introduce MetaQNN, a meta-modeling algorithm based on reinforcement learning to automatically generate high-performing CNN architectures for a given learning task. The learning agent is trained to sequentially choose CNN layers using$Q$ -learning with an$\epsilon$ -greedy exploration strategy and experience replay. The agent explores a large but finite space of possible architectures and iteratively discovers designs with improved performance on the learning task. On image classification benchmarks, the agent-designed networks (consisting of only standard convolution, pooling, and fully-connected layers) beat existing networks designed with the same layer types and are competitive against the state-of-the-art methods that use more complex layer types. We also outperform existing meta-modeling approaches for network design on image classification tasks.
AbstractList	At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful experimentation or modified from a handful of existing networks. We introduce MetaQNN, a meta-modeling algorithm based on reinforcement learning to automatically generate high-performing CNN architectures for a given learning task. The learning agent is trained to sequentially choose CNN layers using$Q$ -learning with an$\epsilon$ -greedy exploration strategy and experience replay. The agent explores a large but finite space of possible architectures and iteratively discovers designs with improved performance on the learning task. On image classification benchmarks, the agent-designed networks (consisting of only standard convolution, pooling, and fully-connected layers) beat existing networks designed with the same layer types and are competitive against the state-of-the-art methods that use more complex layer types. We also outperform existing meta-modeling approaches for network design on image classification tasks.
Author	Naik, Nikhil Baker, Bowen Gupta, Otkrist Raskar, Ramesh
Author_xml	– sequence: 1 givenname: Bowen surname: Baker fullname: Baker, Bowen – sequence: 2 givenname: Otkrist surname: Gupta fullname: Gupta, Otkrist – sequence: 3 givenname: Nikhil surname: Naik fullname: Naik, Nikhil – sequence: 4 givenname: Ramesh surname: Raskar fullname: Raskar, Ramesh
BackLink	https://doi.org/10.48550/arXiv.1611.02167$$DView paper in arXiv
BookMark	eNrjYmDJy89LZWCQNDTQM7EwNTXQTyyqyCzTMzQzNNQzMDI0M-dkcHZJLc5Mz8vMS1fwSy0tSswBUiXl-UXZCo5FyRmZJanJJaVFqcUKpcUgJUGpmXlp-UXJqbmpeSUKPqmJRSCdPAysaYk5xam8UJqbQd7NNcTZQxdsXXxBUWZuYlFlPMjaeLC1xoRVAAA0hDom
ContentType	Journal Article
Copyright	http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml	– notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID	AKY GOX
DOI	10.48550/arxiv.1611.02167
DatabaseName	arXiv Computer Science arXiv.org
DatabaseTitleList
Database_xml	– sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
ExternalDocumentID	1611_02167
GroupedDBID	AKY GOX
ID	FETCH-arxiv_primary_1611_021673
IEDL.DBID	GOX
IngestDate	Tue Sep 30 19:24:42 EDT 2025
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-arxiv_primary_1611_021673
OpenAccessLink	https://arxiv.org/abs/1611.02167
ParticipantIDs	arxiv_primary_1611_02167
PublicationCentury	2000
PublicationDate	2016-11-07
PublicationDateYYYYMMDD	2016-11-07
PublicationDate_xml	– month: 11 year: 2016 text: 2016-11-07 day: 07
PublicationDecade	2010
PublicationYear	2016
Score	3.2188013
SecondaryResourceType	preprint
Snippet	At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful...
SourceID	arxiv
SourceType	Open Access Repository
SubjectTerms	Computer Science - Learning
Title	Designing Neural Network Architectures using Reinforcement Learning
URI	https://arxiv.org/abs/1611.02167
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQsTRLMzdPtEzSNbFMtdA1STJP1k1MSkrVtUw1SjNINTJKTQWftu_rZ-YRauIVYRrBxKAA2wuTWFSRWQY5HzipWB_YHAEdqWloZs7MwAxsKIA28_pHQCYnwUdxQdUj1AHbmGAhpErCTZCBH9q6U3CERIcQA1NqngiDswt4lQSwjlAAHYUBlPeDrL1WcEQaxS9WAC1BT1cISgWfZZoMHrZTgB5_mi7KIO_mGuLsoQu2Nr4AckZEPMhF8WAXGYsxsAB78qkSDApmoMPrUlMMUgwtkk3MQbdemKYlJxqapKQZJhklm1lIMkjgMkUKt5Q0AxewFjcDb5Azl2FgKSkqTZUF1pQlSXLg4AIAK2NuWw
linkProvider	Cornell University
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Designing+Neural+Network+Architectures+using+Reinforcement+Learning&rft.au=Baker%2C+Bowen&rft.au=Gupta%2C+Otkrist&rft.au=Naik%2C+Nikhil&rft.au=Raskar%2C+Ramesh&rft.date=2016-11-07&rft_id=info:doi/10.48550%2Farxiv.1611.02167&rft.externalDocID=1611_02167