Designing Neural Network Architectures using Reinforcement Learning
At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful experimentation or modified from a handful of existing networks. We introduce MetaQNN, a meta-modeling algorithm based on reinforcement learnin...
Saved in:
| Main Authors | , , , |
|---|---|
| Format | Journal Article |
| Language | English |
| Published |
07.11.2016
|
| Subjects | |
| Online Access | Get full text |
| DOI | 10.48550/arxiv.1611.02167 |
Cover
| Abstract | At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful experimentation or modified from a handful of existing networks. We introduce MetaQNN, a meta-modeling algorithm based on reinforcement learning to automatically generate high-performing CNN architectures for a given learning task. The learning agent is trained to sequentially choose CNN layers using$Q$ -learning with an$\epsilon$ -greedy exploration strategy and experience replay. The agent explores a large but finite space of possible architectures and iteratively discovers designs with improved performance on the learning task. On image classification benchmarks, the agent-designed networks (consisting of only standard convolution, pooling, and fully-connected layers) beat existing networks designed with the same layer types and are competitive against the state-of-the-art methods that use more complex layer types. We also outperform existing meta-modeling approaches for network design on image classification tasks. |
|---|---|
| AbstractList | At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful experimentation or modified from a handful of existing networks. We introduce MetaQNN, a meta-modeling algorithm based on reinforcement learning to automatically generate high-performing CNN architectures for a given learning task. The learning agent is trained to sequentially choose CNN layers using$Q$ -learning with an$\epsilon$ -greedy exploration strategy and experience replay. The agent explores a large but finite space of possible architectures and iteratively discovers designs with improved performance on the learning task. On image classification benchmarks, the agent-designed networks (consisting of only standard convolution, pooling, and fully-connected layers) beat existing networks designed with the same layer types and are competitive against the state-of-the-art methods that use more complex layer types. We also outperform existing meta-modeling approaches for network design on image classification tasks. |
| Author | Naik, Nikhil Baker, Bowen Gupta, Otkrist Raskar, Ramesh |
| Author_xml | – sequence: 1 givenname: Bowen surname: Baker fullname: Baker, Bowen – sequence: 2 givenname: Otkrist surname: Gupta fullname: Gupta, Otkrist – sequence: 3 givenname: Nikhil surname: Naik fullname: Naik, Nikhil – sequence: 4 givenname: Ramesh surname: Raskar fullname: Raskar, Ramesh |
| BackLink | https://doi.org/10.48550/arXiv.1611.02167$$DView paper in arXiv |
| BookMark | eNrjYmDJy89LZWCQNDTQM7EwNTXQTyyqyCzTMzQzNNQzMDI0M-dkcHZJLc5Mz8vMS1fwSy0tSswBUiXl-UXZCo5FyRmZJanJJaVFqcUKpcUgJUGpmXlp-UXJqbmpeSUKPqmJRSCdPAysaYk5xam8UJqbQd7NNcTZQxdsXXxBUWZuYlFlPMjaeLC1xoRVAAA0hDom |
| ContentType | Journal Article |
| Copyright | http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
| Copyright_xml | – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
| DBID | AKY GOX |
| DOI | 10.48550/arxiv.1611.02167 |
| DatabaseName | arXiv Computer Science arXiv.org |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository |
| DeliveryMethod | fulltext_linktorsrc |
| ExternalDocumentID | 1611_02167 |
| GroupedDBID | AKY GOX |
| ID | FETCH-arxiv_primary_1611_021673 |
| IEDL.DBID | GOX |
| IngestDate | Tue Sep 30 19:24:42 EDT 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-arxiv_primary_1611_021673 |
| OpenAccessLink | https://arxiv.org/abs/1611.02167 |
| ParticipantIDs | arxiv_primary_1611_02167 |
| PublicationCentury | 2000 |
| PublicationDate | 2016-11-07 |
| PublicationDateYYYYMMDD | 2016-11-07 |
| PublicationDate_xml | – month: 11 year: 2016 text: 2016-11-07 day: 07 |
| PublicationDecade | 2010 |
| PublicationYear | 2016 |
| Score | 3.2188013 |
| SecondaryResourceType | preprint |
| Snippet | At present, designing convolutional neural network (CNN) architectures requires both human expertise and labor. New architectures are handcrafted by careful... |
| SourceID | arxiv |
| SourceType | Open Access Repository |
| SubjectTerms | Computer Science - Learning |
| Title | Designing Neural Network Architectures using Reinforcement Learning |
| URI | https://arxiv.org/abs/1611.02167 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQsTRLMzdPtEzSNbFMtdA1STJP1k1MSkrVtUw1SjNINTJKTQWftu_rZ-YRauIVYRrBxKAA2wuTWFSRWQY5HzipWB_YHAEdqWloZs7MwAxsKIA28_pHQCYnwUdxQdUj1AHbmGAhpErCTZCBH9q6U3CERIcQA1NqngiDswt4lQSwjlAAHYUBlPeDrL1WcEQaxS9WAC1BT1cISgWfZZoMHrZTgB5_mi7KIO_mGuLsoQu2Nr4AckZEPMhF8WAXGYsxsAB78qkSDApmoMPrUlMMUgwtkk3MQbdemKYlJxqapKQZJhklm1lIMkjgMkUKt5Q0AxewFjcDb5Azl2FgKSkqTZUF1pQlSXLg4AIAK2NuWw |
| linkProvider | Cornell University |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Designing+Neural+Network+Architectures+using+Reinforcement+Learning&rft.au=Baker%2C+Bowen&rft.au=Gupta%2C+Otkrist&rft.au=Naik%2C+Nikhil&rft.au=Raskar%2C+Ramesh&rft.date=2016-11-07&rft_id=info:doi/10.48550%2Farxiv.1611.02167&rft.externalDocID=1611_02167 |