Model-Free Deep Recurrent Q-Network Reinforcement Learning for Quantum Circuit Architectures Design

Artificial intelligence (AI) technology leads to new insights into the manipulation of quantum systems in the Noisy Intermediate-Scale Quantum (NISQ) era. Classical agent-based artificial intelligence algorithms provide a framework for the design or control of quantum systems. Traditional reinforcem...

Full description

Saved in:

Bibliographic Details
Published in	Quantum reports Vol. 4; no. 4; pp. 380 - 389
Main Authors	Sogabe, Tomah, Kimura, Tomoaki, Chen, Chih-Chieh, Shiba, Kodai, Kasahara, Nobuhiro, Sogabe, Masaru, Sakamoto, Katsuyoshi
Format	Journal Article
Language	English
Published	Basel MDPI AG 01.12.2022
Subjects	Algorithms Approximation Artificial intelligence Circuit design Circuits Discount rates Hilbert space Learning curves LSTM Machine learning Markov analysis Markov processes Neural networks Q-learning quantum circuits Quantum theory Qubits (quantum computing) reinforcement learning Teaching methods Time series
Online Access	Get full text
ISSN	2624-960X 2624-960X
DOI	10.3390/quantum4040027

Cover

More Information
Summary:	Artificial intelligence (AI) technology leads to new insights into the manipulation of quantum systems in the Noisy Intermediate-Scale Quantum (NISQ) era. Classical agent-based artificial intelligence algorithms provide a framework for the design or control of quantum systems. Traditional reinforcement learning methods are designed for the Markov Decision Process (MDP) and, hence, have difficulty in dealing with partially observable or quantum observable decision processes. Due to the difficulty of building or inferring a model of a specified quantum system, a model-free-based control approach is more practical and feasible than its counterpart of a model-based approach. In this work, we apply a model-free deep recurrent Q-network (DRQN) reinforcement learning method for qubit-based quantum circuit architecture design problems. This paper is the first attempt to solve the quantum circuit design problem from the recurrent reinforcement learning algorithm, while using discrete policy. Simulation results suggest that our long short-term memory (LSTM)-based DRQN method is able to learn quantum circuits for entangled Bell–Greenberger–Horne–Zeilinger (Bell–GHZ) states. However, since we also observe unstable learning curves in experiments, suggesting that the DRQN could be a promising method for AI-based quantum circuit design application, more investigation on the stability issue would be required.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2624-960X 2624-960X
DOI:	10.3390/quantum4040027