Meta-learning in spiking neural networks with reward-modulated STDP

The human brain constantly learns and rapidly adapts to new situations by integrating acquired knowledge and experiences into memory. Developing this capability in machine learning models is considered an important goal of AI research since deep neural networks perform poorly when there is limited d...

Full description

Saved in:
Bibliographic Details
Published inNeurocomputing (Amsterdam) Vol. 600; p. 128173
Main Authors Gholamzadeh Khoee, Arsham, Javaheri, Alireza, Kheradpisheh, Saeed Reza, Ganjtabesh, Mohammad
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.10.2024
Subjects
Online AccessGet full text
ISSN0925-2312
DOI10.1016/j.neucom.2024.128173

Cover

More Information
Summary:The human brain constantly learns and rapidly adapts to new situations by integrating acquired knowledge and experiences into memory. Developing this capability in machine learning models is considered an important goal of AI research since deep neural networks perform poorly when there is limited data or when they need to adapt quickly to new unseen tasks. Meta-learning models are proposed to facilitate quick learning in low-data regimes by employing absorbed information from the past. Although some models have recently been introduced that reached high-performance levels, they are not biologically plausible. In our research, we have proposed a bio-plausible meta-learning model inspired by the hippocampus and the prefrontal cortex using spiking neural networks with a reward-based learning system. The major contribution of our work lies in the design of a bio-plausible meta-learning framework that incorporates learning rules such as Spike-Timing-Dependent Plasticity (STDP) and Reward-Modulated STDP (R-STDP). This framework not only reflects biological learning mechanisms more accurately but also attains competitive results comparable to those achieved by traditional gradient descent-based approaches in meta-learning. Our proposed model includes a memory designed to prevent catastrophic forgetting, a phenomenon that occurs when meta-learning models forget what they have learned so far as learning the new task begins. Furthermore, our new model can easily be applied to spike-based neuromorphic devices and enables fast learning in neuromorphic hardware. The implications and predictions of various models for solving few-shot classification tasks are extensively analyzed. Base on the results, our model has demonstrated the ability to compete with the existing state-of-the-art meta-learning techniques, representing a significant step towards creating AI systems that emulate the human brain’s ability to learn quickly and efficiently from limited data. •“Higher accuracy & generalization w.r.t SOTA methods in few-shot classification tasks.”•“Improved the generalization of meta-SNNs by simulating an efficient episodic memory.”•“Demonstrating the potential of using reward-modulated STDP in SNNS for meta-learning.”
ISSN:0925-2312
DOI:10.1016/j.neucom.2024.128173