Trapezoidal Gradient Descent for Effective Reinforcement Learning in Spiking Networks

With the rapid development of artificial intelligence technology, the field of reinforcement learning has continuously achieved breakthroughs in both theory and practice. However, traditional reinforcement learning algorithms often entail high energy consumption during interactions with the environm...

Full description

Saved in:

Bibliographic Details
Main Authors	Pan, Yuhao, Wang, Xiucheng, Cheng, Nan, Qiu, Qi
Format	Journal Article
Language	English
Published	19.06.2024
Subjects	Computer Science - Artificial Intelligence
Online Access	Get full text
DOI	10.48550/arxiv.2406.13568

Cover

Abstract	With the rapid development of artificial intelligence technology, the field of reinforcement learning has continuously achieved breakthroughs in both theory and practice. However, traditional reinforcement learning algorithms often entail high energy consumption during interactions with the environment. Spiking Neural Network (SNN), with their low energy consumption characteristics and performance comparable to deep neural networks, have garnered widespread attention. To reduce the energy consumption of practical applications of reinforcement learning, researchers have successively proposed the Pop-SAN and MDC-SAN algorithms. Nonetheless, these algorithms use rectangular functions to approximate the spike network during the training process, resulting in low sensitivity, thus indicating room for improvement in the training effectiveness of SNN. Based on this, we propose a trapezoidal approximation gradient method to replace the spike network, which not only preserves the original stable learning state but also enhances the model's adaptability and response sensitivity under various signal dynamics. Simulation results show that the improved algorithm, using the trapezoidal approximation gradient to replace the spike network, achieves better convergence speed and performance compared to the original algorithm and demonstrates good training stability.
AbstractList	With the rapid development of artificial intelligence technology, the field of reinforcement learning has continuously achieved breakthroughs in both theory and practice. However, traditional reinforcement learning algorithms often entail high energy consumption during interactions with the environment. Spiking Neural Network (SNN), with their low energy consumption characteristics and performance comparable to deep neural networks, have garnered widespread attention. To reduce the energy consumption of practical applications of reinforcement learning, researchers have successively proposed the Pop-SAN and MDC-SAN algorithms. Nonetheless, these algorithms use rectangular functions to approximate the spike network during the training process, resulting in low sensitivity, thus indicating room for improvement in the training effectiveness of SNN. Based on this, we propose a trapezoidal approximation gradient method to replace the spike network, which not only preserves the original stable learning state but also enhances the model's adaptability and response sensitivity under various signal dynamics. Simulation results show that the improved algorithm, using the trapezoidal approximation gradient to replace the spike network, achieves better convergence speed and performance compared to the original algorithm and demonstrates good training stability.
Author	Qiu, Qi Pan, Yuhao Cheng, Nan Wang, Xiucheng
Author_xml	– sequence: 1 givenname: Yuhao surname: Pan fullname: Pan, Yuhao – sequence: 2 givenname: Xiucheng surname: Wang fullname: Wang, Xiucheng – sequence: 3 givenname: Nan surname: Cheng fullname: Cheng, Nan – sequence: 4 givenname: Qi surname: Qiu fullname: Qiu, Qi
BackLink	https://doi.org/10.48550/arXiv.2406.13568$$DView paper in arXiv
BookMark	eNqFjrsOgkAQRbfQwtcHWLk_III8Qq-ohbFQrMkGZs0EmCUDwcfX6xp7q3Nz7y3OWAzIEAgx91wniMPQXSl-YO-sAzdyPD-M4pG4pqwaeBksVCX3rAoE6uQW2txSG5aJ1pB32IM8A9KnyaG22xEUE9JNIslLg6WNJ-juhst2KoZaVS3MfpyIxS5JN4flVyBrGGvFz8yKZF8R___jDehyQKU
ContentType	Journal Article
Copyright	http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml	– notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID	AKY GOX
DOI	10.48550/arxiv.2406.13568
DatabaseName	arXiv Computer Science arXiv.org
DatabaseTitleList
Database_xml	– sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
ExternalDocumentID	2406_13568
GroupedDBID	AKY GOX
ID	FETCH-arxiv_primary_2406_135683
IEDL.DBID	GOX
IngestDate	Tue Jul 22 22:49:06 EDT 2025
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-arxiv_primary_2406_135683
OpenAccessLink	https://arxiv.org/abs/2406.13568
ParticipantIDs	arxiv_primary_2406_13568
PublicationCentury	2000
PublicationDate	2024-06-19
PublicationDateYYYYMMDD	2024-06-19
PublicationDate_xml	– month: 06 year: 2024 text: 2024-06-19 day: 19
PublicationDecade	2020
PublicationYear	2024
Score	3.7534623
SecondaryResourceType	preprint
Snippet	With the rapid development of artificial intelligence technology, the field of reinforcement learning has continuously achieved breakthroughs in both theory...
SourceID	arxiv
SourceType	Open Access Repository
SubjectTerms	Computer Science - Artificial Intelligence
Title	Trapezoidal Gradient Descent for Effective Reinforcement Learning in Spiking Networks
URI	https://arxiv.org/abs/2406.13568
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQMTEDJgzDNFDfJClV18QQGBeJxonJwD6PWaJBipmJJbB8AK228DPzCDXxijCNYGJQgO2FSSyqyCyDnA-cVKwPqm5ANzOYWTAzMAMbCqDNvP4RkMlJ8FFcUPUIdcA2JlgIqZJwE2Tgh7buFBwh0SHEwJSaJ8IQGgLa4VSVn5kClHIvAq-xKlFwgZyjpABsNCpAjhAGljsKQangk0yTwYN2CtDDT9MVMvMUggsyQaPaCn6QddvFogzybq4hzh66YIfEF0BOjYgHuTEe7EZjMQYWYN8-VYJBwTjFxCzZ1MgiFUiapCYbJppZpgGbbMC8kJpskJJmIskggcsUKdxS0gxcRsC6F7SiydBShoGlpKg0VRZYd5YkyYEDEACyd3RQ
linkProvider	Cornell University
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Trapezoidal+Gradient+Descent+for+Effective+Reinforcement+Learning+in+Spiking+Networks&rft.au=Pan%2C+Yuhao&rft.au=Wang%2C+Xiucheng&rft.au=Cheng%2C+Nan&rft.au=Qiu%2C+Qi&rft.date=2024-06-19&rft_id=info:doi/10.48550%2Farxiv.2406.13568&rft.externalDocID=2406_13568