Trapezoidal Gradient Descent for Effective Reinforcement Learning in Spiking Networks

With the rapid development of artificial intelligence technology, the field of reinforcement learning has continuously achieved breakthroughs in both theory and practice. However, traditional reinforcement learning algorithms often entail high energy consumption during interactions with the environm...

Full description

Saved in:
Bibliographic Details
Main Authors Pan, Yuhao, Wang, Xiucheng, Cheng, Nan, Qiu, Qi
Format Journal Article
LanguageEnglish
Published 19.06.2024
Subjects
Online AccessGet full text
DOI10.48550/arxiv.2406.13568

Cover

Abstract With the rapid development of artificial intelligence technology, the field of reinforcement learning has continuously achieved breakthroughs in both theory and practice. However, traditional reinforcement learning algorithms often entail high energy consumption during interactions with the environment. Spiking Neural Network (SNN), with their low energy consumption characteristics and performance comparable to deep neural networks, have garnered widespread attention. To reduce the energy consumption of practical applications of reinforcement learning, researchers have successively proposed the Pop-SAN and MDC-SAN algorithms. Nonetheless, these algorithms use rectangular functions to approximate the spike network during the training process, resulting in low sensitivity, thus indicating room for improvement in the training effectiveness of SNN. Based on this, we propose a trapezoidal approximation gradient method to replace the spike network, which not only preserves the original stable learning state but also enhances the model's adaptability and response sensitivity under various signal dynamics. Simulation results show that the improved algorithm, using the trapezoidal approximation gradient to replace the spike network, achieves better convergence speed and performance compared to the original algorithm and demonstrates good training stability.
AbstractList With the rapid development of artificial intelligence technology, the field of reinforcement learning has continuously achieved breakthroughs in both theory and practice. However, traditional reinforcement learning algorithms often entail high energy consumption during interactions with the environment. Spiking Neural Network (SNN), with their low energy consumption characteristics and performance comparable to deep neural networks, have garnered widespread attention. To reduce the energy consumption of practical applications of reinforcement learning, researchers have successively proposed the Pop-SAN and MDC-SAN algorithms. Nonetheless, these algorithms use rectangular functions to approximate the spike network during the training process, resulting in low sensitivity, thus indicating room for improvement in the training effectiveness of SNN. Based on this, we propose a trapezoidal approximation gradient method to replace the spike network, which not only preserves the original stable learning state but also enhances the model's adaptability and response sensitivity under various signal dynamics. Simulation results show that the improved algorithm, using the trapezoidal approximation gradient to replace the spike network, achieves better convergence speed and performance compared to the original algorithm and demonstrates good training stability.
Author Qiu, Qi
Pan, Yuhao
Cheng, Nan
Wang, Xiucheng
Author_xml – sequence: 1
  givenname: Yuhao
  surname: Pan
  fullname: Pan, Yuhao
– sequence: 2
  givenname: Xiucheng
  surname: Wang
  fullname: Wang, Xiucheng
– sequence: 3
  givenname: Nan
  surname: Cheng
  fullname: Cheng, Nan
– sequence: 4
  givenname: Qi
  surname: Qiu
  fullname: Qiu, Qi
BackLink https://doi.org/10.48550/arXiv.2406.13568$$DView paper in arXiv
BookMark eNqFjrsOgkAQRbfQwtcHWLk_III8Qq-ohbFQrMkGZs0EmCUDwcfX6xp7q3Nz7y3OWAzIEAgx91wniMPQXSl-YO-sAzdyPD-M4pG4pqwaeBksVCX3rAoE6uQW2txSG5aJ1pB32IM8A9KnyaG22xEUE9JNIslLg6WNJ-juhst2KoZaVS3MfpyIxS5JN4flVyBrGGvFz8yKZF8R___jDehyQKU
ContentType Journal Article
Copyright http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID AKY
GOX
DOI 10.48550/arxiv.2406.13568
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2406_13568
GroupedDBID AKY
GOX
ID FETCH-arxiv_primary_2406_135683
IEDL.DBID GOX
IngestDate Tue Jul 22 22:49:06 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-arxiv_primary_2406_135683
OpenAccessLink https://arxiv.org/abs/2406.13568
ParticipantIDs arxiv_primary_2406_13568
PublicationCentury 2000
PublicationDate 2024-06-19
PublicationDateYYYYMMDD 2024-06-19
PublicationDate_xml – month: 06
  year: 2024
  text: 2024-06-19
  day: 19
PublicationDecade 2020
PublicationYear 2024
Score 3.7534623
SecondaryResourceType preprint
Snippet With the rapid development of artificial intelligence technology, the field of reinforcement learning has continuously achieved breakthroughs in both theory...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Artificial Intelligence
Title Trapezoidal Gradient Descent for Effective Reinforcement Learning in Spiking Networks
URI https://arxiv.org/abs/2406.13568
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQMTEDJgzDNFDfJClV18QQGBeJxonJwD6PWaJBipmJJbB8AK228DPzCDXxijCNYGJQgO2FSSyqyCyDnA-cVKwPqm5ANzOYWTAzMAMbCqDNvP4RkMlJ8FFcUPUIdcA2JlgIqZJwE2Tgh7buFBwh0SHEwJSaJ8IQGgLa4VSVn5kClHIvAq-xKlFwgZyjpABsNCpAjhAGljsKQangk0yTwYN2CtDDT9MVMvMUggsyQaPaCn6QddvFogzybq4hzh66YIfEF0BOjYgHuTEe7EZjMQYWYN8-VYJBwTjFxCzZ1MgiFUiapCYbJppZpgGbbMC8kJpskJJmIskggcsUKdxS0gxcRsC6F7SiydBShoGlpKg0VRZYd5YkyYEDEACyd3RQ
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Trapezoidal+Gradient+Descent+for+Effective+Reinforcement+Learning+in+Spiking+Networks&rft.au=Pan%2C+Yuhao&rft.au=Wang%2C+Xiucheng&rft.au=Cheng%2C+Nan&rft.au=Qiu%2C+Qi&rft.date=2024-06-19&rft_id=info:doi/10.48550%2Farxiv.2406.13568&rft.externalDocID=2406_13568