Trapezoidal Gradient Descent for Effective Reinforcement Learning in Spiking Networks
With the rapid development of artificial intelligence technology, the field of reinforcement learning has continuously achieved breakthroughs in both theory and practice. However, traditional reinforcement learning algorithms often entail high energy consumption during interactions with the environm...
Saved in:
| Main Authors | , , , |
|---|---|
| Format | Journal Article |
| Language | English |
| Published |
19.06.2024
|
| Subjects | |
| Online Access | Get full text |
| DOI | 10.48550/arxiv.2406.13568 |
Cover
| Abstract | With the rapid development of artificial intelligence technology, the field
of reinforcement learning has continuously achieved breakthroughs in both
theory and practice. However, traditional reinforcement learning algorithms
often entail high energy consumption during interactions with the environment.
Spiking Neural Network (SNN), with their low energy consumption characteristics
and performance comparable to deep neural networks, have garnered widespread
attention. To reduce the energy consumption of practical applications of
reinforcement learning, researchers have successively proposed the Pop-SAN and
MDC-SAN algorithms. Nonetheless, these algorithms use rectangular functions to
approximate the spike network during the training process, resulting in low
sensitivity, thus indicating room for improvement in the training effectiveness
of SNN. Based on this, we propose a trapezoidal approximation gradient method
to replace the spike network, which not only preserves the original stable
learning state but also enhances the model's adaptability and response
sensitivity under various signal dynamics. Simulation results show that the
improved algorithm, using the trapezoidal approximation gradient to replace the
spike network, achieves better convergence speed and performance compared to
the original algorithm and demonstrates good training stability. |
|---|---|
| AbstractList | With the rapid development of artificial intelligence technology, the field
of reinforcement learning has continuously achieved breakthroughs in both
theory and practice. However, traditional reinforcement learning algorithms
often entail high energy consumption during interactions with the environment.
Spiking Neural Network (SNN), with their low energy consumption characteristics
and performance comparable to deep neural networks, have garnered widespread
attention. To reduce the energy consumption of practical applications of
reinforcement learning, researchers have successively proposed the Pop-SAN and
MDC-SAN algorithms. Nonetheless, these algorithms use rectangular functions to
approximate the spike network during the training process, resulting in low
sensitivity, thus indicating room for improvement in the training effectiveness
of SNN. Based on this, we propose a trapezoidal approximation gradient method
to replace the spike network, which not only preserves the original stable
learning state but also enhances the model's adaptability and response
sensitivity under various signal dynamics. Simulation results show that the
improved algorithm, using the trapezoidal approximation gradient to replace the
spike network, achieves better convergence speed and performance compared to
the original algorithm and demonstrates good training stability. |
| Author | Qiu, Qi Pan, Yuhao Cheng, Nan Wang, Xiucheng |
| Author_xml | – sequence: 1 givenname: Yuhao surname: Pan fullname: Pan, Yuhao – sequence: 2 givenname: Xiucheng surname: Wang fullname: Wang, Xiucheng – sequence: 3 givenname: Nan surname: Cheng fullname: Cheng, Nan – sequence: 4 givenname: Qi surname: Qiu fullname: Qiu, Qi |
| BackLink | https://doi.org/10.48550/arXiv.2406.13568$$DView paper in arXiv |
| BookMark | eNqFjrsOgkAQRbfQwtcHWLk_III8Qq-ohbFQrMkGZs0EmCUDwcfX6xp7q3Nz7y3OWAzIEAgx91wniMPQXSl-YO-sAzdyPD-M4pG4pqwaeBksVCX3rAoE6uQW2txSG5aJ1pB32IM8A9KnyaG22xEUE9JNIslLg6WNJ-juhst2KoZaVS3MfpyIxS5JN4flVyBrGGvFz8yKZF8R___jDehyQKU |
| ContentType | Journal Article |
| Copyright | http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
| Copyright_xml | – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
| DBID | AKY GOX |
| DOI | 10.48550/arxiv.2406.13568 |
| DatabaseName | arXiv Computer Science arXiv.org |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository |
| DeliveryMethod | fulltext_linktorsrc |
| ExternalDocumentID | 2406_13568 |
| GroupedDBID | AKY GOX |
| ID | FETCH-arxiv_primary_2406_135683 |
| IEDL.DBID | GOX |
| IngestDate | Tue Jul 22 22:49:06 EDT 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-arxiv_primary_2406_135683 |
| OpenAccessLink | https://arxiv.org/abs/2406.13568 |
| ParticipantIDs | arxiv_primary_2406_13568 |
| PublicationCentury | 2000 |
| PublicationDate | 2024-06-19 |
| PublicationDateYYYYMMDD | 2024-06-19 |
| PublicationDate_xml | – month: 06 year: 2024 text: 2024-06-19 day: 19 |
| PublicationDecade | 2020 |
| PublicationYear | 2024 |
| Score | 3.7534623 |
| SecondaryResourceType | preprint |
| Snippet | With the rapid development of artificial intelligence technology, the field
of reinforcement learning has continuously achieved breakthroughs in both
theory... |
| SourceID | arxiv |
| SourceType | Open Access Repository |
| SubjectTerms | Computer Science - Artificial Intelligence |
| Title | Trapezoidal Gradient Descent for Effective Reinforcement Learning in Spiking Networks |
| URI | https://arxiv.org/abs/2406.13568 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQMTEDJgzDNFDfJClV18QQGBeJxonJwD6PWaJBipmJJbB8AK228DPzCDXxijCNYGJQgO2FSSyqyCyDnA-cVKwPqm5ANzOYWTAzMAMbCqDNvP4RkMlJ8FFcUPUIdcA2JlgIqZJwE2Tgh7buFBwh0SHEwJSaJ8IQGgLa4VSVn5kClHIvAq-xKlFwgZyjpABsNCpAjhAGljsKQangk0yTwYN2CtDDT9MVMvMUggsyQaPaCn6QddvFogzybq4hzh66YIfEF0BOjYgHuTEe7EZjMQYWYN8-VYJBwTjFxCzZ1MgiFUiapCYbJppZpgGbbMC8kJpskJJmIskggcsUKdxS0gxcRsC6F7SiydBShoGlpKg0VRZYd5YkyYEDEACyd3RQ |
| linkProvider | Cornell University |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Trapezoidal+Gradient+Descent+for+Effective+Reinforcement+Learning+in+Spiking+Networks&rft.au=Pan%2C+Yuhao&rft.au=Wang%2C+Xiucheng&rft.au=Cheng%2C+Nan&rft.au=Qiu%2C+Qi&rft.date=2024-06-19&rft_id=info:doi/10.48550%2Farxiv.2406.13568&rft.externalDocID=2406_13568 |