Vyzkoušejte nový nástroj s podporou AI
Summon Research Assistant
BETA
A linearly convergent doubly stochastic Gauss–Seidel algorithm for solving linear equations and a certain class of over-parameterized optimization problems
Razaviyayn, Meisam, Hong, Mingyi, Reyhanian, Navid, Luo, Zhi-Quan
Published in Mathematical programming (01.07.2019)
Published in Mathematical programming (01.07.2019)
Get full text
Journal Article
Traffic light control using deep policy-gradient and value-function-based reinforcement learning
Mousavi, Seyed Sajad, Schukat, Michael, Howley, Enda
Published in IET intelligent transport systems (01.09.2017)
Published in IET intelligent transport systems (01.09.2017)
Get full text
Journal Article
Unified reinforcement Q-learning for mean field game and control problems
Angiuli, Andrea, Fouque, Jean-Pierre, Laurière, Mathieu
Published in Mathematics of control, signals, and systems (01.06.2022)
Published in Mathematics of control, signals, and systems (01.06.2022)
Get full text
Journal Article
Multi-agent Reinforcement Learning Aided Sampling Algorithms for a Class of Multiscale Inverse Problems
Chung, Eric, Leung, Wing Tat, Pun, Sai-Mang, Zhang, Zecheng
Published in Journal of scientific computing (01.08.2023)
Published in Journal of scientific computing (01.08.2023)
Get full text
Journal Article
Dual subgradient algorithms for large-scale nonsmooth learning problems
Cox, Bruce, Juditsky, Anatoli, Nemirovski, Arkadi
Published in Mathematical programming (01.12.2014)
Published in Mathematical programming (01.12.2014)
Get full text
Journal Article
Distributed Communication-Sliding Mirror-Descent Algorithm for Nonsmooth Resource Allocation Problem
Wang, Yinghui, Tu, Zhipeng, Qin, Huashu
Published in Journal of systems science and complexity (01.08.2022)
Published in Journal of systems science and complexity (01.08.2022)
Get full text
Journal Article
GPUSGD: A GPU-accelerated stochastic gradient descent algorithm for matrix factorization
Jin, Jing, Lai, Siyan, Hu, Su, Lin, Jing, Lin, Xiaola
Published in Concurrency and computation (25.09.2016)
Published in Concurrency and computation (25.09.2016)
Get full text
Journal Article