Vyzkoušejte nový nástroj s podporou AI
Summon Research Assistant
BETA
Improved Demonstration-Knowledge Utilization in Reinforcement Learning
Liu, Yanyu, Zeng, Yifeng, Ma, Biyang, Pan, Yinghui, Gao, Huifan, Zhang, Yuting
Published in IEEE transactions on artificial intelligence (01.05.2024)
Published in IEEE transactions on artificial intelligence (01.05.2024)
Get full text
Journal Article
DGRO: Enhancing LLM Reasoning via Exploration-Exploitation Control and Reward Variance Management
Su, Xuerui, Guo, Liya, Wang, Yue, Zhu, Yi, Ma, Zhiming, Wang, Zun, Liu, Yuting
Year of Publication 19.05.2025
Year of Publication 19.05.2025
Get full text
Journal Article