GURU: A Reinforcement Learning Framework that Bridges LLM Reasoning Across Six Domains

Limitations of Reinforcement Learning in Narrow Reasoning Domains Reinforcement Learning RL has demonstrated strong potential to enhance the reasoning capabilities of LLMs, particularly in leading systems such as OpenAI-O3 and DeepSeek-R1. However, most RL research has focused narrowly on math…









