About
REAL Lab is a research group at Zhejiang University focused on building intelligent systems that think, act, and learn.
Our mission is to move beyond static pattern matching, toward AI that reasons through long-horizon problems, grounds knowledge in embodied interaction, operates as Agentic AI, and continually self-evolves.
We work across large language models, multimodal foundation models, agent systems, and embodied AI, with an emphasis on the fundamental capabilities that connect them. We believe the next generation of AI will be defined not by model scale alone, but by how these four pillars (reasoning, embodiment, agency, and learnability) come together in a single system.
News
- 2026.04 LAUNCHREAL Lab website goes live. We are recruiting PhD students, master's students, and research interns; see Join Us.
- 2026.04 OPEN SOURCEReleased ClawGUI, a unified open-source framework for training, evaluating, and deploying GUI agents. [paper]
- 2026.04 ACLTen papers accepted to ACL 2026 (8 Main Conference, 2 Findings), spanning GUI agents, reasoning, reinforcement learning, and multimodal models.
- 2026.02 CVPRGUI-SAGE: Enhancing GUI Automation with Self-Explanatory Learning accepted to CVPR 2026.
- 2026.02 ICLRSix papers accepted to ICLR 2026: InftyThink, VerifyBench, MathFimer, Time Is a Feature, IWR-Bench, and SpatialLadder, spanning reasoning, reward modeling, diffusion LMs, and multimodal evaluation.
- 2025.12 AAAIFour papers accepted to AAAI 2026: GUI-G², Test-Time RL for GUI Grounding, Reality vs Counterfactual (Theory of Mind), and more, spanning GUI agents, reasoning, and multimodal understanding.
- 2025.10 OPEN SOURCEReleased EasySteer, a unified open-source framework for high-performance and extensible LLM steering. [paper]
- 2025.09 EMNLPMultiple papers accepted at EMNLP 2025: AskToAct (main), Logic (Findings), DB-Explore (Findings).
- 2025.07 ACM MMSVGenius, a benchmark for LLMs in SVG understanding, editing, and generation, accepted to ACM Multimedia 2025.
- 2025.05 ACLSTaR-SQL accepted to ACL 2025; Scaling LLMs' Social Reasoning to ACL Findings.
- 2025.03 PREPRINTReleased Embodied-Reasoner, synergizing visual search, reasoning, and action for embodied interactive tasks.
- 2024.09 NeurIPSTaskBench: Benchmarking LLMs for Task Automation accepted to NeurIPS 2024 (Datasets and Benchmarks Track).