<aside> <img src="/icons/alien-pixel_green.svg" alt="/icons/alien-pixel_green.svg" width="40px" /> Home


Mission & Vision

Paper

Researcher

Company & Product

Event

Github

Opportunities

WebAgent Review

Podcast

Contributor

</aside>

<aside> <img src="/icons/verified_green.svg" alt="/icons/verified_green.svg" width="40px" /> About PAPER


This channel is dedicated to building and expanding a comprehensive paper database focused on the Web Agent field and the boarder GUI agent field. Let’s collaborate to enrich this database and advance research in the exciting world of web agents!

</aside>

🔥 Newly updated(Dec)

New~Dec 10 |GAIR: GUI Automation via Information-Joint Reasoning and Group Reflection

New~Dec 09 |EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce

New~Dec 09 |MVP: Multiple View Prediction Improves GUI Grounding

New~Dec 07 |An Index-based Approach for Efficient and Effective Web Content Extraction

New~Dec 05 |Zoom in, Click out: Unlocking and Evaluating the Potential of Zooming for GUI Grounding

New~Dec 04 |AgentBay: A Hybrid Interaction Sandbox for Seamless Human-AI Intervention in Agentic Systems

New~Dec 03 |Evaluating Long-Context Reasoning in LLM-Based WebAgents

New~Dec 02 |LegalWebAgent: Empowering Access to Justice via LLM-Based Web Agents

Dec 02 |PPTArena: A Benchmark for Agentic PowerPoint Editing

Dec 02 |GUI Exploration Lab: Enhancing Screen Navigation in Agents via Multi-Turn Reinforcement Learning

Dec 02 |From Imitation to Discrimination: Toward A Generalized Curriculum Advantage Mechanism Enhancing Cross-Domain Reasoning Tasks

Dec 01 |HiconAgent: History Context-aware Policy Optimization for GUI Agents

Dec 01 | DrawingBench: Evaluating Spatial Reasoning and UI Interaction Capabilities of Large Language Models through Mouse-Based Drawing Tasks

Dec 01 |Chain-of-Ground: Improving GUI Grounding via Iterative Reasoning and Reference Feedback

Nov 30 |MPR-GUI: Benchmarking and Enhancing Multilingual Perception and Reasoning in GUI Agents

Nov 30 |AFRAgent : An Adaptive Feature Renormalization Based High Resolution Aware GUI agent

Untitled