<aside> <img src="/icons/alien-pixel_green.svg" alt="/icons/alien-pixel_green.svg" width="40px" /> Home


Mission & Vision

Paper

Researcher

Company & Product

Event

Github

Opportunities

WebAgent Review

Podcast

Contributor

</aside>

<aside> <img src="/icons/verified_green.svg" alt="/icons/verified_green.svg" width="40px" /> About PAPER


This channel is dedicated to building and expanding a comprehensive paper database focused on the Web Agent field and the boarder GUI agent field. Let’s collaborate to enrich this database and advance research in the exciting world of web agents!

</aside>

🔥 Newly updated(Jan)

New~Jan 22 |

New~Jan 26 |MalURLBench: A Benchmark Evaluating Agents’ Vulnerabilities When Processing Web URLs

New~Jan 26 |SwipeGen: Bridging the Execution Gap in GUI Agents via Human-like Swipe Synthesis

New~Jan 26 |GAIA: A Data Flywheel System for Training GUI Test-Time Scaling Critic Models

New~Jan 25 |EntWorld: A Holistic Environment and Benchmark for Verifiable Enterprise GUI Agents

New~Jan 22 |EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Jan 21 |Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation

Jan 21 |CI4A: Semantic Component Interfaces for Agents Empowering Web Automation

Jan 19 |MagicGUI-RMS: A Multi-Agent Reward Model System for Self-Evolving GUI Agents via Automated Feedback Reflux

Jan 18 |Zero-Permission Manipulation: Can We Trust Large Multimodal Model Powered GUI Agents?

Jan 14 |Compress to Focus: Efficient Coordinate Compression for Policy Optimization in Multi-Turn GUI Agents

Jan 14 |CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents

Jan 14 |PersonalAlign: Hierarchical Implicit Intent Alignment for Personalized GUI Agent with Long-Term User-Centric Records

Jan 14 |GUI-Eyes: Tool-Augmented Perception for Visual Grounding in GUI Agents

Jan 13 |Beyond Clicking:A Step Towards Generalist GUI Grounding via Text Dragging

Jan 13 |WebTrap Park: An Automated Platform for Systematic Security Evaluation of Web Agents

Jan 13 |ExpSeek: Self-Triggered Experience Seeking for Web Agents

Jan 12 |ShowUI-Aloha: Human-Taught GUI Agent

Jan 12 |When Bots Take the Bait: Exposing and Mitigating the Emerging Social Engineering Attack in Web Automation Agent

Jan 9 |From Off-Policy to On-Policy: Enhancing GUI Agents via Bi-level Expert-to-Policy Assimilation

Jan 9 |FronTalk: Benchmarking Front-End Development as Conversational Code Generation with Multi-Modal Feedback

Jan 8 |Agent-Dice: Disentangling Knowledge Updates via Geometric Consensus for Agent Continual Learning

Jan 8 |BackdoorAgent: A Unified Framework for Backdoor Attacks on LLM-based Agents

Jan 8 |InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training

Jan 8 |GUITester: Enabling GUI Agents for Exploratory Defect Discovery

Jan 7 |MobileDreamer: Generative Sketch World Model for GUI Agent

Jan 7 |FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection

Jan 7 |O-Researcher: An Open Ended Deep Research Model via Multi-Agent Distillation and Agentic RL

Jan 7 |WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks

Jan 7 |WebAnchor: Anchoring Agent Planning to Stabilize Long-Horizon Web Reasoning

Jan 5 |AI Agent Systems: Architectures, Applications, and Evaluation

Jan 4 |Unified Generation and Self-Verification for Vision-Language Models via Advantage Decoupled Preference Optimization

Untitled