<aside> <img src="/icons/alien-pixel_green.svg" alt="/icons/alien-pixel_green.svg" width="40px" /> Home
</aside>
<aside> <img src="/icons/verified_green.svg" alt="/icons/verified_green.svg" width="40px" /> About PAPER
This channel is dedicated to building and expanding a comprehensive paper database focused on the Web Agent field and the boarder GUI agent field. Let’s collaborate to enrich this database and advance research in the exciting world of web agents!
</aside>
🔥 Newly updated(Jan)
New~Jan 22 |
New~Jan 26 |MalURLBench: A Benchmark Evaluating Agents’ Vulnerabilities When Processing Web URLs
New~Jan 26 |SwipeGen: Bridging the Execution Gap in GUI Agents via Human-like Swipe Synthesis
New~Jan 26 |GAIA: A Data Flywheel System for Training GUI Test-Time Scaling Critic Models
New~Jan 25 |EntWorld: A Holistic Environment and Benchmark for Verifiable Enterprise GUI Agents
New~Jan 22 |EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience
Jan 21 |Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation
Jan 21 |CI4A: Semantic Component Interfaces for Agents Empowering Web Automation
Jan 18 |Zero-Permission Manipulation: Can We Trust Large Multimodal Model Powered GUI Agents?
Jan 14 |Compress to Focus: Efficient Coordinate Compression for Policy Optimization in Multi-Turn GUI Agents
Jan 14 |CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents
Jan 14 |GUI-Eyes: Tool-Augmented Perception for Visual Grounding in GUI Agents
Jan 13 |Beyond Clicking:A Step Towards Generalist GUI Grounding via Text Dragging
Jan 13 |WebTrap Park: An Automated Platform for Systematic Security Evaluation of Web Agents
Jan 13 |ExpSeek: Self-Triggered Experience Seeking for Web Agents
Jan 12 |ShowUI-Aloha: Human-Taught GUI Agent
Jan 9 |From Off-Policy to On-Policy: Enhancing GUI Agents via Bi-level Expert-to-Policy Assimilation
Jan 8 |Agent-Dice: Disentangling Knowledge Updates via Geometric Consensus for Agent Continual Learning
Jan 8 |BackdoorAgent: A Unified Framework for Backdoor Attacks on LLM-based Agents
Jan 8 |InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training
Jan 8 |GUITester: Enabling GUI Agents for Exploratory Defect Discovery
Jan 7 |MobileDreamer: Generative Sketch World Model for GUI Agent
Jan 7 |FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection
Jan 7 |O-Researcher: An Open Ended Deep Research Model via Multi-Agent Distillation and Agentic RL
Jan 7 |WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks
Jan 7 |WebAnchor: Anchoring Agent Planning to Stabilize Long-Horizon Web Reasoning
Jan 5 |AI Agent Systems: Architectures, Applications, and Evaluation