Computer Use
Build agents that can interact with browsers and desktop applications through screenshots, clicks, and keyboard input.Quick Start
Configuration
Available Tools
The computer use module provides these tools:| Tool | Description |
|---|---|
screenshot | Take a screenshot |
click | Click at coordinates |
doubleClick | Double-click at coordinates |
type | Type text |
key | Press a key or combination |
moveMouse | Move mouse to coordinates |
scroll | Scroll in a direction |
wait | Wait for a duration |
execute | Execute shell command |
With Playwright (Browser)
Human Approval
By default, all actions require human approval:Custom Tool Implementations
Agent Example
Safety Best Practices
- Always require approval - Enable
requireApproval: true - Limit capabilities - Only enable needed features
- Sandbox execution - Run in isolated environments
- Log all actions - Track what the agent does
- Set timeouts - Prevent runaway actions
Environment Variables
| Variable | Required | Description |
|---|---|---|
OPENAI_API_KEY | Yes | For the agent |
ANTHROPIC_API_KEY | For Claude | Claude vision |
Related
- Tool Approval - Human-in-the-loop approvals
- Image Agent - Vision capabilities

