AgentKit Browser Automation

Intelligent Task Planning: Breaks down complex tasks into manageable steps
State Management: Tracks browser state and action results
Error Handling: Robust error handling and recovery mechanisms
Event System: Comprehensive event logging and monitoring
Flexible Action System: Extensible action registry for custom behaviors
Validation Framework: Built-in validation for task completion
Memory Management: Maintains context and history of actions

A sophisticated browser automation framework built with AgentKit, featuring a multi-agent system for intelligent web navigation and task execution.

This project implements a multi-agent system for browser automation, where different agents work together to:

The system consists of four specialized agents:

Planning Agent
- Breaks down tasks into actionable steps
- Creates detailed execution plans
- Determines task completion criteria
Navigator Agent
- Determines the next actions to take
- Manages state transitions
- Handles action execution
- Provides detailed logging and feedback
Browser Agent
- Executes browser automation actions
- Interacts with web elements
- Handles page navigation
- Manages browser state
Validation Agent
- Validates task completion
- Verifies results
- Handles error cases
- Provides feedback on success/failure

git clone https://github.com/tmahesh/playwright-agent.git
cd playwright-agent

npm install

cp .env.sample .env
# Edit .env with your OpenAI API key and other configurations

npx @playwright/mcp@latest --port 8931

npx tsx index.ts

npx inngest-cli@latest dev --no-discovery -u http://localhost:3000/api/inngest -v