π Paper β’ π Blog β’ π’ AGI Inc β’ π Leaderboard β’ π API Docs β’ π‘ Examples
Build, evaluate, and level up your AI agents for the real web.
AGI SDK is a production-ready toolkit for building and evaluating AI browser agents in real-world environments.
It powers REAL Bench: a high-fidelity benchmark for AI agents navigating modern websites like Amazon, DoorDash, Airbnb, and more.
πΉ Train agents to browse and interact with real apps πΉ Benchmark agents with robust, standardized tasks πΉ Submit to the leaderboard and see how your agents stack up!
Features:
- β Full TypeScript support with complete type definitions
- β Built-in LLM agent (OpenAI, Anthropic, OpenRouter)
- β 140+ bundled real-world tasks across 11 website clones
- β Flexible custom agent API
- β Production-ready with comprehensive documentation
- β Leaderboard integration for benchmarking
# Install the SDK
npm install @theagicompany/agisdk
# Install Playwright browser dependencies
npx playwright install --force
# Set your LLM API key (for LLM-based agents)
export OPENAI_API_KEY="your-api-key" # or ANTHROPIC_API_KEY, etc.β Supports OpenAI, Anthropic, OpenRouter, and custom models!
import { REAL } from '@theagicompany/agisdk';
// Set your API key
process.env.OPENAI_API_KEY = 'your-api-key';
// Create agent with built-in LLM
const agent = new REAL.DemoAgent({
modelName: 'gpt-4o',
useAxtree: true,
useScreenshot: true,
});
// Create harness
const harness = new REAL.Harness({
agent,
taskName: 'v2.omnizon-1',
headless: false,
});
// Run task
const results = await harness.run();
console.log(results);import { REAL } from '@theagicompany/agisdk';
class MyAgent implements REAL.Agent {
async getAction(obs: REAL.Observation): Promise<string> {
// Your custom logic here
if (obs.url?.includes('product')) {
return "click('add-to-cart')";
}
return "send_msg_to_user('Task complete')";
}
}
const harness = new REAL.Harness({
agent: new MyAgent(),
taskName: 'v2.omnizon-1',
});
const results = await harness.run();See examples/ for more detailed examples.
- Full-stack web replicas of top real-world apps (Amazon, Uber, Gmail, Airbnb, etc.)
- Robust agent API: Observations, Actions, Memory, Errors
- Built-in LLM agent supporting OpenAI, Anthropic, and OpenRouter
- Customizable harness: plug your own agents
- TypeScript support: Full type definitions included
- Bundled tasks: All tasks included in package (no external dependencies)
The AGI SDK includes high-fidelity, fully-deterministic websites for agents to explore:
| App Clone | Task Prefix | Example Use Case |
|---|---|---|
| π Amazon β Omnizon | v2.omnizon-* |
Buy a laptop, find a gift |
| π DoorDash β DashDish | v2.dashdish-* |
Order dinner |
v2.flyunified-* |
Book a flight | |
| π‘ Airbnb β Staynb | v2.staynb-* |
Reserve accommodation |
| π Google Calendar β GoCalendar | v2.gocalendar-* |
Schedule a meeting |
| π¬ Gmail β GoMail | v2.gomail-* |
Compose an email |
| π½οΈ OpenTable β OpenDining | v2.opendining-* |
Book a restaurant |
| π LinkedIn β NetworkIn | v2.networkin-* |
Accept a connection |
| π Uber β Udriver | v2.udriver-* |
Book a ride |
| πΌ UpWork β TopWork | v2.topwork-* |
Find a freelance gig |
| π Zillow β Zilloft | v2.zilloft-* |
Browse houses |
Comprehensive documentation is available:
- API Reference - Complete API documentation for all classes and methods
- Task Guide - Understanding tasks, evaluation, and task configuration
- Agent Guide - Building custom agents and understanding DemoAgent
- Examples - Working examples with detailed explanations
- Contributing - Guidelines for contributing to the project
- Changelog - Version history and release notes
- New to the SDK? Start with example/starter.ts
- Building a custom agent? See example/custom.ts
- Want advanced features? Check out example/hackable.ts
- Submitting to leaderboard? Follow example/leaderboard.ts
We welcome contributions! Please see our Contributing Guide for details.
Ways to contribute:
- π’ Feature requests and ideas
- π Bug reports and fixes
- π Documentation improvements
- π‘ New examples and tutorials
- π§ͺ Test coverage improvements
Development setup:
git clone https://github.com/agi-inc/agisdk-js.git
cd agisdk-js
npm install
npm run buildSee CONTRIBUTING.md for detailed guidelines.
- Issues: GitHub Issues
- Discussions: GitHub Discussions
- Website: AGI Inc
- Leaderboard: REAL Bench
If you find this project useful, please consider:
- β Starring the repository
- π¦ Sharing on social media
- π Writing about your experience
- π€ Contributing to the project
Apache 2.0 - see LICENSE file for details.