diff --git a/content/docs/custom-evaluators.md b/content/docs/custom-evaluators.md
index 2192cd7..eb61a9b 100644
--- a/content/docs/custom-evaluators.md
+++ b/content/docs/custom-evaluators.md
@@ -1,108 +1,48 @@
 ---
-title: "Custom Evaluators"
+title: Custom Evaluators
 weight: 3
-description: "Write your own scoring logic in Python, JavaScript, or any language."
+description: Extend agentevals with your own evaluator logic in Python.
 ---
 
-Beyond the built-in metrics, you can write your own evaluators in Python, JavaScript, or any language. An evaluator is any program that reads JSON from stdin and writes a score to stdout.
+If the built-in rubric does not cover your use case, you can write custom evaluators in Python and run them alongside the default scoring pipeline.
 
-> For the comprehensive guide, see [custom-evaluators.md](https://github.com/agentevals-dev/agentevals/blob/main/docs/custom-evaluators.md) in the repository.
-
-## Scaffold an Evaluator
+## Install the evaluator SDK
 
 ```bash
-agentevals evaluator init my_evaluator
+pip install agentevals-evaluator-sdk
 ```
 
-This creates a directory with boilerplate and a manifest:
-
-```
-my_evaluator/
-├── my_evaluator.py     # your scoring logic
-└── evaluator.yaml      # metadata manifest
-```
+## Example
 
-You can also list supported runtimes and generate config snippets:
-
-```bash
-agentevals evaluator runtimes              # show supported languages
-agentevals evaluator config my_evaluator \
-  --path ./evaluators/my_evaluator.py      # generate config snippet
-```
-
-## Implement Scoring Logic
-
-Your function receives an `EvalInput` with the agent's invocations and returns an `EvalResult` with a score between 0.0 and 1.0.
+Create a file such as `my_eval.py`:
 
 ```python
-from agentevals_evaluator_sdk import EvalInput, EvalResult, evaluator
+from agentevals_evaluator import Evaluator, Score
 
-@evaluator
-def my_evaluator(input: EvalInput) -> EvalResult:
-    scores = []
-    for inv in input.invocations:
-        # Your scoring logic here
-        score = 1.0
-        scores.append(score)
 
-    return EvalResult(
-        score=sum(scores) / len(scores) if scores else 0.0,
-        per_invocation_scores=scores,
-    )
+class PolitenessEvaluator(Evaluator):
+    name = "politeness"
+    description = "Checks whether the agent response is polite and professional"
 
-if __name__ == "__main__":
-    my_evaluator.run()
+    def evaluate(self, trace) -> Score:
+        text = trace.output_text.lower()
+        passed = "please" in text or "thank you" in text
+        return Score(
+            value=1.0 if passed else 0.0,
+            reasoning="Response includes polite phrasing" if passed else "Response is missing polite phrasing",
+        )
 ```
 
-Install the SDK standalone with `pip install agentevals-evaluator-sdk` (no heavy dependencies).
-
-## Reference in Eval Config
-
-```yaml
-# eval_config.yaml
-evaluators:
-  - name: tool_trajectory_avg_score
-    type: builtin
-
-  - name: my_evaluator
-    type: code
-    path: ./evaluators/my_evaluator.py
-    threshold: 0.7
-```
+Then run it with the CLI:
 
 ```bash
-agentevals run trace.json --config eval_config.yaml --eval-set eval_set.json
-```
-
-## Community Evaluators
-
-Community evaluators can be referenced directly from the shared [evaluators repository](https://github.com/agentevals-dev/evaluators) using `type: remote`:
-
-```yaml
-evaluators:
-  - name: response_quality
-    type: remote
-    source: github
-    ref: evaluators/response_quality/response_quality.py
-    threshold: 0.7
-    config:
-      min_response_length: 20
+agentevals run \
+  --otlp-endpoint http://localhost:6006/v1/traces \
+  --evaluator my_eval.py:PolitenessEvaluator
 ```
 
-Browse available community evaluators on the [Evaluators](/evaluators/) page, or contribute your own.
-
-## Supported Languages
-
-Evaluators can be written in any language that reads JSON from stdin and writes JSON to stdout.
-
-| Language | Extension | SDK available |
-|---|---|---|
-| Python | `.py` | `pip install agentevals-evaluator-sdk` |
-| JavaScript | `.js` | No SDK yet — just read stdin, write stdout |
-| TypeScript | `.ts` | No SDK yet — just read stdin, write stdout |
-
-## Further Reading
+## Tips
 
-- [Custom Evaluators Guide](https://github.com/agentevals-dev/agentevals/blob/main/docs/custom-evaluators.md) — Full protocol reference
-- [Community Evaluators](/evaluators/) — Browse and submit evaluators
-- [Eval Set Format](https://github.com/agentevals-dev/agentevals/blob/main/docs/eval-set-format.md) — Schema and field reference for eval set JSON files
+- Keep evaluators deterministic when possible
+- Return short, useful reasoning strings for debugging
+- Start with a binary pass/fail score before adding more complex grading
diff --git a/content/docs/faq.md b/content/docs/faq.md
index 258efc9..5c4bbaf 100644
--- a/content/docs/faq.md
+++ b/content/docs/faq.md
@@ -1,47 +1,33 @@
 ---
-title: "FAQ"
+title: FAQ
 weight: 6
-description: "Frequently asked questions about AgentEvals."
+description: Common questions about how agentevals works and how to deploy it.
 ---
 
-## How does this compare to ADK's evaluations?
+## Does agentevals re-run my agent?
 
-Unlike ADK's LocalEvalService, which couples agent execution with evaluation, agentevals only handles scoring: it takes pre-recorded traces and compares them against expected behavior using metrics like tool trajectory matching, response quality, and LLM-based judgments.
+No. agentevals evaluates behavior from existing OpenTelemetry traces, so you can score what actually happened in production or staging without replaying requests.
 
-However, if you're iterating on your agents locally, you can point your agents to agentevals and you will see rich runtime information in your browser. For more details, use the bundled wheel and explore the Local Development option in the UI.
+## What do I need to get started?
 
-## How does this compare to Bedrock AgentCore's evaluation?
+You need:
 
-AgentCore's evaluation integration (via `strands-agents-evals`) also couples agent execution with evaluation. It re-invokes the agent for each test case, converts the resulting OTel spans to AWS's ADOT format, and scores them against 4 built-in evaluators (Helpfulness, Accuracy, Harmfulness, Relevance) via a cloud API call. This means you need an AWS account, valid credentials, and network access for every evaluation.
+- OpenTelemetry traces from your agent or workflow
+- An evaluator model configured for scoring
+- The CLI installed with `pip install agentevals-cli`
 
-agentevals takes a different approach: it scores pre-recorded traces locally without re-running anything. It works with standard Jaeger JSON and OTLP formats from any framework, supports open-ended metrics (tool trajectory matching, LLM-based judges, custom scorers), and ships with a CLI and web UI. No cloud dependency required.
+## Can I write my own evaluators?
 
-## What trace formats are supported?
+Yes. Install the SDK with `pip install agentevals-evaluator-sdk` and register your Python evaluator class with the CLI.
 
-AgentEvals supports **OTLP** (OpenTelemetry Protocol) with `http/protobuf` and `http/json`, plus **Jaeger JSON** trace exports. Works with any OTel-instrumented framework including LangChain, Strands, Google ADK, and others.
+## Where do results show up?
 
-## Do I need to re-run my agent to evaluate it?
+Results can be written back to your backend, exported in CI, or inspected in the agentevals UI.
 
-No. Record once, score as many times as you want. AgentEvals evaluates from existing traces, so you never need to replay expensive LLM calls.
+## Does this work with any tracing backend?
 
-## What frameworks are supported?
+It works anywhere you can access OpenTelemetry-compatible trace data or an OTLP endpoint that exposes the traces agentevals needs.
 
-Any framework that emits OpenTelemetry spans works out of the box. This includes **LangChain**, **Strands**, **Google ADK**, and any other OTel-instrumented framework. The zero-code integration requires no SDK — just point your agent's OTel exporter to agentevals.
+## Is there a web UI?
 
-## Can I write custom evaluators?
-
-Yes. Evaluators can be written in Python, JavaScript, or any language that reads JSON from stdin and writes JSON to stdout. See the [Custom Evaluators](/docs/custom-evaluators/) page for details.
-
-A Python SDK is available (`pip install agentevals-evaluator-sdk`) for convenience, but it's not required.
-
-## Can I use this in CI/CD?
-
-Absolutely. The CLI is designed for CI integration. Use `--output json` for machine-readable results. See the [CLI & CI/CD section](/docs/integrations/#cli--cicd) for a GitHub Actions example.
-
-## Is there a community evaluator registry?
-
-Yes. Browse community-contributed evaluators on the [Evaluators](/evaluators/) page, or contribute your own to the [evaluators repository](https://github.com/agentevals-dev/evaluators).
-
-## Is AgentEvals open source?
-
-Yes. AgentEvals is open source and available on [GitHub](https://github.com/agentevals-dev/agentevals). Contributions are welcome!
+Yes — see the [UI Walkthrough](/docs/ui-walkthrough/) for the current workflow and screenshots.
diff --git a/content/docs/quick-start.md b/content/docs/quick-start.md
index 85532ac..b952648 100644
--- a/content/docs/quick-start.md
+++ b/content/docs/quick-start.md
@@ -1,66 +1,42 @@
 ---
-title: "Quick Start"
+title: Quick Start
 weight: 1
-description: "Get up and running with AgentEvals in under 5 minutes."
+description: Install agentevals, point it at your traces, and run your first evaluation.
 ---
 
-## Installation
+agentevals scores AI agent behavior from existing OpenTelemetry traces — no re-runs required.
 
-Grab a wheel from the [releases page](https://github.com/agentevals-dev/agentevals/releases). The **core** wheel has the CLI and REST API. The **bundle** wheel adds streaming and the embedded web UI.
+## Install the CLI
 
 ```bash
-pip install agentevals-<version>-py3-none-any.whl
-
-# For live streaming support:
-pip install "agentevals-<version>-py3-none-any.whl[live]"
-```
-
-**From source** with `uv` or Nix:
-
-```bash
-uv sync
-# or: nix develop .
+pip install agentevals-cli
 ```
 
-See [DEVELOPMENT.md](https://github.com/agentevals-dev/agentevals/blob/main/DEVELOPMENT.md) for build instructions.
-
-## CLI Quick Start
+## Run your first evaluation
 
-Run an evaluation against a sample trace:
+Point the CLI at an OTLP endpoint and evaluate the traces it finds.
 
 ```bash
-uv run agentevals run samples/helm.json \
-  --eval-set samples/eval_set_helm.json \
-  -m tool_trajectory_avg_score
+agentevals run \
+  --otlp-endpoint http://localhost:6006/v1/traces \
+  --model openai/gpt-4o-mini
 ```
 
-List available evaluators:
+If your collector requires auth, add headers:
 
 ```bash
-uv run agentevals evaluator list
+agentevals run \
+  --otlp-endpoint https://collector.example.com/v1/traces \
+  --otlp-header "Authorization=Bearer <token>" \
+  --model openai/gpt-4o-mini
 ```
 
-## Live UI Quick Start
-
-Start the server with the embedded web UI:
-
-```bash
-agentevals serve
-```
-
-Open `http://localhost:8001` to upload traces and eval sets, select metrics, and view results with interactive span trees.
-
-**From source** (two terminals):
-
-```bash
-uv run agentevals serve --dev     # Terminal 1
-cd ui && npm install && npm run dev  # Terminal 2 → http://localhost:5173
-```
+## What happens under the hood
 
-Live-streamed traces appear in the "Local Dev" tab, grouped by session ID.
+agentevals reconstructs each traced agent interaction, sends the relevant context to an evaluator model, and writes back structured scores you can inspect in the UI or export in CI.
 
-## What's Next
+## Next steps
 
-- [Integrations](/docs/integrations/) — Zero-code, SDK, and CLI/CI integration patterns
-- [Custom Evaluators](/docs/custom-evaluators/) — Build your own evaluators
-- [UI Walkthrough](/docs/ui-walkthrough/) — Deep dive into the web UI
+- Learn how traces, models, and outputs are configured in [Advanced](/docs/advanced/)
+- Add your own scoring logic in [Custom Evaluators](/docs/custom-evaluators/)
+- View and compare runs in the [UI Walkthrough](/docs/ui-walkthrough/)
diff --git a/layouts/index.html b/layouts/index.html
index 7d0301f..cd44f0c 100644
--- a/layouts/index.html
+++ b/layouts/index.html
@@ -1,224 +1,82 @@
-{{ define "main" }}
-
-<!-- Navigation -->
-<nav class="nav">
-  <a href="{{ "/" | relURL }}" class="nav-logo">
-    <img src="{{ "images/logo-color.png" | relURL }}" alt="AgentEvals" class="logo-dark">
-    <img src="{{ "images/logo-light.png" | relURL }}" alt="AgentEvals" class="logo-light">
-  </a>
-  <button class="nav-toggle" onclick="document.querySelector('.nav-links').classList.toggle('active')" aria-label="Menu">&#9776;</button>
-  <div class="nav-links">
-    <a href="#features">Features</a>
-    <a href="#how-it-works">How It Works</a>
-    <a href="#interfaces">Interfaces</a>
-    <a href="#get-started">Get Started</a>
-    <a href="{{ "docs/" | relURL }}">Docs</a>
-    <a href="{{ "evaluators/" | relURL }}">Evaluators</a>
-    <a href="{{ .Site.Params.discord }}" target="_blank">Discord</a>
-    <a href="{{ .Site.Params.github }}" target="_blank" class="btn-sm">GitHub</a>
-    <button class="theme-toggle" onclick="toggleTheme()" aria-label="Toggle theme">
-      <span class="icon-sun">&#9728;</span>
-      <span class="icon-moon">&#9790;</span>
-    </button>
-  </div>
-</nav>
-
-<!-- Hero -->
-<section class="hero">
-  <div class="hero-content">
-    <div class="hero-logo">
-      <img src="{{ "images/logo-color.png" | relURL }}" alt="AgentEvals" class="logo-dark">
-      <img src="{{ "images/logo-color-transparent.png" | relURL }}" alt="AgentEvals" class="logo-light">
-    </div>
-    <h1>Ship Agents <span class="highlight">Reliably</span></h1>
-    <p>Benchmark your agents before they hit production. AgentEvals scores performance and inference quality from OpenTelemetry traces — no re-runs, no guesswork.</p>
-    <div class="hero-buttons">
-      <a href="{{ .Site.Params.github }}" target="_blank" class="btn btn-primary">
-        <svg class="icon" viewBox="0 0 24 24" fill="currentColor"><path d="M12 0c-6.626 0-12 5.373-12 12 0 5.302 3.438 9.8 8.207 11.387.599.111.793-.261.793-.577v-2.234c-3.338.726-4.033-1.416-4.033-1.416-.546-1.387-1.333-1.756-1.333-1.756-1.089-.745.083-.729.083-.729 1.205.084 1.839 1.237 1.839 1.237 1.07 1.834 2.807 1.304 3.492.997.107-.775.418-1.305.762-1.604-2.665-.305-5.467-1.334-5.467-5.931 0-1.311.469-2.381 1.236-3.221-.124-.303-.535-1.524.117-3.176 0 0 1.008-.322 3.301 1.23.957-.266 1.983-.399 3.003-.404 1.02.005 2.047.138 3.006.404 2.291-1.552 3.297-1.23 3.297-1.23.653 1.653.242 2.874.118 3.176.77.84 1.235 1.911 1.235 3.221 0 4.609-2.807 5.624-5.479 5.921.43.372.823 1.102.823 2.222v3.293c0 .319.192.694.801.576 4.765-1.589 8.199-6.086 8.199-11.386 0-6.627-5.373-12-12-12z"/></svg>
-        View on GitHub
-      </a>
-      <a href="{{ "docs/" | relURL }}" class="btn btn-secondary">Read the Docs</a>
-      <a href="{{ .Site.Params.discord }}" target="_blank" class="btn btn-secondary">
-        Join Discord
-      </a>
-    </div>
-  </div>
-</section>
-
-<!-- Features -->
-<section id="features" class="container">
-  <div class="section-header">
-    <h2>Why AgentEvals?</h2>
-    <p>Evaluate agent behavior from real traces, not synthetic replays.</p>
-  </div>
-  <div class="features-grid">
-    <div class="feature-card">
-      <div class="feature-icon">&#x1f50d;</div>
-      <h3>Trace-Based Evaluation</h3>
-      <p>Parse OTLP streams and Jaeger JSON traces to evaluate agent behavior directly from production or test telemetry data.</p>
-    </div>
-    <div class="feature-card">
-      <div class="feature-icon">&#x26a1;</div>
-      <h3>No Re-Running Required</h3>
-      <p>Score agent behavior from existing traces. No need to replay expensive LLM calls or wait for agent re-execution.</p>
-    </div>
-    <div class="feature-card">
-      <div class="feature-icon">&#x1f3af;</div>
-      <h3>Golden Eval Sets</h3>
-      <p>Define expected behaviors as golden eval sets and score traces against them using ADK's evaluation framework.</p>
-    </div>
-    <div class="feature-card">
-      <div class="feature-icon">&#x1f4ca;</div>
-      <h3>Trajectory Matching</h3>
-      <p>Compare agent trajectories with strict, unordered, subset, or superset matching modes for flexible evaluation.</p>
-    </div>
-    <div class="feature-card">
-      <div class="feature-icon">&#x1f916;</div>
-      <h3>LLM-as-Judge</h3>
-      <p>Use LLM-powered evaluation for nuanced scoring of agent behavior without requiring reference trajectories.</p>
-    </div>
-    <div class="feature-card">
-      <div class="feature-icon">&#x1f6e0;</div>
-      <h3>CI/CD Integration</h3>
-      <p>Run evaluations in your pipeline with the CLI. Gate deployments on agent behavior quality scores.</p>
-    </div>
-    <div class="feature-card">
-      <div class="feature-icon">&#x1f9e9;</div>
-      <h3>Custom Evaluators</h3>
-      <p>Write your own scoring logic in Python, JavaScript, or any language. Share evaluators through the community registry.</p>
-    </div>
-  </div>
-</section>
-
-<!-- How It Works -->
-<section id="how-it-works" class="how-it-works">
-  <div class="container">
-    <div class="section-header">
-      <h2>How It Works</h2>
-      <p>Three steps from traces to scores.</p>
-    </div>
-    <div class="steps">
-      <div class="step">
-        <div class="step-number">1</div>
-        <h3>Collect Traces</h3>
-        <p>Instrument your agent with OpenTelemetry or export Jaeger JSON traces from your observability platform.</p>
-      </div>
-      <div class="step">
-        <div class="step-number">2</div>
-        <h3>Define Eval Sets</h3>
-        <p>Create golden evaluation sets that describe expected agent behaviors, tool calls, and trajectories.</p>
-      </div>
-      <div class="step">
-        <div class="step-number">3</div>
-        <h3>Score &amp; Report</h3>
-        <p>Run evaluations via CLI or Web UI. Get detailed scores and pass/fail results.</p>
-      </div>
-    </div>
-  </div>
-</section>
-
-<!-- Interfaces -->
-<section id="interfaces" class="interfaces">
-  <div class="container">
-    <div class="section-header">
-      <h2>Two Ways to Evaluate</h2>
-      <p>Choose the interface that fits your workflow.</p>
-    </div>
-    <div class="interfaces-grid">
-      <div class="interface-card">
-        <div class="interface-icon">&#x2328;</div>
-        <h3>CLI</h3>
-        <p>Script evaluations and integrate into CI/CD pipelines. Pipe in traces, get scores out. Built for automation.</p>
-      </div>
-      <div class="interface-card">
-        <div class="interface-icon">&#x1f5a5;</div>
-        <h3>Web UI</h3>
-        <p>Visually inspect traces and interactively evaluate agent behavior. Browse results, compare runs, and drill into details.</p>
-      </div>
-    </div>
-  </div>
-</section>
-
-<!-- Custom Evaluators CTA -->
-<section class="evaluators-cta">
-  <div class="container">
-    <div class="evaluators-cta-inner">
-      <div class="evaluators-cta-text">
-        <h2>Build Your Own Evaluators</h2>
-        <p>Write custom scoring logic in Python, JavaScript, or any language. Share it with the community through our evaluator registry.</p>
-      </div>
-      <div class="evaluators-cta-actions">
-        <a href="{{ "evaluators/" | relURL }}" class="btn btn-primary">Browse Evaluators</a>
-        <a href="https://github.com/agentevals-dev/evaluators#contributing-an-evaluator" target="_blank" class="btn btn-secondary">Submit Your Own</a>
-      </div>
-    </div>
-  </div>
-</section>
-
-<!-- Get Started -->
-<section id="get-started" class="code-section">
-  <div class="container">
-    <div class="section-header">
-      <h2>Get Started</h2>
-      <p>Up and running in seconds.</p>
-    </div>
-    <div class="code-block">
-      <div class="code-header">
-        <div class="code-dots">
-          <span></span><span></span><span></span>
+<!DOCTYPE html>
+<html lang="en">
+<head>
+  <meta charset="UTF-8">
+  <meta name="viewport" content="width=device-width, initial-scale=1.0">
+  <title>{{ .Site.Title }}</title>
+  <link rel="preconnect" href="https://fonts.googleapis.com">
+  <link rel="preconnect" href="https://fonts.gstatic.com" crossorigin>
+  <link href="https://fonts.googleapis.com/css2?family=Inter:wght@400;500;600;700;800&family=JetBrains+Mono:wght@400;500;600&display=swap" rel="stylesheet">
+  {{ $styles := resources.Get "css/main.css" | minify | fingerprint }}
+  <link rel="stylesheet" href="{{ $styles.RelPermalink }}">
+</head>
+<body>
+  <div class="homepage">
+    <header class="hero">
+      <nav class="topbar">
+        <a class="brand" href="/">agentevals</a>
+        <div class="nav-links">
+          <a href="/docs/quick-start/">Docs</a>
+          <a href="https://github.com/agentevals-dev/agentevals" target="_blank" rel="noreferrer">GitHub</a>
         </div>
-        <span class="code-label">terminal</span>
-      </div>
-      <div class="code-body">
-<pre><span class="comment"># Install from release wheel</span>
-<span class="cmd">pip</span> install agentevals-&lt;version&gt;-py3-none-any.whl
+      </nav>
 
-<span class="comment"># Run an evaluation against a trace</span>
-<span class="cmd">agentevals</span> run samples/helm.json \
-  <span class="flag">--eval-set</span> <span class="string">samples/eval_set_helm.json</span> \
-  <span class="flag">-m</span> <span class="string">tool_trajectory_avg_score</span>
+      <div class="hero-content">
+        <p class="eyebrow">OpenTelemetry-native agent evaluation</p>
+        <h1>Score agent behavior from traces you already have.</h1>
+        <p class="hero-copy">
+          agentevals evaluates AI agent sessions from OpenTelemetry traces, so you can measure
+          quality, compare runs, and catch regressions without re-running the agent.
+        </p>
 
-<span class="comment"># Start the web UI</span>
-<span class="cmd">agentevals</span> serve
+        <div class="hero-actions">
+          <a class="button primary" href="/docs/quick-start/">Get started</a>
+          <a class="button secondary" href="https://github.com/agentevals-dev/agentevals" target="_blank" rel="noreferrer">View on GitHub</a>
+        </div>
 
-</pre>
+        <div class="hero-terminal" aria-label="CLI example">
+<pre><code>$ pip install agentevals-cli
+$ agentevals run --otlp-endpoint http://localhost:6006/v1/traces --model openai/gpt-4o-mini
+→ scored 128 traces
+→ exported results
+</code></pre>
+        </div>
       </div>
-    </div>
-  </div>
-</section>
+    </header>
 
-<!-- CTA -->
-<section class="cta">
-  <div class="cta-content">
-    <h2>Start Evaluating Your Agents</h2>
-    <p>Open source. Trace-driven. No re-runs needed.</p>
-    <div class="cta-buttons">
-      <a href="{{ .Site.Params.github }}" target="_blank" class="btn btn-primary">
-        <svg class="icon" viewBox="0 0 24 24" fill="currentColor"><path d="M12 0c-6.626 0-12 5.373-12 12 0 5.302 3.438 9.8 8.207 11.387.599.111.793-.261.793-.577v-2.234c-3.338.726-4.033-1.416-4.033-1.416-.546-1.387-1.333-1.756-1.333-1.756-1.089-.745.083-.729.083-.729 1.205.084 1.839 1.237 1.839 1.237 1.07 1.834 2.807 1.304 3.492.997.107-.775.418-1.305.762-1.604-2.665-.305-5.467-1.334-5.467-5.931 0-1.311.469-2.381 1.236-3.221-.124-.303-.535-1.524.117-3.176 0 0 1.008-.322 3.301 1.23.957-.266 1.983-.399 3.003-.404 1.02.005 2.047.138 3.006.404 2.291-1.552 3.297-1.23 3.297-1.23.653 1.653.242 2.874.118 3.176.77.84 1.235 1.911 1.235 3.221 0 4.609-2.807 5.624-5.479 5.921.43.372.823 1.102.823 2.222v3.293c0 .319.192.694.801.576 4.765-1.589 8.199-6.086 8.199-11.386 0-6.627-5.373-12-12-12z"/></svg>
-        GitHub
-      </a>
-      <a href="{{ .Site.Params.discord }}" target="_blank" class="btn btn-secondary">
-        Join Discord
-      </a>
-    </div>
-  </div>
-</section>
+    <main>
+      <section class="feature-grid">
+        <article class="feature-card">
+          <h2>Use existing traces</h2>
+          <p>Evaluate real agent behavior from OTEL traces captured in staging or production.</p>
+        </article>
+        <article class="feature-card">
+          <h2>No replay required</h2>
+          <p>Score outcomes and reasoning without re-running your workflows or tools.</p>
+        </article>
+        <article class="feature-card">
+          <h2>Bring your own evaluators</h2>
+          <p>Start with the built-in flow, then extend it with custom Python evaluators.</p>
+        </article>
+      </section>
 
-<!-- Footer -->
-<footer class="footer">
-  <div class="footer-content">
-    <a href="{{ "/" | relURL }}" class="footer-logo">
-      <img src="{{ "images/logo-color.png" | relURL }}" alt="AgentEvals" class="logo-dark">
-      <img src="{{ "images/logo-light.png" | relURL }}" alt="AgentEvals" class="logo-light">
-    </a>
-    <div class="footer-links">
-      <a href="{{ "docs/" | relURL }}">Docs</a>
-      <a href="{{ .Site.Params.github }}" target="_blank">GitHub</a>
-      <a href="{{ .Site.Params.discord }}" target="_blank">Discord</a>
-      <a href="https://github.com/agentregistry-dev/" target="_blank">AgentRegistry</a>
-    </div>
-    <span class="footer-copy">&copy; {{ now.Year }} AgentEvals. Open source under Apache 2.0.</span>
+      <section class="docs-preview">
+        <div>
+          <p class="section-label">Docs</p>
+          <h2>Everything you need to go from traces to scores.</h2>
+          <p>
+            Follow the quick start, configure your evaluator model, and learn how to plug in
+            custom evaluators and inspect results in the UI.
+          </p>
+        </div>
+        <div class="docs-links">
+          <a href="/docs/quick-start/">Quick Start</a>
+          <a href="/docs/advanced/">Advanced</a>
+          <a href="/docs/custom-evaluators/">Custom Evaluators</a>
+          <a href="/docs/ui-walkthrough/">UI Walkthrough</a>
+        </div>
+      </section>
+    </main>
   </div>
-</footer>
-
-{{ end }}
+</body>
+</html>