Evaliphy (Beta) - Simplify End-to-End AI Testing | Open Source | No Lock-In

Evaliphy simplifies end-to-end AI testing by treating your AI system like a black box.

Instead of research-focused frameworks or fine-tuning pipelines, Evaliphy lets you:

Test your real AI API (black-box, no internals)
Write assertions in TypeScript
Run in CI like your other tests
Get human-readable reports with actionable reasoning

No Python. No ML overhead. No vendor lock-in.

Works with any black-box AI system — RAG, agents, chatbots, content generation, summarization, and more.

Getting Started

First, run the development server:

npm run dev
# or
yarn dev
# or
pnpm dev
# or
bun dev

Why Evaliphy Exists

Most AI evaluation frameworks are built for research. They're amazing if you're fine-tuning models or benchmarking datasets. But they're not built for developers shipping products.

The Developer Problem

You have:

✅ A RAG system in production (or agents, chatbots, summarizers, etc.)
✅ A CI/CD pipeline
✅ TypeScript/JavaScript code
❌ No way to test your AI system like you test your APIs

The Evaliphy Approach

We took a different path. Evaliphy lets you:

Test via HTTP (your real API)
Write in TypeScript (your language)
Run in CI (your pipeline)
Use simple assertions (your mental model)

Quick Comparison

Feature	Evaliphy	DeepEval / Ragas	Promptfoo
Best For	Developers testing in CI	Researchers / Fine-tuning	LLM Prompt Testing
Language	TypeScript / Node.js	Python	JavaScript / Python
Testing Approach	Black-box API calls	White-box Pipeline	Prompt-focused
Workflow	CI/CD Pipelines	Notebooks / Scripts	CLI / Web UI
Setup	`npm install -g evaliphy`	`pip install deepeval`	`npx promptfoo`
Use Case	"Does my RAG/AI API work?"	"How good is my model?"	"Is my prompt right?"

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
app		app
content		content
lib		lib
public		public
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
README.md		README.md
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evaliphy (Beta) - Simplify End-to-End AI Testing | Open Source | No Lock-In

Getting Started

Why Evaliphy Exists

The Developer Problem

The Evaliphy Approach

Quick Comparison

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Evaliphy (Beta) - Simplify End-to-End AI Testing | Open Source | No Lock-In

Getting Started

Why Evaliphy Exists

The Developer Problem

The Evaliphy Approach

Quick Comparison

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages