RLX: Hosted RL Training

This is a hosted RL training platform using Verifiers and Prime RL.

It lets you import GitHub repositories, choose an rlx.toml config, provision Prime Intellect GPU pods, launch Prime RL jobs, and inspect the resulting pipeline through logs, Weights & Biases, job output, and run metadata in one place.

What RLX does

Imports GitHub repositories as RL projects.
Lets you select a branch and named rlx.toml config entry for a run.
Provisions GPU compute on Prime Intellect.
Boots and prepares the runtime through ordered Celery jobs executed over SSH.
Surfaces per-run logs for orchestrator, trainer, and inference processes.
Tracks job execution, command output, retries, and run termination from the UI.
Stores user SSH keys and integration credentials needed for pod access and monitoring.

Product Tour

Projects Browse imported repositories and jump into active work quickly.	New project Import from GitHub search results or start from a repository URL.
Runs Scan configs, branches, GPU selections, status, and creation time at a glance.	Jobs Inspect the setup and execution pipeline with grouped status sections and drill into command output.
Logs Watch live run output while keeping critical run metadata and access details nearby.	Settings Manage GitHub connectivity and SSH keys required for provisioning and pod access.
Weights & Biases Open experiment dashboards for training curves, reward signals, and run-level monitoring.

How it works

At a high level, RLX sits between the browser UI, the application backend, and the systems needed to launch and monitor RL workloads.

flowchart LR
  Web["Next.js web app"] --> Actions["Server actions"]
  Actions --> API["FastAPI API"]
  API --> Postgres["PostgreSQL"]
  API --> Redis["Redis"]
  Redis --> Worker["Celery worker"]
  Redis --> Beat["Celery beat"]
  API --> GitHub["GitHub API"]
  API --> Prime["Prime Intellect API"]
  API --> Secrets["AWS Secrets Manager"]
  Worker --> SSH["SSH into GPU pod"]
  SSH --> PrimeRL["Prime RL process"]

Typical run flow:

Sign in and connect GitHub.
Import a repository as a project.
Pick a branch, config name, and GPU shape.
Create a run and provision a Prime Intellect pod.
Wait for the pod to become active.
Execute the queued setup jobs over SSH.
Launch Prime RL with the resolved config path.
Monitor logs, jobs, and run state from the run details page.

Stack

Frontend: Next.js 16, React 19, TypeScript, Tailwind CSS 4, shadcn/ui, Clerk
Backend: FastAPI, SQLAlchemy, Alembic, asyncpg, Pydantic
Async execution: Redis, Celery worker, Celery beat
Infra integrations: Prime Intellect, GitHub, AWS Secrets Manager

Repo layout

rlx/
├── apps/
│   ├── api/        # FastAPI backend
│   └── web/        # Next.js frontend
├── docs/           # Architecture and implementation docs
├── scripts/        # Helper scripts
├── dev.sh          # tmux-based local dev launcher
└── docker-compose.yml

Local development

Prerequisites

Node.js 20+
pnpm
Python 3.13+
uv
Docker
Redis
A PostgreSQL database reachable through DATABASE_URL
tmux if you want to use ./dev.sh

Environment

Create these local env files before starting the stack:

apps/web/.env.local

NEXT_PUBLIC_CLERK_PUBLISHABLE_KEY=pk_...
CLERK_SECRET_KEY=sk_...
API_BASE_URL=http://localhost:8000

apps/api/.env

DATABASE_URL=postgresql+asyncpg://...
CLERK_SECRET_KEY=sk_...
GITHUB_CLIENT_ID=...
GITHUB_CLIENT_SECRET=...

Quick start

If you already have your env files and external database ready, the fastest local path is:

./dev.sh

This opens a tmux session with:

Next.js web app
FastAPI API
Redis
Celery worker
Celery beat

Docker Compose

You can also run the app services with Docker:

docker compose up --build

This starts:

web
api
redis
worker
scheduler

You still need a valid DATABASE_URL configured for the API container.

Manual startup

Frontend:

cd apps/web
pnpm install
pnpm dev

Backend:

cd apps/api
uv sync
uv run uvicorn rlx_api.main:app --reload --port 8000

Redis:

docker run -d --name redis -p 6379:6379 redis:7-alpine

Worker:

cd apps/api
uv run celery -A rlx_api.celery_app:celery_app worker --loglevel=info -Q pod_ops,repo_ops

Scheduler:

cd apps/api
uv run celery -A rlx_api.celery_app:celery_app beat --loglevel=info

Useful commands

Frontend:

cd apps/web
pnpm build
pnpm lint

Backend:

cd apps/api
uv run alembic upgrade head
uv run alembic current

Name		Name	Last commit message	Last commit date
Latest commit History 170 Commits
.cursor		.cursor
apps		apps
docs		docs
scripts		scripts
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
codeflix.json		codeflix.json
dev.sh		dev.sh
docker-compose.yml		docker-compose.yml
opencode.json		opencode.json
rlx-gpu-ssh-plan.md		rlx-gpu-ssh-plan.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RLX: Hosted RL Training

What RLX does

Product Tour

How it works

Stack

Repo layout

Local development

Prerequisites

Environment

Quick start

Docker Compose

Manual startup

Useful commands

Additional docs

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RLX: Hosted RL Training

What RLX does

Product Tour

How it works

Stack

Repo layout

Local development

Prerequisites

Environment

Quick start

Docker Compose

Manual startup

Useful commands

Additional docs

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages