An OpenEnv benchmark testing the ability of AI agents to act as Site Reliability Engineers (SREs) by diagnosing and filtering raw production failure logs.
-
Updated
Apr 8, 2026 - Python
An OpenEnv benchmark testing the ability of AI agents to act as Site Reliability Engineers (SREs) by diagnosing and filtering raw production failure logs.
An RL environment where an LLM agent learns to curate talking-head video clips for AV LoRA training. No labels exposed, rewards only.
A production-grade OpenEnv environment for benchmarking RL agents on real-world data cleaning and schema engineering tasks.
OpenEnv-compliant RL environment for SQL query debugging. Built for META x PyTorch x SST OpenEnv Hackathon.
A real-world RL environment where AI agents learn to maintain and update test suites when code changes. Includes tasks for unit testing, bug detection, and regression auditing with structured reward signals.
A long horizon incident response environment.
An OpenEnv environment where AI agents triage satellite intelligence reports, classify threats, and make real-time defense decisions.
An OpenEnv-compliant reinforcement learning environment for personalized AI tutoring — simulating real-world EdTech dynamics with psychometric student modeling and multi-objective pedagogical optimization.
Deterministic reinforcement learning environment for simulating open-source issue triage workflows
a reinforcement learning agent built with OpenEnv and Stable-Baselines3 that learns to intelligently manage email workflows. The agent handles tasks ranging from spam filtering to drafting meeting invitations and resolving ambiguous client requests.
An OpenEnv based RL environment that allows agents to learn to clean datasets across 3 levels of difficulties.
Reinforcement Learning system for smart irrigation of Punjab rice farms. Built for the OpenEnv Hackathon.
Data Cleaning Agent for Cleaning Unorganised Dataset
Government Scheme Eligibility Matching - OpenEnv Environment
Fault-injecting OpenEnv training environment for vibe-coded SaaS incidents. 30 scenarios grounded in 2025-26 production failures. Drop-in OpenClaw-RL pool server. Claude Code skill included.
A production-oriented OpenEnv-style environment for evaluating tool-using agents on customer support ticket triage.
OpenEnv code review environment for AI agents.
NeoVentEnv: An OpenEnv neonatal ventilator management simulator for training and evaluating RL/LLM agents on realistic NICU tasks.
Add a description, image, and links to the openenv-environment topic page so that developers can more easily learn about it.
To associate your repository with the openenv-environment topic, visit your repo's landing page and select "manage topics."