Skip to content

Environments

The OpenEnv community has built a catalog of ready-to-run environments that cover deterministic smoke tests, full developer workflows, and multi-step reasoning challenges. Explore the surface area below and jump directly into the guides for each environment.

Community Environments

Echo

Minimal observation/action loop for verifying client integrations, CI pipelines, and onboarding flows in seconds.

Coding

Secure sandbox with filesystem access and evaluation hooks for executing generated code and building autonomous dev workflows.

Chat

Message-driven loop tailored for conversational agents that need structured turns, safety rails, and message attribution.

Atari

Classic Arcade Learning Environment tasks packaged for fast benchmarking of reinforcement-learning style agents.

OpenSpiel

Multi-agent, game-theory workloads powered by DeepMind’s OpenSpiel suite, ideal for search and self-play experiments.

SUMO-RL

Traffic control scenarios with SUMO simulators for agents that reason about continuous control and scheduling.

FinRL

Financial market simulations with portfolio APIs, perfect for RLHF strategies and algorithmic trading experiments.

FinQA

Financial question-answering on SEC 10-K filings with SQL tools, fuzzy numerical matching, and binary reward for RL training.

TextArena

Multi-task text arena for language-game competitions such as Wordle, reasoning puzzles, and program synthesis.

Git

Teaches agents to navigate repositories, inspect diffs, and land changes via Git-native operations.

DIPG Safety

Safety-critical diagnostics from the DIPG benchmark, highlighting guardrails, adversarial prompts, and risk scoring.

Snake

Classic snake game environment for RL research with configurable grids, partial observability, and customizable rewards.

Chess

Chess RL environment powered by the moonfish engine with configurable opponents, PSQT evaluation, and full rules support.

Maze

A grid maze RL environment where the agent must navigate from a start cell to an exit while avoiding walls.

Unity

A wrapper for Unity environments to bring different nice graphical simulation environment from Unity that supports Ml-Agents-envs.

OpenApp

A web application simulation environment for OpenEnv that wraps the OpenApps framework and BrowserGym.

Web Search

Web search environment for RL research with configurable grids, partial observability, and customizable rewards.

BrowserGym

Browser automation environment for web agents with DOM interaction, navigation, and multi-step task completion.

Wildfire

Autonomous wildfire containment simulation with grid-based fire spread, weather dynamics, and resource-constrained control strategies for RL agents.

REPL

Python REPL environment for training language models on code execution tasks. Based on the Recursive Language Models (RLM) paradigm

Terminal-Bench 2

OpenEnv wrapper for Terminal-Bench 2 tasks

Julia

Julia code execution environment for RL training with test result tracking, reward calculation, and process pooling for performance.

kernrl

GPU kernel optimization environment for training LLMs to write fast CUDA/Triton kernels with real hardware feedback.

Grid World

A simple 5x5 grid world environment for testing and learning the OpenEnv framework.

RLVE Gym

A suite of 400 environments that procedurally generate reasoning problems for LM training with configurable difficulty.

Reasoning Gym

Procedural dataset generators providing 100+ reasoning tasks across algebra, arithmetic, geometry, graph theory, logic, and games with adjustable complexity and algorithmic verification.

Calendar Gym

This environment exposes a Calendar Gym tools through the OpenEnv reset/step/state interface. The server runs a FastAPI app that serves the OpenEnv endpoints.

dm_control

MuJoCo-based continuous control tasks from DeepMind's dm_control suite — cartpole, hopper, quadruped, walker, and more.

Want to publish your own environment? Head over to the Build Your Own Environment guide for a step-by-step walkthrough.