Environments¶
The OpenEnv community has built a catalog of ready-to-run environments that cover deterministic smoke tests, full developer workflows, and multi-step reasoning challenges. Explore the surface area below and jump directly into the guides for each environment.
Community Environments¶
Minimal observation/action loop for verifying client integrations, CI pipelines, and onboarding flows in seconds.
Secure sandbox with filesystem access and evaluation hooks for executing generated code and building autonomous dev workflows.
Message-driven loop tailored for conversational agents that need structured turns, safety rails, and message attribution.
Classic Arcade Learning Environment tasks packaged for fast benchmarking of reinforcement-learning style agents.
Multi-agent, game-theory workloads powered by DeepMind’s OpenSpiel suite, ideal for search and self-play experiments.
Traffic control scenarios with SUMO simulators for agents that reason about continuous control and scheduling.
Financial market simulations with portfolio APIs, perfect for RLHF strategies and algorithmic trading experiments.
Financial question-answering on SEC 10-K filings with SQL tools, fuzzy numerical matching, and binary reward for RL training.
Multi-task text arena for language-game competitions such as Wordle, reasoning puzzles, and program synthesis.
Teaches agents to navigate repositories, inspect diffs, and land changes via Git-native operations.
Safety-critical diagnostics from the DIPG benchmark, highlighting guardrails, adversarial prompts, and risk scoring.
Classic snake game environment for RL research with configurable grids, partial observability, and customizable rewards.
Chess RL environment powered by the moonfish engine with configurable opponents, PSQT evaluation, and full rules support.
A grid maze RL environment where the agent must navigate from a start cell to an exit while avoiding walls.
A wrapper for Unity environments to bring different nice graphical simulation environment from Unity that supports Ml-Agents-envs.
A web application simulation environment for OpenEnv that wraps the OpenApps framework and BrowserGym.
Web search environment for RL research with configurable grids, partial observability, and customizable rewards.
Browser automation environment for web agents with DOM interaction, navigation, and multi-step task completion.
Autonomous wildfire containment simulation with grid-based fire spread, weather dynamics, and resource-constrained control strategies for RL agents.
Python REPL environment for training language models on code execution tasks. Based on the Recursive Language Models (RLM) paradigm
Julia code execution environment for RL training with test result tracking, reward calculation, and process pooling for performance.
GPU kernel optimization environment for training LLMs to write fast CUDA/Triton kernels with real hardware feedback.
A suite of 400 environments that procedurally generate reasoning problems for LM training with configurable difficulty.
Procedural dataset generators providing 100+ reasoning tasks across algebra, arithmetic, geometry, graph theory, logic, and games with adjustable complexity and algorithmic verification.
This environment exposes a Calendar Gym tools through the OpenEnv reset/step/state interface. The server runs a FastAPI app that serves the OpenEnv endpoints.
Want to publish your own environment? Head over to the Build Your Own Environment guide for a step-by-step walkthrough.