Skip to content

Commit f3c4564

Browse files
committed
Update README.md
1 parent ad78967 commit f3c4564

File tree

1 file changed

+4
-2
lines changed

1 file changed

+4
-2
lines changed

README.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,8 @@ An e2e framework for creating, deploying and using isolated execution environmen
55
[![PyPI](https://img.shields.io/pypi/v/openenv-core?color=blue)](https://pypi.org/project/openenv-core/)
66
[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/meta-pytorch/OpenEnv/blob/main/examples/OpenEnv_Tutorial.ipynb) **← Try the Interactive Tutorial!**
77

8+
Train LLMs to play BlackJack using torchforge (PyTorch's native RL framework) + OpenEnv: [`examples/grpo_blackjack/`](examples/grpo_blackjack/)
9+
810
## Overview
911

1012
OpenEnv provides a standard for interacting with agentic execution environments via simple Gymnasium style APIs - step(), reset(), state(). Users of agentic execution environments can interact with the environment during RL training loops using these simple APIs.
@@ -171,10 +173,10 @@ client.close() # Stops and removes container
171173
- smolagents (for coding environment)
172174

173175
## Supported RL Tools
174-
The goal of this project is to support a broad set of open and closed tools to help standardize the agentic RL community. If you have a project that supports OpenEnv environments, please put up a PR to add your tool name along with a link to your documentation.
176+
The goal of this project is to support a broad set of open and closed tools to help standardize the agentic RL community. If you have a project that supports OpenEnv environments, please put up a PR to add your tool name along with a link to your documentation.
175177

176178
### torchforge
177-
(coming soon)
179+
See GRPO BlackJack training example: [`examples/grpo_blackjack/`](examples/grpo_blackjack/)
178180

179181
### TRL
180182
(coming soon}

0 commit comments

Comments
 (0)