Skip to content
This repository was archived by the owner on Jul 7, 2023. It is now read-only.

Commit 1a6ebf2

Browse files
author
Henryk Michalewski
committed
Changes to the reinforcement learning README
1 parent d7dfe79 commit 1a6ebf2

File tree

1 file changed

+11
-0
lines changed

1 file changed

+11
-0
lines changed

tensor2tensor/rl/README.md

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -10,3 +10,14 @@ Currently the only supported algorithm is Proximy Policy Optimization - PPO.
1010
## Sample usage - training in Pendulum-v0 environment.
1111

1212
```python rl/t2t_rl_trainer.py --problems=Pendulum-v0 --hparams_set continuous_action_base [--output_dir dir_location]```
13+
14+
## Sample usage - training in Pendulum-v0 environment.
15+
16+
```python rl/t2t_rl_trainer.py --problems=PongNoFrameskip-v4 --hparams_set discrete_action_base [--output_dir dir_location]```
17+
18+
## Sample usage - generation of a model
19+
20+
```python tensor2tensor/bin/t2t-trainer --generate_data --data_dir=~/t2t_data --problems=gym_pong_trajectories_from_policy --hparams_set=base_atari --model_path [model]```
21+
22+
```python tensor2tensor/bin/t2t-datagen --data_dir=~/t2t_data --tmp_dir=~/t2t_data/tmp --problem=gym_pong_trajectories_from_policy --model_path [model]```
23+

0 commit comments

Comments
 (0)