Skip to content
This repository was archived by the owner on Jul 7, 2023. It is now read-only.

Commit 62797a0

Browse files
author
Henryk Michalewski
committed
More tiny changes in README
1 parent b2dc7c2 commit 62797a0

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

tensor2tensor/rl/README.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -7,13 +7,13 @@ for now and under heavy development.
77

88
Currently the only supported algorithm is Proximy Policy Optimization - PPO.
99

10-
## Sample usage - training in Pendulum-v0 environment.
10+
## Sample usage - training in the Pendulum-v0 environment.
1111

1212
```python rl/t2t_rl_trainer.py --problems=Pendulum-v0 --hparams_set continuous_action_base [--output_dir dir_location]```
1313

14-
## Sample usage - training in Pendulum-v0 environment.
14+
## Sample usage - training in the PongNoFrameskip-v0 environment.
1515

16-
```python tensor2tensor/rl/t2t_rl_trainer.py --problem PongNoFrameskip-v4 --hparams_set atari_base --hparams num_agents=3 --output_dir /tmp/pong`date +%Y%m%d_%H%M%S```
16+
```python tensor2tensor/rl/t2t_rl_trainer.py --problem stacked_pong --hparams_set atari_base --hparams num_agents=5 --output_dir /tmp/pong`date +%Y%m%d_%H%M%S````
1717

1818
## Sample usage - generation of a model
1919

0 commit comments

Comments
 (0)