Skip to content
This repository was archived by the owner on Jul 7, 2023. It is now read-only.

Commit b2dc7c2

Browse files
author
Henryk Michalewski
committed
More changes to README
1 parent 1a6ebf2 commit b2dc7c2

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tensor2tensor/rl/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ Currently the only supported algorithm is Proximy Policy Optimization - PPO.
1313

1414
## Sample usage - training in Pendulum-v0 environment.
1515

16-
```python rl/t2t_rl_trainer.py --problems=PongNoFrameskip-v4 --hparams_set discrete_action_base [--output_dir dir_location]```
16+
```python tensor2tensor/rl/t2t_rl_trainer.py --problem PongNoFrameskip-v4 --hparams_set atari_base --hparams num_agents=3 --output_dir /tmp/pong`date +%Y%m%d_%H%M%S```
1717

1818
## Sample usage - generation of a model
1919

0 commit comments

Comments
 (0)