@@ -11,7 +11,7 @@ and this project adheres to
1111
1212#### com.unity.ml-agents / com.unity.ml-agents.extensions (C#)
1313- Added a new feature to replicate training areas dynamically during runtime. (#5568 )
14- - Update Barracuda to 2.3.1-preview
14+ - Update Barracuda to 2.3.1-preview ( # 5591 )
1515
1616#### ml-agents / ml-agents-envs / gym-unity (Python)
1717
@@ -20,36 +20,34 @@ and this project adheres to
2020#### com.unity.ml-agents / com.unity.ml-agents.extensions (C#)
2121- Added the capacity to initialize behaviors from any checkpoint and not just the latest one (#5525 )
2222- Added the ability to get a read-only view of the stacked observations (#5523 )
23+
2324#### ml-agents / ml-agents-envs / gym-unity (Python)
24- - Set gym version in gym-unity to gym release 0.20.0
25+ - Set gym version in gym-unity to gym release 0.20.0 ( # 5540 )
2526- Added support for having ` beta ` , ` epsilon ` , and ` learning rate ` on separate schedules (affects only PPO and POCA). (#5538 )
26-
2727- Changed default behavior to restart crashed Unity environments rather than exiting. (#5553 )
2828 - Rate & lifetime limits on this are configurable via 3 new yaml options
2929 1 . env_params.max_lifetime_restarts (--max-lifetime-restarts) [ default=10]
3030 2 . env_params.restarts_rate_limit_n (--restarts-rate-limit-n) [ default=1]
3131 3 . env_params.restarts_rate_limit_period_s (--restarts-rate-limit-period-s) [ default=60]
32-
33-
3432- Deterministic action selection is now supported during training and inference(#5619 )
3533 - Added a new ` --deterministic ` cli flag to deterministically select the most probable actions in policy. The same thing can
3634 be achieved by adding ` deterministic: true ` under ` network_settings ` of the run options configuration.(#5597 )
3735 - Extra tensors are now serialized to support deterministic action selection in onnx. (#5593 )
3836 - Support inference with deterministic action selection in editor (#5599 )
39- ### Bug Fixes
40- - Fixed a bug where the critics were not being normalized during training. (#5595 )
41- - Fixed the bug where curriculum learning would crash because of the incorrect run_options parsing. (#5586 )
37+ - Added minimal analytics collection to LL-API (#5511 )
4238
39+ ### Bug Fixes
4340#### com.unity.ml-agents / com.unity.ml-agents.extensions (C#)
4441
4542#### ml-agents / ml-agents-envs / gym-unity (Python)
43+ - Fixed a bug where the critics were not being normalized during training. (#5595 )
44+ - Fixed the bug where curriculum learning would crash because of the incorrect run_options parsing. (#5586 )
4645- Fixed a bug in multi-agent cooperative training where agents might not receive all of the states of
4746terminated teammates. (#5441 )
4847- Fixed wrong attribute name in argparser for torch device option (#5433 )(#5467 )
4948- Fixed conflicting CLI and yaml options regarding resume & initialize_from (#5495 )
5049- Fixed failing tests for gym-unity due to gym 0.20.0 release (#5540 )
5150- Fixed a bug in VAIL where the variational bottleneck was not properly passing gradients (#5546 )
52- - Added minimal analytics collection to LL-API (#5511 )
5351
5452## [ 2.1.0-exp.1] - 2021-06-09
5553### Minor Changes
0 commit comments