Sven Mika
|
db058d0fb3
[RLlib] Rename `metrics_smoothing_episodes` into `metrics_num_episodes_for_smoothing` for clarity. (#20983)
|
2 年之前 |
Sven Mika
|
60b2219d72
[RLlib] Allow for evaluation to run by `timesteps` (alternative to `episodes`) and add auto-setting to make sure train doesn't ever have to wait for eval (e.g. long episodes) to finish. (#20757)
|
2 年之前 |
Avnish Narayan
|
026bf01071
[RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535)
|
3 年之前 |
Julius Frost
|
a88b217d3f
[rllib] Enhancements to Input API for customizing offline datasets (#16957)
|
3 年之前 |