Sven Mika
|
60b2219d72
[RLlib] Allow for evaluation to run by `timesteps` (alternative to `episodes`) and add auto-setting to make sure train doesn't ever have to wait for eval (e.g. long episodes) to finish. (#20757)
|
2 年之前 |
Sven Mika
|
e6aae61487
[RLlib; testing] Fix bug in stress tests not handling >1 trials per experiment (due to grid-search in IMPALA stress tests). (#18705)
|
3 年之前 |
Sven Mika
|
c5d20849ae
[RLlib] Rename `rllib rollout` into `rllib evaluate` (backward compatible) to match Trainer API. (#18467)
|
3 年之前 |