提交历史

作者 SHA1 备注 提交日期
  Sven Mika 60b2219d72 [RLlib] Allow for evaluation to run by `timesteps` (alternative to `episodes`) and add auto-setting to make sure train doesn't ever have to wait for eval (e.g. long episodes) to finish. (#20757) 2 年之前
  Sven Mika e6aae61487 [RLlib; testing] Fix bug in stress tests not handling >1 trials per experiment (due to grid-search in IMPALA stress tests). (#18705) 3 年之前
  Sven Mika c5d20849ae [RLlib] Rename `rllib rollout` into `rllib evaluate` (backward compatible) to match Trainer API. (#18467) 3 年之前