提交历史

作者 SHA1 备注 提交日期
  Sven Mika d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) 2 年之前
  Sven Mika ed85f59194 [RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879) 3 年之前
  Sven Mika c3a15ecc0f [RLlib] Issue #13802: Enhance metrics for `multiagent->count_steps_by=agent_steps` setting. (#14033) 3 年之前
  Sven Mika 2aec77e305 [RLlib] Fix two test cases that only fail on Travis. (#11435) 4 年之前
  Sven Mika 2746fc0476 [RLlib] Auto-framework, retire `use_pytorch` in favor of `framework=...` (#8520) 4 年之前
  Eric Liang 9a83908c46 [rllib] Deprecate policy optimizers (#8345) 4 年之前
  Eric Liang f5d12a958b [rllib] Port Ape-X to distributed execution API (#7497) 4 年之前