Commit History

Author SHA1 Message Date
  Sven Mika d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) 2 years ago
  Sven Mika c3e3fc7637 [RLlib] Issue 18280: A3C/IMPALA multi-agent not working. (#19100) 3 years ago
  Sven Mika ed85f59194 [RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879) 3 years ago
  Sven Mika eb0038612f [RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584) 3 years ago
  Sven Mika d001af3e59 [RLlib] Allow `rllib rollout` to run distributed via evaluation workers. (#13718) 3 years ago
  Barak Michener 8e76796fd0 ci: Redo `format.sh --all` script & backfill lint fixes (#9956) 4 years ago
  Sven Mika fcdf410ae1 [RLlib] Tf2.x native. (#8752) 4 years ago
  Sven Mika 4da0e542d5 [RLlib] DDPG and SAC eager support (preparation for tf2.x) (#9204) 4 years ago
  Sven Mika 5b2a97597b [RLlib] Retire `try_import_tree` (should be installed along with other requirements). (#9211) 4 years ago
  Sven Mika 0d37103f84 [RLlib] Prototype: Model Trajectory View API, part 0 (#9171) 4 years ago
  Sven Mika 97d524c075 [RLlib] Issue 8769 broken OOM tests_dir cases (R & S). (#8770) 4 years ago
  Sven Mika 2746fc0476 [RLlib] Auto-framework, retire `use_pytorch` in favor of `framework=...` (#8520) 4 years ago