Sven Mika
|
d5bfb7b7da
[RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652)
|
2 years ago |
Sven Mika
|
c3e3fc7637
[RLlib] Issue 18280: A3C/IMPALA multi-agent not working. (#19100)
|
3 years ago |
Sven Mika
|
ed85f59194
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879)
|
3 years ago |
Sven Mika
|
eb0038612f
[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584)
|
3 years ago |
Sven Mika
|
d001af3e59
[RLlib] Allow `rllib rollout` to run distributed via evaluation workers. (#13718)
|
3 years ago |
Barak Michener
|
8e76796fd0
ci: Redo `format.sh --all` script & backfill lint fixes (#9956)
|
4 years ago |
Sven Mika
|
fcdf410ae1
[RLlib] Tf2.x native. (#8752)
|
4 years ago |
Sven Mika
|
4da0e542d5
[RLlib] DDPG and SAC eager support (preparation for tf2.x) (#9204)
|
4 years ago |
Sven Mika
|
5b2a97597b
[RLlib] Retire `try_import_tree` (should be installed along with other requirements). (#9211)
|
4 years ago |
Sven Mika
|
0d37103f84
[RLlib] Prototype: Model Trajectory View API, part 0 (#9171)
|
4 years ago |
Sven Mika
|
97d524c075
[RLlib] Issue 8769 broken OOM tests_dir cases (R & S). (#8770)
|
4 years ago |
Sven Mika
|
2746fc0476
[RLlib] Auto-framework, retire `use_pytorch` in favor of `framework=...` (#8520)
|
4 years ago |