Sven Mika
|
d5bfb7b7da
[RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652)
|
2 年之前 |
Sven Mika
|
ed85f59194
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879)
|
3 年之前 |
Sven Mika
|
c3a15ecc0f
[RLlib] Issue #13802: Enhance metrics for `multiagent->count_steps_by=agent_steps` setting. (#14033)
|
3 年之前 |
Sven Mika
|
2aec77e305
[RLlib] Fix two test cases that only fail on Travis. (#11435)
|
4 年之前 |
Sven Mika
|
2746fc0476
[RLlib] Auto-framework, retire `use_pytorch` in favor of `framework=...` (#8520)
|
4 年之前 |
Eric Liang
|
9a83908c46
[rllib] Deprecate policy optimizers (#8345)
|
4 年之前 |
Eric Liang
|
f5d12a958b
[rllib] Port Ape-X to distributed execution API (#7497)
|
4 年之前 |