Sven Mika
|
b10d5533be
[RLlib] Issue 20920 (partial solution): contrib/MADDPG + pettingzoo coop-pong-v4 not working. (#21452)
|
2 years ago |
Sven Mika
|
3d2e27485b
[RLlib] Trainer sub-class DQN/SimpleQ/APEX-DQN/R2D2 (instead of using `build_trainer`). (#20633)
|
2 years ago |
Sven Mika
|
73f5c4039b
[RLlib] Fix flakey test_a3c, test_maml, test_apex_dqn. (#19035)
|
3 years ago |
Sven Mika
|
9c9b482661
[RLlib] Allow n-step > 1 and prio. replay for R2D2 and RNNSAC. (#18939)
|
3 years ago |
Sven Mika
|
4888d7c9af
[RLlib] Replay buffers: Add config option to store contents in checkpoints. (#17999)
|
3 years ago |
Sven Mika
|
8000258333
[RLlib] R2D2 Implementation. (#13933)
|
3 years ago |