Sven Mika
|
3d2e27485b
[RLlib] Trainer sub-class DQN/SimpleQ/APEX-DQN/R2D2 (instead of using `build_trainer`). (#20633)
|
2 年之前 |
Sven Mika
|
2d24ef0d32
[RLlib] Add all simple learning tests as `framework=tf2`. (#19273)
|
3 年之前 |
Sven Mika
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |
Sven Mika
|
9c9b482661
[RLlib] Allow n-step > 1 and prio. replay for R2D2 and RNNSAC. (#18939)
|
3 年之前 |
Sven Mika
|
08c09737fa
[RLlib] Fix R2D2 (torch) multi-GPU issue. (#18550)
|
3 年之前 |
Sven Mika
|
494ddd98c1
[RLlib] Replace "seq_lens" w/ SampleBatch.SEQ_LENS. (#17928)
|
3 年之前 |
Sven Mika
|
a428f10ebe
[RLlib] Add multi-GPU learning tests to nightly. (#17778)
|
3 年之前 |
Sven Mika
|
839fc59224
[RLlib] CQL TensorFlow support (#15841)
|
3 年之前 |
Sven Mika
|
732197e23a
[RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393)
|
3 年之前 |
Sven Mika
|
8000258333
[RLlib] R2D2 Implementation. (#13933)
|
3 年之前 |