Avnish Narayan
|
026bf01071
[RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535)
|
3 年之前 |
Sven Mika
|
599e589481
[RLlib] Move existing fake multi-GPU learning tests into separate buildkite job. (#18065)
|
3 年之前 |
Sven Mika
|
90b21ce27e
[RLlib] De-flake 3 test cases; Fix `config.simple_optimizer` and `SampleBatch.is_training` warnings. (#17321)
|
3 年之前 |
Sven Mika
|
53206dd440
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
3 年之前 |
Sven Mika
|
8a891b3c30
[RLlib] SAC n_step > 1. (#10567)
|
4 年之前 |
Sven Mika
|
2746fc0476
[RLlib] Auto-framework, retire `use_pytorch` in favor of `framework=...` (#8520)
|
4 年之前 |
Sven Mika
|
baa053496a
[RLlib] Benchmark and regression test yaml cleanup and restructuring. (#8414)
|
4 年之前 |