提交历史

作者 SHA1 备注 提交日期
  Avnish Narayan 026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535) 3 年之前
  Sven Mika 599e589481 [RLlib] Move existing fake multi-GPU learning tests into separate buildkite job. (#18065) 3 年之前
  Sven Mika 90b21ce27e [RLlib] De-flake 3 test cases; Fix `config.simple_optimizer` and `SampleBatch.is_training` warnings. (#17321) 3 年之前
  Sven Mika 53206dd440 [RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531) 3 年之前
  Sven Mika 8a891b3c30 [RLlib] SAC n_step > 1. (#10567) 4 年之前
  Sven Mika 2746fc0476 [RLlib] Auto-framework, retire `use_pytorch` in favor of `framework=...` (#8520) 4 年之前
  Sven Mika baa053496a [RLlib] Benchmark and regression test yaml cleanup and restructuring. (#8414) 4 年之前