提交历史

作者 SHA1 备注 提交日期
  Sven Mika 63db0e3a7c [RLlib] Fix SAC learning test flakiness introduced in PR: "Sub-class `Trainer` (instead of `build_trainer()`): All remaining classes; soft-deprecate `build_trainer`." (#20985) 2 年之前
  Sven Mika 8a72824c63 [RLlib Testig] Split and unflake more CI tests (make sure all jobs are < 30min). (#18591) 3 年之前
  Sven Mika 53206dd440 [RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531) 3 年之前