Commit History

Author SHA1 Message Date
  Sven Mika 63db0e3a7c [RLlib] Fix SAC learning test flakiness introduced in PR: "Sub-class `Trainer` (instead of `build_trainer()`): All remaining classes; soft-deprecate `build_trainer`." (#20985) 2 years ago
  Sven Mika 8a72824c63 [RLlib Testig] Split and unflake more CI tests (make sure all jobs are < 30min). (#18591) 3 years ago
  Sven Mika 53206dd440 [RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531) 3 years ago