Author | SHA1 Message | Date |
---|---|---|
Sven Mika | 63db0e3a7c [RLlib] Fix SAC learning test flakiness introduced in PR: "Sub-class `Trainer` (instead of `build_trainer()`): All remaining classes; soft-deprecate `build_trainer`." (#20985) | 2 years ago |
Sven Mika | 8a72824c63 [RLlib Testig] Split and unflake more CI tests (make sure all jobs are < 30min). (#18591) | 3 years ago |
Sven Mika | 53206dd440 [RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531) | 3 years ago |