Sven Mika 53206dd440 [RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531) | 3 年之前 | |
---|---|---|
.. | ||
enormous.zip | 53206dd440 [RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531) | 3 年之前 |
large.json | e2be41b407 [RLlib] MARWIL + BC: Various fixes and enhancements. (#16218) | 3 年之前 |
small.json | c4a3e1589b [RLlib] CQL: Bug fixes and OPE example added to test and offline_rl.py example. (#15761) | 3 年之前 |