Sven Mika
|
cabaa3b3c6
[RLlib Testing] Add A3C/APPO/BC/DDPPO/MARWIL/CQL/ES/ARS/TD3 to weekly learning tests. (#18381)
|
3 年之前 |
Michael Luo
|
4cbe13cdfd
[RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603)
|
3 年之前 |
Michael Luo
|
ec2c10309b
[RLlib] CQL for HalfCheetah-Random-v0 + Hopper-Random-v0 + CQL Bug Fixes (#14243)
|
3 年之前 |
Michael Luo
|
587f207c2f
[RLlib] Support for D4RL + Semi-working CQL Benchmark (#13550)
|
3 年之前 |
Michael Luo
|
42cd414e5b
[RLlib] New Offline RL Algorithm: CQL (based on SAC) (#13118)
|
3 年之前 |