gjoliver e7f9e8ceec [RLlib] Report total_train_steps correctly for offline agents like CQL. (#20541) 2 年之前
..
halfcheetah-bc.yaml 4cbe13cdfd [RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603) 3 年之前
halfcheetah-cql.yaml cabaa3b3c6 [RLlib Testing] Add A3C/APPO/BC/DDPPO/MARWIL/CQL/ES/ARS/TD3 to weekly learning tests. (#18381) 3 年之前
hopper-bc.yaml 4cbe13cdfd [RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603) 3 年之前
hopper-cql.yaml 4cbe13cdfd [RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603) 3 年之前
pendulum-cql.yaml e7f9e8ceec [RLlib] Report total_train_steps correctly for offline agents like CQL. (#20541) 2 年之前