Commit History

Author SHA1 Message Date
  Sven Mika cf21c634a3 [RLlib] Fix deprecated warning for torch_ops.py (soft-replaced by torch_utils.py). (#19982) 3 years ago
  Sven Mika b4300dd532 [RLlib] Issue 18812: Torch multi-GPU stats not protected against race conditions. (#18937) 3 years ago
  Sven Mika ed85f59194 [RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879) 3 years ago
  Sven Mika 7f2b3c0824 [RLlib] Issue 17667: CQL-torch + GPU not working (due to simple_optimizer=False; must use simple optimizer!). (#17742) 3 years ago
  Sven Mika 811d71b368 [RLlib] Issue 17653: Torch multi-GPU (>1) broken for LSTMs. (#17657) 3 years ago
  Sven Mika 53206dd440 [RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531) 3 years ago
  Sven Mika 2900a06dd7 [RLlib] Issue 14503: SAC not allowing custom action distributions. (#16427) 3 years ago
  Sven Mika 839fc59224 [RLlib] CQL TensorFlow support (#15841) 3 years ago
  Sven Mika 469f5227da [RLlib] CQL bug fix: Normalize actions for atanh in BC part of the CQL loss. (#15814) 3 years ago
  Sven Mika c4a3e1589b [RLlib] CQL: Bug fixes and OPE example added to test and offline_rl.py example. (#15761) 3 years ago
  Michael Luo 4cbe13cdfd [RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603) 3 years ago
  Michael Luo ec2c10309b [RLlib] CQL for HalfCheetah-Random-v0 + Hopper-Random-v0 + CQL Bug Fixes (#14243) 3 years ago
  Sven Mika 2e3655e8a9 [RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238) 3 years ago
  Michael Luo 42cd414e5b [RLlib] New Offline RL Algorithm: CQL (based on SAC) (#13118) 3 years ago