Sven Mika
|
cf21c634a3
[RLlib] Fix deprecated warning for torch_ops.py (soft-replaced by torch_utils.py). (#19982)
|
3 years ago |
Sven Mika
|
b4300dd532
[RLlib] Issue 18812: Torch multi-GPU stats not protected against race conditions. (#18937)
|
3 years ago |
Sven Mika
|
ed85f59194
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879)
|
3 years ago |
Sven Mika
|
7f2b3c0824
[RLlib] Issue 17667: CQL-torch + GPU not working (due to simple_optimizer=False; must use simple optimizer!). (#17742)
|
3 years ago |
Sven Mika
|
811d71b368
[RLlib] Issue 17653: Torch multi-GPU (>1) broken for LSTMs. (#17657)
|
3 years ago |
Sven Mika
|
53206dd440
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
3 years ago |
Sven Mika
|
2900a06dd7
[RLlib] Issue 14503: SAC not allowing custom action distributions. (#16427)
|
3 years ago |
Sven Mika
|
839fc59224
[RLlib] CQL TensorFlow support (#15841)
|
3 years ago |
Sven Mika
|
469f5227da
[RLlib] CQL bug fix: Normalize actions for atanh in BC part of the CQL loss. (#15814)
|
3 years ago |
Sven Mika
|
c4a3e1589b
[RLlib] CQL: Bug fixes and OPE example added to test and offline_rl.py example. (#15761)
|
3 years ago |
Michael Luo
|
4cbe13cdfd
[RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603)
|
3 years ago |
Michael Luo
|
ec2c10309b
[RLlib] CQL for HalfCheetah-Random-v0 + Hopper-Random-v0 + CQL Bug Fixes (#14243)
|
3 years ago |
Sven Mika
|
2e3655e8a9
[RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238)
|
3 years ago |
Michael Luo
|
42cd414e5b
[RLlib] New Offline RL Algorithm: CQL (based on SAC) (#13118)
|
3 years ago |