.. |
a3c
|
599e589481
[RLlib] Move existing fake multi-GPU learning tests into separate buildkite job. (#18065)
|
3 years ago |
ars
|
2589309cf0
[RLlib] Make sure torch and tf behave the same wrt conv2d nets. (#8785)
|
4 years ago |
cql
|
e7f9e8ceec
[RLlib] Report total_train_steps correctly for offline agents like CQL. (#20541)
|
2 years ago |
ddpg
|
60b2219d72
[RLlib] Allow for evaluation to run by `timesteps` (alternative to `episodes`) and add auto-setting to make sure train doesn't ever have to wait for eval (e.g. long episodes) to finish. (#20757)
|
2 years ago |
dqn
|
d5bfb7b7da
[RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652)
|
2 years ago |
dreamer
|
4e9888ce2f
[RLlib] Dreamer (#10172)
|
4 years ago |
es
|
b84575c092
[RLlib] 2 RLlib Flaky Tests (#14930)
|
3 years ago |
impala
|
026bf01071
[RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535)
|
3 years ago |
maml
|
59bc1e6c09
[RLLib] MAML extension for all models except RNNs (#11337)
|
3 years ago |
marwil
|
4b278c36fc
[RLlib] Behavioral Cloning (from MARWIL). (#10619)
|
4 years ago |
mbmpo
|
6e6c680f14
MBMPO Cartpole (#11832)
|
4 years ago |
pg
|
853d10871c
[RLlib] Issue 18499: PGTrainer with training_iteration fn does not support multi-GPU. (#21376)
|
2 years ago |
ppo
|
f3397b6f48
[RLlib] Minor fixes/cleanups; chop_into_sequences now handles nested data. (#19408)
|
3 years ago |
qmix
|
abd3bef63b
[RLlib] QMIX better defaults + added to CI learning tests (#21332)
|
2 years ago |
sac
|
63db0e3a7c
[RLlib] Fix SAC learning test flakiness introduced in PR: "Sub-class `Trainer` (instead of `build_trainer()`): All remaining classes; soft-deprecate `build_trainer`." (#20985)
|
2 years ago |
cleanup_experiment.py
|
baa053496a
[RLlib] Benchmark and regression test yaml cleanup and restructuring. (#8414)
|
4 years ago |
compact-regression-test.yaml
|
93c0a5549b
[RLlib] Deprecate `vf_share_layers` in top-level PPO/MAML/MB-MPO configs. (#13397)
|
3 years ago |
create_plots.py
|
baa053496a
[RLlib] Benchmark and regression test yaml cleanup and restructuring. (#8414)
|
4 years ago |