Sven Mika f3397b6f48 [RLlib] Minor fixes/cleanups; chop_into_sequences now handles nested data. (#19408) 3 年之前
..
a3c 599e589481 [RLlib] Move existing fake multi-GPU learning tests into separate buildkite job. (#18065) 3 年之前
ars 2589309cf0 [RLlib] Make sure torch and tf behave the same wrt conv2d nets. (#8785) 4 年之前
cql 026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535) 3 年之前
ddpg 026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535) 3 年之前
dqn 08c09737fa [RLlib] Fix R2D2 (torch) multi-GPU issue. (#18550) 3 年之前
dreamer 4e9888ce2f [RLlib] Dreamer (#10172) 4 年之前
es b84575c092 [RLlib] 2 RLlib Flaky Tests (#14930) 3 年之前
impala 026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535) 3 年之前
maml 59bc1e6c09 [RLLib] MAML extension for all models except RNNs (#11337) 3 年之前
marwil 4b278c36fc [RLlib] Behavioral Cloning (from MARWIL). (#10619) 4 年之前
mbmpo 6e6c680f14 MBMPO Cartpole (#11832) 4 年之前
pg 599e589481 [RLlib] Move existing fake multi-GPU learning tests into separate buildkite job. (#18065) 3 年之前
ppo f3397b6f48 [RLlib] Minor fixes/cleanups; chop_into_sequences now handles nested data. (#19408) 3 年之前
sac 026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535) 3 年之前
cleanup_experiment.py baa053496a [RLlib] Benchmark and regression test yaml cleanup and restructuring. (#8414) 4 年之前
compact-regression-test.yaml 93c0a5549b [RLlib] Deprecate `vf_share_layers` in top-level PPO/MAML/MB-MPO configs. (#13397) 3 年之前
create_plots.py baa053496a [RLlib] Benchmark and regression test yaml cleanup and restructuring. (#8414) 4 年之前