Sven Mika
|
fd13bac9b3
[RLlib] Add `worker` arg (optional) to `policy_mapping_fn`. (#18184)
|
3 年之前 |
Sven Mika
|
1520c3d147
[RLlib] Deepcopy env_ctx for vectorized sub-envs AND add eval-worker-option to `Trainer.add_policy()` (#18428)
|
3 年之前 |
Sven Mika
|
45f60e51a9
[RLlib] DDPPO fixes and benchmarks. (#18390)
|
3 年之前 |
Sven Mika
|
f18213712f
[RLlib] Redo: "fix self play example scripts" PR (17566) (#17895)
|
3 年之前 |
Amog Kamsetty
|
77f28f1c30
Revert "[RLlib] Fix `Trainer.add_policy` for num_workers>0 (self play example scripts). (#17566)" (#17709)
|
3 年之前 |
Sven Mika
|
3b447265d8
[RLlib] Fix `Trainer.add_policy` for num_workers>0 (self play example scripts). (#17566)
|
3 年之前 |
Sven Mika
|
7bc4376466
[RLlib] Example script: Simple league-based self-play w/ open spiel env (markov soccer or connect-4). (#17077)
|
3 年之前 |