Sven Mika
|
fd13bac9b3
[RLlib] Add `worker` arg (optional) to `policy_mapping_fn`. (#18184)
|
3 年之前 |
Sven Mika
|
649580d735
[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046)
|
3 年之前 |
Amog Kamsetty
|
38b5b6d24c
Revert "[RLlib] Simplify multiagent config (automatically infer class/spaces/config). (#16565)" (#17036)
|
3 年之前 |
Sven Mika
|
e4123fff27
[RLlib] Simplify multiagent config (automatically infer class/spaces/config). (#16565)
|
3 年之前 |
Sven Mika
|
be6db06485
[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569)
|
3 年之前 |
Amog Kamsetty
|
bd3cbfc56a
Revert "[RLlib] Allow policies to be added/deleted on the fly. (#16359)" (#16543)
|
3 年之前 |
Sven Mika
|
e78ec370a9
[RLlib] Allow policies to be added/deleted on the fly. (#16359)
|
3 年之前 |
Sven Mika
|
d2c755ccef
[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832)
|
3 年之前 |
Sven Mika
|
41968512ca
[RLlib] Partial GPU examples (for learner and workers). (#15334)
|
3 年之前 |
Sven Mika
|
c8ca4d03ad
[RLlib] Issue with agent-id -> pol-id mapping not required to be fixed across different episodes. (#15020)
|
3 年之前 |
Sven Mika
|
732197e23a
[RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393)
|
3 年之前 |
Sven Mika
|
841d93d366
[RLlib] Issue 12233 shared tf layers example not really shared (only works for tf1.x, not tf2.x). (#12399)
|
3 年之前 |
Sven Mika
|
c17169dc11
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
4 年之前 |
Sven Mika
|
43043ee4d5
[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136)
|
4 年之前 |
Sven Mika
|
d8a081a185
[RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590)
|
4 年之前 |
Sven Mika
|
2746fc0476
[RLlib] Auto-framework, retire `use_pytorch` in favor of `framework=...` (#8520)
|
4 年之前 |
Sven Mika
|
57544b1ff9
[RLlib] Examples folder restructuring (Model examples; final part). (#8278)
|
4 年之前 |
Sven Mika
|
42991d723f
[RLlib] rllib/examples folder restructuring (#8250)
|
4 年之前 |
Sven Mika
|
bf25aee392
[RLlib] Deprecate all Model(v1) usage. (#8146)
|
4 年之前 |
Sven Mika
|
d537e9f0d8
[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155)
|
4 年之前 |