Artur Niederfahrenhorst
|
00152a4b4b
[RLlib] Clean up some signatures for compute_actions. (#31241)
|
1 年之前 |
Sven Mika
|
8e680c483c
[RLlib] gymnasium support (new `Env.reset()/step()/seed()/render()` APIs). (#28369)
|
1 年之前 |
Jun Gong
|
20499548e6
[RLlib] Policy mapping fn can not be called with keyword arguments. (#31141)
|
1 年之前 |
Artur Niederfahrenhorst
|
ef62802353
[RLlib] Unify policy mapping function usage (#30216)
|
1 年之前 |
Jun Gong
|
51ea841033
[RLlib] fix episode tests. (#30185)
|
1 年之前 |
Sven Mika
|
e7a614f388
Revert "Revert "[RLlib] AlgorithmConfig: Next steps (volume 01); Algos, Rollo…" (#29747)
|
2 年之前 |
Kai Fricke
|
12b579d95e
Revert "[RLlib] AlgorithmConfig: Next steps (volume 01); Algos, RolloutWorker, PolicyMap, WorkerSet use AlgorithmConfig objects under the hood. (#29395)" (#29742)
|
2 年之前 |
Sven Mika
|
182744bbd1
[RLlib] AlgorithmConfig: Next steps (volume 01); Algos, RolloutWorker, PolicyMap, WorkerSet use AlgorithmConfig objects under the hood. (#29395)
|
2 年之前 |
Sven Mika
|
130b7eeaba
[RLlib] `Trainer` to `Algorithm` renaming. (#25539)
|
2 年之前 |
Avnish Narayan
|
740def0a13
[RLlib] Put env-checker on critical path. (#22191)
|
2 年之前 |
Balaji Veeramani
|
7f1bacc7dc
[CI] Format Python code with Black (#21975)
|
2 年之前 |
Avnish Narayan
|
12b087acb8
[RLlib] Base env pre-checker. (#21569)
|
2 年之前 |
Sven Mika
|
9c73871da0
[RLlib; Docs overhaul] Docstring cleanup: Evaluation (#19783)
|
3 年之前 |
mvindiola1
|
62f5da0b65
[RLlib] Add unit tests for updating episode data in base_env (#17137)
|
3 年之前 |