Sven Mika
|
827ab91741
[RLlib] Replace remaining mentions of "trainer" by "algorithm". (#36557)
|
1 年之前 |
SangBin Cho
|
0c69020432
Revert "Simplify logging configuration. (#30863)" (#31858)
|
1 年之前 |
Sven Mika
|
8e680c483c
[RLlib] gymnasium support (new `Env.reset()/step()/seed()/render()` APIs). (#28369)
|
1 年之前 |
Peyton Murray
|
608276bb96
Simplify logging configuration. (#30863)
|
1 年之前 |
Jun Gong
|
20499548e6
[RLlib] Policy mapping fn can not be called with keyword arguments. (#31141)
|
1 年之前 |
Sven Mika
|
e715a8b761
[RLlib] AlgorithmConfig: Replace more occurrences of old config dicts; Make all Algorithms use the non-dict lookup for config properties. (#30096)
|
1 年之前 |
Sven Mika
|
42864d711d
[RLlib; AIR; Tune] Replace all remaining `tune.run()` with `tune.Tuner(...).fit()`. (#28325)
|
2 年之前 |
xwjiang2010
|
fcf897ee72
[air] update rllib example to use Tuner API. (#26987)
|
2 年之前 |
Sven Mika
|
130b7eeaba
[RLlib] `Trainer` to `Algorithm` renaming. (#25539)
|
2 年之前 |
Sven Mika
|
7c39aa5fac
[RLlib] Trainer.training_iteration -> Trainer.training_step; Iterations vs reportings: Clarification of terms. (#25076)
|
2 年之前 |
Sven Mika
|
b5bc2b93c3
[RLlib] Move all remaining algos into `algorithms` directory. (#25366)
|
2 年之前 |
Yi Cheng
|
fd0f967d2e
Revert "[RLlib] Move (A/DD)?PPO and IMPALA algos to `algorithms` dir and rename policy and trainer classes. (#25346)" (#25420)
|
2 年之前 |
Sven Mika
|
e4ceae19ef
[RLlib] Move (A/DD)?PPO and IMPALA algos to `algorithms` dir and rename policy and trainer classes. (#25346)
|
2 年之前 |
Jun Gong
|
eaf9c941ae
[RLlib] Migrate PPO Impala and APPO policies to use sub-classing implementation. (#25117)
|
2 年之前 |
Siyuan (Ryans) Zhuang
|
0c74ecad12
[Lint] Cleanup incorrectly formatted strings (Part 1: RLLib). (#23128)
|
2 年之前 |
Balaji Veeramani
|
7f1bacc7dc
[CI] Format Python code with Black (#21975)
|
2 年之前 |
Sven Mika
|
d5bfb7b7da
[RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652)
|
2 年之前 |
Sven Mika
|
be6db06485
[RLlib] Re-do: Trainer: Support add and delete Policies. (#16569)
|
3 年之前 |
Amog Kamsetty
|
bd3cbfc56a
Revert "[RLlib] Allow policies to be added/deleted on the fly. (#16359)" (#16543)
|
3 年之前 |
Sven Mika
|
e78ec370a9
[RLlib] Allow policies to be added/deleted on the fly. (#16359)
|
3 年之前 |
Sven Mika
|
d2c755ccef
[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832)
|
3 年之前 |
Sven Mika
|
732197e23a
[RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393)
|
3 年之前 |
Lara Codeca
|
e735add268
[RLlib] Integration with SUMO Simulator (#11710)
|
4 年之前 |