Sven Mika
|
8e680c483c
[RLlib] gymnasium support (new `Env.reset()/step()/seed()/render()` APIs). (#28369)
|
1 年之前 |
xwjiang2010
|
fcf897ee72
[air] update rllib example to use Tuner API. (#26987)
|
2 年之前 |
Sven Mika
|
b5bc2b93c3
[RLlib] Move all remaining algos into `algorithms` directory. (#25366)
|
2 年之前 |
Yi Cheng
|
fd0f967d2e
Revert "[RLlib] Move (A/DD)?PPO and IMPALA algos to `algorithms` dir and rename policy and trainer classes. (#25346)" (#25420)
|
2 年之前 |
Sven Mika
|
e4ceae19ef
[RLlib] Move (A/DD)?PPO and IMPALA algos to `algorithms` dir and rename policy and trainer classes. (#25346)
|
2 年之前 |
Balaji Veeramani
|
7f1bacc7dc
[CI] Format Python code with Black (#21975)
|
2 年之前 |
Stefan Schneider
|
489febc6b2
[RLlib] Better example scripts: Description --no-tune and --local-mode CLI options (#17038)
|
3 年之前 |
Sven Mika
|
53206dd440
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
3 年之前 |
Stefan Schneider
|
55709bac7a
[RLlib] Examples for training, saving, loading, testing an agent with SB & RLlib (#15897)
|
3 年之前 |