Jun Gong
|
2317c693cf
[RLlib] Use SampleBrach instead of input dict whenever possible (#20746)
|
2 年之前 |
gjoliver
|
99a0088233
[RLlib] Unify the way we create local replay buffer for all agents (#19627)
|
3 年之前 |
gjoliver
|
c3c42278e4
[RLlib] clean up all the SampleBatch['is_training'] deprecation warnings (#19652)
|
3 年之前 |
Sven Mika
|
1f0646f658
[RLlib] Issue 18418: SAC w/ dict space not working. (#19101)
|
3 年之前 |
Julius Frost
|
d7a5ec1830
[RLlib] SAC tuple observation space fix (#17356)
|
3 年之前 |
Sven Mika
|
eb0038612f
[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584)
|
3 年之前 |
Sven Mika
|
52c94b7ee9
[RLlib] Allow SAC to use custom models as Q- or policy nets and deprecate "state-preprocessor" for image spaces. (#13522)
|
3 年之前 |
Sven Mika
|
56878221ed
[RLlib] Redo: Make TFModelV2 fully modular like TorchModelV2 (soft-deprecate register_variables, unify var names wrt torch). (#13363)
|
3 年之前 |
Kai Fricke
|
25f10a947a
Revert "[RLlib] Make TFModelV2 behave more like TorchModelV2: Obsolete register_variables. Unify variable dicts. (#13339)" (#13361)
|
3 年之前 |
Sven Mika
|
e2b2abb88b
[RLlib] Make TFModelV2 behave more like TorchModelV2: Obsolete register_variables. Unify variable dicts. (#13339)
|
3 年之前 |
Sven Mika
|
291c172d83
[RLlib] Support Simplex action spaces for SAC (torch and tf). (#11909)
|
4 年之前 |
Sven Mika
|
805dad3bc4
[RLlib] SAC algo cleanup. (#10825)
|
4 年之前 |
Sven Mika
|
43043ee4d5
[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136)
|
4 年之前 |
Sven Mika
|
f7e4dae852
[RLlib] DQN and SAC Atari benchmark fixes. (#7962)
|
4 年之前 |
Sven Mika
|
428516056a
[RLlib] SAC Torch (incl. Atari learning) (#7984)
|
4 年之前 |