Sven Mika
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 years ago |
Sven Mika
|
5107d16ae5
[RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. (#17530)
|
3 years ago |
Sven Mika
|
56878221ed
[RLlib] Redo: Make TFModelV2 fully modular like TorchModelV2 (soft-deprecate register_variables, unify var names wrt torch). (#13363)
|
3 years ago |
Kai Fricke
|
25f10a947a
Revert "[RLlib] Make TFModelV2 behave more like TorchModelV2: Obsolete register_variables. Unify variable dicts. (#13339)" (#13361)
|
3 years ago |
Sven Mika
|
e2b2abb88b
[RLlib] Make TFModelV2 behave more like TorchModelV2: Obsolete register_variables. Unify variable dicts. (#13339)
|
3 years ago |
Sven Mika
|
8726521604
[RLlib] JAXPolicy prep PR #2 (move get_activation_fn (backward-compatibly), minor fixes and preparations). (#13091)
|
3 years ago |
Sven Mika
|
d537e9f0d8
[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155)
|
4 years ago |
Sven Mika
|
6e1c3ea824
[RLlib] Exploration API (+EpsilonGreedy sub-class). (#6974)
|
4 years ago |
Sven Mika
|
c957ed58ed
[RLlib] Implement PPO torch version. (#6826)
|
4 years ago |
Sven
|
60d4d5e1aa
Remove future imports (#6724)
|
4 years ago |
Sven
|
8b16847c02
Get utils ready for better Agent torch support. (#6561)
|
4 years ago |