Sven Mika
|
d0014cd351
[RLlib] Policies get/set_state fixes and enhancements. (#16354)
|
3 年之前 |
Sven Mika
|
57544b1ff9
[RLlib] Examples folder restructuring (Model examples; final part). (#8278)
|
4 年之前 |
Sven Mika
|
1775e89f26
[RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143)
|
4 年之前 |
Sven Mika
|
e153e3179f
[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798)
|
4 年之前 |
Sven Mika
|
0db2046b0a
[RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107) (#7124)
|
4 年之前 |
Sven Mika
|
d537e9f0d8
[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155)
|
4 年之前 |
Sven Mika
|
6e1c3ea824
[RLlib] Exploration API (+EpsilonGreedy sub-class). (#6974)
|
4 年之前 |
Sven Mika
|
303547f119
[RLlib] Policy-classes cleanup and torch/tf unification. (#6770)
|
4 年之前 |