Sven Mika
|
2d24ef0d32
[RLlib] Add all simple learning tests as `framework=tf2`. (#19273)
|
3 年之前 |
Sven Mika
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |
Sven Mika
|
59f796edf3
[RLlib] Fix crash when using StochasticSampling exploration (most PG-style algos) w/ tf and numpy > 1.19.5 (#18366)
|
3 年之前 |
Sven Mika
|
3013d9b341
[RLlib] Fix "Cannot convert a symbolic Tensor (default_policy/strided_slice_3:0) to a numpy array. This error may indicate that you're trying to pass a Tensor to a NumPy call, which is not supported" (#17587)
|
3 年之前 |
Sven Mika
|
d0014cd351
[RLlib] Policies get/set_state fixes and enhancements. (#16354)
|
3 年之前 |
Sven Mika
|
199e5d0f75
[RLlib] Exploration class type annotations. (#11251)
|
4 年之前 |
Sven Mika
|
ce96b03b07
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033)
|
4 年之前 |
Michael Luo
|
4d7bd8c892
[RLlib] Implementation of "Model-based Meta Policy Optimization" (MB MPO) (#9409)
|
4 年之前 |
Sven Mika
|
ff9c1dac88
[RLlib] Issue 9667 DDPG Torch bugs and enhancements. (#9680)
|
4 年之前 |
Sven Mika
|
fcdf410ae1
[RLlib] Tf2.x native. (#8752)
|
4 年之前 |
Sven Mika
|
4da0e542d5
[RLlib] DDPG and SAC eager support (preparation for tf2.x) (#9204)
|
4 年之前 |
Sven Mika
|
43043ee4d5
[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136)
|
4 年之前 |
Sven Mika
|
a00144f746
[RLlib] Fix issue 8135 (DDPG inf actions when using [-inf,inf] action space). (#8302)
|
4 年之前 |
Sven Mika
|
d0fab84e4d
[RLlib] DDPG PyTorch version. (#7953)
|
4 年之前 |
Sven Mika
|
428516056a
[RLlib] SAC Torch (incl. Atari learning) (#7984)
|
4 年之前 |
Sven Mika
|
e153e3179f
[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798)
|
4 年之前 |
Sven Mika
|
83e06cd30a
[RLlib] DDPG refactor and Exploration API action noise classes. (#7314)
|
4 年之前 |