Sven Mika
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |
Sven Mika
|
ac3371a148
[RLlib] Discussion 3644: Fix bug for complex obs spaces containing `Box([2D shape])` and discrete component. (#18917)
|
3 年之前 |
Sven Mika
|
61a1274619
[RLlib] No Preprocessors (part 2). (#18468)
|
3 年之前 |
Sven Mika
|
59f796edf3
[RLlib] Fix crash when using StochasticSampling exploration (most PG-style algos) w/ tf and numpy > 1.19.5 (#18366)
|
3 年之前 |
Sven Mika
|
8a844ff840
[RLlib] Issues: 17397, 17425, 16715, 17174. When on driver, Torch|TFPolicy should not use `ray.get_gpu_ids()` (b/c no GPUs assigned by ray). (#17444)
|
3 年之前 |
Sven Mika
|
5a313ba3d6
[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169)
|
3 年之前 |
Sven Mika
|
18d173b172
[RLlib] Implement policy_maps (multi-agent case) in RolloutWorkers as LRU caches. (#17031)
|
3 年之前 |
Sven Mika
|
cecfc3b43b
[RLlib] Multi-GPU support for Torch algorithms. (#14709)
|
3 年之前 |
Sven Mika
|
2e3655e8a9
[RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238)
|
3 年之前 |
Sven Mika
|
fb318addcb
[RLlib] Curiosity exploration module: tf/tf2.x/tf-eager support. (#11945)
|
3 年之前 |
Sven Mika
|
0df55a139c
[RLlib] Attention Net prep PR #1: Smaller cleanups. (#12447)
|
3 年之前 |
Sven Mika
|
592c161032
[RLlib] Issue 12118: LSTM prev-a/r should be separately configurable. Fix missing prev-a one-hot encoding. (#12397)
|
3 年之前 |
Sven Mika
|
62c7ab5182
[RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747)
|
4 年之前 |
Sven Mika
|
5b788ccb13
[RLlib] Trajectory view API (prep PR for switching on by default across all RLlib; plumbing only) (#11717)
|
4 年之前 |
Sven Mika
|
805dad3bc4
[RLlib] SAC algo cleanup. (#10825)
|
4 年之前 |
Barak Michener
|
8e76796fd0
ci: Redo `format.sh --all` script & backfill lint fixes (#9956)
|
4 年之前 |
Sven Mika
|
fcdf410ae1
[RLlib] Tf2.x native. (#8752)
|
4 年之前 |
Sven Mika
|
43043ee4d5
[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136)
|
4 年之前 |
Sven Mika
|
4fd8977eaf
[RLlib] Minor cleanup in preparation to tf2.x support. (#9130)
|
4 年之前 |
Sven Mika
|
7008902cff
[RLlib] Minor `rllib.utils` cleanup. (#8932)
|
4 年之前 |
Sven Mika
|
bf25aee392
[RLlib] Deprecate all Model(v1) usage. (#8146)
|
4 年之前 |
Sven Mika
|
22ccc43670
[RLlib] DQN torch version. (#7597)
|
4 年之前 |
Sven
|
60d4d5e1aa
Remove future imports (#6724)
|
4 年之前 |
Eric Liang
|
34fbc7fb4c
rllib] Fix leak of TensorFlow assign operations in DQN/DDPG
|
5 年之前 |
gehring
|
b520f6141e
[rllib] Adds eager support with a generic `TFEagerPolicy` class (#5436)
|
5 年之前 |
Eric Liang
|
5d7afe8092
[rllib] Try moving RLlib to top level dir (#5324)
|
5 年之前 |