提交历史

作者 SHA1 备注 提交日期
  Sven Mika 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
  Sven Mika ac3371a148 [RLlib] Discussion 3644: Fix bug for complex obs spaces containing `Box([2D shape])` and discrete component. (#18917) 3 年之前
  Sven Mika 61a1274619 [RLlib] No Preprocessors (part 2). (#18468) 3 年之前
  Sven Mika 59f796edf3 [RLlib] Fix crash when using StochasticSampling exploration (most PG-style algos) w/ tf and numpy > 1.19.5 (#18366) 3 年之前
  Sven Mika 8a844ff840 [RLlib] Issues: 17397, 17425, 16715, 17174. When on driver, Torch|TFPolicy should not use `ray.get_gpu_ids()` (b/c no GPUs assigned by ray). (#17444) 3 年之前
  Sven Mika 5a313ba3d6 [RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169) 3 年之前
  Sven Mika 18d173b172 [RLlib] Implement policy_maps (multi-agent case) in RolloutWorkers as LRU caches. (#17031) 3 年之前
  Sven Mika cecfc3b43b [RLlib] Multi-GPU support for Torch algorithms. (#14709) 3 年之前
  Sven Mika 2e3655e8a9 [RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238) 3 年之前
  Sven Mika fb318addcb [RLlib] Curiosity exploration module: tf/tf2.x/tf-eager support. (#11945) 3 年之前
  Sven Mika 0df55a139c [RLlib] Attention Net prep PR #1: Smaller cleanups. (#12447) 3 年之前
  Sven Mika 592c161032 [RLlib] Issue 12118: LSTM prev-a/r should be separately configurable. Fix missing prev-a one-hot encoding. (#12397) 3 年之前
  Sven Mika 62c7ab5182 [RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747) 4 年之前
  Sven Mika 5b788ccb13 [RLlib] Trajectory view API (prep PR for switching on by default across all RLlib; plumbing only) (#11717) 4 年之前
  Sven Mika 805dad3bc4 [RLlib] SAC algo cleanup. (#10825) 4 年之前
  Barak Michener 8e76796fd0 ci: Redo `format.sh --all` script & backfill lint fixes (#9956) 4 年之前
  Sven Mika fcdf410ae1 [RLlib] Tf2.x native. (#8752) 4 年之前
  Sven Mika 43043ee4d5 [RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136) 4 年之前
  Sven Mika 4fd8977eaf [RLlib] Minor cleanup in preparation to tf2.x support. (#9130) 4 年之前
  Sven Mika 7008902cff [RLlib] Minor `rllib.utils` cleanup. (#8932) 4 年之前
  Sven Mika bf25aee392 [RLlib] Deprecate all Model(v1) usage. (#8146) 4 年之前
  Sven Mika 22ccc43670 [RLlib] DQN torch version. (#7597) 4 年之前
  Sven 60d4d5e1aa Remove future imports (#6724) 4 年之前
  Eric Liang 34fbc7fb4c rllib] Fix leak of TensorFlow assign operations in DQN/DDPG 5 年之前
  gehring b520f6141e [rllib] Adds eager support with a generic `TFEagerPolicy` class (#5436) 5 年之前
  Eric Liang 5d7afe8092 [rllib] Try moving RLlib to top level dir (#5324) 5 年之前