提交历史

作者 SHA1 备注 提交日期
  kourosh hakhamaneshi b7717df6f6 [RLlib] Fix connector issue in `Algorithm.compute_single_action()`. (#30555) 1 年之前
  Sven Mika 2ed09c5445 [RLlib] Move all config validation logic into AlgorithmConfig classes. (#29854) 1 年之前
  Sven Mika 432f023642 [RLlib] Deprecate `AlgorithmConfig.framework("tfe")`: Use `tf2` instead. (#29755) 2 年之前
  Sven Mika 130b7eeaba [RLlib] `Trainer` to `Algorithm` renaming. (#25539) 2 年之前
  Sven Mika b5bc2b93c3 [RLlib] Move all remaining algos into `algorithms` directory. (#25366) 2 年之前
  kourosh hakhamaneshi 3815e52a61 [RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits (#24896) 2 年之前
  Balaji Veeramani 7f1bacc7dc [CI] Format Python code with Black (#21975) 2 年之前
  Avnish Narayan 026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535) 3 年之前
  Sven Mika 18d173b172 [RLlib] Implement policy_maps (multi-agent case) in RolloutWorkers as LRU caches. (#17031) 3 年之前
  Sven Mika 53206dd440 [RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531) 3 年之前
  Sven Mika 8698cf9bc8 [RLlib] Fix param noise test case on CI. (#14926) 3 年之前
  Sven Mika 99c81c6795 [RLlib] Attention Net prep PR #3. (#12450) 3 年之前
  Sven Mika ce96b03b07 [RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 4 年之前
  Barak Michener 8e76796fd0 ci: Redo `format.sh --all` script & backfill lint fixes (#9956) 4 年之前
  Sven Mika fcdf410ae1 [RLlib] Tf2.x native. (#8752) 4 年之前
  Sven Mika 4da0e542d5 [RLlib] DDPG and SAC eager support (preparation for tf2.x) (#9204) 4 年之前
  Sven Mika 43043ee4d5 [RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136) 4 年之前
  Sven Mika 2746fc0476 [RLlib] Auto-framework, retire `use_pytorch` in favor of `framework=...` (#8520) 4 年之前
  Sven Mika d0fab84e4d [RLlib] DDPG PyTorch version. (#7953) 4 年之前
  Sven Mika 1b31c11806 [RLlib] DDPG re-factor to fit into RLlib's functional algorithm builder API. (#7934) 4 年之前