提交历史

作者 SHA1 备注 提交日期
  Sven Mika f82880eda1 Revert "Revert [RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061) (#20399)" (#20417) 2 年之前
  Amog Kamsetty 90dc5460d4 Revert "[RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061)" (#20399) 2 年之前
  Sven Mika 5b1c8e46e1 [RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061) 2 年之前
  Sven Mika 9c73871da0 [RLlib; Docs overhaul] Docstring cleanup: Evaluation (#19783) 3 年之前
  Sven Mika e3e6ed7aaa [RLlib] Issues 17844, 18034: Fix n-step > 1 bug. (#18358) 3 年之前
  Sven Mika e973b726c2 [RLlib] Support native tf.keras.Models (part 2) - Default keras models for Vision/RNN/Attention. (#15273) 3 年之前
  Sven Mika bb8a286cbc [RLlib] Support native tf.keras.Model (milestone toward obsoleting ModelV2 class). (#14684) 3 年之前
  Sven Mika 04bc0a9828 [RLlib] Remove all non-trajectory view API code. (#14860) 3 年之前
  Sven Mika 69202c6a7d [RLlib] Obsolete usage tracking dict via sample batch. (#13065) 3 年之前
  Sven Mika 2e3655e8a9 [RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238) 3 年之前
  Sven Mika b2bcab711d [RLlib] Attention Nets: tf (#12753) 3 年之前
  Sven Mika ce96b03b07 [RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 4 年之前
  Sven Mika 47eb6613b5 [RLlib] Remove unnecessary copies in `compute_advantages`. (#10897) 4 年之前
  Sven Mika d14b501692 [RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115) 4 年之前
  Sven Mika b0b0463161 [RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678) 4 年之前
  Eric Liang 1e0e1a45e6 [rllib] Add type annotations for evaluation/, env/ packages (#9003) 4 年之前
  roireshef 3c60caa448 [rllib] implemented compute_advantages without gae (#6941) 4 年之前
  Sven Mika c957ed58ed [RLlib] Implement PPO torch version. (#6826) 4 年之前
  Sven 60d4d5e1aa Remove future imports (#6724) 4 年之前
  Robert Nishihara 39a3459886 Remove (object) from class declarations. (#6658) 4 年之前
  Neil Lugovoy 1376f1ae60 [tune] Reporter crash fix (#5426) 5 年之前
  Eric Liang 5d7afe8092 [rllib] Try moving RLlib to top level dir (#5324) 5 年之前