提交历史

作者 SHA1 备注 提交日期
  Sven Mika 9c5a0cfd7a [RLlib] Issue 14385: `Policy.compute_actions_from_input_dict` does not properly track accessed fields for Policy's view requirements. (#14386) 3 年之前
  Sven Mika 69202c6a7d [RLlib] Obsolete usage tracking dict via sample batch. (#13065) 3 年之前
  Sven Mika 8000258333 [RLlib] R2D2 Implementation. (#13933) 3 年之前
  Sven Mika 81e7434091 [RLlib] TFPolicy.export_model: Add timestep placeholder to model's signature, if needed. (#13988) 3 年之前
  Sven Mika eb0038612f [RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584) 3 年之前
  Sven Mika 6f342a2221 [RLlib] Preparatory PR for: Documentation on Model Building. (#13260) 3 年之前
  Sven Mika 391cdfae8c [RLlib] Trajectory view API docs. (#12718) 3 年之前
  Sven Mika b2bcab711d [RLlib] Attention Nets: tf (#12753) 3 年之前
  Sven Mika 74c98ac38e [RLlib] Issue 12244: Unable to restore multi-agent PPOTFPolicy's Model (from exported). (#12786) 3 年之前
  Sven Mika 99c81c6795 [RLlib] Attention Net prep PR #3. (#12450) 3 年之前
  Sven Mika 9021f15b2a [RLlib] Fix setup-dev.py error when creating a softlink for new_dashboard. (#12442) 3 年之前
  Sven Mika 6da4342822 [RLlib] Add on_learn_on_batch (Policy) callback to DefaultCallbacks. (#12070) 3 年之前
  Sven Mika 62c7ab5182 [RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747) 4 年之前
  Sven Mika d9f1874e34 [RLlib] Minor fixes (torch GPU bugs + some cleanup). (#11609) 4 年之前
  Sven Mika 8ea1bc5ff9 [RLlib] Allow for more than 2^31 policy timesteps. (#11301) 4 年之前
  Sven Mika ce96b03b07 [RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 4 年之前
  Sven Mika 2256047876 [RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114) 4 年之前
  Barak Michener 8e76796fd0 ci: Redo `format.sh --all` script & backfill lint fixes (#9956) 4 年之前
  Sven Mika fcdf410ae1 [RLlib] Tf2.x native. (#8752) 4 年之前
  Sven Mika f43d934817 [RLlib] Type annotations for policy. (#9248) 4 年之前
  Sven Mika 43043ee4d5 [RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136) 4 年之前
  Sven Mika 25c0974543 [RLlib] Issue 8412 (Adam vars not stored in ModelV2). (#8480) 4 年之前
  Sven Mika 6c2b9a4cfa [RLlib] Remove tf.py_function from all Schedule classes (not differentiable and causes other bugs in MA setups). (#8304) 4 年之前
  Sven Mika 5537fe13b0 [RLlib] Exploration API: ParamNoise Integration into DQN; working example/test cases. (#7814) 4 年之前
  Sven Mika e153e3179f [RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798) 4 年之前
  Sven Mika 66df8b8c35 [RLlib] Working/learning example: PPO + torch + LSTM. (#7797) 4 年之前
  Sven Mika e4bd5db4d8 [RLlib] Minimal ParamNoise PR. (#7772) 4 年之前
  Sven Mika 369a3417c4 [RLlib] Add tf-graph by default when doing `Policy.export_model()`. (#7759) 4 年之前
  Sven Mika 1138f2ebed [RLlib] Issue 7046 cannot restore keras model from h5 file. (#7482) 4 年之前
  Sven Mika 2fb219a658 [Ray RLlib] Fix tree import (#7662) 4 年之前