提交历史

作者 SHA1 备注 提交日期
  Sven Mika a931076f59 [RLlib] Tf2 + eager-tracing same speed as framework=tf; Add more test coverage for tf2+tracing. (#19981) 3 年之前
  Sven Mika 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
  Sven Mika b213565783 [RLlib] Fix failing test cases: Soft-deprecate ModelV2.from_batch (in favor of ModelV2.__call__). (#19693) 3 年之前
  gjoliver c3c42278e4 [RLlib] clean up all the SampleBatch['is_training'] deprecation warnings (#19652) 3 年之前
  Sven Mika 61a1274619 [RLlib] No Preprocessors (part 2). (#18468) 3 年之前
  Sven Mika 8a066474d4 [RLlib] No Preprocessors; preparatory PR #1 (#18367) 3 年之前
  Sven Mika 494ddd98c1 [RLlib] Replace "seq_lens" w/ SampleBatch.SEQ_LENS. (#17928) 3 年之前
  Sven Mika 7eb1a29426 [RLlib] Fix ModelV2 custom metrics for torch. (#16734) 3 年之前
  Sven Mika e973b726c2 [RLlib] Support native tf.keras.Models (part 2) - Default keras models for Vision/RNN/Attention. (#15273) 3 年之前
  Sven Mika bb8a286cbc [RLlib] Support native tf.keras.Model (milestone toward obsoleting ModelV2 class). (#14684) 3 年之前
  Sven Mika cecfc3b43b [RLlib] Multi-GPU support for Torch algorithms. (#14709) 3 年之前
  Sven Mika 69202c6a7d [RLlib] Obsolete usage tracking dict via sample batch. (#13065) 3 年之前
  Sven Mika ee4b6e7e3b [RLlib] Unity3D example broken due to change in ML-Agents API. Attention-net prev-n-a/r. Attention-wrapper works with images. (#14569) 3 年之前
  Sven Mika 95ef04b71a [RLlib] Implement `TorchPolicy.export_model`. (#13989) 3 年之前
  Sven Mika 52c94b7ee9 [RLlib] Allow SAC to use custom models as Q- or policy nets and deprecate "state-preprocessor" for image spaces. (#13522) 3 年之前
  Sven Mika 6f342a2221 [RLlib] Preparatory PR for: Documentation on Model Building. (#13260) 3 年之前
  Sven Mika 391cdfae8c [RLlib] Trajectory view API docs. (#12718) 3 年之前
  Sven Mika a5318961de [RLlib] Preprocessor fixes (multi-discrete) and tests. (#13083) 3 年之前
  Sven Mika 99ae7bae05 [RLlib] JAXPolicy prep. PR #1. (#13077) 3 年之前
  Sven Mika b2bcab711d [RLlib] Attention Nets: tf (#12753) 3 年之前
  Sven Mika 99c81c6795 [RLlib] Attention Net prep PR #3. (#12450) 3 年之前
  Sven Mika 3ad9365e1d [RLlib] Attention Net prep PR #2: Smaller cleanups. (#12449) 3 年之前
  Sven Mika 62c7ab5182 [RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747) 4 年之前
  Sven Mika d9f1874e34 [RLlib] Minor fixes (torch GPU bugs + some cleanup). (#11609) 4 年之前
  Sven Mika 36bda8432b [RLlib] Trajectory view API: Simple List Collector (on by default for PPO); LSTM-agnostic (#11056) 4 年之前
  Sven Mika 805dad3bc4 [RLlib] SAC algo cleanup. (#10825) 4 年之前
  Sven Mika e968b52cb7 [RLlib] Trajectory view API - 03 Fast LSTM + prev actions/rewards (#9950) 4 年之前
  Sven Mika 2256047876 [RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114) 4 年之前
  Sven Mika 57690a3a9f [RLlib] Trajectory view API - 02 actual API scaffold (#9753) 4 年之前
  Eric Liang 590943a499 [rllib] Type annotations for model classes (#9646) 4 年之前