提交历史

作者 SHA1 备注 提交日期
  Sven Mika 0f62ccc6ee [RLlib] Clean up some deprecation messages (they shouldn't be there) and make others `error=True` (from `error=False`) (#38555) 1 年之前
  Sven Mika 8e680c483c [RLlib] gymnasium support (new `Env.reset()/step()/seed()/render()` APIs). (#28369) 1 年之前
  Sven Mika 130b7eeaba [RLlib] `Trainer` to `Algorithm` renaming. (#25539) 2 年之前
  Balaji Veeramani 7f1bacc7dc [CI] Format Python code with Black (#21975) 2 年之前
  Sven Mika 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
  Sven Mika 5107d16ae5 [RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. (#17530) 3 年之前
  Michael Luo 4cbe13cdfd [RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603) 3 年之前
  Amog Kamsetty ebc44c3d76 [CI] Upgrade flake8 to 3.9.1 (#15527) 3 年之前
  Sven Mika 99ae7bae05 [RLlib] JAXPolicy prep. PR #1. (#13077) 3 年之前
  Sven Mika 99c81c6795 [RLlib] Attention Net prep PR #3. (#12450) 3 年之前
  Sven Mika 19c8033df2 [RLlib] Fix most remaining RLlib algos for running with trajectory view API. (#12366) 3 年之前
  Sven Mika fb318addcb [RLlib] Curiosity exploration module: tf/tf2.x/tf-eager support. (#11945) 3 年之前
  Sven Mika 62c7ab5182 [RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747) 4 年之前
  Sven Mika 5b788ccb13 [RLlib] Trajectory view API (prep PR for switching on by default across all RLlib; plumbing only) (#11717) 4 年之前
  Sven Mika 0c0f67c14d [RLlib] ARS/ES eval workers not working: Issue 9933. (#11308) 4 年之前
  Sven Mika c17169dc11 [RLlib] Fix all example scripts to run on GPUs. (#11105) 4 年之前
  Sven Mika 36bda8432b [RLlib] Trajectory view API: Simple List Collector (on by default for PPO); LSTM-agnostic (#11056) 4 年之前
  Sven Mika 805dad3bc4 [RLlib] SAC algo cleanup. (#10825) 4 年之前
  desktable 799318d7d7 [RLlib] Add type annotations for agents/dqn (#10626) 4 年之前
  Sven Mika e968b52cb7 [RLlib] Trajectory view API - 03 Fast LSTM + prev actions/rewards (#9950) 4 年之前
  Sven Mika d14b501692 [RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115) 4 年之前
  Sven Mika 2cbe29a7fa [RLlib] Curiosity minor fixes, do-overs, and testing. (#10143) 4 年之前
  Eric Liang ca133e2699 [rllib] Remove extra model config kwargs passed incorrectly for Torch models (#10055) 4 年之前
  Sven Mika 2256047876 [RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114) 4 年之前
  Tanay Wakhare 1826b29757 [RLlib] Curiosity (intrinsic motivation) Exploration module. (#9912) 4 年之前
  Barak Michener 8e76796fd0 ci: Redo `format.sh --all` script & backfill lint fixes (#9956) 4 年之前
  Sven Mika 57690a3a9f [RLlib] Trajectory view API - 02 actual API scaffold (#9753) 4 年之前
  Sven Mika 9b90f7db67 [RLlib] Missing type annotations policy templates. (#9846) 4 年之前
  Eric Liang 5acd3e66dd [rllib] Fix torch TD error, IMPALA LR updates (#9477) 4 年之前
  Sven Mika 4fd8977eaf [RLlib] Minor cleanup in preparation to tf2.x support. (#9130) 4 年之前