提交历史

作者 SHA1 备注 提交日期
  Sven Mika 32c73a319d [RLlib] Issue 39031: SlateQ example script bug. (#39550) 1 年之前
  Sven Mika b0045799e3 [RLlib] DreamerV3: Make 200M (XL model) work; mixed float16 option (#38461) 1 年之前
  Artur Niederfahrenhorst 1c29b98c71 [RLlib] Fix issues with action masking examples. (#38095) 1 年之前
  Sven Mika 8c055af084 [RLlib] DreamerV3: Add CI Testing. (#37979) 1 年之前
  Rohan Potdar 6f88b0c674 [RLlib] Bump Gymnasium to 0.28.1 (#35698) 1 年之前
  Artur Niederfahrenhorst dcf499f768 [RLlib][doc-test] Improved doc examples inside core folder (#37558) 1 年之前
  Artur Niederfahrenhorst ab131bb8c2 [RLlib] Early improvements to Catalogs and RL Modules docs + Catalogs improvements (#37245) 1 年之前
  Artur Niederfahrenhorst 960032a15f [RLlib][RLModules] RNNs and RLModules (#32723) 1 年之前
  kourosh hakhamaneshi c7b2c6c40d [RLlib][Torch 2.0 compile] Inference benchmarks (#36534) 1 年之前
  Sven Mika e14c9b1da5 [RLlib] Remove `vtrace_drop_last_ts` option and add proper vf bootstrapping to IMPALA and APPO. (#36013) 1 年之前
  Artur Niederfahrenhorst 2a12cf5eff [RLlib] Compile update logic on learner and use cudagraphs (#35759) 1 年之前
  Sven Mika 827ab91741 [RLlib] Replace remaining mentions of "trainer" by "algorithm". (#36557) 1 年之前
  Sven Mika 656fe0703c Revert revert [RLlib] DreamerV3 Main Algo. (#36571) 1 年之前
  Kai Fricke 42e06e3948 Revert "[RLlib] DreamerV3: Main algo code and required changes to some RLlib APIs (RolloutWorker). (#35386)" (#36564) 1 年之前
  Sven Mika 8290bd112c [RLlib] DreamerV3: Main algo code and required changes to some RLlib APIs (RolloutWorker). (#35386) 1 年之前
  Balaji Veeramani 2125b18f37 [RLlib][Docs] Enable doctests for RLLib (#35931) 1 年之前
  Sven Mika ce6a1ee9be [RLlib] Unflake `single_agent_env_runner` test (fix bug and change test size from small to medium). (#36075) 1 年之前
  Avnish Narayan 773757cbc4 [RLLIB] Example of finetuning with bc and tuning with PPO on cartpole (#35966) 1 年之前
  Avnish Narayan 7acdbadf7b [Rllib] ci reduce flakiness reduce test time (#36047) 1 年之前
  Sven Mika 61f2dc1b05 [RLlib] Enhance `run_regression_tests.py`: Allow overriding `--env` and `--framework` on command line. (#35985) 1 年之前
  Sven Mika baf2d72cac [RLlib] Cleanups: Learner API and Catalog. (#35982) 1 年之前
  Avnish Narayan e6b5b8bcf3 [RLlib] break up the learner group tests into shorter tests (#35926) 1 年之前
  Sven Mika a794320dff [RLlib] RLlib light: EnvRunner API. (#35872) 1 年之前
  Sven Mika f1f714c69e [RLlib] Learner API enhancements and cleanups (prep. for DreamerV3). (#35877) 1 年之前
  Avnish Narayan afdfb3e911 Revert "[RLlib] Add more gpu/multicpu jobs, make supported spaces tests exclusive (#35735)" (#35891) 1 年之前
  Balaji Veeramani d4a8900895 [CI] Migrate from `sphinx.ext.doctest` to `pytest-sphinx` (#35286) 1 年之前
  Avnish Narayan 83179ab1db [RLlib] Load state from load_state_path for rlmodule spec. (#35180) 1 年之前
  Avnish Narayan 3d124ccaba [RLlib] Add more gpu/multicpu jobs, make supported spaces tests exclusive (#35735) 1 年之前
  Artur Niederfahrenhorst 999fbf9e3f [RLlib] Increase the required time for PPO learner tests (#35651) 1 年之前
  Michael Möbius 601a3ea96e [RLlib] Fix IMPALA/APPO when using multi GPU setup and Multi-Agent Env (#35120) 1 年之前