Sven Mika
|
32c73a319d
[RLlib] Issue 39031: SlateQ example script bug. (#39550)
|
1 年之前 |
Sven Mika
|
b0045799e3
[RLlib] DreamerV3: Make 200M (XL model) work; mixed float16 option (#38461)
|
1 年之前 |
Artur Niederfahrenhorst
|
1c29b98c71
[RLlib] Fix issues with action masking examples. (#38095)
|
1 年之前 |
Sven Mika
|
8c055af084
[RLlib] DreamerV3: Add CI Testing. (#37979)
|
1 年之前 |
Rohan Potdar
|
6f88b0c674
[RLlib] Bump Gymnasium to 0.28.1 (#35698)
|
1 年之前 |
Artur Niederfahrenhorst
|
dcf499f768
[RLlib][doc-test] Improved doc examples inside core folder (#37558)
|
1 年之前 |
Artur Niederfahrenhorst
|
ab131bb8c2
[RLlib] Early improvements to Catalogs and RL Modules docs + Catalogs improvements (#37245)
|
1 年之前 |
Artur Niederfahrenhorst
|
960032a15f
[RLlib][RLModules] RNNs and RLModules (#32723)
|
1 年之前 |
kourosh hakhamaneshi
|
c7b2c6c40d
[RLlib][Torch 2.0 compile] Inference benchmarks (#36534)
|
1 年之前 |
Sven Mika
|
e14c9b1da5
[RLlib] Remove `vtrace_drop_last_ts` option and add proper vf bootstrapping to IMPALA and APPO. (#36013)
|
1 年之前 |
Artur Niederfahrenhorst
|
2a12cf5eff
[RLlib] Compile update logic on learner and use cudagraphs (#35759)
|
1 年之前 |
Sven Mika
|
827ab91741
[RLlib] Replace remaining mentions of "trainer" by "algorithm". (#36557)
|
1 年之前 |
Sven Mika
|
656fe0703c
Revert revert [RLlib] DreamerV3 Main Algo. (#36571)
|
1 年之前 |
Kai Fricke
|
42e06e3948
Revert "[RLlib] DreamerV3: Main algo code and required changes to some RLlib APIs (RolloutWorker). (#35386)" (#36564)
|
1 年之前 |
Sven Mika
|
8290bd112c
[RLlib] DreamerV3: Main algo code and required changes to some RLlib APIs (RolloutWorker). (#35386)
|
1 年之前 |
Balaji Veeramani
|
2125b18f37
[RLlib][Docs] Enable doctests for RLLib (#35931)
|
1 年之前 |
Sven Mika
|
ce6a1ee9be
[RLlib] Unflake `single_agent_env_runner` test (fix bug and change test size from small to medium). (#36075)
|
1 年之前 |
Avnish Narayan
|
773757cbc4
[RLLIB] Example of finetuning with bc and tuning with PPO on cartpole (#35966)
|
1 年之前 |
Avnish Narayan
|
7acdbadf7b
[Rllib] ci reduce flakiness reduce test time (#36047)
|
1 年之前 |
Sven Mika
|
61f2dc1b05
[RLlib] Enhance `run_regression_tests.py`: Allow overriding `--env` and `--framework` on command line. (#35985)
|
1 年之前 |
Sven Mika
|
baf2d72cac
[RLlib] Cleanups: Learner API and Catalog. (#35982)
|
1 年之前 |
Avnish Narayan
|
e6b5b8bcf3
[RLlib] break up the learner group tests into shorter tests (#35926)
|
1 年之前 |
Sven Mika
|
a794320dff
[RLlib] RLlib light: EnvRunner API. (#35872)
|
1 年之前 |
Sven Mika
|
f1f714c69e
[RLlib] Learner API enhancements and cleanups (prep. for DreamerV3). (#35877)
|
1 年之前 |
Avnish Narayan
|
afdfb3e911
Revert "[RLlib] Add more gpu/multicpu jobs, make supported spaces tests exclusive (#35735)" (#35891)
|
1 年之前 |
Balaji Veeramani
|
d4a8900895
[CI] Migrate from `sphinx.ext.doctest` to `pytest-sphinx` (#35286)
|
1 年之前 |
Avnish Narayan
|
83179ab1db
[RLlib] Load state from load_state_path for rlmodule spec. (#35180)
|
1 年之前 |
Avnish Narayan
|
3d124ccaba
[RLlib] Add more gpu/multicpu jobs, make supported spaces tests exclusive (#35735)
|
1 年之前 |
Artur Niederfahrenhorst
|
999fbf9e3f
[RLlib] Increase the required time for PPO learner tests (#35651)
|
1 年之前 |
Michael Möbius
|
601a3ea96e
[RLlib] Fix IMPALA/APPO when using multi GPU setup and Multi-Agent Env (#35120)
|
1 年之前 |