Avnish Narayan
|
684e28b380
[RLlib] RLlib deprecation Notices Part 1 (algorithm/, evaluation/, execution/, models/jax/) (#36826)
|
1 年之前 |
Sven Mika
|
e5d8b28f53
[RLlib] Remove `policy_config` property from RolloutWorker (not needed). (#35878)
|
1 年之前 |
Sven Mika
|
a9667e7b9d
[RLlib] Fix flakey 100-policies LRU cache test. (#30823)
|
1 年之前 |
Archit Kulkarni
|
c42b080617
[RLlib] [Tune] [Autoscaler] Remove `six` dependency (#30815)
|
1 年之前 |
Sven Mika
|
b38f0c8454
[RLlib] Only sync policy weights on RolloutWorkers of those policies that were actually updated. (#29973)
|
1 年之前 |
Sven Mika
|
432f023642
[RLlib] Deprecate `AlgorithmConfig.framework("tfe")`: Use `tf2` instead. (#29755)
|
2 年之前 |
Sven Mika
|
59a967a3a0
[RLlib] Cleanup some deprecated metric keys and classes. (#26036)
|
2 年之前 |
Sven Mika
|
130b7eeaba
[RLlib] `Trainer` to `Algorithm` renaming. (#25539)
|
2 年之前 |
Eric Liang
|
905258dbc1
Clean up docstyle in python modules and add LINT rule (#25272)
|
2 年之前 |
Artur Niederfahrenhorst
|
fb2915d26a
[RLlib] Replay Buffer API and Ape-X. (#24506)
|
2 年之前 |
Sven Mika
|
25001f6d8d
[RLlib] APPO Training iteration fn. (#24545)
|
2 年之前 |
Avnish Narayan
|
f2bb6f6806
[RLlib] Impala training iteration fn (#23454)
|
2 年之前 |
Balaji Veeramani
|
7f1bacc7dc
[CI] Format Python code with Black (#21975)
|
2 年之前 |
Jun Gong
|
8ebc50f844
[RLlib] Issue 21334: Fix APPO when kl_loss is enabled. (#21855)
|
2 年之前 |
Sven Mika
|
49cd7ea6f9
[RLlib] Trainer sub-class PPO/DDPPO (instead of `build_trainer()`). (#20571)
|
2 年之前 |
Artur Niederfahrenhorst
|
d07e50e957
[RLlib] Replay buffer API (cleanups; docstrings; renames; move into `rllib/execution/buffers` dir) (#20552)
|
2 年之前 |
Sven Mika
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |
Sven Mika
|
ed85f59194
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879)
|
3 年之前 |
Chris Bamford
|
58a73821fb
[RLlib] IMPALA sample throughput calculation and full queue slowdown fixes (#17822)
|
3 年之前 |
Sven Mika
|
5a313ba3d6
[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169)
|
3 年之前 |
Sven Mika
|
4b8dadccbd
[RLlib] Fix PR 16162: Having added sleep to `_NextValueNotReady` causes TD3 tests to become flakey. (#16309)
|
3 年之前 |
Chris Bamford
|
1e3721ef4a
[RLlib] Remove bad spinlocks to allow pytorch GPU scheduler to interrupt. (#16162)
|
3 年之前 |
Sven Mika
|
d001af3e59
[RLlib] Allow `rllib rollout` to run distributed via evaluation workers. (#13718)
|
3 年之前 |
Michael Luo
|
a2d1215200
[RLlib] Execution Annotation (#13036)
|
3 年之前 |
Edward Oakes
|
cde711aaf1
Revert "[RLLib] Execution-Folder Type Annotations (#12760)" (#12886)
|
3 年之前 |
Michael Luo
|
becca1424d
[RLLib] Execution-Folder Type Annotations (#12760)
|
3 年之前 |
Sven Mika
|
805dad3bc4
[RLlib] SAC algo cleanup. (#10825)
|
4 年之前 |
Sven Mika
|
fcdf410ae1
[RLlib] Tf2.x native. (#8752)
|
4 年之前 |
Eric Liang
|
9a83908c46
[rllib] Deprecate policy optimizers (#8345)
|
4 年之前 |