提交历史

作者 SHA1 备注 提交日期
  Avnish Narayan 684e28b380 [RLlib] RLlib deprecation Notices Part 1 (algorithm/, evaluation/, execution/, models/jax/) (#36826) 1 年之前
  Sven Mika e5d8b28f53 [RLlib] Remove `policy_config` property from RolloutWorker (not needed). (#35878) 1 年之前
  Sven Mika a9667e7b9d [RLlib] Fix flakey 100-policies LRU cache test. (#30823) 1 年之前
  Archit Kulkarni c42b080617 [RLlib] [Tune] [Autoscaler] Remove `six` dependency (#30815) 1 年之前
  Sven Mika b38f0c8454 [RLlib] Only sync policy weights on RolloutWorkers of those policies that were actually updated. (#29973) 1 年之前
  Sven Mika 432f023642 [RLlib] Deprecate `AlgorithmConfig.framework("tfe")`: Use `tf2` instead. (#29755) 2 年之前
  Sven Mika 59a967a3a0 [RLlib] Cleanup some deprecated metric keys and classes. (#26036) 2 年之前
  Sven Mika 130b7eeaba [RLlib] `Trainer` to `Algorithm` renaming. (#25539) 2 年之前
  Eric Liang 905258dbc1 Clean up docstyle in python modules and add LINT rule (#25272) 2 年之前
  Artur Niederfahrenhorst fb2915d26a [RLlib] Replay Buffer API and Ape-X. (#24506) 2 年之前
  Sven Mika 25001f6d8d [RLlib] APPO Training iteration fn. (#24545) 2 年之前
  Avnish Narayan f2bb6f6806 [RLlib] Impala training iteration fn (#23454) 2 年之前
  Balaji Veeramani 7f1bacc7dc [CI] Format Python code with Black (#21975) 2 年之前
  Jun Gong 8ebc50f844 [RLlib] Issue 21334: Fix APPO when kl_loss is enabled. (#21855) 2 年之前
  Sven Mika 49cd7ea6f9 [RLlib] Trainer sub-class PPO/DDPPO (instead of `build_trainer()`). (#20571) 2 年之前
  Artur Niederfahrenhorst d07e50e957 [RLlib] Replay buffer API (cleanups; docstrings; renames; move into `rllib/execution/buffers` dir) (#20552) 2 年之前
  Sven Mika 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
  Sven Mika ed85f59194 [RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879) 3 年之前
  Chris Bamford 58a73821fb [RLlib] IMPALA sample throughput calculation and full queue slowdown fixes (#17822) 3 年之前
  Sven Mika 5a313ba3d6 [RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169) 3 年之前
  Sven Mika 4b8dadccbd [RLlib] Fix PR 16162: Having added sleep to `_NextValueNotReady` causes TD3 tests to become flakey. (#16309) 3 年之前
  Chris Bamford 1e3721ef4a [RLlib] Remove bad spinlocks to allow pytorch GPU scheduler to interrupt. (#16162) 3 年之前
  Sven Mika d001af3e59 [RLlib] Allow `rllib rollout` to run distributed via evaluation workers. (#13718) 3 年之前
  Michael Luo a2d1215200 [RLlib] Execution Annotation (#13036) 3 年之前
  Edward Oakes cde711aaf1 Revert "[RLLib] Execution-Folder Type Annotations (#12760)" (#12886) 3 年之前
  Michael Luo becca1424d [RLLib] Execution-Folder Type Annotations (#12760) 3 年之前
  Sven Mika 805dad3bc4 [RLlib] SAC algo cleanup. (#10825) 4 年之前
  Sven Mika fcdf410ae1 [RLlib] Tf2.x native. (#8752) 4 年之前
  Eric Liang 9a83908c46 [rllib] Deprecate policy optimizers (#8345) 4 年之前