提交历史

作者 SHA1 备注 提交日期
  Sven Mika 49cd7ea6f9 [RLlib] Trainer sub-class PPO/DDPPO (instead of `build_trainer()`). (#20571) 2 年之前
  Artur Niederfahrenhorst d07e50e957 [RLlib] Replay buffer API (cleanups; docstrings; renames; move into `rllib/execution/buffers` dir) (#20552) 2 年之前
  Sven Mika 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
  Sven Mika ed85f59194 [RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879) 3 年之前
  Chris Bamford 58a73821fb [RLlib] IMPALA sample throughput calculation and full queue slowdown fixes (#17822) 3 年之前
  Sven Mika 5a313ba3d6 [RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169) 3 年之前
  Sven Mika 4b8dadccbd [RLlib] Fix PR 16162: Having added sleep to `_NextValueNotReady` causes TD3 tests to become flakey. (#16309) 3 年之前
  Chris Bamford 1e3721ef4a [RLlib] Remove bad spinlocks to allow pytorch GPU scheduler to interrupt. (#16162) 3 年之前
  Sven Mika d001af3e59 [RLlib] Allow `rllib rollout` to run distributed via evaluation workers. (#13718) 3 年之前
  Michael Luo a2d1215200 [RLlib] Execution Annotation (#13036) 3 年之前
  Edward Oakes cde711aaf1 Revert "[RLLib] Execution-Folder Type Annotations (#12760)" (#12886) 3 年之前
  Michael Luo becca1424d [RLLib] Execution-Folder Type Annotations (#12760) 3 年之前
  Sven Mika 805dad3bc4 [RLlib] SAC algo cleanup. (#10825) 4 年之前
  Sven Mika fcdf410ae1 [RLlib] Tf2.x native. (#8752) 4 年之前
  Eric Liang 9a83908c46 [rllib] Deprecate policy optimizers (#8345) 4 年之前