码涯-AIGC代码仓库-提交历史

作者	SHA1 备注	提交日期
Sven Mika	eea6b40a3e [RLlib] Minor cleanups in `Trainer`; better tf/tf2 info messages about possible tracing speedups. (#20109)	2 年之前
Sven Mika	a931076f59 [RLlib] Tf2 + eager-tracing same speed as framework=tf; Add more test coverage for tf2+tracing. (#19981)	3 年之前
Sven Mika	2d24ef0d32 [RLlib] Add all simple learning tests as `framework=tf2`. (#19273)	3 年之前
Sven Mika	0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)	3 年之前
Sven Mika	bab9c0f670 [RLlib; Docs overhaul] Redo: Docstring cleanup: Trainer, trainer_template, Callbacks."" (#19830)	3 年之前
Sven Mika	9c73871da0 [RLlib; Docs overhaul] Docstring cleanup: Evaluation (#19783)	3 年之前
Sven Mika	4a82d3ea6c Revert "[RLlib; Docs overhaul] Docstring cleanup: Trainer, trainer_template, Callbacks. (#19758)" (#19806)	3 年之前
Sven Mika	80eeb13175 [RLlib; Docs overhaul] Docstring cleanup: Trainer, trainer_template, Callbacks. (#19758)	3 年之前
gjoliver	99a0088233 [RLlib] Unify the way we create local replay buffer for all agents (#19627)	3 年之前
Sven Mika	fd438d5630 [RLlib] Issue 18104: Cannot set remote_worker_envs=True for non local-mode and MultiAgentEnv. (#19133)	3 年之前
Sven Mika	828f5d26b7 [RLlib] Custom view requirements (e.g. for prev-n-obs) work with `compute_single_action` and `compute_actions_from_input_dict`. (#18921)	3 年之前
Sven Mika	61a1274619 [RLlib] No Preprocessors (part 2). (#18468)	3 年之前
Sven Mika	698b4eeed3 [RLlib] POC: Separate losses for APPO/IMPALA. Enable TFPolicy to handle multiple optimizers/losses (like TorchPolicy). (#18669)	3 年之前
Sven Mika	3f89f35e52 [RLlib] Better error messages and hints; + failure-mode tests; (#18466)	3 年之前
Sven Mika	8a066474d4 [RLlib] No Preprocessors; preparatory PR #1 (#18367)	3 年之前
Sven Mika	1520c3d147 [RLlib] Deepcopy env_ctx for vectorized sub-envs AND add eval-worker-option to `Trainer.add_policy()` (#18428)	3 年之前
Sven Mika	56f142cac1 [RLlib] Add support for evaluation_num_episodes=auto (run eval for as long as the parallel train step takes). (#18380)	3 年之前
Sven Mika	ba58f5edb1 [RLlib] Strictly run `evaluation_num_episodes` episodes each evaluation run (no matter the other eval config settings). (#18335)	3 年之前
Sven Mika	a772c775cd [RLlib] Set random seed (if provided) to Trainer process as well. (#18307)	3 年之前
Sven Mika	82465f9342 [RLlib] Better PolicyServer example (w/ or w/o tune) and add printing out actual listen port address in log-level=INFO. (#18254)	3 年之前
Sven Mika	599e589481 [RLlib] Move existing fake multi-GPU learning tests into separate buildkite job. (#18065)	3 年之前
Sven Mika	4888d7c9af [RLlib] Replay buffers: Add config option to store contents in checkpoints. (#17999)	3 年之前
Sven Mika	9883505e84 [RLlib] Add [LSTM=True + multi-GPU]-tests to nightly RLlib testing suite (for all algos supporting RNNs, except R2D2, RNNSAC, and DDPPO). (#18017)	3 年之前
Sven Mika	8248ba531b [RLlib] Redo #17410: Example script: Remote worker envs with inference done on main node. (#17960)	3 年之前
Alex Wu	318ba6fae0 Revert "[RLlib] Add example script for how to have n remote (parallel) envs with inference happening on "main" (possibly GPU) node. (#17410)" (#17951)	3 年之前
Sven Mika	8fc16b9a18 [RLlib] Add example script for how to have n remote (parallel) envs with inference happening on "main" (possibly GPU) node. (#17410)	3 年之前
Sven Mika	f18213712f [RLlib] Redo: "fix self play example scripts" PR (17566) (#17895)	3 年之前
Amog Kamsetty	77f28f1c30 Revert "[RLlib] Fix `Trainer.add_policy` for num_workers>0 (self play example scripts). (#17566)" (#17709)	3 年之前
Sven Mika	3b447265d8 [RLlib] Fix `Trainer.add_policy` for num_workers>0 (self play example scripts). (#17566)	3 年之前
Sven Mika	5107d16ae5 [RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. (#17530)	3 年之前

更新的提交更旧的提交

提交历史 查找

提交历史