码涯-AIGC代码仓库-提交历史

作者	SHA1 备注	提交日期
Sven Mika	d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652)	2 年之前
Sven Mika	c4636c7c05 [RLlib] Issue 21633: SimpleQ should not use a prio. replay buffer. (#21665)	2 年之前
Jun Gong	7517aefe05 [RLlib] Bring back BC and Marwil learning tests. (#21574)	2 年之前
Sven Mika	90c6b10498 [RLlib] Decentralized multi-agent learning; PR #01 (#21421)	2 年之前
Sven Mika	188324c5c7 [RLlib] Issue 21552: `unsquash_action` and `clip_action` (when None) cause wrong actions computed by `Trainer.compute_single_action`. (#21553)	2 年之前
Matti Picus	ec6a33b736 [tune] fixes to allow tune/tests/test_commands.py to run on windows (#21342)	2 年之前
Sven Mika	f94bd99ce4 [RLlib] Issue 21044: Improve error message for "multiagent" dict checks. (#21448)	2 年之前
Sven Mika	92f030331e [RLlib] Initial code/comment cleanups in preparation for decentralized multi-agent learner. (#21420)	2 年之前
Sven Mika	853d10871c [RLlib] Issue 18499: PGTrainer with training_iteration fn does not support multi-GPU. (#21376)	2 年之前
Sven Mika	9e6b871739 [RLlib] Better utils for flattening complex inputs and enable prev-actions for LSTM/attention for complex action spaces. (#21330)	2 年之前
Sven Mika	62dbf26394 [RLlib] POC: Run PGTrainer w/o the distr. exec API (Trainer's new training_iteration method). (#20984)	2 年之前
Jun Gong	767f78eaf8 [RLlib] Always attach latest eval metrics. (#21011)	2 年之前
Sven Mika	db058d0fb3 [RLlib] Rename `metrics_smoothing_episodes` into `metrics_num_episodes_for_smoothing` for clarity. (#20983)	2 年之前
Sven Mika	596c8e2772 [RLlib] Experimental no-flatten option for actions/prev-actions. (#20918)	2 年之前
Sven Mika	f814c2af89 [RLlib; Docs] Docs API reference pages: `rllib/execution`, `rllib/evaluation`, `rllib/models`, `rllib/offline`. (#20538)	2 年之前
Sven Mika	b4790900f5 [RLlib] Sub-class `Trainer` (instead of `build_trainer()`): All remaining classes; soft-deprecate `build_trainer`. (#20725)	2 年之前
Sven Mika	60b2219d72 [RLlib] Allow for evaluation to run by `timesteps` (alternative to `episodes`) and add auto-setting to make sure train doesn't ever have to wait for eval (e.g. long episodes) to finish. (#20757)	2 年之前
Sven Mika	9e38f6f613 [RLlib] Trainer sub-class DDPG/TD3/APEX-DDPG (instead of `build_trainer`). (#20636)	2 年之前
Sven Mika	3d2e27485b [RLlib] Trainer sub-class DQN/SimpleQ/APEX-DQN/R2D2 (instead of using `build_trainer`). (#20633)	2 年之前
Sven Mika	c07d8c4c22 [RLlib] Trainer sub-class A2C/A3C (instead of `build_trainer`). (#20635)	2 年之前
Sven Mika	49cd7ea6f9 [RLlib] Trainer sub-class PPO/DDPPO (instead of `build_trainer()`). (#20571)	2 年之前
Sven Mika	9d2fe5756c [RLlib] Trainer sub-class for APPO (instead of using `build_trainer()`). (#20424)	2 年之前
Artur Niederfahrenhorst	d07e50e957 [RLlib] Replay buffer API (cleanups; docstrings; renames; move into `rllib/execution/buffers` dir) (#20552)	2 年之前
Sven Mika	56619b955e [RLlib; Documentation] Some docstring cleanups; Rename RemoteVectorEnv into RemoteBaseEnv for clarity. (#20250)	2 年之前
Avnish Narayan	dc17f0a241 Add error messages for missing tf and torch imports (#20205)	2 年之前
Kai Fricke	3e6ba5d6d2 Revert "Revert [RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`." (#20285)	2 年之前
Kai Fricke	246787cdd9 Revert "[RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`. (#20055)" (#20284)	2 年之前
Sven Mika	6f85af435f [RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`. (#20055)	2 年之前
Sven Mika	ebd56b57db [RLlib; documentation] "RLlib in 60sec" overhaul. (#20215)	2 年之前
Kai Fricke	9c2b8c8501 [tune] Deprecate DurableTrainable (#19880)	2 年之前

更新的提交更旧的提交

提交历史 查找

提交历史