Sven Mika 8c28fe265a [RLlib] Cleanup examples folder vol. 23: Add example script for custom metrics on `EnvRunners` (using `MetricsLogger` API). (#47969) 2 天之前
..
_docs cf7a09daa2 [RLlib] Provide more constants for common result dict keys, e.g. `EPISODE_RETURN_MEAN`. (#45330) 5 月之前
_old_api_stack a62d4bfd3a [RLlib] New API stack: (Multi)RLModule overhaul vol 04 (deprecate RLModuleConfig; cleanups, DefaultModelConfig dataclass). (#47908) 1 周之前
actions c9fa046438 [RLlib] Discontinue support for "hybrid" API stack (using RLModule + Learner, but still on RolloutWorker and Policy) (#46085) 3 周之前
algorithms cbde03cf8c [RLlib] New API stack: (Multi)RLModule overhaul vol 02 (VPG RLModule, Algo, and Learner example classes). (#47885) 2 周之前
catalogs ed5b3821b2 [RLlib] Add "shuffle batch per epoch" option. (#47458) 1 月之前
checkpoints a62d4bfd3a [RLlib] New API stack: (Multi)RLModule overhaul vol 04 (deprecate RLModuleConfig; cleanups, DefaultModelConfig dataclass). (#47908) 1 周之前
connectors a62d4bfd3a [RLlib] New API stack: (Multi)RLModule overhaul vol 04 (deprecate RLModuleConfig; cleanups, DefaultModelConfig dataclass). (#47908) 1 周之前
curiosity a62d4bfd3a [RLlib] New API stack: (Multi)RLModule overhaul vol 04 (deprecate RLModuleConfig; cleanups, DefaultModelConfig dataclass). (#47908) 1 周之前
curriculum a62d4bfd3a [RLlib] New API stack: (Multi)RLModule overhaul vol 04 (deprecate RLModuleConfig; cleanups, DefaultModelConfig dataclass). (#47908) 1 周之前
debugging ed5b3821b2 [RLlib] Add "shuffle batch per epoch" option. (#47458) 1 月之前
envs 8c28fe265a [RLlib] Cleanup examples folder vol. 23: Add example script for custom metrics on `EnvRunners` (using `MetricsLogger` API). (#47969) 2 天之前
evaluation 63233ecfb3 [RLlib; new API stack by default] Switch on new API stack by default for SAC and DQN. (#47217) 3 周之前
fault_tolerance e75f5e7aa9 [RLlib] Add restart-failed-env option to new api stack. (#47608) 1 月之前
gpus ed5b3821b2 [RLlib] Add "shuffle batch per epoch" option. (#47458) 1 月之前
hierarchical c84bf37cb2 [RLlib] Cleanup examples folder #13. Fix main examples docs page for RLlib. (#45382) 4 月之前
inference a62d4bfd3a [RLlib] New API stack: (Multi)RLModule overhaul vol 04 (deprecate RLModuleConfig; cleanups, DefaultModelConfig dataclass). (#47908) 1 周之前
learners 8c28fe265a [RLlib] Cleanup examples folder vol. 23: Add example script for custom metrics on `EnvRunners` (using `MetricsLogger` API). (#47969) 2 天之前
metrics 8c28fe265a [RLlib] Cleanup examples folder vol. 23: Add example script for custom metrics on `EnvRunners` (using `MetricsLogger` API). (#47969) 2 天之前
multi_agent a62d4bfd3a [RLlib] New API stack: (Multi)RLModule overhaul vol 04 (deprecate RLModuleConfig; cleanups, DefaultModelConfig dataclass). (#47908) 1 周之前
offline_rl e182e192c1 [RLlib] New API stack: (Multi)RLModule overhaul vol 03 (Introduce generic `_forward` to further simplify the user experience). (#47889) 2 周之前
ray_serve 2ca00e0596 [RLlib] AlgorithmConfig cleanup 03: Cleaner names and structuring of API-stack config settings. (#44920) 5 月之前
ray_tune 710f557308 [RLlib] Cleanup, rename, clarify: Algorithm.workers/evaluation_workers, local_worker(), etc.. (#46726) 3 月之前
rl_modules a62d4bfd3a [RLlib] New API stack: (Multi)RLModule overhaul vol 04 (deprecate RLModuleConfig; cleanups, DefaultModelConfig dataclass). (#47908) 1 周之前
__init__.py 5d7afe8092 [rllib] Try moving RLlib to top level dir (#5324) 5 年之前
autoregressive_action_dist.py c8aa7f1f6b [RLlib] Update autoregressive actions example. (#47829) 3 周之前
centralized_critic.py c84bf37cb2 [RLlib] Cleanup examples folder #13. Fix main examples docs page for RLlib. (#45382) 4 月之前
centralized_critic_2.py c84bf37cb2 [RLlib] Cleanup examples folder #13. Fix main examples docs page for RLlib. (#45382) 4 月之前
compute_adapted_gae_on_postprocess_trajectory.py c84bf37cb2 [RLlib] Cleanup examples folder #13. Fix main examples docs page for RLlib. (#45382) 4 月之前
custom_recurrent_rnn_tokenizer.py 616eef8f8a [RLlib] New API stack: (Multi)RLModule overhaul vol 05 (deprecate Specs, SpecDict, TensorSpec). (#47915) 1 周之前
quadx_waypoints.py a62d4bfd3a [RLlib] New API stack: (Multi)RLModule overhaul vol 04 (deprecate RLModuleConfig; cleanups, DefaultModelConfig dataclass). (#47908) 1 周之前
replay_buffer_api.py c84bf37cb2 [RLlib] Cleanup examples folder #13. Fix main examples docs page for RLlib. (#45382) 4 月之前