码涯-AIGC代码仓库-openoker/ray: 一个针对强化学习和深度学习所设计的大规模分布式计算框架。 @ data-dashboard

Sven Mika 1fbb143950 [RLlib] Issue 39453: PettingZoo wrappers should use correct multi-agent dict action- and observation spaces. (#39459)		1 年之前
..
bandit	827ab91741 [RLlib] Replace remaining mentions of "trainer" by "algorithm". (#36557)	1 年之前
catalog	ab131bb8c2 [RLlib] Early improvements to Catalogs and RL Modules docs + Catalogs improvements (#37245)	1 年之前
connectors	2ffd7e49bd [rllib] Fix storage-path related tests (#38947)	1 年之前
documentation	2ffd7e49bd [rllib] Fix storage-path related tests (#38947)	1 年之前
env	586f1b5139 [RLlib] Fix MB-MPO bug. (#39654)	1 年之前
export	827ab91741 [RLlib] Replace remaining mentions of "trainer" by "algorithm". (#36557)	1 年之前
inference_and_serving	2ffd7e49bd [rllib] Fix storage-path related tests (#38947)	1 年之前
learner	2913e9b971 [train] Legacy interface cleanup (`air.Checkpoint`, `LegacyExperimentAnalysis`) (#39289)	1 年之前
models	ab131bb8c2 [RLlib] Early improvements to Catalogs and RL Modules docs + Catalogs improvements (#37245)	1 年之前
multi_agent_and_self_play	c17a44cdfa Revert "Revert "[RLlib] AlphaStar: Parallelized, multi-agent/multi-GPU learni…" (#22153)	2 年之前
policy	960032a15f [RLlib][RLModules] RNNs and RLModules (#32723)	1 年之前
rl_module	1c29b98c71 [RLlib] Fix issues with action masking examples. (#38095)	1 年之前
serving	2ffd7e49bd [rllib] Fix storage-path related tests (#38947)	1 年之前
simulators	0c69020432 Revert "Simplify logging configuration. (#30863)" (#31858)	1 年之前
tune	054e00f8b8 [AIR] Remove head node syncing as the default storage option (#37142)	1 年之前
__init__.py	5d7afe8092 [rllib] Try moving RLlib to top level dir (#5324)	5 年之前
action_masking.py	1c29b98c71 [RLlib] Fix issues with action masking examples. (#38095)	1 年之前
attention_net.py	9a2d5f443e [RLlib][RLModule][build_base] Disabling RLModule API for LSTM / Attn (#33662)	1 年之前
attention_net_supervised.py	8e680c483c [RLlib] gymnasium support (new `Env.reset()/step()/seed()/render()` APIs). (#28369)	1 年之前
autoregressive_action_dist.py	2df2428cdf [RLlib] Put notices and error out on invalid ModelV2/Policy related configs for RL Modules (#37526)	1 年之前
bare_metal_policy_with_custom_view_reqs.py	2ed09c5445 [RLlib] Move all config validation logic into AlgorithmConfig classes. (#29854)	1 年之前
batch_norm_model.py	2df2428cdf [RLlib] Put notices and error out on invalid ModelV2/Policy related configs for RL Modules (#37526)	1 年之前
cartpole_lstm.py	960032a15f [RLlib][RLModules] RNNs and RLModules (#32723)	1 年之前
centralized_critic.py	17596b03d1 [RLlib][RLModule][build_base] Disabled Centralized Critic and Curiosity tests with RLModules (#33663)	1 年之前
centralized_critic_2.py	17596b03d1 [RLlib][RLModule][build_base] Disabled Centralized Critic and Curiosity tests with RLModules (#33663)	1 年之前
checkpoint_by_custom_criteria.py	a3ec4a936e [RLlib] Enable `eager_tracing=True` by default. (#36556)	1 年之前
coin_game_env.py	223b39611e [RLlib] Deprecate/cleanup: AlgorithmConfig["multiagent"] access and usage in tests and examples. (#35879)	1 年之前
complex_struct_space.py	5af66e66cc [RLlib] AlgorithmConfigs: Broad rollout; Example scripts. (#29700)	2 年之前
compute_adapted_gae_on_postprocess_trajectory.py	8e680c483c [RLlib] gymnasium support (new `Env.reset()/step()/seed()/render()` APIs). (#28369)	1 年之前
curriculum_learning.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
custom_env.py	2df2428cdf [RLlib] Put notices and error out on invalid ModelV2/Policy related configs for RL Modules (#37526)	1 年之前
custom_eval.py	d127273ec7 [RLlib] Fix: Recovered eval worker should use eval-config's policy_mapping_fn and policy_to_train fn, not the main train workers' ones. (#33648)	1 年之前
custom_experiment.py	2ffd7e49bd [rllib] Fix storage-path related tests (#38947)	1 年之前
custom_fast_model.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
custom_input_api.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
custom_keras_model.py	8e680c483c [RLlib] gymnasium support (new `Env.reset()/step()/seed()/render()` APIs). (#28369)	1 年之前
custom_logger.py	827ab91741 [RLlib] Replace remaining mentions of "trainer" by "algorithm". (#36557)	1 年之前
custom_metrics_and_callbacks.py	7c07659224 Update episode_v2.py with last_info_for (#37382)	1 年之前
custom_model_api.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
custom_model_loss_and_metrics.py	2df2428cdf [RLlib] Put notices and error out on invalid ModelV2/Policy related configs for RL Modules (#37526)	1 年之前
custom_observation_filters.py	5af66e66cc [RLlib] AlgorithmConfigs: Broad rollout; Example scripts. (#29700)	2 年之前
custom_recurrent_rnn_tokenizer.py	960032a15f [RLlib][RLModules] RNNs and RLModules (#32723)	1 年之前
custom_rnn_model.py	9a2d5f443e [RLlib][RLModule][build_base] Disabling RLModule API for LSTM / Attn (#33662)	1 年之前
custom_tf_policy.py	2ed09c5445 [RLlib] Move all config validation logic into AlgorithmConfig classes. (#29854)	1 年之前
custom_torch_policy.py	2ed09c5445 [RLlib] Move all config validation logic into AlgorithmConfig classes. (#29854)	1 年之前
custom_train_fn.py	2ffd7e49bd [rllib] Fix storage-path related tests (#38947)	1 年之前
custom_vector_env.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
deterministic_training.py	5b5d83cef9 [RLlib] Fix rest of PPO RL Modules tests (#35672)	1 年之前
dmlab_watermaze.py	7f1bacc7dc [CI] Format Python code with Black (#21975)	2 年之前
eager_execution.py	a3ec4a936e [RLlib] Enable `eager_tracing=True` by default. (#36556)	1 年之前
env_rendering_and_recording.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
fractional_gpus.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
hierarchical_training.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
iterated_prisoners_dilemma_env.py	223b39611e [RLlib] Deprecate/cleanup: AlgorithmConfig["multiagent"] access and usage in tests and examples. (#35879)	1 年之前
lstm_auto_wrapping.py	72fefc3a40 [RLlib] AlgorithmConfig: Replace more of the old-style config dicts across codebase. (#29799)	2 年之前
mobilenet_v2_with_lstm.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
multi-agent-leela-chess-zero.py	56b79117af [RLlib] Contribution of LeelaChessZero algorithm for playing chess in a MultiAgent env. (#31480)	1 年之前
multi_agent_cartpole.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
multi_agent_custom_policy.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
multi_agent_different_spaces_for_agents.py	960032a15f [RLlib][RLModules] RNNs and RLModules (#32723)	1 年之前
multi_agent_independent_learning.py	12ff13dda1 [RLlib] Fix waterworld example and test (#32117)	1 年之前
multi_agent_parameter_sharing.py	223b39611e [RLlib] Deprecate/cleanup: AlgorithmConfig["multiagent"] access and usage in tests and examples. (#35879)	1 年之前
multi_agent_two_trainers.py	960032a15f [RLlib][RLModules] RNNs and RLModules (#32723)	1 年之前
nested_action_spaces.py	89ac80d883 [RLlib] Issue 39421: MultiDiscrete action spaces not supported on new stack. (#39534)	1 年之前
offline_rl.py	794cfd9725 [RLlib] `AlgorithmConfig.overrides()` to replace `multiagent->policies->config` and `evaluation_config` dicts. (#30879)	1 年之前
parallel_evaluation_and_training.py	a3ec4a936e [RLlib] Enable `eager_tracing=True` by default. (#36556)	1 年之前
parametric_actions_cartpole.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
parametric_actions_cartpole_embeddings_learnt_by_model.py	05eea3844a [RLlib] Fix `env_check` for parametric actions (with action mask). (#34790)	1 年之前
partial_gpus.py	0c74ecad12 [Lint] Cleanup incorrectly formatted strings (Part 1: RLLib). (#23128)	2 年之前
preprocessing_disabled.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
random_parametric_agent.py	2ed09c5445 [RLlib] Move all config validation logic into AlgorithmConfig classes. (#29854)	1 年之前
re3_exploration.py	e715a8b761 [RLlib] AlgorithmConfig: Replace more occurrences of old config dicts; Make all Algorithms use the non-dict lookup for config properties. (#30096)	1 年之前
recommender_system_with_recsim_and_slateq.py	32c73a319d [RLlib] Issue 39031: SlateQ example script bug. (#39550)	1 年之前
remote_base_env_with_custom_api.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
remote_envs_with_inference_done_on_main_node.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
replay_buffer_api.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
restore_1_of_n_agents_from_checkpoint.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
rnnsac_stateless_cartpole.py	c477124a09 [air] fix one rllib example that still uses `log_dir` (#39067)	1 年之前
rock_paper_scissors_multiagent.py	1fbb143950 [RLlib] Issue 39453: PettingZoo wrappers should use correct multi-agent dict action- and observation spaces. (#39459)	1 年之前
rollout_worker_custom_workflow.py	9ed2fa250a [tune] Deprecate `tune.report`, `tune.checkpoint_dir`, `checkpoint_dir`, and `reporter` (#39093)	1 年之前
saving_experiences.py	8e680c483c [RLlib] gymnasium support (new `Env.reset()/step()/seed()/render()` APIs). (#28369)	1 年之前
sb2rllib_rllib_example.py	8e680c483c [RLlib] gymnasium support (new `Env.reset()/step()/seed()/render()` APIs). (#28369)	1 年之前
sb2rllib_sb_example.py	8e680c483c [RLlib] gymnasium support (new `Env.reset()/step()/seed()/render()` APIs). (#28369)	1 年之前
self_play_league_based_with_open_spiel.py	827ab91741 [RLlib] Replace remaining mentions of "trainer" by "algorithm". (#36557)	1 年之前
self_play_with_open_spiel.py	827ab91741 [RLlib] Replace remaining mentions of "trainer" by "algorithm". (#36557)	1 年之前
sumo_env_local.py	827ab91741 [RLlib] Replace remaining mentions of "trainer" by "algorithm". (#36557)	1 年之前
trajectory_view_api.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
trajectory_view_api_rlm.py	960032a15f [RLlib][RLModules] RNNs and RLModules (#32723)	1 年之前
two_step_game.py	8d2dc9a399 [RLlib] Change default framework from tf to torch (#33604)	1 年之前
two_trainer_workflow.py	ba04d01ee0 [RLlib][RLModule] Disabled RLModule in Two trainer workflow example (#33727)	1 年之前
unity3d_env_local.py	827ab91741 [RLlib] Replace remaining mentions of "trainer" by "algorithm". (#36557)	1 年之前
vizdoom_with_attention_net.py	a3ec4a936e [RLlib] Enable `eager_tracing=True` by default. (#36556)	1 年之前