码涯-AIGC代码仓库-openoker/ray: 一个针对强化学习和深度学习所设计的大规模分布式计算框架。 @ ubranch-1

Sven Mika a931076f59 [RLlib] Tf2 + eager-tracing same speed as framework=tf; Add more test coverage for tf2+tracing. (#19981)		3 年之前
..
tests	9a7fbd3cdf [RLlib] Add coin game env. Matrix social dilemma env. With tests and examples. (#14208)	3 年之前
utils	9a7fbd3cdf [RLlib] Add coin game env. Matrix social dilemma env. With tests and examples. (#14208)	3 年之前
__init__.py	42991d723f [RLlib] rllib/examples folder restructuring (#8250)	4 年之前
action_mask_env.py	ea4a22249c [RLlib] Add simple action-masking example script/env/model (tf and torch). (#18494)	3 年之前
ant_rand_goal.py	c95dea51e9 [RLlib] External env enhancements + more examples. (#16583)	3 年之前
cartpole_mass.py	c95dea51e9 [RLlib] External env enhancements + more examples. (#16583)	3 年之前
coin_game_non_vectorized_env.py	7588bfd315 [Lint] Add flake8-bugbear (#19053)	3 年之前
coin_game_vectorized_env.py	7588bfd315 [Lint] Add flake8-bugbear (#19053)	3 年之前
correlated_actions_env.py	eab9c25856 [RLlib] Better example scripts: Description --no-tune and --local-mode CLI options (autoregressive_action_dist.py) (#17705)	3 年之前
curriculum_capable_env.py	026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535)	3 年之前
d4rl_env.py	4cbe13cdfd [RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603)	3 年之前
debug_counter_env.py	d5604eaba3 [RLlib] Attention nets PyTorch support and cleanup (using traj. view API). (#12029)	3 年之前
dm_control_suite.py	e74947cc94 [RLlib] Env directory cleanup and tests. (#13082)	3 年之前
env_using_remote_actor.py	2357bbc0c8 [RLlib] Issue 18231: Better (earlier) env validation and error message improvement. (#18249)	3 年之前
env_with_subprocess.py	5dc4b6686e [RLlib] Implement DQN PyTorch distributional head. (#9589)	4 年之前
fast_image_env.py	42991d723f [RLlib] rllib/examples folder restructuring (#8250)	4 年之前
gpu_requiring_env.py	41968512ca [RLlib] Partial GPU examples (for learner and workers). (#15334)	3 年之前
halfcheetah_rand_direc.py	c95dea51e9 [RLlib] External env enhancements + more examples. (#16583)	3 年之前
look_and_push.py	796a834c48 [RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371)	4 年之前
matrix_sequential_social_dilemma.py	7588bfd315 [Lint] Add flake8-bugbear (#19053)	3 年之前
mbmpo_env.py	026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535)	3 年之前
mock_env.py	902e854af2 [RLlib; Docs overhaul] Docstring cleanup: Environments. (#19784)	3 年之前
multi_agent.py	026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535)	3 年之前
nested_space_repeat_after_me_env.py	a428f10ebe [RLlib] Add multi-GPU learning tests to nightly. (#17778)	3 年之前
parametric_actions_cartpole.py	026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535)	3 年之前
pendulum_mass.py	c95dea51e9 [RLlib] External env enhancements + more examples. (#16583)	3 年之前
random_env.py	a931076f59 [RLlib] Tf2 + eager-tracing same speed as framework=tf; Add more test coverage for tf2+tracing. (#19981)	3 年之前
repeat_after_me_env.py	a428f10ebe [RLlib] Add multi-GPU learning tests to nightly. (#17778)	3 年之前
repeat_initial_obs_env.py	42991d723f [RLlib] rllib/examples folder restructuring (#8250)	4 年之前
simple_corridor.py	41968512ca [RLlib] Partial GPU examples (for learner and workers). (#15334)	3 年之前
simple_rpg.py	be26a7b1b0 [rllib] Support for complex / variable-length observation spaces (#8393)	4 年之前
stateless_cartpole.py	8a72824c63 [RLlib Testig] Split and unflake more CI tests (make sure all jobs are < 30min). (#18591)	3 年之前
stateless_pendulum.py	8a72824c63 [RLlib Testig] Split and unflake more CI tests (make sure all jobs are < 30min). (#18591)	3 年之前
transformed_action_space_env.py	026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535)	3 年之前
two_step_game.py	10fd7111b3 [rllib] Improve test learning check, fix flaky two step qmix (#16843)	3 年之前
windy_maze_env.py	ebc44c3d76 [CI] Upgrade flake8 to 3.9.1 (#15527)	3 年之前