Sven Mika a931076f59 [RLlib] Tf2 + eager-tracing same speed as framework=tf; Add more test coverage for tf2+tracing. (#19981) 3 年之前
..
tests 9a7fbd3cdf [RLlib] Add coin game env. Matrix social dilemma env. With tests and examples. (#14208) 3 年之前
utils 9a7fbd3cdf [RLlib] Add coin game env. Matrix social dilemma env. With tests and examples. (#14208) 3 年之前
__init__.py 42991d723f [RLlib] rllib/examples folder restructuring (#8250) 4 年之前
action_mask_env.py ea4a22249c [RLlib] Add simple action-masking example script/env/model (tf and torch). (#18494) 3 年之前
ant_rand_goal.py c95dea51e9 [RLlib] External env enhancements + more examples. (#16583) 3 年之前
cartpole_mass.py c95dea51e9 [RLlib] External env enhancements + more examples. (#16583) 3 年之前
coin_game_non_vectorized_env.py 7588bfd315 [Lint] Add flake8-bugbear (#19053) 3 年之前
coin_game_vectorized_env.py 7588bfd315 [Lint] Add flake8-bugbear (#19053) 3 年之前
correlated_actions_env.py eab9c25856 [RLlib] Better example scripts: Description --no-tune and --local-mode CLI options (autoregressive_action_dist.py) (#17705) 3 年之前
curriculum_capable_env.py 026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535) 3 年之前
d4rl_env.py 4cbe13cdfd [RLlib] CQL loss fn fixes, MuJoCo + Pendulum benchmarks, offline-RL example script w/ json file. (#15603) 3 年之前
debug_counter_env.py d5604eaba3 [RLlib] Attention nets PyTorch support and cleanup (using traj. view API). (#12029) 3 年之前
dm_control_suite.py e74947cc94 [RLlib] Env directory cleanup and tests. (#13082) 3 年之前
env_using_remote_actor.py 2357bbc0c8 [RLlib] Issue 18231: Better (earlier) env validation and error message improvement. (#18249) 3 年之前
env_with_subprocess.py 5dc4b6686e [RLlib] Implement DQN PyTorch distributional head. (#9589) 4 年之前
fast_image_env.py 42991d723f [RLlib] rllib/examples folder restructuring (#8250) 4 年之前
gpu_requiring_env.py 41968512ca [RLlib] Partial GPU examples (for learner and workers). (#15334) 3 年之前
halfcheetah_rand_direc.py c95dea51e9 [RLlib] External env enhancements + more examples. (#16583) 3 年之前
look_and_push.py 796a834c48 [RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371) 4 年之前
matrix_sequential_social_dilemma.py 7588bfd315 [Lint] Add flake8-bugbear (#19053) 3 年之前
mbmpo_env.py 026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535) 3 年之前
mock_env.py 902e854af2 [RLlib; Docs overhaul] Docstring cleanup: Environments. (#19784) 3 年之前
multi_agent.py 026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535) 3 年之前
nested_space_repeat_after_me_env.py a428f10ebe [RLlib] Add multi-GPU learning tests to nightly. (#17778) 3 年之前
parametric_actions_cartpole.py 026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535) 3 年之前
pendulum_mass.py c95dea51e9 [RLlib] External env enhancements + more examples. (#16583) 3 年之前
random_env.py a931076f59 [RLlib] Tf2 + eager-tracing same speed as framework=tf; Add more test coverage for tf2+tracing. (#19981) 3 年之前
repeat_after_me_env.py a428f10ebe [RLlib] Add multi-GPU learning tests to nightly. (#17778) 3 年之前
repeat_initial_obs_env.py 42991d723f [RLlib] rllib/examples folder restructuring (#8250) 4 年之前
simple_corridor.py 41968512ca [RLlib] Partial GPU examples (for learner and workers). (#15334) 3 年之前
simple_rpg.py be26a7b1b0 [rllib] Support for complex / variable-length observation spaces (#8393) 4 年之前
stateless_cartpole.py 8a72824c63 [RLlib Testig] Split and unflake more CI tests (make sure all jobs are < 30min). (#18591) 3 年之前
stateless_pendulum.py 8a72824c63 [RLlib Testig] Split and unflake more CI tests (make sure all jobs are < 30min). (#18591) 3 年之前
transformed_action_space_env.py 026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535) 3 年之前
two_step_game.py 10fd7111b3 [rllib] Improve test learning check, fix flaky two step qmix (#16843) 3 年之前
windy_maze_env.py ebc44c3d76 [CI] Upgrade flake8 to 3.9.1 (#15527) 3 年之前