Commit History

Author SHA1 Message Date
  Sven Mika d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) 2 years ago
  Sven Mika 90c6b10498 [RLlib] Decentralized multi-agent learning; PR #01 (#21421) 2 years ago
  Sven Mika 92f030331e [RLlib] Initial code/comment cleanups in preparation for decentralized multi-agent learner. (#21420) 2 years ago
  Sven Mika daa4304a91 [RLlib] Switch off preprocessors by default for PGTrainer. (#21008) 2 years ago
  Amog Kamsetty 611bfc1352 [ML] Move `find_free_port` to `ml_utils` (#20828) 2 years ago
  Avnish Narayan 74dd0e4085 [RLlib] Make `to_base_env()` a method of all RLlib-supported Env classes (#20811) 2 years ago
  Avnish Narayan 3ddc09544d [rllib] Env to base env refactor (#20785) 2 years ago
  Sven Mika 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 years ago
  Sven Mika ea2bea7e30 [RLlib; Docs overhaul] Docstring cleanup: Offline. (#19808) 3 years ago
  Sven Mika 9c73871da0 [RLlib; Docs overhaul] Docstring cleanup: Evaluation (#19783) 3 years ago
  Sven Mika 902e854af2 [RLlib; Docs overhaul] Docstring cleanup: Environments. (#19784) 3 years ago
  Sven Mika c3e3fc7637 [RLlib] Issue 18280: A3C/IMPALA multi-agent not working. (#19100) 3 years ago
  Sven Mika 61a1274619 [RLlib] No Preprocessors (part 2). (#18468) 3 years ago
  Sven Mika fd13bac9b3 [RLlib] Add `worker` arg (optional) to `policy_mapping_fn`. (#18184) 3 years ago
  Sven Mika 8a72824c63 [RLlib Testig] Split and unflake more CI tests (make sure all jobs are < 30min). (#18591) 3 years ago
  Sven Mika 3f89f35e52 [RLlib] Better error messages and hints; + failure-mode tests; (#18466) 3 years ago
  Sven Mika 8a066474d4 [RLlib] No Preprocessors; preparatory PR #1 (#18367) 3 years ago
  Sven Mika 1520c3d147 [RLlib] Deepcopy env_ctx for vectorized sub-envs AND add eval-worker-option to `Trainer.add_policy()` (#18428) 3 years ago
  Sven Mika a772c775cd [RLlib] Set random seed (if provided) to Trainer process as well. (#18307) 3 years ago
  Sven Mika 9a8ca6a69d [RLlib] Fix Atari learning test regressions (2 bugs) and 1 minor attention net bug. (#18306) 3 years ago
  gjoliver 336e79956a [RLlib] Make MultiAgentEnv inherit gym.Env to avoid direct class type manipulation (#18156) 3 years ago
  Sven Mika 2357bbc0c8 [RLlib] Issue 18231: Better (earlier) env validation and error message improvement. (#18249) 3 years ago
  gjoliver 6621bb5611 [RLlib] Minor renaming and cleanups related to last rollout worker seed fix. (#18155) 3 years ago
  gjoliver a8813675f4 [RLlib] Issue 17900: Set `seed` in single vectorized sub-envs properly, if `num_envs_per_worker > 1` (#18110) 3 years ago
  Sven Mika f18213712f [RLlib] Redo: "fix self play example scripts" PR (17566) (#17895) 3 years ago
  Sven Mika 2bd2ee7a73 [RLlib] SampleBatch: Docstring- and API cleanups; Add support for nested data. (#17485) 3 years ago
  akern40 0cb2c602db [rllib] Fixes typo in RolloutWorker.__init__ (#17583) 3 years ago
  Amog Kamsetty 77f28f1c30 Revert "[RLlib] Fix `Trainer.add_policy` for num_workers>0 (self play example scripts). (#17566)" (#17709) 3 years ago
  Sven Mika 3b447265d8 [RLlib] Fix `Trainer.add_policy` for num_workers>0 (self play example scripts). (#17566) 3 years ago
  Kai Fricke 5d56a8aac5 [RLlib] Fix ignoring "sample_collector" config key (#17460) 3 years ago