Sven Mika
|
d5bfb7b7da
[RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652)
|
2 years ago |
Sven Mika
|
90c6b10498
[RLlib] Decentralized multi-agent learning; PR #01 (#21421)
|
2 years ago |
Sven Mika
|
92f030331e
[RLlib] Initial code/comment cleanups in preparation for decentralized multi-agent learner. (#21420)
|
2 years ago |
Sven Mika
|
daa4304a91
[RLlib] Switch off preprocessors by default for PGTrainer. (#21008)
|
2 years ago |
Amog Kamsetty
|
611bfc1352
[ML] Move `find_free_port` to `ml_utils` (#20828)
|
2 years ago |
Avnish Narayan
|
74dd0e4085
[RLlib] Make `to_base_env()` a method of all RLlib-supported Env classes (#20811)
|
2 years ago |
Avnish Narayan
|
3ddc09544d
[rllib] Env to base env refactor (#20785)
|
2 years ago |
Sven Mika
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 years ago |
Sven Mika
|
ea2bea7e30
[RLlib; Docs overhaul] Docstring cleanup: Offline. (#19808)
|
3 years ago |
Sven Mika
|
9c73871da0
[RLlib; Docs overhaul] Docstring cleanup: Evaluation (#19783)
|
3 years ago |
Sven Mika
|
902e854af2
[RLlib; Docs overhaul] Docstring cleanup: Environments. (#19784)
|
3 years ago |
Sven Mika
|
c3e3fc7637
[RLlib] Issue 18280: A3C/IMPALA multi-agent not working. (#19100)
|
3 years ago |
Sven Mika
|
61a1274619
[RLlib] No Preprocessors (part 2). (#18468)
|
3 years ago |
Sven Mika
|
fd13bac9b3
[RLlib] Add `worker` arg (optional) to `policy_mapping_fn`. (#18184)
|
3 years ago |
Sven Mika
|
8a72824c63
[RLlib Testig] Split and unflake more CI tests (make sure all jobs are < 30min). (#18591)
|
3 years ago |
Sven Mika
|
3f89f35e52
[RLlib] Better error messages and hints; + failure-mode tests; (#18466)
|
3 years ago |
Sven Mika
|
8a066474d4
[RLlib] No Preprocessors; preparatory PR #1 (#18367)
|
3 years ago |
Sven Mika
|
1520c3d147
[RLlib] Deepcopy env_ctx for vectorized sub-envs AND add eval-worker-option to `Trainer.add_policy()` (#18428)
|
3 years ago |
Sven Mika
|
a772c775cd
[RLlib] Set random seed (if provided) to Trainer process as well. (#18307)
|
3 years ago |
Sven Mika
|
9a8ca6a69d
[RLlib] Fix Atari learning test regressions (2 bugs) and 1 minor attention net bug. (#18306)
|
3 years ago |
gjoliver
|
336e79956a
[RLlib] Make MultiAgentEnv inherit gym.Env to avoid direct class type manipulation (#18156)
|
3 years ago |
Sven Mika
|
2357bbc0c8
[RLlib] Issue 18231: Better (earlier) env validation and error message improvement. (#18249)
|
3 years ago |
gjoliver
|
6621bb5611
[RLlib] Minor renaming and cleanups related to last rollout worker seed fix. (#18155)
|
3 years ago |
gjoliver
|
a8813675f4
[RLlib] Issue 17900: Set `seed` in single vectorized sub-envs properly, if `num_envs_per_worker > 1` (#18110)
|
3 years ago |
Sven Mika
|
f18213712f
[RLlib] Redo: "fix self play example scripts" PR (17566) (#17895)
|
3 years ago |
Sven Mika
|
2bd2ee7a73
[RLlib] SampleBatch: Docstring- and API cleanups; Add support for nested data. (#17485)
|
3 years ago |
akern40
|
0cb2c602db
[rllib] Fixes typo in RolloutWorker.__init__ (#17583)
|
3 years ago |
Amog Kamsetty
|
77f28f1c30
Revert "[RLlib] Fix `Trainer.add_policy` for num_workers>0 (self play example scripts). (#17566)" (#17709)
|
3 years ago |
Sven Mika
|
3b447265d8
[RLlib] Fix `Trainer.add_policy` for num_workers>0 (self play example scripts). (#17566)
|
3 years ago |
Kai Fricke
|
5d56a8aac5
[RLlib] Fix ignoring "sample_collector" config key (#17460)
|
3 years ago |