Sven Mika d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) 2 年之前
..
tests 12b087acb8 [RLlib] Base env pre-checker. (#21569) 2 年之前
README.md 5d7afe8092 [rllib] Try moving RLlib to top level dir (#5324) 5 年之前
__init__.py 9a83908c46 [rllib] Deprecate policy optimizers (#8345) 4 年之前
mixers.py 805dad3bc4 [RLlib] SAC algo cleanup. (#10825) 4 年之前
model.py 99c81c6795 [RLlib] Attention Net prep PR #3. (#12450) 3 年之前
qmix.py d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) 2 年之前
qmix_policy.py f3397b6f48 [RLlib] Minor fixes/cleanups; chop_into_sequences now handles nested data. (#19408) 3 年之前

README.md

Code in this package is adapted from https://github.com/oxwhirl/pymarl.