Sven Mika d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) | 2 年之前 | |
---|---|---|
.. | ||
tests | 12b087acb8 [RLlib] Base env pre-checker. (#21569) | 2 年之前 |
README.md | 5d7afe8092 [rllib] Try moving RLlib to top level dir (#5324) | 5 年之前 |
__init__.py | 9a83908c46 [rllib] Deprecate policy optimizers (#8345) | 4 年之前 |
mixers.py | 805dad3bc4 [RLlib] SAC algo cleanup. (#10825) | 4 年之前 |
model.py | 99c81c6795 [RLlib] Attention Net prep PR #3. (#12450) | 3 年之前 |
qmix.py | d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) | 2 年之前 |
qmix_policy.py | f3397b6f48 [RLlib] Minor fixes/cleanups; chop_into_sequences now handles nested data. (#19408) | 3 年之前 |
Code in this package is adapted from https://github.com/oxwhirl/pymarl.