Sven Mika d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) | 2 years ago | |
---|---|---|
.. | ||
__init__.py | 5af745c90d [RLlib] Implement the SlateQ algorithm (#11450) | 4 years ago |
slateq.py | d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) | 2 years ago |
slateq_torch_policy.py | 99ae7bae05 [RLlib] JAXPolicy prep. PR #1. (#13077) | 3 years ago |