Sven Mika d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) | 2 年之前 | |
---|---|---|
.. | ||
__init__.py | 5af745c90d [RLlib] Implement the SlateQ algorithm (#11450) | 4 年之前 |
slateq.py | d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) | 2 年之前 |
slateq_torch_policy.py | 99ae7bae05 [RLlib] JAXPolicy prep. PR #1. (#13077) | 3 年之前 |