Sven Mika d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) 2 years ago
..
__init__.py 5af745c90d [RLlib] Implement the SlateQ algorithm (#11450) 4 years ago
slateq.py d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) 2 years ago
slateq_torch_policy.py 99ae7bae05 [RLlib] JAXPolicy prep. PR #1. (#13077) 3 years ago