Sven Mika d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) 2 years ago
..
tests d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) 2 years ago
__init__.py bec719d823 [RLlib] Trainer sub-class IMPALA (instead of using `build_trainer()`). (#20570) 2 years ago
impala.py d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) 2 years ago
vtrace_tf.py ef18893fb5 [RLlib] PPO, APPO, and DD-PPO code cleanup. (#10420) 4 years ago
vtrace_tf_policy.py e6ae08f416 [RLlib] Optionally don't drop last ts in v-trace calculations (APPO and IMPALA). (#19601) 3 years ago
vtrace_torch.py cf21c634a3 [RLlib] Fix deprecated warning for torch_ops.py (soft-replaced by torch_utils.py). (#19982) 3 years ago
vtrace_torch_policy.py e6ae08f416 [RLlib] Optionally don't drop last ts in v-trace calculations (APPO and IMPALA). (#19601) 3 years ago