.. |
tests
|
d5bfb7b7da
[RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652)
|
2 years ago |
__init__.py
|
bec719d823
[RLlib] Trainer sub-class IMPALA (instead of using `build_trainer()`). (#20570)
|
2 years ago |
impala.py
|
d5bfb7b7da
[RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652)
|
2 years ago |
vtrace_tf.py
|
ef18893fb5
[RLlib] PPO, APPO, and DD-PPO code cleanup. (#10420)
|
4 years ago |
vtrace_tf_policy.py
|
e6ae08f416
[RLlib] Optionally don't drop last ts in v-trace calculations (APPO and IMPALA). (#19601)
|
3 years ago |
vtrace_torch.py
|
cf21c634a3
[RLlib] Fix deprecated warning for torch_ops.py (soft-replaced by torch_utils.py). (#19982)
|
3 years ago |
vtrace_torch_policy.py
|
e6ae08f416
[RLlib] Optionally don't drop last ts in v-trace calculations (APPO and IMPALA). (#19601)
|
3 years ago |