Sven Mika
|
d5bfb7b7da
[RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652)
|
2 years ago |
Sven Mika
|
853d10871c
[RLlib] Issue 18499: PGTrainer with training_iteration fn does not support multi-GPU. (#21376)
|
2 years ago |
Sven Mika
|
c07d8c4c22
[RLlib] Trainer sub-class A2C/A3C (instead of `build_trainer`). (#20635)
|
2 years ago |
gjoliver
|
99a0088233
[RLlib] Unify the way we create local replay buffer for all agents (#19627)
|
3 years ago |
Sven Mika
|
9883505e84
[RLlib] Add [LSTM=True + multi-GPU]-tests to nightly RLlib testing suite (for all algos supporting RNNs, except R2D2, RNNSAC, and DDPPO). (#18017)
|
3 years ago |
Sven Mika
|
5a313ba3d6
[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169)
|
3 years ago |
Michael Luo
|
474f04e322
[RLlib] DDPG/TD3 + A3C/A2C + MARWIL/BC Annotation/Comments/Code Cleanup (#14707)
|
3 years ago |
Sven Mika
|
732197e23a
[RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393)
|
3 years ago |
Sven Mika
|
e40b14d255
[RLlib] Batch-size for truncate_episode batch_mode should be confgurable in agent-steps (rather than env-steps), if needed. (#12420)
|
3 years ago |
Sven Mika
|
19d785b947
[LINT] Except RLlib from checking for flake8 error F821 (#9946)
|
4 years ago |
Eric Liang
|
9a83908c46
[rllib] Deprecate policy optimizers (#8345)
|
4 years ago |
Eric Liang
|
31b40b00f6
[rllib] Pull out experimental dsl into rllib.execution module, add initial unit tests (#7958)
|
4 years ago |
Eric Liang
|
dd70720578
[rllib] Rename sample_batch_size => rollout_fragment_length (#7503)
|
4 years ago |
Eric Liang
|
f5d12a958b
[rllib] Port Ape-X to distributed execution API (#7497)
|
4 years ago |
Eric Liang
|
0f88444686
[rllib] Support multi-agent training in pipeline impls, add easy flag to enable (#7338)
|
4 years ago |
Sven
|
60d4d5e1aa
Remove future imports (#6724)
|
4 years ago |
Eric Liang
|
243b1b7281
[rllib] Add microbatch optimizer with A2C example (#6161)
|
5 years ago |
Eric Liang
|
5d7afe8092
[rllib] Try moving RLlib to top level dir (#5324)
|
5 years ago |