Commit History

Author SHA1 Message Date
  Sven Mika d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) 2 years ago
  Sven Mika 853d10871c [RLlib] Issue 18499: PGTrainer with training_iteration fn does not support multi-GPU. (#21376) 2 years ago
  Sven Mika c07d8c4c22 [RLlib] Trainer sub-class A2C/A3C (instead of `build_trainer`). (#20635) 2 years ago
  gjoliver 99a0088233 [RLlib] Unify the way we create local replay buffer for all agents (#19627) 3 years ago
  Sven Mika 9883505e84 [RLlib] Add [LSTM=True + multi-GPU]-tests to nightly RLlib testing suite (for all algos supporting RNNs, except R2D2, RNNSAC, and DDPPO). (#18017) 3 years ago
  Sven Mika 5a313ba3d6 [RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169) 3 years ago
  Michael Luo 474f04e322 [RLlib] DDPG/TD3 + A3C/A2C + MARWIL/BC Annotation/Comments/Code Cleanup (#14707) 3 years ago
  Sven Mika 732197e23a [RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393) 3 years ago
  Sven Mika e40b14d255 [RLlib] Batch-size for truncate_episode batch_mode should be confgurable in agent-steps (rather than env-steps), if needed. (#12420) 3 years ago
  Sven Mika 19d785b947 [LINT] Except RLlib from checking for flake8 error F821 (#9946) 4 years ago
  Eric Liang 9a83908c46 [rllib] Deprecate policy optimizers (#8345) 4 years ago
  Eric Liang 31b40b00f6 [rllib] Pull out experimental dsl into rllib.execution module, add initial unit tests (#7958) 4 years ago
  Eric Liang dd70720578 [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 4 years ago
  Eric Liang f5d12a958b [rllib] Port Ape-X to distributed execution API (#7497) 4 years ago
  Eric Liang 0f88444686 [rllib] Support multi-agent training in pipeline impls, add easy flag to enable (#7338) 4 years ago
  Sven 60d4d5e1aa Remove future imports (#6724) 4 years ago
  Eric Liang 243b1b7281 [rllib] Add microbatch optimizer with A2C example (#6161) 5 years ago
  Eric Liang 5d7afe8092 [rllib] Try moving RLlib to top level dir (#5324) 5 years ago