Commit History

Author SHA1 Message Date
  Sven Mika 9e6b871739 [RLlib] Better utils for flattening complex inputs and enable prev-actions for LSTM/attention for complex action spaces. (#21330) 2 years ago
  Sven Mika 596c8e2772 [RLlib] Experimental no-flatten option for actions/prev-actions. (#20918) 2 years ago
  Sven Mika 839fc59224 [RLlib] CQL TensorFlow support (#15841) 3 years ago
  Sven Mika cecfc3b43b [RLlib] Multi-GPU support for Torch algorithms. (#14709) 3 years ago
  Sven Mika b267f1f1ba [RLlib] Add support for Int-Box action spaces. (#15012) 3 years ago
  Sven Mika bcaff63909 [RLlib] SquashedGaussians should throw error when entropy or kl are called. (#13126) 3 years ago
  Michael Luo 59ccbc0fc7 [RLlib] Model Annotations: Tensorflow (#11964) 4 years ago
  Sven Mika 291c172d83 [RLlib] Support Simplex action spaces for SAC (torch and tf). (#11909) 4 years ago
  Sven Mika ce96b03b07 [RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 4 years ago
  Sven Mika 2256047876 [RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114) 4 years ago
  Eric Liang 590943a499 [rllib] Type annotations for model classes (#9646) 4 years ago
  Sven Mika 935d8308fb [RLlib] Issue #9437 (PyTorch converts to CPU tensor, even if on GPU). (#9497) 4 years ago
  Sven Mika 5b2a97597b [RLlib] Retire `try_import_tree` (should be installed along with other requirements). (#9211) 4 years ago
  Sven Mika 43043ee4d5 [RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136) 4 years ago
  Sven Mika 4fd8977eaf [RLlib] Minor cleanup in preparation to tf2.x support. (#9130) 4 years ago
  Victor Le aee01133cd Fix dict/tuple hybrid action space for tensorflow eager execution (#8781) 4 years ago
  Sven Mika 6d196197bc [RLlib] utils/spaces ... (#8608) 4 years ago
  Sven Mika d7eaacb5fe [RLlib] Issue 8319 DDPG (MA or num_envs_per_worker > 1) broken. (#8324) 4 years ago
  Sven Mika eea75ac623 [RLlib] Beta distribution. (#8229) 4 years ago
  Sven Mika 1775e89f26 [RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143) 4 years ago
  Sven Mika e9ee5c4e5f [RLlib] Nested action space PR (minimally invasive; torch only + test). (#8101) 4 years ago
  Sven Mika 165a86f1ab [RLlib] SAC MuJoCo instability issues (tf and torch versions). (#8063) 4 years ago
  Sven Mika 428516056a [RLlib] SAC Torch (incl. Atari learning) (#7984) 4 years ago
  Sven Mika e153e3179f [RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798) 4 years ago
  Sven Mika 1138f2ebed [RLlib] Issue 7046 cannot restore keras model from h5 file. (#7482) 4 years ago
  Sven Mika 510c850651 [RLlib] SAC add discrete action support. (#7320) 4 years ago
  Sven Mika 4198db5038 Torch multicat support (7419) 4 years ago
  Sven Mika 83e06cd30a [RLlib] DDPG refactor and Exploration API action noise classes. (#7314) 4 years ago
  Sven Mika 0db2046b0a [RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107) (#7124) 4 years ago
  Sven Mika d537e9f0d8 [RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155) 4 years ago