Commit History

Author SHA1 Message Date
  Sven Mika 9e6b871739 [RLlib] Better utils for flattening complex inputs and enable prev-actions for LSTM/attention for complex action spaces. (#21330) 2 years ago
  Sven Mika cf21c634a3 [RLlib] Fix deprecated warning for torch_ops.py (soft-replaced by torch_utils.py). (#19982) 3 years ago
  Sven Mika 9eba1871bb [RLlib] Support easy `use_attention=True` flag for using the GTrXL model. (#11698) 3 years ago
  Sven Mika 391cdfae8c [RLlib] Trajectory view API docs. (#12718) 3 years ago
  Sven Mika c524f86785 [RLlib] BC/MARWIL/recurrent nets minor cleanups and bug fixes. (#13064) 3 years ago
  Sven Mika 99c81c6795 [RLlib] Attention Net prep PR #3. (#12450) 3 years ago
  Sven Mika 3ad9365e1d [RLlib] Attention Net prep PR #2: Smaller cleanups. (#12449) 3 years ago
  Sven Mika 592c161032 [RLlib] Issue 12118: LSTM prev-a/r should be separately configurable. Fix missing prev-a one-hot encoding. (#12397) 3 years ago
  Sven Mika 62c7ab5182 [RLlib] Trajectory view API: Enable by default for PPO, IMPALA, PG, A3C (tf and torch). (#11747) 4 years ago
  Michael Luo b2984d1c34 [RLlib] Model Annotations to Torch Models (#9749) 4 years ago
  Jiajie Xiao 0b07af374a allow tuple action space (#11429) 4 years ago
  Sven Mika 36bda8432b [RLlib] Trajectory view API: Simple List Collector (on by default for PPO); LSTM-agnostic (#11056) 4 years ago
  Sven Mika e968b52cb7 [RLlib] Trajectory view API - 03 Fast LSTM + prev actions/rewards (#9950) 4 years ago
  Sven Mika 2cbe29a7fa [RLlib] Curiosity minor fixes, do-overs, and testing. (#10143) 4 years ago
  Sven Mika 57690a3a9f [RLlib] Trajectory view API - 02 actual API scaffold (#9753) 4 years ago
  Sven Mika 5c6d5d4ab1 This PR fixes the currently broken lstm_use_prev_action_reward flag for default lstm models (model.use_lstm=True). (#8970) 4 years ago
  Sven Mika 0ba7472da9 [Testing] Fix LINT/sphinx errors. (#8874) 4 years ago
  Sven Mika c74dc58f8b [RLlib] Fix `use_lstm` flag for ModelV2 (w/o ModelV1 wrapping) and add it for PyTorch. (#8734) 4 years ago
  Sven Mika 796a834c48 [RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371) 4 years ago