Commit History

Author SHA1 Message Date
  Sven Mika 2d24ef0d32 [RLlib] Add all simple learning tests as `framework=tf2`. (#19273) 3 years ago
  Sven Mika 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 years ago
  gjoliver d81885c1f1 [RLlib] Fix all the CI tests that were broken by is_training and replay buffer changes; re-comment-in the failing RLlib tests (#19809) 3 years ago
  Sven Mika b213565783 [RLlib] Fix failing test cases: Soft-deprecate ModelV2.from_batch (in favor of ModelV2.__call__). (#19693) 3 years ago
  gjoliver c3c42278e4 [RLlib] clean up all the SampleBatch['is_training'] deprecation warnings (#19652) 3 years ago
  Sven Mika 8a066474d4 [RLlib] No Preprocessors; preparatory PR #1 (#18367) 3 years ago
  Sven Mika e3e6ed7aaa [RLlib] Issues 17844, 18034: Fix n-step > 1 bug. (#18358) 3 years ago
  Sven Mika a428f10ebe [RLlib] Add multi-GPU learning tests to nightly. (#17778) 3 years ago
  Sven Mika 5a313ba3d6 [RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169) 3 years ago
  Michael Luo 474f04e322 [RLlib] DDPG/TD3 + A3C/A2C + MARWIL/BC Annotation/Comments/Code Cleanup (#14707) 3 years ago
  Sven Mika 839fc59224 [RLlib] CQL TensorFlow support (#15841) 3 years ago
  Antoine Galataud ce1c001b1d [RLlib] DQN: Place LearningRateSchedule mixin at the right moment (#15558) 3 years ago
  Sven Mika 04bc0a9828 [RLlib] Remove all non-trajectory view API code. (#14860) 3 years ago
  Sven Mika 69202c6a7d [RLlib] Obsolete usage tracking dict via sample batch. (#13065) 3 years ago
  Sven Mika 732197e23a [RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393) 3 years ago
  Sven Mika 8000258333 [RLlib] R2D2 Implementation. (#13933) 3 years ago
  Kai Fricke d9e5d5f47a [RLlib] Cast fcnet_hiddens to list for DQN models (list vs tuple mismatch error) (#14308) 3 years ago
  Sven Mika 2e3655e8a9 [RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238) 3 years ago
  desktable 4ccfd07a61 [RLlib] Add docstrings for agents/dqn (#10710) 4 years ago
  desktable 799318d7d7 [RLlib] Add type annotations for agents/dqn (#10626) 4 years ago
  Sven Mika 8a891b3c30 [RLlib] SAC n_step > 1. (#10567) 4 years ago
  Barak Michener 8e76796fd0 ci: Redo `format.sh --all` script & backfill lint fixes (#9956) 4 years ago
  Sven Mika 5dc4b6686e [RLlib] Implement DQN PyTorch distributional head. (#9589) 4 years ago
  Sven Mika 935d8308fb [RLlib] Issue #9437 (PyTorch converts to CPU tensor, even if on GPU). (#9497) 4 years ago
  Sven Mika fcdf410ae1 [RLlib] Tf2.x native. (#8752) 4 years ago
  Sven Mika 43043ee4d5 [RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136) 4 years ago
  Sven Mika 4fd8977eaf [RLlib] Minor cleanup in preparation to tf2.x support. (#9130) 4 years ago
  Sven Mika 428516056a [RLlib] SAC Torch (incl. Atari learning) (#7984) 4 years ago
  Sven Mika 22ccc43670 [RLlib] DQN torch version. (#7597) 4 years ago