Commit History

Author SHA1 Message Date
  Sven Mika 0f62ccc6ee [RLlib] Clean up some deprecation messages (they shouldn't be there) and make others `error=True` (from `error=False`) (#38555) 1 year ago
  Avnish Narayan 49342d58ec [RLlib] RLlib deprecation Notices Part 2 (models/tf, models/torch, base_mode, catalog, modelv2, models/temp_spec_classes, policy/) (#36840) 1 year ago
  Sven Mika 8e680c483c [RLlib] gymnasium support (new `Env.reset()/step()/seed()/render()` APIs). (#28369) 1 year ago
  Sven Mika 756321145d [RLlib] Add metrics to IMPALA/APPO/PPO (prototype) to measure off-policy'ness for performed updates. (#29983) 1 year ago
  Peyton Murray d6b6dc560d [AIR] Address multiple warnings in AIR examples (#28800) 2 years ago
  Sven Mika 199dc8cff0 Revert "Revert "[RLlib] @deprecate(error=True|False) escalation."" (#28807) 2 years ago
  Amog Kamsetty e501654925 Revert "[RLlib] @deprecate(error=True|False) escalation. (#28733)" (#28795) 2 years ago
  Sven Mika c4348c1889 [RLlib] @deprecate(error=True|False) escalation. (#28733) 2 years ago
  Sven Mika acda25d184 [RLlib] Fix gym.spaces.Box receiving TensorDims from RLlib as shape (instead or list of ints). (#28334) 2 years ago
  Artur Niederfahrenhorst e10876604d [RLlib] Include SampleBatch.T column in all collected batches. (#25926) 2 years ago
  Sven Mika 130b7eeaba [RLlib] `Trainer` to `Algorithm` renaming. (#25539) 2 years ago
  Eric Liang 905258dbc1 Clean up docstyle in python modules and add LINT rule (#25272) 2 years ago
  Eric Liang 4963dfaae0 [api] Add API stability annotations for all RLlib symbols and add to LINT (#25060) 2 years ago
  Ishant Mrinal 0248c60387 [RLlib] Add additional return values to `action_sampler_fn`. (#22721) 2 years ago
  Sven Mika b1cda46681 [RLlib] SlateQ (tf GPU + multi-GPU) + Bandit fixes (#23276) 2 years ago
  Siyuan (Ryans) Zhuang 0c74ecad12 [Lint] Cleanup incorrectly formatted strings (Part 1: RLLib). (#23128) 2 years ago
  Balaji Veeramani 7f1bacc7dc [CI] Format Python code with Black (#21975) 2 years ago
  Sven Mika 596c8e2772 [RLlib] Experimental no-flatten option for actions/prev-actions. (#20918) 2 years ago
  Ishant Mrinal 2868d1a2cf [RLlib] Support for RE3 exploration algorithm (for tf) (#19551) 2 years ago
  Sven Mika 6ff4061f3a [RLlib] Issue 20269: Offline RL example not working due to new_obs not being written to file. (#20366) 2 years ago
  Sven Mika a931076f59 [RLlib] Tf2 + eager-tracing same speed as framework=tf; Add more test coverage for tf2+tracing. (#19981) 3 years ago
  Sven Mika 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 years ago
  Sven Mika f2cb2ed203 [RLlib; Docs overhaul] Docstring cleanup: Policies, policy_templates. (#19759) 3 years ago
  Sven Mika b213565783 [RLlib] Fix failing test cases: Soft-deprecate ModelV2.from_batch (in favor of ModelV2.__call__). (#19693) 3 years ago
  Sven Mika 61a1274619 [RLlib] No Preprocessors (part 2). (#18468) 3 years ago
  Sven Mika 698b4eeed3 [RLlib] POC: Separate losses for APPO/IMPALA. Enable TFPolicy to handle multiple optimizers/losses (like TorchPolicy). (#18669) 3 years ago
  Sven Mika 9883505e84 [RLlib] Add [LSTM=True + multi-GPU]-tests to nightly RLlib testing suite (for all algos supporting RNNs, except R2D2, RNNSAC, and DDPPO). (#18017) 3 years ago
  Sven Mika 494ddd98c1 [RLlib] Replace "seq_lens" w/ SampleBatch.SEQ_LENS. (#17928) 3 years ago
  Sven Mika a428f10ebe [RLlib] Add multi-GPU learning tests to nightly. (#17778) 3 years ago
  Sven Mika 924f11cd45 [RLlib] Torch algos use now-framework-agnostic MultiGPUTrainOneStep execution op (~33% speedup for PPO-torch + GPU). (#17371) 3 years ago