Sven Mika
|
0f62ccc6ee
[RLlib] Clean up some deprecation messages (they shouldn't be there) and make others `error=True` (from `error=False`) (#38555)
|
1 year ago |
Avnish Narayan
|
49342d58ec
[RLlib] RLlib deprecation Notices Part 2 (models/tf, models/torch, base_mode, catalog, modelv2, models/temp_spec_classes, policy/) (#36840)
|
1 year ago |
Sven Mika
|
8e680c483c
[RLlib] gymnasium support (new `Env.reset()/step()/seed()/render()` APIs). (#28369)
|
1 year ago |
Sven Mika
|
756321145d
[RLlib] Add metrics to IMPALA/APPO/PPO (prototype) to measure off-policy'ness for performed updates. (#29983)
|
1 year ago |
Peyton Murray
|
d6b6dc560d
[AIR] Address multiple warnings in AIR examples (#28800)
|
2 years ago |
Sven Mika
|
199dc8cff0
Revert "Revert "[RLlib] @deprecate(error=True|False) escalation."" (#28807)
|
2 years ago |
Amog Kamsetty
|
e501654925
Revert "[RLlib] @deprecate(error=True|False) escalation. (#28733)" (#28795)
|
2 years ago |
Sven Mika
|
c4348c1889
[RLlib] @deprecate(error=True|False) escalation. (#28733)
|
2 years ago |
Sven Mika
|
acda25d184
[RLlib] Fix gym.spaces.Box receiving TensorDims from RLlib as shape (instead or list of ints). (#28334)
|
2 years ago |
Artur Niederfahrenhorst
|
e10876604d
[RLlib] Include SampleBatch.T column in all collected batches. (#25926)
|
2 years ago |
Sven Mika
|
130b7eeaba
[RLlib] `Trainer` to `Algorithm` renaming. (#25539)
|
2 years ago |
Eric Liang
|
905258dbc1
Clean up docstyle in python modules and add LINT rule (#25272)
|
2 years ago |
Eric Liang
|
4963dfaae0
[api] Add API stability annotations for all RLlib symbols and add to LINT (#25060)
|
2 years ago |
Ishant Mrinal
|
0248c60387
[RLlib] Add additional return values to `action_sampler_fn`. (#22721)
|
2 years ago |
Sven Mika
|
b1cda46681
[RLlib] SlateQ (tf GPU + multi-GPU) + Bandit fixes (#23276)
|
2 years ago |
Siyuan (Ryans) Zhuang
|
0c74ecad12
[Lint] Cleanup incorrectly formatted strings (Part 1: RLLib). (#23128)
|
2 years ago |
Balaji Veeramani
|
7f1bacc7dc
[CI] Format Python code with Black (#21975)
|
2 years ago |
Sven Mika
|
596c8e2772
[RLlib] Experimental no-flatten option for actions/prev-actions. (#20918)
|
2 years ago |
Ishant Mrinal
|
2868d1a2cf
[RLlib] Support for RE3 exploration algorithm (for tf) (#19551)
|
2 years ago |
Sven Mika
|
6ff4061f3a
[RLlib] Issue 20269: Offline RL example not working due to new_obs not being written to file. (#20366)
|
2 years ago |
Sven Mika
|
a931076f59
[RLlib] Tf2 + eager-tracing same speed as framework=tf; Add more test coverage for tf2+tracing. (#19981)
|
3 years ago |
Sven Mika
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 years ago |
Sven Mika
|
f2cb2ed203
[RLlib; Docs overhaul] Docstring cleanup: Policies, policy_templates. (#19759)
|
3 years ago |
Sven Mika
|
b213565783
[RLlib] Fix failing test cases: Soft-deprecate ModelV2.from_batch (in favor of ModelV2.__call__). (#19693)
|
3 years ago |
Sven Mika
|
61a1274619
[RLlib] No Preprocessors (part 2). (#18468)
|
3 years ago |
Sven Mika
|
698b4eeed3
[RLlib] POC: Separate losses for APPO/IMPALA. Enable TFPolicy to handle multiple optimizers/losses (like TorchPolicy). (#18669)
|
3 years ago |
Sven Mika
|
9883505e84
[RLlib] Add [LSTM=True + multi-GPU]-tests to nightly RLlib testing suite (for all algos supporting RNNs, except R2D2, RNNSAC, and DDPPO). (#18017)
|
3 years ago |
Sven Mika
|
494ddd98c1
[RLlib] Replace "seq_lens" w/ SampleBatch.SEQ_LENS. (#17928)
|
3 years ago |
Sven Mika
|
a428f10ebe
[RLlib] Add multi-GPU learning tests to nightly. (#17778)
|
3 years ago |
Sven Mika
|
924f11cd45
[RLlib] Torch algos use now-framework-agnostic MultiGPUTrainOneStep execution op (~33% speedup for PPO-torch + GPU). (#17371)
|
3 years ago |