Sven Mika
|
9e6b871739
[RLlib] Better utils for flattening complex inputs and enable prev-actions for LSTM/attention for complex action spaces. (#21330)
|
2 years ago |
Sven Mika
|
596c8e2772
[RLlib] Experimental no-flatten option for actions/prev-actions. (#20918)
|
2 years ago |
Sven Mika
|
839fc59224
[RLlib] CQL TensorFlow support (#15841)
|
3 years ago |
Sven Mika
|
cecfc3b43b
[RLlib] Multi-GPU support for Torch algorithms. (#14709)
|
3 years ago |
Sven Mika
|
b267f1f1ba
[RLlib] Add support for Int-Box action spaces. (#15012)
|
3 years ago |
Sven Mika
|
bcaff63909
[RLlib] SquashedGaussians should throw error when entropy or kl are called. (#13126)
|
3 years ago |
Michael Luo
|
59ccbc0fc7
[RLlib] Model Annotations: Tensorflow (#11964)
|
4 years ago |
Sven Mika
|
291c172d83
[RLlib] Support Simplex action spaces for SAC (torch and tf). (#11909)
|
4 years ago |
Sven Mika
|
ce96b03b07
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033)
|
4 years ago |
Sven Mika
|
2256047876
[RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114)
|
4 years ago |
Eric Liang
|
590943a499
[rllib] Type annotations for model classes (#9646)
|
4 years ago |
Sven Mika
|
935d8308fb
[RLlib] Issue #9437 (PyTorch converts to CPU tensor, even if on GPU). (#9497)
|
4 years ago |
Sven Mika
|
5b2a97597b
[RLlib] Retire `try_import_tree` (should be installed along with other requirements). (#9211)
|
4 years ago |
Sven Mika
|
43043ee4d5
[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136)
|
4 years ago |
Sven Mika
|
4fd8977eaf
[RLlib] Minor cleanup in preparation to tf2.x support. (#9130)
|
4 years ago |
Victor Le
|
aee01133cd
Fix dict/tuple hybrid action space for tensorflow eager execution (#8781)
|
4 years ago |
Sven Mika
|
6d196197bc
[RLlib] utils/spaces ... (#8608)
|
4 years ago |
Sven Mika
|
d7eaacb5fe
[RLlib] Issue 8319 DDPG (MA or num_envs_per_worker > 1) broken. (#8324)
|
4 years ago |
Sven Mika
|
eea75ac623
[RLlib] Beta distribution. (#8229)
|
4 years ago |
Sven Mika
|
1775e89f26
[RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143)
|
4 years ago |
Sven Mika
|
e9ee5c4e5f
[RLlib] Nested action space PR (minimally invasive; torch only + test). (#8101)
|
4 years ago |
Sven Mika
|
165a86f1ab
[RLlib] SAC MuJoCo instability issues (tf and torch versions). (#8063)
|
4 years ago |
Sven Mika
|
428516056a
[RLlib] SAC Torch (incl. Atari learning) (#7984)
|
4 years ago |
Sven Mika
|
e153e3179f
[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798)
|
4 years ago |
Sven Mika
|
1138f2ebed
[RLlib] Issue 7046 cannot restore keras model from h5 file. (#7482)
|
4 years ago |
Sven Mika
|
510c850651
[RLlib] SAC add discrete action support. (#7320)
|
4 years ago |
Sven Mika
|
4198db5038
Torch multicat support (7419)
|
4 years ago |
Sven Mika
|
83e06cd30a
[RLlib] DDPG refactor and Exploration API action noise classes. (#7314)
|
4 years ago |
Sven Mika
|
0db2046b0a
[RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107) (#7124)
|
4 years ago |
Sven Mika
|
d537e9f0d8
[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155)
|
4 years ago |