Sven Mika
|
f82880eda1
Revert "Revert [RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061) (#20399)" (#20417)
|
2 年之前 |
Amog Kamsetty
|
90dc5460d4
Revert "[RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061)" (#20399)
|
2 年之前 |
Sven Mika
|
5b1c8e46e1
[RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061)
|
2 年之前 |
Sven Mika
|
9c73871da0
[RLlib; Docs overhaul] Docstring cleanup: Evaluation (#19783)
|
3 年之前 |
Sven Mika
|
e3e6ed7aaa
[RLlib] Issues 17844, 18034: Fix n-step > 1 bug. (#18358)
|
3 年之前 |
Sven Mika
|
e973b726c2
[RLlib] Support native tf.keras.Models (part 2) - Default keras models for Vision/RNN/Attention. (#15273)
|
3 年之前 |
Sven Mika
|
bb8a286cbc
[RLlib] Support native tf.keras.Model (milestone toward obsoleting ModelV2 class). (#14684)
|
3 年之前 |
Sven Mika
|
04bc0a9828
[RLlib] Remove all non-trajectory view API code. (#14860)
|
3 年之前 |
Sven Mika
|
69202c6a7d
[RLlib] Obsolete usage tracking dict via sample batch. (#13065)
|
3 年之前 |
Sven Mika
|
2e3655e8a9
[RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238)
|
3 年之前 |
Sven Mika
|
b2bcab711d
[RLlib] Attention Nets: tf (#12753)
|
3 年之前 |
Sven Mika
|
ce96b03b07
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033)
|
4 年之前 |
Sven Mika
|
47eb6613b5
[RLlib] Remove unnecessary copies in `compute_advantages`. (#10897)
|
4 年之前 |
Sven Mika
|
d14b501692
[RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115)
|
4 年之前 |
Sven Mika
|
b0b0463161
[RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678)
|
4 年之前 |
Eric Liang
|
1e0e1a45e6
[rllib] Add type annotations for evaluation/, env/ packages (#9003)
|
4 年之前 |
roireshef
|
3c60caa448
[rllib] implemented compute_advantages without gae (#6941)
|
4 年之前 |
Sven Mika
|
c957ed58ed
[RLlib] Implement PPO torch version. (#6826)
|
4 年之前 |
Sven
|
60d4d5e1aa
Remove future imports (#6724)
|
4 年之前 |
Robert Nishihara
|
39a3459886
Remove (object) from class declarations. (#6658)
|
4 年之前 |
Neil Lugovoy
|
1376f1ae60
[tune] Reporter crash fix (#5426)
|
5 年之前 |
Eric Liang
|
5d7afe8092
[rllib] Try moving RLlib to top level dir (#5324)
|
5 年之前 |