kourosh hakhamaneshi
|
b7717df6f6
[RLlib] Fix connector issue in `Algorithm.compute_single_action()`. (#30555)
|
1 年之前 |
Sven Mika
|
2ed09c5445
[RLlib] Move all config validation logic into AlgorithmConfig classes. (#29854)
|
1 年之前 |
Sven Mika
|
432f023642
[RLlib] Deprecate `AlgorithmConfig.framework("tfe")`: Use `tf2` instead. (#29755)
|
2 年之前 |
Sven Mika
|
130b7eeaba
[RLlib] `Trainer` to `Algorithm` renaming. (#25539)
|
2 年之前 |
Sven Mika
|
b5bc2b93c3
[RLlib] Move all remaining algos into `algorithms` directory. (#25366)
|
2 年之前 |
kourosh hakhamaneshi
|
3815e52a61
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits (#24896)
|
2 年之前 |
Balaji Veeramani
|
7f1bacc7dc
[CI] Format Python code with Black (#21975)
|
2 年之前 |
Avnish Narayan
|
026bf01071
[RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535)
|
3 年之前 |
Sven Mika
|
18d173b172
[RLlib] Implement policy_maps (multi-agent case) in RolloutWorkers as LRU caches. (#17031)
|
3 年之前 |
Sven Mika
|
53206dd440
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
3 年之前 |
Sven Mika
|
8698cf9bc8
[RLlib] Fix param noise test case on CI. (#14926)
|
3 年之前 |
Sven Mika
|
99c81c6795
[RLlib] Attention Net prep PR #3. (#12450)
|
3 年之前 |
Sven Mika
|
ce96b03b07
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033)
|
4 年之前 |
Barak Michener
|
8e76796fd0
ci: Redo `format.sh --all` script & backfill lint fixes (#9956)
|
4 年之前 |
Sven Mika
|
fcdf410ae1
[RLlib] Tf2.x native. (#8752)
|
4 年之前 |
Sven Mika
|
4da0e542d5
[RLlib] DDPG and SAC eager support (preparation for tf2.x) (#9204)
|
4 年之前 |
Sven Mika
|
43043ee4d5
[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136)
|
4 年之前 |
Sven Mika
|
2746fc0476
[RLlib] Auto-framework, retire `use_pytorch` in favor of `framework=...` (#8520)
|
4 年之前 |
Sven Mika
|
d0fab84e4d
[RLlib] DDPG PyTorch version. (#7953)
|
4 年之前 |
Sven Mika
|
1b31c11806
[RLlib] DDPG re-factor to fit into RLlib's functional algorithm builder API. (#7934)
|
4 年之前 |