Sven Mika
|
4bc257f4fb
[RLlib] Fix custom multi action distr (#13681)
|
3 年之前 |
Sven Mika
|
a5318961de
[RLlib] Preprocessor fixes (multi-discrete) and tests. (#13083)
|
3 年之前 |
Sven Mika
|
3f4bc16276
[RLlib] Add a minimal JAX ModelV2 (FCNet) to RLlib. (#12502)
|
3 年之前 |
Sven Mika
|
1ebcdf236f
[RLlib] Add support for custom MultiActionDistributions. (#11311)
|
4 年之前 |
Barak Michener
|
8e76796fd0
ci: Redo `format.sh --all` script & backfill lint fixes (#9956)
|
4 年之前 |
Sven Mika
|
fcdf410ae1
[RLlib] Tf2.x native. (#8752)
|
4 年之前 |
Sven Mika
|
43043ee4d5
[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136)
|
4 年之前 |
Sven Mika
|
0422e9c5a8
[RLlib] Add 2 Transformer learning test cases on StatelessCartPole (PPO and IMPALA). (#8624)
|
4 年之前 |
Sven Mika
|
796a834c48
[RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371)
|
4 年之前 |
Sven Mika
|
bf25aee392
[RLlib] Deprecate all Model(v1) usage. (#8146)
|
4 年之前 |
Sven Mika
|
428516056a
[RLlib] SAC Torch (incl. Atari learning) (#7984)
|
4 年之前 |
Sven Mika
|
20ef4a8603
[RLlib] Cleanup/unify all test cases. (#7533)
|
4 年之前 |
Sven Mika
|
6e1c3ea824
[RLlib] Exploration API (+EpsilonGreedy sub-class). (#6974)
|
4 年之前 |
Robert Nishihara
|
39a3459886
Remove (object) from class declarations. (#6658)
|
4 年之前 |
Eric Liang
|
97ccd75952
[rllib] Enable object store memory limit by default (#5534)
|
5 年之前 |
Eric Liang
|
cc86271cf8
[hotfix] fix Travis action dist test (#5428)
|
5 年之前 |
Matthew A. Wright
|
e3c9f7e83a
Custom action distributions (#5164)
|
5 年之前 |
Eric Liang
|
5d7afe8092
[rllib] Try moving RLlib to top level dir (#5324)
|
5 年之前 |