Sven Mika
|
f3397b6f48
[RLlib] Minor fixes/cleanups; chop_into_sequences now handles nested data. (#19408)
|
3 年之前 |
Sven Mika
|
494ddd98c1
[RLlib] Replace "seq_lens" w/ SampleBatch.SEQ_LENS. (#17928)
|
3 年之前 |
Sven Mika
|
5a313ba3d6
[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169)
|
3 年之前 |
Sven Mika
|
4f66309e19
[RLlib] Redo issue 14533 tf enable eager exec (#14984)
|
3 年之前 |
SangBin Cho
|
fa5f961d5e
Revert "[RLlib] Issue 14533: `tf.enable_eager_execution()` must be called at beginning. (#14737)" (#14918)
|
3 年之前 |
Sven Mika
|
3e389d5812
[RLlib] Issue 14533: `tf.enable_eager_execution()` must be called at beginning. (#14737)
|
3 年之前 |
Sven Mika
|
732197e23a
[RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393)
|
3 年之前 |
Sven Mika
|
93c0a5549b
[RLlib] Deprecate `vf_share_layers` in top-level PPO/MAML/MB-MPO configs. (#13397)
|
3 年之前 |
Sven Mika
|
b2bcab711d
[RLlib] Attention Nets: tf (#12753)
|
3 年之前 |
Sven Mika
|
28ab797cf5
[RLlib] Deprecate old classes, methods, functions, config keys (in prep for RLlib 1.0). (#10544)
|
4 年之前 |
Sven Mika
|
2746fc0476
[RLlib] Auto-framework, retire `use_pytorch` in favor of `framework=...` (#8520)
|
4 年之前 |
Sven Mika
|
796a834c48
[RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371)
|
4 年之前 |
Sven Mika
|
bf25aee392
[RLlib] Deprecate all Model(v1) usage. (#8146)
|
4 年之前 |
Eric Liang
|
dd70720578
[rllib] Rename sample_batch_size => rollout_fragment_length (#7503)
|
4 年之前 |
Sven Mika
|
20ef4a8603
[RLlib] Cleanup/unify all test cases. (#7533)
|
4 年之前 |
Sven
|
60d4d5e1aa
Remove future imports (#6724)
|
4 年之前 |
Eric Liang
|
f7bda0abad
[rllib] Fix rnn shape with multi-dimensional data (#5939)
|
5 年之前 |
Richard Liaw
|
0010f54378
Update Cloudpickle (#5643)
|
5 年之前 |
Eric Liang
|
5d7afe8092
[rllib] Try moving RLlib to top level dir (#5324)
|
5 年之前 |