Sven Mika
|
f82880eda1
Revert "Revert [RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061) (#20399)" (#20417)
|
2 年之前 |
Amog Kamsetty
|
90dc5460d4
Revert "[RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061)" (#20399)
|
2 年之前 |
Sven Mika
|
5b1c8e46e1
[RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061)
|
2 年之前 |
Sven Mika
|
82465f9342
[RLlib] Better PolicyServer example (w/ or w/o tune) and add printing out actual listen port address in log-level=INFO. (#18254)
|
3 年之前 |
Sven Mika
|
8248ba531b
[RLlib] Redo #17410: Example script: Remote worker envs with inference done on main node. (#17960)
|
3 年之前 |
Alex Wu
|
318ba6fae0
Revert "[RLlib] Add example script for how to have n remote (parallel) envs with inference happening on "main" (possibly GPU) node. (#17410)" (#17951)
|
3 年之前 |
Sven Mika
|
8fc16b9a18
[RLlib] Add example script for how to have n remote (parallel) envs with inference happening on "main" (possibly GPU) node. (#17410)
|
3 年之前 |
Stefan Schneider
|
489febc6b2
[RLlib] Better example scripts: Description --no-tune and --local-mode CLI options (#17038)
|
3 年之前 |
Sven Mika
|
53206dd440
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
3 年之前 |
Sven Mika
|
d2c755ccef
[RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832)
|
3 年之前 |
Sven Mika
|
e973b726c2
[RLlib] Support native tf.keras.Models (part 2) - Default keras models for Vision/RNN/Attention. (#15273)
|
3 年之前 |
Sven Mika
|
e961d2f4b2
[RLlib] Improve example scripts for attention nets, CartPole LSTM, and custom RNN-models. (#15329)
|
3 年之前 |
Sven Mika
|
e98808ce11
[RLlib] Fix 2 flakey test cases. (#14892)
|
3 年之前 |
Sven Mika
|
78c64ca151
[RLlib] Attention net example script: Clarifications on how to use with Trainer.compute_action. (#14864)
|
3 年之前 |
Sven Mika
|
9eba1871bb
[RLlib] Support easy `use_attention=True` flag for using the GTrXL model. (#11698)
|
3 年之前 |
Sven Mika
|
d5604eaba3
[RLlib] Attention nets PyTorch support and cleanup (using traj. view API). (#12029)
|
3 年之前 |
Sven Mika
|
b2bcab711d
[RLlib] Attention Nets: tf (#12753)
|
3 年之前 |
Sven Mika
|
c17169dc11
[RLlib] Fix all example scripts to run on GPUs. (#11105)
|
4 年之前 |
Sven Mika
|
43043ee4d5
[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136)
|
4 年之前 |
Sven Mika
|
7008902cff
[RLlib] Minor `rllib.utils` cleanup. (#8932)
|
4 年之前 |
Sven Mika
|
d8a081a185
[RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590)
|
4 年之前 |
Sven Mika
|
2746fc0476
[RLlib] Auto-framework, retire `use_pytorch` in favor of `framework=...` (#8520)
|
4 年之前 |
Sven Mika
|
0422e9c5a8
[RLlib] Add 2 Transformer learning test cases on StatelessCartPole (PPO and IMPALA). (#8624)
|
4 年之前 |
Sven Mika
|
796a834c48
[RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371)
|
4 年之前 |