提交历史

作者 SHA1 备注 提交日期
  Sven Mika f82880eda1 Revert "Revert [RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061) (#20399)" (#20417) 2 年之前
  Amog Kamsetty 90dc5460d4 Revert "[RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061)" (#20399) 2 年之前
  Sven Mika 5b1c8e46e1 [RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061) 2 年之前
  Sven Mika 82465f9342 [RLlib] Better PolicyServer example (w/ or w/o tune) and add printing out actual listen port address in log-level=INFO. (#18254) 3 年之前
  Sven Mika 8248ba531b [RLlib] Redo #17410: Example script: Remote worker envs with inference done on main node. (#17960) 3 年之前
  Alex Wu 318ba6fae0 Revert "[RLlib] Add example script for how to have n remote (parallel) envs with inference happening on "main" (possibly GPU) node. (#17410)" (#17951) 3 年之前
  Sven Mika 8fc16b9a18 [RLlib] Add example script for how to have n remote (parallel) envs with inference happening on "main" (possibly GPU) node. (#17410) 3 年之前
  Stefan Schneider 489febc6b2 [RLlib] Better example scripts: Description --no-tune and --local-mode CLI options (#17038) 3 年之前
  Sven Mika 53206dd440 [RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531) 3 年之前
  Sven Mika d2c755ccef [RLlib] Examples scripts add argparse help and replace `--torch` with `--framework`. (#15832) 3 年之前
  Sven Mika e973b726c2 [RLlib] Support native tf.keras.Models (part 2) - Default keras models for Vision/RNN/Attention. (#15273) 3 年之前
  Sven Mika e961d2f4b2 [RLlib] Improve example scripts for attention nets, CartPole LSTM, and custom RNN-models. (#15329) 3 年之前
  Sven Mika e98808ce11 [RLlib] Fix 2 flakey test cases. (#14892) 3 年之前
  Sven Mika 78c64ca151 [RLlib] Attention net example script: Clarifications on how to use with Trainer.compute_action. (#14864) 3 年之前
  Sven Mika 9eba1871bb [RLlib] Support easy `use_attention=True` flag for using the GTrXL model. (#11698) 3 年之前
  Sven Mika d5604eaba3 [RLlib] Attention nets PyTorch support and cleanup (using traj. view API). (#12029) 3 年之前
  Sven Mika b2bcab711d [RLlib] Attention Nets: tf (#12753) 3 年之前
  Sven Mika c17169dc11 [RLlib] Fix all example scripts to run on GPUs. (#11105) 4 年之前
  Sven Mika 43043ee4d5 [RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136) 4 年之前
  Sven Mika 7008902cff [RLlib] Minor `rllib.utils` cleanup. (#8932) 4 年之前
  Sven Mika d8a081a185 [RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590) 4 年之前
  Sven Mika 2746fc0476 [RLlib] Auto-framework, retire `use_pytorch` in favor of `framework=...` (#8520) 4 年之前
  Sven Mika 0422e9c5a8 [RLlib] Add 2 Transformer learning test cases on StatelessCartPole (PPO and IMPALA). (#8624) 4 年之前
  Sven Mika 796a834c48 [RLlib] Attention Net integration into ModelV2 and learning RL example. (#8371) 4 年之前