提交历史

作者 SHA1 备注 提交日期
  Sven Mika d5bfb7b7da [RLlib] Preparatory PR for multi-agent multi-GPU learner (alpha-star style) #03 (#21652) 2 年之前
  Sven Mika 3ac4daba07 [RLlib] Discussion 4351: Conv2d default filter tests and add default setting for 96x96 image obs space. (#21560) 2 年之前
  Avnish Narayan f7a5fc36eb [rllib] Give rnnsac_stateless cartpole gpu, increase timeout (#21407) 2 年之前
  Sven Mika 9e6b871739 [RLlib] Better utils for flattening complex inputs and enable prev-actions for LSTM/attention for complex action spaces. (#21330) 2 年之前
  Sven Mika abd3bef63b [RLlib] QMIX better defaults + added to CI learning tests (#21332) 2 年之前
  Sven Mika daa4304a91 [RLlib] Switch off preprocessors by default for PGTrainer. (#21008) 2 年之前
  Sven Mika 596c8e2772 [RLlib] Experimental no-flatten option for actions/prev-actions. (#20918) 2 年之前
  Eric Liang 6f93ea437e Remove the flaky test tag (#21006) 2 年之前
  Avnish Narayan 6996eaa986 [RLlib] Add necessary fields to Base Envs, and BaseEnv wrapper classes (#20832) 2 年之前
  Ishant Mrinal 2868d1a2cf [RLlib] Support for RE3 exploration algorithm (for tf) (#19551) 2 年之前
  Sven Mika 60b2219d72 [RLlib] Allow for evaluation to run by `timesteps` (alternative to `episodes`) and add auto-setting to make sure train doesn't ever have to wait for eval (e.g. long episodes) to finish. (#20757) 2 年之前
  Jun Gong 65bd8e29f8 [RLlib] Update a few things to get rid of the `remote_vector_env` deprecation warning. (#20753) 2 年之前
  mvindiola1 8cee0c03bf [RLlib] Update `max_seq_len` in pad_batch_to_sequences_of_same_size (#20743) 2 年之前
  Sven Mika 7a585fb275 [RLlib; Documentation] RLlib README overhaul. (#20249) 2 年之前
  Sven Mika 56619b955e [RLlib; Documentation] Some docstring cleanups; Rename RemoteVectorEnv into RemoteBaseEnv for clarity. (#20250) 2 年之前
  Avnish Narayan dc17f0a241 Add error messages for missing tf and torch imports (#20205) 2 年之前
  Sven Mika f82880eda1 Revert "Revert [RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061) (#20399)" (#20417) 2 年之前
  Amog Kamsetty 90dc5460d4 Revert "[RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061)" (#20399) 2 年之前
  Sven Mika 5b1c8e46e1 [RLlib] POC: Deprecate `build_policy` (policy template) for torch only; PPOTorchPolicy (#20061) 2 年之前
  Sven Mika ebd56b57db [RLlib; documentation] "RLlib in 60sec" overhaul. (#20215) 2 年之前
  Sven Mika 143d23a278 [RLlib] Issue 20062: Action inference examples missing (#20144) 2 年之前
  Sven Mika 76f8a9f125 [RLlib; testing] Increase size of two time-out'ing test cases from medium to large. (#20128) 3 年之前
  Sven Mika a931076f59 [RLlib] Tf2 + eager-tracing same speed as framework=tf; Add more test coverage for tf2+tracing. (#19981) 3 年之前
  Sven Mika 4cb23d1c95 [Tune; Testing] Revert to 3.7 (undone by accident by previous PR); + some minor comment cleanups. (#20031) 3 年之前
  gjoliver 2c1fa459d4 [RLlib] Add an RLlib Tune experiment to UserTest suite. (#19807) 3 年之前
  Avnish Narayan 026bf01071 [RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535) 3 年之前
  Sven Mika e6ae08f416 [RLlib] Optionally don't drop last ts in v-trace calculations (APPO and IMPALA). (#19601) 3 年之前
  Sven Mika 2d24ef0d32 [RLlib] Add all simple learning tests as `framework=tf2`. (#19273) 3 年之前
  Sven Mika 4d945fe651 [RLlib] Issue 19878: Re-instate bare_metal_policy example script (#19881) 3 年之前
  Rohan138 b9c9cc5946 [RLlib] Updated PettingZoo+RLlib tutorial; Removed pettingzoo example script (#19069) 3 年之前