提交历史

作者 SHA1 备注 提交日期
  Sven Mika e74947cc94 [RLlib] Env directory cleanup and tests. (#13082) 3 年之前
  Sven Mika d49c3fae0b [RLlib] Trajectory View API: Atari framestacking. (#13315) 3 年之前
  Sven Mika 391cdfae8c [RLlib] Trajectory view API docs. (#12718) 3 年之前
  Sven Mika e40b14d255 [RLlib] Batch-size for truncate_episode batch_mode should be confgurable in agent-steps (rather than env-steps), if needed. (#12420) 3 年之前
  Sven Mika 99c81c6795 [RLlib] Attention Net prep PR #3. (#12450) 3 年之前
  Sven Mika 19c8033df2 [RLlib] Fix most remaining RLlib algos for running with trajectory view API. (#12366) 3 年之前
  Sven Mika 3ad9365e1d [RLlib] Attention Net prep PR #2: Smaller cleanups. (#12449) 3 年之前
  Sven Mika 0df55a139c [RLlib] Attention Net prep PR #1: Smaller cleanups. (#12447) 3 年之前
  Sven Mika dab241dcc6 [RLlib] Fix inconsistency wrt batch size in SampleCollector (traj. view API). Makes DD-PPO work with traj. view API. (#12063) 3 年之前
  Sven Mika 5b788ccb13 [RLlib] Trajectory view API (prep PR for switching on by default across all RLlib; plumbing only) (#11717) 4 年之前
  Sven Mika 36bda8432b [RLlib] Trajectory view API: Simple List Collector (on by default for PPO); LSTM-agnostic (#11056) 4 年之前
  Sven Mika 805dad3bc4 [RLlib] SAC algo cleanup. (#10825) 4 年之前
  Sven Mika 4b278c36fc [RLlib] Behavioral Cloning (from MARWIL). (#10619) 4 年之前
  Sven Mika 244aafdcf8 [RLlib] Curiosity enhancements. (#10373) 4 年之前
  Sven Mika 715ee8dfc9 [RLlib] Issue 10469: Callbacks should receive env idx ... (#10477) 4 年之前
  Sven Mika e968b52cb7 [RLlib] Trajectory view API - 03 Fast LSTM + prev actions/rewards (#9950) 4 年之前
  Sven Mika 2256047876 [RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114) 4 年之前
  Barak Michener 8e76796fd0 ci: Redo `format.sh --all` script & backfill lint fixes (#9956) 4 年之前
  Miguel Morales 372114b4ed Update sampler.py (#9805) 4 年之前
  Sven Mika b0b0463161 [RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678) 4 年之前
  Sven Mika 03ab86567f [RLlib] Layout of Trajectory View API (new class: Trajectory; not used yet). (#9269) 4 年之前
  Sven Mika 5b2a97597b [RLlib] Retire `try_import_tree` (should be installed along with other requirements). (#9211) 4 年之前
  Eric Liang 1e0e1a45e6 [rllib] Add type annotations for evaluation/, env/ packages (#9003) 4 年之前
  Sven Mika 8d1ccfd0f7 [RLlib] Issue 8889: action clipping bug ppo not learning mujoco (#8898) 4 年之前
  Sven Mika 368088be85 [RLlib] Sample batch docs and cleanup. (#8778) 4 年之前
  Sven Mika d8a081a185 [RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590) 4 年之前
  Sven Mika 6d196197bc [RLlib] utils/spaces ... (#8608) 4 年之前
  Sven Mika d76578700d [RLlib] `Policy.compute_single_action()` broken for nested actions (Issue 8411). (#8514) 4 年之前
  Eric Liang f48da50e1c [rllib] observation function api for multi-agent (#8236) 4 年之前
  Sven Mika 1775e89f26 [RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143) 4 年之前