提交历史

作者 SHA1 备注 提交日期
  Jun Gong 7517aefe05 [RLlib] Bring back BC and Marwil learning tests. (#21574) 2 年之前
  Sven Mika 188324c5c7 [RLlib] Issue 21552: `unsquash_action` and `clip_action` (when None) cause wrong actions computed by `Trainer.compute_single_action`. (#21553) 2 年之前
  Sven Mika f94bd99ce4 [RLlib] Issue 21044: Improve error message for "multiagent" dict checks. (#21448) 2 年之前
  Sven Mika 92f030331e [RLlib] Initial code/comment cleanups in preparation for decentralized multi-agent learner. (#21420) 2 年之前
  Sven Mika 62dbf26394 [RLlib] POC: Run PGTrainer w/o the distr. exec API (Trainer's new training_iteration method). (#20984) 2 年之前
  gjoliver e7f9e8ceec [RLlib] Report total_train_steps correctly for offline agents like CQL. (#20541) 2 年之前
  gjoliver 724a140795 [rllib] Make sure json can serialize result dict (#20439) 2 年之前
  gjoliver 6e787f70e0 [Rllib/release] Disable throughput check (#20387) 2 年之前
  Kai Fricke 3e6ba5d6d2 Revert "Revert [RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`." (#20285) 2 年之前
  Kai Fricke 246787cdd9 Revert "[RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`. (#20055)" (#20284) 2 年之前
  Sven Mika 6f85af435f [RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`. (#20055) 2 年之前
  gjoliver d8a61f801f [RLlib] Create a set of performance benchmark tests to run nightly. (#19945) 2 年之前
  Sven Mika a931076f59 [RLlib] Tf2 + eager-tracing same speed as framework=tf; Add more test coverage for tf2+tracing. (#19981) 3 年之前
  Sven Mika 2d24ef0d32 [RLlib] Add all simple learning tests as `framework=tf2`. (#19273) 3 年之前
  Sven Mika 0b308719f8 [RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829) 3 年之前
  Carlo Grisetti 5cee8a1985 [release tests] Switch from yaml.load to yaml.safe_load (#19365) 3 年之前
  Sven Mika d439fd7f17 [RLlib] TF2/eager memory leak fixes. (#19198) 3 年之前
  Sven Mika c3e3fc7637 [RLlib] Issue 18280: A3C/IMPALA multi-agent not working. (#19100) 3 年之前
  Sven Mika 73f5c4039b [RLlib] Fix flakey test_a3c, test_maml, test_apex_dqn. (#19035) 3 年之前
  Sven Mika 16ad46a654 [RLlib] Fix broken test_r2d2.py. (#19017) 3 年之前
  Sven Mika ed85f59194 [RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879) 3 年之前
  Sven Mika 828f5d26b7 [RLlib] Custom view requirements (e.g. for prev-n-obs) work with `compute_single_action` and `compute_actions_from_input_dict`. (#18921) 3 年之前
  Sven Mika e6aae61487 [RLlib; testing] Fix bug in stress tests not handling >1 trials per experiment (due to grid-search in IMPALA stress tests). (#18705) 3 年之前
  Sven Mika ba1c489b79 [RLlib Testing] Lower `--smoke-test` "time_total_s" to make sure it doesn't time out. (#18670) 3 年之前
  Sven Mika 8a72824c63 [RLlib Testig] Split and unflake more CI tests (make sure all jobs are < 30min). (#18591) 3 年之前
  Sven Mika 45f60e51a9 [RLlib] DDPPO fixes and benchmarks. (#18390) 3 年之前
  Sven Mika cabaa3b3c6 [RLlib Testing] Add A3C/APPO/BC/DDPPO/MARWIL/CQL/ES/ARS/TD3 to weekly learning tests. (#18381) 3 年之前
  Sven Mika 5292b70fc6 [RLlib] Add multi-GPU attention net tests to nightly test suite (+ R2D2 tests for LSTM and attention nets). (#18368) 3 年之前
  Sven Mika e3e6ed7aaa [RLlib] Issues 17844, 18034: Fix n-step > 1 bug. (#18358) 3 年之前
  Sven Mika 9a8ca6a69d [RLlib] Fix Atari learning test regressions (2 bugs) and 1 minor attention net bug. (#18306) 3 年之前