Jun Gong
|
7517aefe05
[RLlib] Bring back BC and Marwil learning tests. (#21574)
|
2 年之前 |
Sven Mika
|
188324c5c7
[RLlib] Issue 21552: `unsquash_action` and `clip_action` (when None) cause wrong actions computed by `Trainer.compute_single_action`. (#21553)
|
2 年之前 |
Sven Mika
|
f94bd99ce4
[RLlib] Issue 21044: Improve error message for "multiagent" dict checks. (#21448)
|
2 年之前 |
Sven Mika
|
92f030331e
[RLlib] Initial code/comment cleanups in preparation for decentralized multi-agent learner. (#21420)
|
2 年之前 |
Sven Mika
|
62dbf26394
[RLlib] POC: Run PGTrainer w/o the distr. exec API (Trainer's new training_iteration method). (#20984)
|
2 年之前 |
gjoliver
|
e7f9e8ceec
[RLlib] Report total_train_steps correctly for offline agents like CQL. (#20541)
|
2 年之前 |
gjoliver
|
724a140795
[rllib] Make sure json can serialize result dict (#20439)
|
2 年之前 |
gjoliver
|
6e787f70e0
[Rllib/release] Disable throughput check (#20387)
|
2 年之前 |
Kai Fricke
|
3e6ba5d6d2
Revert "Revert [RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`." (#20285)
|
2 年之前 |
Kai Fricke
|
246787cdd9
Revert "[RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`. (#20055)" (#20284)
|
2 年之前 |
Sven Mika
|
6f85af435f
[RLlib] POC: `PGTrainer` class that works by sub-classing, not `trainer_template.py`. (#20055)
|
2 年之前 |
gjoliver
|
d8a61f801f
[RLlib] Create a set of performance benchmark tests to run nightly. (#19945)
|
2 年之前 |
Sven Mika
|
a931076f59
[RLlib] Tf2 + eager-tracing same speed as framework=tf; Add more test coverage for tf2+tracing. (#19981)
|
3 年之前 |
Sven Mika
|
2d24ef0d32
[RLlib] Add all simple learning tests as `framework=tf2`. (#19273)
|
3 年之前 |
Sven Mika
|
0b308719f8
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
3 年之前 |
Carlo Grisetti
|
5cee8a1985
[release tests] Switch from yaml.load to yaml.safe_load (#19365)
|
3 年之前 |
Sven Mika
|
d439fd7f17
[RLlib] TF2/eager memory leak fixes. (#19198)
|
3 年之前 |
Sven Mika
|
c3e3fc7637
[RLlib] Issue 18280: A3C/IMPALA multi-agent not working. (#19100)
|
3 年之前 |
Sven Mika
|
73f5c4039b
[RLlib] Fix flakey test_a3c, test_maml, test_apex_dqn. (#19035)
|
3 年之前 |
Sven Mika
|
16ad46a654
[RLlib] Fix broken test_r2d2.py. (#19017)
|
3 年之前 |
Sven Mika
|
ed85f59194
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879)
|
3 年之前 |
Sven Mika
|
828f5d26b7
[RLlib] Custom view requirements (e.g. for prev-n-obs) work with `compute_single_action` and `compute_actions_from_input_dict`. (#18921)
|
3 年之前 |
Sven Mika
|
e6aae61487
[RLlib; testing] Fix bug in stress tests not handling >1 trials per experiment (due to grid-search in IMPALA stress tests). (#18705)
|
3 年之前 |
Sven Mika
|
ba1c489b79
[RLlib Testing] Lower `--smoke-test` "time_total_s" to make sure it doesn't time out. (#18670)
|
3 年之前 |
Sven Mika
|
8a72824c63
[RLlib Testig] Split and unflake more CI tests (make sure all jobs are < 30min). (#18591)
|
3 年之前 |
Sven Mika
|
45f60e51a9
[RLlib] DDPPO fixes and benchmarks. (#18390)
|
3 年之前 |
Sven Mika
|
cabaa3b3c6
[RLlib Testing] Add A3C/APPO/BC/DDPPO/MARWIL/CQL/ES/ARS/TD3 to weekly learning tests. (#18381)
|
3 年之前 |
Sven Mika
|
5292b70fc6
[RLlib] Add multi-GPU attention net tests to nightly test suite (+ R2D2 tests for LSTM and attention nets). (#18368)
|
3 年之前 |
Sven Mika
|
e3e6ed7aaa
[RLlib] Issues 17844, 18034: Fix n-step > 1 bug. (#18358)
|
3 年之前 |
Sven Mika
|
9a8ca6a69d
[RLlib] Fix Atari learning test regressions (2 bugs) and 1 minor attention net bug. (#18306)
|
3 年之前 |