Sven Mika
|
90c6b10498
[RLlib] Decentralized multi-agent learning; PR #01 (#21421)
|
2 年之前 |
Sven Mika
|
f94bd99ce4
[RLlib] Issue 21044: Improve error message for "multiagent" dict checks. (#21448)
|
2 年之前 |
Sven Mika
|
853d10871c
[RLlib] Issue 18499: PGTrainer with training_iteration fn does not support multi-GPU. (#21376)
|
2 年之前 |
Sven Mika
|
62dbf26394
[RLlib] POC: Run PGTrainer w/o the distr. exec API (Trainer's new training_iteration method). (#20984)
|
2 年之前 |
Sven Mika
|
ed85f59194
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879)
|
3 年之前 |
Sven Mika
|
05a55a9335
[RLlib] Issue 18668: Unity3D env client/server example not working (fix + add to test cases). (#18942)
|
3 年之前 |
Chris Bamford
|
58a73821fb
[RLlib] IMPALA sample throughput calculation and full queue slowdown fixes (#17822)
|
3 年之前 |
Sven Mika
|
53206dd440
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
3 年之前 |
Sven Mika
|
c3a15ecc0f
[RLlib] Issue #13802: Enhance metrics for `multiagent->count_steps_by=agent_steps` setting. (#14033)
|
3 年之前 |
Sven Mika
|
e40b14d255
[RLlib] Batch-size for truncate_episode batch_mode should be confgurable in agent-steps (rather than env-steps), if needed. (#12420)
|
3 年之前 |
Eric Liang
|
daa03ba6e6
[rllib] Add execution module to package ref (#10941)
|
4 年之前 |
Sven Mika
|
805dad3bc4
[RLlib] SAC algo cleanup. (#10825)
|
4 年之前 |
Sven Mika
|
2256047876
[RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114)
|
4 年之前 |
Eric Liang
|
1e0e1a45e6
[rllib] Add type annotations for evaluation/, env/ packages (#9003)
|
4 年之前 |
Eric Liang
|
9a83908c46
[rllib] Deprecate policy optimizers (#8345)
|
4 年之前 |
Eric Liang
|
9f04a65922
[rllib] Add PPO+DQN two trainer multiagent workflow example (#8334)
|
4 年之前 |
Eric Liang
|
ee0eb44a32
Rename async_queue_depth -> num_async (#8207)
|
4 年之前 |
Eric Liang
|
baadbdf8d4
[rllib] Execute PPO using training workflow (#8206)
|
4 年之前 |
Eric Liang
|
31b40b00f6
[rllib] Pull out experimental dsl into rllib.execution module, add initial unit tests (#7958)
|
4 年之前 |